Designing Staged Evaluation Workflows for LLMs: Integrating Domain Experts, Lay Users, and Model-Generated Evaluation Criteria

Published in Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI 2026), 2026