Human Expertise for AI Red-Teaming and Scalable Evaluation

Alice Qian; Srravya Chandhiramowuli; Laura A. Dabbish; Hong Shen; Alex S Taylor; Ding Wang; Theodora Skeadas; Bolor-Erdene Jagdagdorj

Human Expertise for AI Red-Teaming and Scalable Evaluation

Alice Qian ,
Srravya Chandhiramowuli ,
Laura A. Dabbish ,
Hong Shen ,
Alex S Taylor ,
Ding Wang ,
Theodora Skeadas ,
Bolor-Erdene Jagdagdorj

CHI 2026 | April 2026

Download BibTex

Rapid adoption of generative AI has outpaced the infrastructure needed to red team systems responsibly. This workshop tackles a core tension: scaling AI red teaming while centering human expertise and well-being. We convene academic, industry, and nonprofit practitioners for two threads. (A) Vision: surface high-level goals and principles for effective, humane red teaming. (B) Build: identify opportunities to support human-AI red teaming, such as scenario libraries, role prompts for red teamers, and calibration methods that align automated efforts with human expertise. Through this workshop, we will develop a vision for the future of effective AI red teaming that leverages and protects human expertise while meeting the needs of evaluation at scale.