Human Expertise for AI Red-Teaming and Scalable Evaluation
- Alice Qian ,
- Srravya Chandhiramowuli ,
- Laura A. Dabbish ,
- Hong Shen ,
- Alex S Taylor ,
- Ding Wang ,
- Theodora Skeadas ,
- Bolor-Erdene Jagdagdorj
CHI 2026 |
Rapid adoption of generative AI has outpaced the infrastructure needed to red team systems responsibly. This workshop tackles a core tension: scaling AI red teaming while centering human expertise and well-being. We convene academic, industry, and nonprofit practitioners for two threads. (A) Vision: surface high-level goals and principles for effective, humane red teaming. (B) Build: identify opportunities to support human-AI red teaming, such as scenario libraries, role prompts for red teamers, and calibration methods that align automated efforts with human expertise. Through this workshop, we will develop a vision for the future of effective AI red teaming that leverages and protects human expertise while meeting the needs of evaluation at scale.