Human Expertise for AI Red-Teaming and Scalable Evaluation

  • Alice Qian ,
  • Srravya Chandhiramowuli ,
  • Laura A. Dabbish ,
  • Hong Shen ,
  • Alex S Taylor ,
  • Ding Wang ,
  • Theodora Skeadas ,
  • Bolor-Erdene Jagdagdorj

CHI 2026 |

Rapid adoption of generative AI has outpaced the infrastructure needed to red team systems responsibly. This workshop tackles a core tension: scaling AI red teaming while centering human expertise and well-being. We convene academic, industry, and nonprofit practitioners for two threads. (A) Vision: surface high-level goals and principles for effective, humane red teaming. (B) Build: identify opportunities to support human-AI red teaming, such as scenario libraries, role prompts for red teamers, and calibration methods that align automated efforts with human expertise. Through this workshop, we will develop a vision for the future of effective AI red teaming that leverages and protects human expertise while meeting the needs of evaluation at scale.