Nouvelles et reportages
BlueCodeAgent: A blue teaming agent enabled by automated red teaming for CodeGen AI
| Chengquan Guo , Yuzhou Nie, Chulin Xie, Zinan Lin, Wenbo Guo, et Bo Li
BlueCodeAgent is an end-to-end blue-teaming framework built to boost code security using automated red-teaming processes, data, and safety rules to guide LLMs’ defensive decisions. Dynamic testing reduces false positives in vulnerability detection.
Prix | ACM SIGMICRO
Esha Choukse receives 2025 SIGMICRO Early Career Award
Choukse was recognized for her foundational contributions to hardware memory compression and to sustainable and efficient datacenter systems.
RedCodeAgent: Automatic red-teaming agent against diverse code agents
| Chengquan Guo , Chulin Xie, Yu Yang, Zhaorun Chen, Zinan Lin, Xander Davies, Yarin Gal, Dawn Song, et Bo Li
Code agents help streamline software development workflows, but may also introduce critical security risks. Learn how RedCodeAgent automates and improves “red-teaming” attack simulations to help uncover real-world threats that other methods overlook.
Dans l’actualité | What will AI Mean for Humanity?
What will AI Mean for Humanity?
E. Glen Weyl appeared on a panel at Harvard University about the implications of AI for the human soul.
Dans l’actualité | Open to Debate
E. Glen Weyl debate Curtis Yarvin
Glen Weyl debate prominent anti-democratic technologist and philosopher Curtis Yarvin on whether the US should be ruled by a CEO dictator.
Applicability vs. job displacement: further notes on our recent research on AI and occupations
| Kiran Tomlinson, Sonia Jaffe, Will Wang, Scott Counts, et Siddharth Suri
Recently, we released a paper Working with AI: Measuring the Occupational Implications of Generative AI that studied what occupations might find AI chatbots useful, and to what degree. The paper sparked significant discussion, which is no surprise since people care deeply about the future of AI and jobs–that’s part of why we think it’s important to…
Project Ire autonomously identifies malware at scale
| Brian Caswell, Dustin Fraze, Sarah Smith, Rodrigo Racanicci, Tim Middleton-Sally, Shelby Hayes, Stanley He, Katy Smith, Bhakta Pradhan, et Mike Walker
Designed to classify software without context, Project Ire replicates the gold standard in malware analysis through reverse engineering. It streamlines a complex, expert-driven process, making large-scale malware detection faster & more consistent.
VeriTrail: Detecting hallucination and tracing provenance in multi-step AI workflows
| Dasha Metropolitansky
VeriTrail, new from Microsoft Research, can detect AI-generated content that is not supported by the source text, trace the provenance of content from final output back to the source, and locate where errors were likely introduced.
Dans l’actualité | National Academy of Engineering
Kevin Hsieh selected to participate in 2025 Grainger Foundation Frontiers of Engineering Symposium
The signature activity of the National Academy of Engineering brings the next generation of engineering leaders together to share new techniques and approaches, facilitate collaboration, and build professional networks.