Project
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
Coordinate-Free Visual Grounding for GUI Agents One of the principal challenges in building VLM-powered GUI agents is visual grounding, i.e., localizing the appropriate screen region for action execution based on both the visual content and…
Video
Agentic AI Ecosystems: Navigating Cultural-Awareness, Biases and Misinformation in Multi-agent and Human-agent Interactions
In an era where artificial intelligence (AI) increasingly mediates human communication, understanding the dynamics of human-AI interaction is critical. This talk explores the potential of multi-agent AI systems in fostering inclusive, and culturally aware human-AI…