LLaVA: Large Language and Vision Assistant
LLaVA is an open-source project, collaborating with research community to advance the state-of-the-art in AI. LLaVA represents the first end-to-end trained large multimodal model (LMM) that achieves impressive chat capabilities mimicking spirits of the multimodal…
Agent AI
Agent-based multimodal AI systems are becoming a ubiquitous presence in our everyday lives. A promising direction for making these systems more interactive is to embody them as agents within specific environments. The grounding of large…
End-to-End Encrypted Group Chats with MLS: Design, Implementation and Verification
MLS is a new IETF standard that deals with secure, end-to-end encrypted group messaging. In this work, recently awarded the Internet Defense Prize and a Distinguished Paper Award at USENIX, Théophile will describe how the…
Microsoft Research India – who we are.
Employees from Microsoft Research India talk about their work, their aspirations to change the world and what makes MSR India such a great place to work in.
Research Focus: Week of September 11, 2023
In this issue: Efficient polyglot analytics on semantic data aids query performance; generative retrieval for conversational question answering improves dialogue-based interfaces; a new tool uses ML to address capacity degradation in lithium-ion batteries.