Tool
LayoutLM
LayoutLM is a simple but effective multi-modal pre-training method of text, layout, and image for visually-rich document understanding and information extraction tasks, such as form understanding and receipt understanding. LayoutLM archives the SOTA results on…
Project
ezPitch: Connecting Salespersons and Customers through Relevant News
The goal of ezPitch is connecting salespersons and customers through relevant news. Why is this important? In the daily work, the sales persons need to search, track and explore the related news about customers before…
Microsoft Research Blog
VinVL: Advancing the state of the art for vision-language models
Humans understand the world by perceiving and fusing information from multiple channels, such as images viewed by the eyes, voices heard by the ears, and other forms of sensory input. One of the core aspirations…