Meeting Intelligence: Task Rephrasing
Exploratory project to ‘decontextualize’ tasks and to-do items identified in meeting transcriptions, and rewrite each of them in a single sentence to appear in a separate to-do list. We build upon pretrained seq2seq transformer models,…
Odia Speech Data and Model
As part of this release, Navana Tech and Microsoft Research India are open-sourcing 1648 hours of validated Odia speech dataset and a baseline model for Odia speech recognition. The speech dataset consists of recordings in…
LiST (Lite Self-Training)
We present a new method LiST for efficient fine-tuning of large pre-trained language models (PLMs) in few-shot learning settings. LiST significantly improves over recent methods that adopt prompt fine-tuning using two key techniques. The first…