Trace Id is missing

Unlock your potential with Microsoft Copilot

Get things done faster and unleash your creativity with the power of AI anywhere you go.
Microsoft Copilot app being utilized to generate pictures of a singing dog, assisting to identify a flower, and helping to generate an email to congratulate a coworker on a promotion.

Rajasthani Hindi Speech Data

This dataset consists of audio recordings of participants reading out stories in Rajasthani Hindi, one sentence at a time. We had 98 participants from Soda, Rajasthan. Each participant read 30 stories. In total, we have 426873 recordings in this dataset. We had roughly 58 male participants and 40 female participants.

Important! Selecting a language below will dynamically change the complete page content to that language.

Download
  • Version:

    1.0

    Date Published:

    12/07/2023

    File Name:

    Rajasthani-Hindi-Speech-Data.zip

    File Size:

    2.7 GB

    The tarball contains a set of recordings in the 3gp format. For each 3gp file, there is a corresponding txt file with the same name that contains the prompt shown to the user for that recording. Each file is named as follows: < sentence-id>_< user-id>.3gp/txt. While random sampling suggests that most users have to their best effort tried to accurately read out the sentences, we have not performed any quality analysis on the data. There could be errors in some of the recordings.
  • Supported Operating Systems

    Linux, Apple Mac OS X, Windows 7, Windows 10, Windows 11

    • Windows 7, Windows 8, Windows 10, or Windows 11
    • Click Download and follow the instructions.
Follow Microsoft