Business

Microsoft releases Indian language 'Speech Corpus' for researchers

Thu, Sep 06 2018 03:38:37 PM

Bengaluru, Sep 6 (IANS): To help researchers and academia build Indian language speech recognition for all applications where speech is used, Microsoft India on Thursday launched its Indian language "Speech Corpus", offering speech training and test data for Telugu, Tamil and Gujarati.

This is the largest publicly available Indian language speech dataset which includes audio and corresponding transcripts, Microsoft said in a statement.

This Indian language "Speech Corpus" content is provided by Microsoft Research Open Data initiative, a collection of free datasets from Microsoft Research to advance research in areas such as natural language processing, computer vision, and domain specific sciences.

"Microsoft Indian Language Speech Corpus is an extension of our on-going efforts to reduce language barriers and empower Indians to harness the full potential of the Internet," said Sundar Srinivasan, General Manager, Artificial Intelligence and Research, Microsoft India.

"Using our technology expertise, we want to accelerate innovation in voice based computing for India by supporting researchers and academia," Srinivasan said.

Microsoft's Indian Language Speech Corpus was tested at Interspeech 2018 conference in Hyderabad this month.

In a Low Resource Speech Recognition Challenge, participants used data from Microsoft Indian language speech corpus to build Automatic Speech Recognition (ASR) systems.

They were able to create high quality speech recognition models using this data, thus validating the efficacy of the Corpus, Microsoft said.

Microsoft has been working with Indian languages for over two decades since the launch of Project Bhasha in 1998, allowing users to input localised text easily and quickly using the Indian Language Input tool.

Follow Daijiworld News Network on

Latest

Naver to launch JV for digital twin projects in Saudi Arabia

Indian economy capable of handling global shocks: RBI Governor

Sagarmanthan maritime meet kicks off in Delhi on Monday

India has become a market that you can’t ignore: Global experts

Egypt's state-run automaker resumes production after 15-year suspension

India clocks 12 pc rise in deal volume in Jan-Oct, China sees 23 pc decline

Global pharma sector stares at lack of talent, specific skills: Report

Business

Microsoft releases Indian language 'Speech Corpus' for researchers

Top Stories

Vitu Realty wins Economic Times' ‘Innovative Plot Developer of Year’ award

Leave a Comment Your Email address will not be published.

Title: Microsoft releases Indian language 'Speech Corpus' for researchers

You might also like

Sudanese army says 150 paramilitary fighters killed in Sudan

Delhi chokes under ‘severe plus’ air quality amid dense fog

South Korean President Yoon arrives in Brazil to attend G20 summit

Deeply touched, says PM Modi after meeting Indian community in Brazil

Manipur: HM Amit Shah reviews situation, directs officials to take proactive steps

PM Modi arrives in Brazil to participate in G20 Summit

North Korea launches trash balloons toward South Korea

North Korea's Kim calls for bolstering nuclear forces 'without limitation,' completing war preparati

APEC members expect fully, well-functioning WTO dispute settlement system

Philippines: Four killed, three injured in road crash