Bookmark Kaumudi Online  Bookmark this site  Editor@Kaumudi  |  Marketing  Print Advt rates  |  Calendar 2018        Go!    
 
 
September 25, Tuesday 2018 8:21 PM       

       HEADLINES: P K Sasi controversy: Evidence collected from 6 persons                                              Violinist Balabhaskar’s daughter dies in accident                                              Tamil movie incites murder in Thiruvananthapuram                                              Abhilash Tomy brought to Amsterdam                                              Govt promise turns farce; farmer commits suicide in Wayanad                                              Rahul Gandhi a 'directionless leader', Cong creating campaign based on lies: Javadekar                                              Amit Shah to attend BJP workers meet in Jaipur on Wednesday                                              Pak won't abandon peace efforts despite India's reluctance: Qureshi                                              Maldives government acknowledges opposition victory                                              Trump's latest tariffs on China begin to bite                                              China says can't hold US trade talks with 'knife to the throat'                                              I'm a 'scapegoat' says sacked Sri Lanka captain Mathews                                              Test cricket should not be tinkered with, says Kohli                                              Kaumudi Facebook
       SCI&TECH Next Article: Facebook likely to run on 100% renewable energy by 2020  
       Microsoft releases Speech Corpus for three Indian languages
 
         Posted on :10:16:28 Sep 7, 2018
   
A A
       Last edited on:10:16:28 Sep 7, 2018
         Tags: Microsoft, Speech Corpus
 

 

NEW DELHI: Microsoft India on Thursday announced the availability of Microsoft Indian language Speech Corpus, offering speech training and test data for Telugu, Tamil and Gujarati.

Offering the largest publicly available Indian language speech dataset, Microsoft aims to help researchers and academia build Indian language speech recognition for all applications where speech is used.

This Indian language Speech Corpus content is provided by Microsoft Research Open Data initiative, a collection of free datasets from Microsoft Research to advance state-of-the-art research in areas such as natural language processing, computer vision, and domain specific sciences.

Today, there is a scarcity of adequate digital data for text, speech and linguistic resources - which are imperative in building large machine learning models for many vernacular languages across the world. Moreover, the differences in enunciation, accent, diction, and slang across various regions in India are very subtle.

As a result of these complexities, development of accurate digital tools in Indian languages has been slow. Microsoft is working to address this lack of data and catalyze the development of machine learning based models that can help in building systems for low resource languages, thus enabling the eco system of researchers, academia and tech companies working on India language models and to accelerate the needs of Indian users. The launch of Microsoft Indian Language Speech Corpus is a part of this effort.

"We believe India's increasing digital literacy needs to be supported by a multi-lingual digital world. Microsoft Indian Language Speech Corpus is an extension of our on-going efforts to reduce language barriers and empower Indians to harness the full potential of the Internet," said GM, Artificial Intelligence & Research, Microsoft India, Sundar Srinivasan.

"Using our technology expertise, we want to accelerate innovation in voice based computing for India by supporting researchers and academia," added Srinivasan.

Microsoft's Indian Language Speech Corpus was tested at Interspeech 2018, the world's largest and most comprehensive conference on the science and technology of spoken language processing. In a Low Resource Speech Recognition Challenge, participants used data from Microsoft Indian language speech corpus to build Automatic Speech Recognition (ASR) systems. They were able to create high quality speech recognition models using this data, thus validating the efficacy of the Corpus.

Microsoft has been working with Indian languages for over two decades since the launch of Project Bhasha in 1998, allowing users to input localized text easily and quickly using the Indian Language Input tool. With the help of AI and Deep Neural Networks, Microsoft is working on improving real-time language translation for Hindi, Bengali, Tamil and now expanding it to real-time language translation for Telugu.

It also recently announced support for email addresses in multiple Indian languages across most of its email apps and services. Also, as part of the latest Windows update, Microsoft added Tamil 99 virtual keyboard to Windows 10. Through its global Local Language Program (LLP), Microsoft provides people access to technology in their native language. This includes Language Interface Packs for Indian languages like Hindi, Kannada, Bengali, Malayalam, amongst others.

 

A A
       SCI&TECH
Next Article: Facebook likely to run on 100% renewable energy by 2020
 
 
SCI&TECH HEADLINES
Pluto should be reclassified as a planet, reveals study  
Shared responsibility essential for conserving migratory species  
Microsoft releases Speech Corpus for three Indian languages  
Facebook likely to run on 100% renewable energy by 2020  
Can brain suppress the act of revenge?  
Kerala techies launch portal to facilitate relief measures  
Here's what you may not know about H2O  
Alexa will tell you when it has done its homework  
Students recreate horrific atomic bombings of Hiroshima using VR technology  
Study discovers compounds that can reverse cell ageing  
Online interactive courses on AI in trading, first time on internet  
Apple to fix devices damaged by Japan's floods for free  
IT industry should focus on developing new technologies  
Skype adds read receipts to chats  
Mobile app for replacement of transformers in Raj  
Kashmiri students make solar boat for Dal Lake  
NASA prepares to fly probe into Sun's scorching atmosphere  
Yoga helps against non-communicable diseases: WHO  
Spironolactone can help prevent acne: Study  
Older Amazonian forests help regulate global climate  
Goal conflict linked to depressive symptoms  
A new world: Top 10 new species for 2018  
Beat the risk of frailty with healthy heart  
Twitter to hide trolls that hurl abuse: Twitter CEO  
Fortnite is finally coming to Android  
 
Do you agree with KCBC's statement against nuns who protested?
yes
 
no
 
no opinion
 
 
 
Home Kerala India World Business Sports Sci&Tech Education Automobile CityNews Movies Environment Letters 
© Copyright keralakaumudi Online 2011  |  Reproduction in whole or in part without written permission is prohibited.
Head Office Address: Kaumudi Buildings, Pettah P.O, Trivandrum - 695024, India.
Online queries talk to Deepu Sasidharan, + 91 98472 38959 or Email deepu[at]kaumudi.com
Customer Service -Advertisement Disclaimer Statement   |  Copyright Policy