Ankit Sehrawat
Mr. Ankit Kumar is currently working as an assistant professor in A.B.E.S. Engineering College, Ghaziabad. He completed his M.Tech from National Institute of Technology, Kurukshetra. Currently his area of great interest is robust Hindi Speech Recognition. He has authored and co-authored of various publication in major referred scientific conferences and journals.
ICNLPSRS 2018: 20th Int. Conference on Natural Language Computing, Speech Recognition and Synthesis
Oct 8-9, 2018
Tokyo, Japan
Publication: Journal
Deadline : 30 Mar 2018
UPCOMING EVENTS
SLT : IEEE Spoken Language Technology Workshop
Dec 18, 2018 - Dec 21, 2018
Athens , Greece
Deadline : 2 July 2018
INTERSPEECH-2018:Speech Research for Emerging Markets in Multilingual Societies
Sep 2-6, 2018
IIIT-Hyderabad
Publication: Interspeech
Deadline : 23 Mar 2018
LATEST RESEARCH TOOLS ON SPEECH RECOGNITION
HTK 3.5
HTK 3.5
The Hidden Markov Model Toolkit (HTK 3.4.1) is a open source speech recognition toolkit developed by Cambridge University Engineering department (CUED). The tools provide sophisticated facilities for speech analysis, HMM training, testing and result analysis.
KALDI
KALDI
Kaldi is a toolkit for speech recognition written in C++ and licensed under the Apache License v2.0. Kaldi is intended for use by speech recognition researchers.Kaldi is similar in aims and scope to HTK. The goal is to have modern and flexible code, written in C++, that is easy to modify and extend.
SPHINX
CMUSphinx toolkit is powerful tool for speech recognition. Such applications could include voice control of desktop, various automotive devices, intelligent houses by this. Other possible applications are speech transcription, closed captioning, speech translation, voice search and language learning
MATLAB VOICEBOX
VOICEBOX is a speech processing toolbox consists of MATLAB routines that are maintained by and mostly written by Mike Brookes, Department of E.C.E., Imperial College, Exhibition Road, London SW7 2BT, UK. In particular it contains a number of default directory paths indicating where temporary files should be created, where speech data normally resides, etc.