Home | ankitprofile

Ankit Sehrawat

Mr. Ankit Kumar is currently working as an assistant professor in A.B.E.S. Engineering College, Ghaziabad. He completed his M.Tech from National Institute of Technology, Kurukshetra. Currently his area of great interest is robust Hindi Speech Recognition. He has authored and co-authored of various publication in major referred scientific conferences and journals.

ICNLPSRS 2018: 20th Int. Conference on Natural Language Computing, Speech Recognition and Synthesis

Oct 8-9, 2018

Tokyo, Japan

Publication: Journal

Deadline : 30 Mar 2018

UPCOMING EVENTS

SLT : IEEE Spoken Language Technology Workshop

Dec 18, 2018 - Dec 21, 2018

Athens , Greece

Deadline : 2 July 2018

DETAILS

Discover More

INTERSPEECH-2018:Speech Research for Emerging Markets in Multilingual Societies

Sep 2-6, 2018

IIIT-Hyderabad

Publication: Interspeech

Deadline : 23 Mar 2018

LATEST RESEARCH TOOLS ON SPEECH RECOGNITION

HTK 3.5

The Hidden Markov Model Toolkit (HTK 3.4.1) is a open source speech recognition toolkit developed by Cambridge University Engineering department (CUED). The tools provide sophisticated facilities for speech analysis, HMM training, testing and result analysis.

KALDI

Kaldi is a toolkit for speech recognition written in C++ and licensed under the Apache License v2.0. Kaldi is intended for use by speech recognition researchers.Kaldi is similar in aims and scope to HTK. The goal is to have modern and flexible code, written in C++, that is easy to modify and extend.

SPHINX

CMUSphinx toolkit is powerful tool for speech recognition. Such applications could include voice control of desktop, various automotive devices, intelligent houses by this. Other possible applications are speech transcription, closed captioning, speech translation, voice search and language learning

MATLAB VOICEBOX

VOICEBOX is a speech processing toolbox consists of MATLAB routines that are maintained by and mostly written by Mike Brookes, Department of E.C.E., Imperial College, Exhibition Road, London SW7 2BT, UK. In particular it contains a number of default directory paths indicating where temporary files should be created, where speech data normally resides, etc.