Medical Intelligence & Language Engineering Lab : Document Analysis & Recognition | Speech Synthesis | Indian Language | Medical Image Processing | Medical Signal Processing

Medical Intelligence and Language Engineering Lab

TTS Demo | Downloads | Videos | Contact Us | Site Map

Home | About Mile | Projects | Research Area | Publications | Alumni | FAQ's | News & Events | Gallery

Speech Processing

Journal Publications

T V Ananthapadmanabha, A G Ramakrishnan, , “Intrinsic-cum-extrinsic normalization of formant data of vowels,” Journal of the Acoustical Society of America Express Letters, Vol. 140(5), Nov. 2016, pp. EL446 - 451, DOI: 10.1121/1.4967311 (Download)

A P Prathosh, P Sujith, A G Ramakrishnan and Prasanta K Ghosh, , “Cumulative Impulse Strength for Epoch Extraction,” IEEE Signal Processing Letters, pp. , 2015. (Download)

A G Ramakrishnan, B Abhiram and S R Mahadeva Prasanna, , “Voice source characterization using pitch synchronous discrete cosine transform for speaker identification,” Journal of the Acoustical Society of America Express Letters, Vol. 137(), pp. , 2015. (ASA Download link) (Download)

A P Prathosh, A G Ramakrishnan, T V Ananthapadmanabha, , “Estimation of voice-onset time in continuous speech using temporal measures,” Journal of the Acoustical Society of America Express Letters, Vol. 136(2), Aug. 2014, pp. EL122 - EL128., 2014. (ASA Download link) (Download)

T V Ananthapadmanabha, A P Prathosh, A G Ramakrishnan, , “Detection of the closure-burst transitions of stops and affricates in continuous speech using the plosion index,” Journal of the Acoustical Society of America, Vol. 135 (1), 2014.(ASA Download link) (Download)

A. P. Prathosh, T. V. Ananthapadmanabha, and A. G. Ramakrishnan, “Epoch extraction based on integrated linear prediction residual using plosion index,” IEEE Transactions on Audio, Speech and Language Processing, 2013, Vol. 21, Iss. 12, pp. 2471-2480. (Download)

R. Muralishankar and A.G. Ramakrishnan, “Pseudo Complex Cepstrum Using Discrete Cosine Transform,” International Journal of Speech Technology, 2005, Vol. 8, pp. 181-191. (Download)

R Muralishankar, A.G.Ramakrishnan and P Prathibha, “Modification of Pitch using DCT in the Source Domain,” Speech Communication, 2004, Vol. 42/2, pp. 143-154.(Download)

Conference Publications

T V Ananthapadmanabha, A G Ramakrishnan and Shubham Sharma , “An objective critical distance measure based on the relative level of spectral valley,” Proc. 17th Annual Conference of the International Speech Communication Association (INTERSPEECH 2017), Stockholm, Sweden, Aug. 20-24, 2017 (Download)

Nazreen PM, A G Ramakrishnan and Prasanta K Ghosh, “A class-specific speech enhancement for phoneme recognition: a dictionary learning approach, ” Proc. 17th Annual Conference of the International Speech Communication Association (INTERSPEECH 2016), San Fransico, USA, Sept. 8-12, 2016. (Download)

K V Vijay Girish, A G Ramakrishnan and T V Ananthapadmanabha, “Hierarchical classiﬁcation of speaker and background noise and estimation of SNR using sparse representation, ” Proc. 17th Annual Conference of the International Speech Communication Association (INTERSPEECH 2016), San Fransico, USA, Sept. 8-12, 2016. (Download)

K V Vijay Girish, A G Ramakrishnan and T V Ananthapadmanabha, “Cosine similarity based dictionary learning and source recovery for classiﬁcation of diverse audio sources, ” Proc. 13th Internation IEEE India Conference (INDICON 2016), Bangalore, India, Dec. 16-18, 2016. (Download)

K V Vijay Girish, Veena Vijai and A G Ramakrishnan, “Relationship between spoken Indian languages by clustering of long distance bigram features of speech, ” Proc. 13th Internation IEEE India Conference (INDICON 2016), Bangalore, India, Dec. 16-18, 2016. (Download)

A P Prathosh, A G Ramakrishnan and T V Ananthapadmanabha, “Classiﬁcation of place-of-articulation of stop consonants using temporal analysis, ” Proc. 16th Annual Conference of the International Speech Communication Association (INTERSPEECH 2015), Dresden, Germany, Sept. 6-10, 2015. (Download)

P Sujith, AP Prathosh, AG Ramakrishnan and PK Ghosh, “An Error Correction Scheme for GCI Detection Algorithms using Pitch Smoothness Criterion, ” Proc. 16th Annual Conference of the International Speech Communication Association (INTERSPEECH 2015), Dresden, Germany, Sept. 6-10, 2015. (Download)

Rohan Kumar Das, Abhiram B, S R Mahadeva Prasanna and A G Ramakrishnan, “Combining Source and System Information for Limited Data Speaker Verification, ” Proc. 15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014), Singapore, Sept. 14-18, 2014 (Download)

Rajaram B S R, Shiva Kumar H R and A G Ramakrishnan, “MILE TTS for Tamil for blizzard challenge 2014,” Proc. of Blizzard Challenge Workshop, Singapore, Sept. 19, 2014 (Download)

Bhanuprakash Abhiram, Prathosh A P, A G Ramakrishnan, “A fast algorithm for speech polarity detection using long-term linear prediction, ” Proc. Tenth International Conference on Signal Processing and Communications (SPCOM 2014), Bangalore, July 22-24, 2014 (Download)

Vikram R L, K V Vijay Girish, Harshavardhan S , A G Ramakrishnan and T V Ananthapadmanabha, “SUBBAND ANALYSIS OF LINEAR PREDICTION RESIDUAL FOR THE ESTIMATION OF GLOTTAL CLOSURE INSTANTS, ” Proc. ICASSP 2014, Florence, Italy, May 4-9, 2014 (Download)

A G Ramakrishnan, “Speech Technology and Tamil, ” Proc. National Conference on Tamil Internet, Chennai, Jan. 6, 2014. Organized by World Tamil Sangam, Madurai, India. (Download)

Shiva Kumar H R, Ashwini J K, Rajaram B S R and A G Ramakrishnan, “MILE TTS for Tamil and Kannada for blizzard challenge 2013,” Proc. of Blizzard Challenge Workshop, Barcelona, Spain, September 3rd 2013 (Download)

A G Ramakrishnan and Lakshmi Chithambaran, “Modeling basic emotions for Tamil speech synthesis,” Proc. 12-th International Tamil Internet Conf., Kuala Lumpur, Malaysia, Aug. 15-18, 2013. (Download)

Vikram Ramesh Lakkavalli, K V Vijay Girish and A G Ramakrishnan, “Sub-band Envelope Approach to Obtain Instants of Significant Excitation in Speech,” Proc. 18th National Conference on Communications (NCC 2012), 2012, Kharagpur, India. (Download)

A G Ramakrishnan, L R Vikram, Abhinava, H R Shiva Kumar, “Bilingual TTS for Tamil and English,” Proc. Tamil Internet 2010, Coimbatore, June 23-26, 2010, pp. 303 - 305.(Download)

K Shashikiran, Abhinava, Swapnil Belhe, A G Ramakrishnan, “Speaking tool in Tamil for vocally disabled,” Tamil Internet 2010, Coimbatore, June 23-26, 2010, pp. 327 - 329.(Download)

Vikram Ramesh Lakkavalli, Arulmozhi P and A G Ramakrishnan, “Continuity Metric for Unit Selection based Text-to-Speech Synthesis,” IEEE International Conference On Signal Processing & Communications (SPCOM 2010), 2010, Bangalore, India.(Download)

K.Partha Sarathy and A.G.Ramakrishnan, “A Research bed for unit selection based text to speech synthesis,” Proc. IEEE Workshop on spoken language technolog (SLT 08), Dec. 15-18, 2008, Goa, India.(Download)

Rangarao Muralishankar, Mamindapalli Ravi Shanker, A. G. Ramakrishnan., “ Perceptual-MVDR based analysis-synthesis of pitch synchronous frames for pitch modification.” Proc. 2008 IEEE International Conf. Multimedia and Expo, ICME 2008, June 23-26 2008, Hannover, Germany. pp. 81-84, IEEE, 2008.(Download)

K.Partha Sarathy and A.G.Ramakrishnan, “Text to speech synthesis system for mobile applications,” Proc. Workshop in Image and Signal Processing (WISP-2007), IIT Guwahati, Dec 28-29 2007, pp. 74-77.(Download)

H.G.Ranjani, G.Ananthakrishnan and A.G.Ramakrishnan, “Sinusoidal analysis and music inspired filter bank for training-free speech segmentation for TTS,” Proc. Workshop in Image and Signal Processing (WISP-2007), IIT Guwahati, Dec 28-29 2007, pp. 78-81.(Download)

H.G.Ranjani, G.Ananthakrishnan and A.G.Ramakrishnan, “Explicit segmentation of speech signals using Bach filter banks,” Proc. at Workshop in Image and Signal Processing (WISP-2007), IIT Guwahati, Dec 28-29 2007, pp. 47-50.(Download)

Ravishanker, R.Muralishankar and A.G.Ramakrishnan, “Bauer Method of MVDR Spectral Factorization for Pitch Modification in the Source Domain,” Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2007), New Paltz, New York, October 21-24, 2007, pp. 263-266. (Download)

Muralishankar, Ravishanker and A.G.Ramakrishnan, “MVDR spectral estimation for DCT based pitch modification,” Proc. 3rd Language and Technology Conference, Poznan, Poland, October 5-7, 2007, pp. 241-245. (Download)

Sreekanth Majji and A G Ramakrishnan, “Festival Based Maiden TTS System for Tamil Language,” Proc. 3rd Language and Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics, Poznan, Poland, Oct 5-7, 2007, pp. 187-191. (Download)

G.Ananthakrishnan, H.G.Ranjani and A.G.Ramakrishnan, “Comparative study of filter-bank mean-energy distance for automated segmentation of speech signals,” Proc. Intern. Conf. Sig. Proc. Commn. Networking, Chennai, Feb 22-24, 2007, pp. 6-10. (Download)

G.Ananthakrishnan and A.G.Ramakrishnan, “Relative pitch tracking for singing voice as application in query by humming systems,” Proc. IV IASTED Intern. Conf. Sig. Proc. Pattern Recog. Applications SPPRA 2007, Austria, Feb. 14-16, 2007, pp. 275-280. (Download)

G.Ananthakrishnan, H.G.Ranjani and A.G.Ramakrishnan, “Language independent automated segmentation of speech using Bach scale filter-banks,” Proc. IV Intl. Conf. on Intelligent Sensing Info. Proc., (ICISIP 2006), Dec 15-18, 2006, pp. 115-120. (Download)

R.Murali Shankar, Lakshmish.N.Kaushik, and A.G.Ramakrishnan, “Time Scaling of Speech and Music using Independent Subspace Analysis,” Proc. Intern. Conf. Signal Processing and Communication, Bangalore, Dec 11 - 14, 2004, pp. 310-314.(Download)

R.Murali Shankar, A.G.Ramakrishnan and Lakshmish.N.Kaushik, “Time Scaling of Speech using Independent Subspace Analysis,” Proc. INTERSPEECH 2004 8th Intern. Conf. Spoken Language Processing, Oct 4 - 8, 2004, Vol 3, pp. 2465 - 2468. (Download)

Joel Pinto, R.Muralishankar and A.G.Ramakrishnan, “ICA in Speech Recognition using HMM's,” Proc. V International Conf Adv in Pattern Recognition (ICAPR 2003), ISI, Kolkata, Dec 10-13, 2003.(Download)

R.Muralishankar, Srikanth and A.G.Ramakrishnan, “Subspace and hypothesis based effective segmentation of co-articulated basic-units for concatenative speech synthesis,” IEEE TENCON, Oct 15-17, Bangalore, 2003, Vol. 1, pp. 388-392. (Download)

R.Muralishankar, A.Vijay Krishna and A.G.Ramakrishnan, “Subspace based Vowel Consonant Segmentation,” Proc. IEEE Workshop on Statistical Signal Processing, Sept 28 - Oct 1, 2003, St.Louis, Missouri, pp. 589- 592. (Download)

P.Prathibha and A.G. Ramakrishnan “Web-enabled Speech Synthesizer for Tamil, ” Proc. Tamil Internet 2002, California, USA, Sept 27-29, San Francisco, pp. 134-140. (Download)

P.Prathibha, A.G. Ramakrishnan and R.Muralishankar “ Thirukkural II - A Text-to-Speech Synthesis System, ” Proc. Tamil Internet 2002, California, USA, Sept 27-29, San Francisco, pp. 126-133. (Download)

R.Muralishankar, A.G.Ramakrishnan and P.Prathibha, "Warped-LP residual resampling using DCT for pitch modification," Proc. ICSLP, Denver, Colorado, Sept. 22-25, 2002.

G.L.Jayavardhana Rama, A.G.Ramakrishnan, R.Muralishankar and P.Prathibha, “A Complete Text-to-Speech Synthesis System in Tamil,” Proc. IEEE 2002 Workshop Speech Synthesis, Santa Monica, CA USA, Sep. 11-13, 2002, pp. 191-194. (Download)

R.Muralishankar and A.G.Ramakrishnan, “DCT based pseudo complex cepstrum,” Proc. IEEE ICASSP 2002, Florida, USA, May 13 - 17, 2002, Vol. 1, pp. 521-524.(Download)

G.L.Jayavardhana Rama, A.G.Ramakrishnan, M.Vijay Venkatesh, and R.Muralishankar, “Thirukkural - a text-to-speech synthesis system,” Proc. Tamil Internet 2001, Kuala Lumpur, August 26-28, 2001, pp. 92-97. (Download)

R. MuraliSankar, A.G.Ramakrishnan, A.K.Rohitprasad and M.Anoop , “DCT based pitch modification,” Sixth Biennial Conference on Signal Processing and Communication SPCOM 01, IISc, Bangalore, July 15-18, 2001, pp. 114-117.(Download)

R.Murali Shankar and A.G.Ramakrishnan, “Synthesis of Speech with Emotions,” Proc. Intern. Conf. Commn., Computers and Devices, Kharagpur, Dec. 14-16, 2000, pp. 767-770. (Download)

K.Suresh and A.G.Ramakrishnan, “A DCT based approach to Estimation of Pitch,” Proc. Intern. Conf. Multimedia Processing and Systems, Chennai, Aug. 13-15, 2000, pp. 54-57. (Download)

R. Murali shankar and A. G. Ramakrishnan, “Robust Pitch Detection using DCT based Spectral Autocorrelation,” Proc. Intern. Conf. Multimedia Processing and Systems (ICMPS), Chennai, Aug. 13-15, 2000, pp. 129-132. (Download)

A.G.Ramakrishnan and V.Karthigeyan, “A preliminary speech synthesis system in Tamil,” TamilNet 99 - International Conf. on Tamil in Information Technology, Chennai, Feb. 7-8, 1999, http://www.tamilnet99.org

\A9 2012 Medical Intelligence and Language Engineering Lab - IISc Campus, Bangalore.