InterSpeech 历年最佳论文

2019-12-24 26 min read # InterSpeech # ISCA

InterSpeech

（主要数据来源：每年会议的 AbstractBook.pdf）

2023 年
时间：2023 年 8 月 20 日 ~ 24 日
地点：爱尔兰都柏林（线下）
共收到 2293 篇投稿，其中 2207 篇被审稿，最终接收 1097 篇论文

序号	最佳学生论文（提名）	作者	获奖情况
1	Multimodal Turn-taking Model Using Visual Cues for End-of-Utterance Prediction in Spoken Dialogue Systems	Fuma Kurata, Mao Saeki, Shinya Fujie, Yoichi Matsuyama	最佳学生论文
2	Transvelar Nasal Coupling Contributing to Speaker Characteristics in Non-nasal Vowels	Ziyu Zhu, Yujie Chi, Zhao Zhang, Kiyoshi Honda, Jianguo Wei
3	Diverse and Expressive Speech Prosody Prediction with Denoising Diffusion Probabilistic Model	Xiang Li, Songxiang Liu, Max W. Y. Lam, Zhiyong Wu, Chao Weng, Helen Meng
4	MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets	Ziyang Ma, Zhisheng Zheng, Changli Tang, Yujin Wang, Xie Chen
5	Neural Model Reprogramming with Similarity Based Mapping for Low-Resource Spoken Command Recognition	Hao Yen, Pin-Jui Ku, Chao-Han Huck Yang, Hu Hu, Sabata Marco Siniscalchi, Pin-Yu Chen, Yu Tsao
6	Speech Self-Supervised Representation Benchmarking: Are We Doing it Right?	Salah Zaiem, Youcef Kemiche, Titouan Parcollet, Slim Essid, Mirco Ravanelli
7	Speaker Tracking using Graph Attention Networks with Varying Duration Utterances across Multi-Channel Naturalistic Data: Fearless Steps Apollo-11 Audio Corpus	Meena M. C. Shekar, John H. L. Hansen
8	Two-Stage Voice Anonymization for Enhanced Privacy	Francesco Nespoli, Joerg Bitzer, Daniel P. Barreda, Patrick A. Naylor
9	Sociodemographic and Attitudinal Effects on Dialect Speakers’ Articulation of the Standard Language: Evidence from German-Speaking Switzerland	Carina Steiner, Dieter Studer-Joho, Corinne Lanthemann, Andrin Büchler, Adrian Leemann
10	Which aspects of motor speech disorder are captured by Mel Frequency Cepstral Coefficients? Evidence from the change in STN-DBS conditions in Parkinson’s disease	Vojtěch Illner, Petr Krýže, Jan Švihlík, Mário Souza, Paul Krack, Elina Tripoliti, Robert Jech, Jan Rusz
11	An Automatic Multimodal Approach to Analyze Linguistic and Acoustic Cues on Parkinson’s Disease Patients	Daniel Escobar-Grisales, Tomas Arias-Vergara, Cristian David Ríos Urrego, Elmar Noeth, Adolfo Garcia, Juan Rafael Orozco-Arroyave
12	CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice	Juan Pablo Zuluaga Gomez, Sara Ahmed, Danielius Visockas, Cem Subakan
13	Automatic Prediction of Language Learners’ Listenability Using Speech and Text Features Extracted from Listening Drills	Yingxiang Gao, Jaehyun Choi, Nobuaki Minematsu, Noriko Nakanishi, Daisuke Saito

2022 年
时间：2022 年 9 月 18 日 ~ 22 日
地点：韩国仁川（线下/远程混合会议）
共收到 2490 篇投稿，其中 2140 篇被审稿，最终接收 1102 篇论文

序号	最佳学生论文（提名）	作者	获奖情况
1	Trajectories Predicted by Optimal Speech Motor Control Using LSTM Networks	Tsiky Rakotomalala, Pierre Baraduc, Pascal Perrier
2	Transfer Learning Framework for Low-Resource Text-toSpeech Using a Large-Scale Unlabeled Speech Corpus	Minchan Kim, Myeonghun Jeong, Byoung Jin ChoiA, Sunghwan Ahn, Joun Yeop Lee, Nam Soo Kim	最佳学生论文
3	Pharyngealization in Amazigh: Acoustic and Articulatory Marking Over Time	Tsiky Rakotomalala, Pierre Baraduc, Pascal Perrier
4	Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition	Guangzhi Sun, Chao Zhang, Phil Woodland	最佳学生论文
5	Where's the Uh, Hesitation? the Interplay between Filled Pause Location, Speech Rate and Fundamental Frequency in Perception of Confidence	Ambika Kirkland, Harm Lameris, Éva Székely, Joakim Gustafson
6	Deep Residual Spiking Neural Network for Keyword Spotting in Low-Resource Settings	Qu Yang, Qi Liu, Haizhou Li
7	Attentive Feature Fusion for Robust Speaker Verification	Bei Liu, Zhengyang Chen, Yanmin Qian
8	Robust Self-Supervised Audio-Visual Speech Recognition	Bowen Shi, Wei-Ning Hsu, Abdelrahman Mohamed
9	Distance-Based Sound Separation	Katharine Patterson, Kevin Wilson, Scott Wisdom, John R. Hershey
10	Investigating Perception of Spoken Dialogue Acceptability through Surprisal	Sarenne Carrol Wallbridge, Catherine Lai, Peter Bell	最佳学生论文
11	Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation	Vinay Kothapally, John H.L. Hansen
12	Learning Audio-Text Agreement for Open-vocabulary Keyword Spotting	Hyeon-Kyeong Shin, Hyewon Han, Doyeon Kim, SooWhan Chung, Hong-Goo Kang

2021 年
时间：2021 年 8 月 30日 ~ 9 月 3 日
地点：捷克布尔诺（远程会议）
共收到 2277 篇投稿，其中 1990 篇被审稿，最终接收 963 篇论文

序号	最佳学生论文（提名）	作者	获奖情况
1	StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for NaturalSounding Voice Conversion	Yinghao Li, Ali Zare, Nima Mesgarani	最佳学生论文
2	Stochastic Process Regression for Cross-Cultural Speech Emotion Recognition	Mani Kumar Tellamekala, Enrique Sanchez, Georgios Tzimiropoulos, Timo Giesbrecht, Michel Valstar
3	Multilingual Transfer of Acoustic Word Embeddings Improves When Training on Languages Related to the Target Zero-Resource Language	Christiaan Jacobs, Herman Kamper
4	Effective Phase Encoding for End-to-end Speaker Verification	Junyi Peng, Xiaoyang Qu, Rongzhi Gu, Jianzong Wang, Jing Xiao, Lukas Burget, Jan Černocký
5	Dialogue Situation Recognition for Everyday Conversation Using Multimodal Information	Yuya Chiba, Ryuichiro Higashinaka
6	Audio Retrieval with Natural Language Queries	Andreea-Maria Oncescu, A. Sophia Koepke, João Henriques, Zeynep Akata, Samuel Albanie
7	Optimally Encoding Inductive Biases into the Transformer Improves End-to-End Speech Translation	Piyush Vyas, Anastasia Kuznetsova, Donald Williamson	最佳学生论文
8	WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution	Kexun Zhang, Yi Ren, Changliang Xu, Zhou Zhao
9	Acoustic Indicators of Speech Motor Coordination in Adults With and Without Traumatic Brain Injury	Tanya Talkar, Nancy Solomon, Douglas Brungart, Stefanie Kuchinsky, Megan Eitel, Sara Lippa, Tracey Brickell, Louis French, Rael Lange, Thomas Quatieri
10	A Discriminative Entity-Aware Language Model for Virtual Assistants	Mandana Saebi, Ernest Pusateri, Aaksha Meghawat, Christophe Van Gysel
11	An Automatic, Simple Ultrasound Biofeedback Parameter for Distinguishing Accurate and Misarticulated Rhotic Syllables	Sarah Li, Colin Annand, Sarah Dugan, Sarah Schwab, Kathryn Eary, Michael Swearengen, Sarah Stack, Suzanne Boyce, Michael Riley, T. Mast
12	Active Speaker Detection as a Multi-Objective Optimization with Uncertainty-based Multimodal Fusion	Baptiste Pouthier, Laurent Pilati, Leela Gudupudi, Charles Bouveyron, Frederic Precioso
13	Exploring the Potential of Lexical Paraphrases for Mitigating Noise-Induced Comprehension Errors	Anupama Chingacham, Vera Demberg, Dietrich Klakow	最佳学生论文
14	Automatic Analysis of the Emotional Content of Speech in Daylong Child-Centered Recordings from a Neonatal Intensive Care Unit	Einari Vaaras, Sari Ahlqvist-Björkroth, Konstantinos Drossos, Okko Räsänen
15	Leveraging Real-time MRI for Illuminating Linguistic Velum Action	Miran Oh, Dani Byrd, Shrikanth Narayanan

2020 年
时间：2020 年 10 月 25 日 ~ 29 日
地点：中国上海（远程会议）
共收到 2258 篇投稿，其中 2103 篇被审稿，最终接收 1021 篇论文

序号	最佳学生论文（提名）	作者	获奖情况
1	Nonlinear ISA with Auxiliary Variables for Learning Speech Representations	Amrith Setlur, Barnabas Poczos, Alan W Black
2	Low Latency End-to-End Streaming Speech Recognition with a Scout Network	Chengyi Wang, Yu Wu, Liang Lu, Shujie Liu, Jinyu Li, Guoli Ye, Ming Zhou
3	A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences	Pranay Manocha, Adam Finkelstein, Richard Zhang, Nicholas Bryan, Gautham Mysore, Zeyu Jin
4	Speaker discrimination in humans and machines: Effects of speaking style variability	Amber Afshan, Jody Kreiman, Abeer Alwan
5	FaceFilter: Audio-visual speech separation using still images	Soo-Whan Chung, Soyeon Choe, Joon Son Chung, Hong-Goo Kang	最佳学生论文
6	ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification	Brecht Desplanques, Jenthe Thienpondt, Kris Demuynck
7	Distilling the Knowledge of BERT for Sequence-to-Sequence ASR	Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara
8	Vector-Quantized Autoregressive Predictive Coding	Yu-An Chung, Hao Tang, James Glass	最佳学生论文
9	Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder	Si-Ioi Ng, Tan Lee
10	Phonetic Accommodation of L2 German Speakers to the Virtual Language Learning Tutor Mirabella	Iona Gessinger, Bernd Möbius, Bistra Andreeva, Eran Raveh, Ingmar Steiner	最佳学生论文
11	Abstractive Spoken Document Summarization using Hierarchical Model with Multi-stage Attention Diversity Optimization	Potsawee Manakul, Mark Gales, Linlin Wang

2019 年
时间：2019 年 9 月 15 日 ~ 19 日
地点：奥地利格拉茨
共收到 2180 篇投稿，其中 1855 篇被审稿，最终接收 914 篇论文

序号	最佳学生论文（提名）	作者	获奖情况
1	ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems	Yuan Gong, Jian Yang, Jacob Huber, Mitchell MacKnight, Christian Poellabauer
2	Curriculum-Based Transfer Learning for an Effective End-to-End Spoken Language Understanding and Domain Portability	Antoine Caubrière, Natalia Tomashenko, Antoine Laurent, Emmanuel Morin, Nathalie Camelin and Yannick Estève
3	Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks	Santiago Pascual, Mirco Ravanelli, Joan Serrà, Antonio Bonafonte and Yoshua Bengio
4	Evaluating Near End Listening Enhancement Algorithms in Realistic Environments	Carol Chermaz, Cassia Valentini-Botinhao, Henning Schepker, Simon King	最佳学生论文
5	Adversarially Trained End-to-end Korean Singing Voice Synthesis System	Juheon Lee, Hyeong-Seok Choi, Chang-Bin Jeon, Junghyun Koo, Kyogu Lee	最佳学生论文
6	The Contribution of Acoustic Features Analysis to Model Emotion Perceptual Process for Language Diversity	Xingfeng Li, Masato Akagi
7	CycleGAN-based Emotion Style Transfer as Data Augmentation for Speech Emotion Recognition	Fang Bao, Michael Neumann, Ngoc Thang Vu
8	Exploiting Visual Features using Bayesian Gated Neural Networks for Disordered Speech Recognition	Shansong Liu, Shoukang Hu, Yi Wang, Jianwei Yu, Rongfeng Su, Xunying Liu, Helen Meng
9	On the Role of Style in Parsing Speech with Neural Models	Trang Tran, Jiahong Yuan, Yang Liu, Mari Ostendorf
10	An Effective Deep Embedding Learning Architecture for Speaker Verification	Yiheng Jiang, Yan Song, Ian McLoughlin, Zhifu Gao, Lirong Dai
11	Towards the Speech Features of Mild Cognitive Impairment: Universal Evidence from Structured and Unstructured Connected Speech of Chinese	Tianqi Wang, Chongyuan Lian, Jingshen Pan, Quanlei Yan, Feiqi Zhu, Manwa L. Ng, Lan Wang,Nan Yan
12	Semi-supervised Sequence-to-sequence ASR using Unpaired Speech and Text	Murali Karthick Baskar, Shinji Watanabe, Ramón Astudillo, Takaaki Hori, Lukas Burget, Jan Černocký
13	Language Modeling with Deep Transformers	Kazuki Irie, Albert Zeye, Ralf Schlüter, Hermann Ney	最佳学生论文

2018 年
时间：2018 年 9 月 2 日 ~ 6 日
地点：印度海得拉巴
共收到 1668 篇投稿，最终接收 749 篇论文

序号	最佳学生论文（提名）	作者	获奖情况
1	Automatic Glottis Localization and Segmentation in Stroboscopic Videos Using Deep Neural Network	Achuth Rao M V, Rahul Krishnamurthy, Pebbili Gopikishore, Veeramani Priyadharshini, Prasanta Kumar Ghosh
2	Effects of User Controlled Speech Rate on Intelligibility in Noisy Environments	John S. Novak, III, Robert V. Kenyon
3	An Interlocutor-Modulated Attentional LSTM for Differentiating between Subgroups of Autism Spectrum Disorder	Yun-Shao Lin, Susan Shur-Fen Ga, Chi-Chun Lee
4	An Improved Deep Embedding Learning Method for Short Duration Speaker Verification	Zhifu Gao, Yan Song, Ian McLoughlin, Wu Guo, Lirong Dai
5	Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network	Weipeng He, Petr Motlicek, Jean-Marc Odobez
6	A Priori SNR Estimation Based on a Recurrent Neural Network for Robust Speech Enhancement	Yangyang Xia and Richard M. Stern
7	Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations	Ju-chieh Chou, Cheng-chieh Yeh, Hung-yi Lee and Lin-shan Lee
8	Multi-Modal Data Augmentation for End-to-End ASR	Adithya Renduchintala, Shuoyang Ding, Matthew Wiesner and Shinji Watanabe	最佳学生论文
9	A GPU-based WFST Decoder with Exact Lattice Generation	Zhehuai Chen, Justin Luitjens, Hainan Xu, Yiming Wang, Daniel Povey and Sanjeev Khudanpur
10	User-centric Evaluation of Automatic Punctuation in ASR Closed Captioning	Ákos Máté Tündik, György Szaszák, Gábor Gosztolya and András Bek
11	Detecting Depression with Audio/Text Sequence Modeling of Interview	Tuka Alhanai, Mohammad Ghassemi and James Glass	最佳学生论文
12	Joint Learning of Interactive Spoken Content Retrieval and Trainable User Simulator	Pei-Hung Chung, Kuan Tung, Ching-Lun Tai and Hung-Yi Lee	最佳学生论文

2017 年
时间：2017 年 8 月 20 日 ~ 24 日
地点：瑞典斯德哥尔摩
共收到 1711 篇投稿，其中 1582 篇被审稿，最终接收 799 篇论文

序号	最佳学生论文（提名）	作者	获奖情况
1	Relating Unsupervised Word Segmentation to Reported Vocabulary Acquisition	Elin Larsen, Alejandrina Cristia, Emmanuel Dupoux
2	Mind the Peak: When Museum Is Temporarily Understood as Musical in Australian English	Katharina Zahner, Heather Kember, Bettina Braun
3	Jointly Predicting Arousal, Valence and Dominance with Multi-Task Learning	Srinivas Parthasarathy, Carlos Busso
4	VoxCeleb: A Large-scale Speaker Identification Dataset	Arsha Nagrani, Joon Son Chung, Andrew Zisserman	最佳学生论文
5	Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery	Janek Ebbers, Jahn Heymann, Lukas Drude, Thomas Glarner, Reinhold Haeb-Umbach, Bhiksha Raj	最佳学生论文
6	Tight Integration of Spatial and Spectral Features for BSS with Deep Clustering Embeddings	Lukas Drude, Reinhold Haeb-Umbach
7	VCV Synthesis using Task Dynamics to Animate a Factor-based Articulatory Model	Rachel Alexander, Tanner Sorensen, Asterios Toutios, Shrikanth Narayanan
8	CTC in the Context of Generalized Full-Sum HMM Training	Albert Zeyer, Eugen Beck, Ralf Schlüter, Hermann Ney
9	Residual Memory Networks in Language Modeling: Improving the Reputation of Feed-Forward Networks	Karel Beneš, Murali Baskar, Lukáš Burget	最佳学生论文
10	Experiments in Character-level Neural Network Models for Punctuation	William Gale, Sarangarajan Parthasarathy
11	Entrainment in Multi-Party Spoken Dialogues at Multiple Linguistic Levels	Zahra Rahimi, Anish Kumar, Diane Litman, Susannah Paletz, Mingzhi Yu
12	Topic Identification for Speech without ASR	Chunxi Liu, Jan Trmal, Matthew Wiesner, Craig Harman, Sanjeev Khudanpur

2016 年
时间：2016 年 9 月 8 日 ~ 12 日
地点：美国旧金山
共收到 1644 篇投稿，其中 1585 篇被审稿，最终接收 779 篇论文

序号	最佳学生论文（提名）	作者	获奖情况
1	Characterizing Vocal Tract Dynamics Across Speakers Using Real-Time MRI	Tanner Sorensen, Asterios Toutios, Louis Goldstein, Shrikanth Narayanan	最佳学生论文
2	Towards Machine Comprehension of Spoken Content: Initial TOEFL Listening Comprehension Test by Machine	Bo-Hsiang Tseng, Sheng-Syun Shen, Hung-Yi Lee, Lin-Shan Lee
3	A DNN-HMM Approach to Story Segmentation	Jia Yu, Xiong Xiao, Lei Xie, Eng Siong Chng, Haizhou Li
4	Head Motion Generation with Synthetic Speech: A Data Driven Approach	Najmeh Sadoughi, Carlos Busso
5	The Rhythmic Constraint on Prosodic Boundaries in Mandarin Chinese Based on Corpora of Silent Reading and Speech Perception	Wei Lai, Jiahong Yuan, Ya Li, Xiaoying Xu, Mark Liberman	最佳学生论文
6	Is Deception Emotional? An Emotion-Driven Predictive Approach	Shahin Amiriparian, Jouni Pohjalainen, Erik Marchi, Sergey Pugachevskiy, Björn Schuller
7	Probabilistic Approach Using Joint Clean and Noisy I-Vectors Modeling for Speaker Recognition	Waad Ben Kheder, Driss Matrouf, Ajili Moez, Jean-François Bonastre
8	Majorisation-Minimisation Based Optimisation of the Composite Autoregressive System with Application to Glottal Inverse Filtering	Lauri Juvela, Hirokazu Kameoka, Manu Airaksinen, Junichi Yamagishi, Paavo Alku
9	Local Sparsity Based Online Dictionary Learning for Environment-Adaptive Speech Enhancement with Nonnegative Matrix Factorization	Kwang Myung Jeon, Hong Kook Kim
10	GlottDNN - A Full-Band Glottal Vocoder for Statistical Parametric Speech Synthesis	Manu Airaksinen, Bajibabu Bollepalli, Lauri Juvela, Zhizheng Wu, Simon King, Paavo Alku	最佳学生论文
11	Stimulated Deep Neural Network for Speech Recognition	Chunyang Wu, Penny Karanasou, Mark Gales, Khe Chai Sim
12	Contextual Prediction Models for Speech Recognition	Yoni Halpern, Keith Hall, Vlad Schogol, Michael Riley, Brian Roark, Gleb Skobeltsyn, Martin Baeuml

2015 年
时间：2015 年 9 月 6 日 ~ 10 日
地点：德国德累斯顿
共 1458 篇被审稿，最终接收 746 篇论文

序号	最佳学生论文（提名）	作者	获奖情况
1	Low-Frequency Components Analysis in Running Speech for the Automatic Detection of Parkinson's Disease	T. Villa-Cañas, J.D. Arias-Londoño, J.R. Orozco-Arroyave, J.F. Vargas-Bonilla, E. Nöth
2	Speech Planning in 4-Year-Old Children Versus Adults: Acoustic and Articulatory Analyses	Guillaume Barbier, Pascal Perrier, Lucie Ménard, Yohan Payan, Mark Tiede, Joseph Perkell
3	Using Representation Learning and Out-of-Domain Data for a Paralinguistic Speech Task	Benjamin Milde, Chris Biemann
4	Under-Resourced Speech Recognition Based on the Speech Manifold	Reza Sahraeian, Dirk Van Compernolle, Febe de Wet
5	Estimation of the Air-Tissue Boundaries of the Vocal Tract in the Mid-Sagittal Plane from Electromagnetic Articulograph Data	Satyabrata Parida, Pattem Ashok Kumar, Prasanta Kumar Ghosh
6	Adapting Machine Translation Models toward Misrecognized Speech with Text-To-Speech Pronunciation Rules and Acoustic Confusability	Nicholas Ruiz, Qin Gao, William Lewis, Marcello Federico	最佳学生论文
7	A Universal VAD Based on Jointly Trained Deep Neural Networks	Qing Wang, Jun Du, Xiao Bao, Zi-Rui Wang, Li-Rong Dai, Chin-Hui Lee
8	Semi-supervised Maximum Mutual Information Training of Deep Neural Network Acoustic Models	Vimal Manohar, Daniel Povey, Sanjeev Khudanpur
9	Salient Dimensions in Implicit Phonotactic Learning	Elise Michon, Emmanuel Dupoux , Alejandrina Cristia
10	A Time Delay Neural Network Architecture for Efficient Modeling of Long Temporal Contexts	Vijayaditya Peddinti, Daniel Povey, Sanjeev Khudanpur	最佳学生论文
11	Representing Nonspeech Audio Signals through Speech Classification Models	Huy Phan, Lars Hertel, Marco Maass, Radoslaw Mazur, Alfred Mertins
12	Objective Intelligibility Assessment of Text-To-Speech Systems through Utterance Verification	Raphael Ullmann, Ramya Rasipuram, Mathew Magimai.-Doss, Hervé Bourlard	最佳学生论文

2014 年
时间：2014 年 9 月 14 日 ~ 18 日
地点：新加坡
共 1173 篇被审稿，最终接收 614 篇论文

序号	最佳学生论文（提名）	作者	获奖情况
1	Investigating the Effect of F0 and Vocal Intensity on Harmonic Magnitudes: Data from Laryngeal High-Speed Video Endoscopy	Gang Chen, Soo Jin Park, Jody Kreiman, Abeer Alwan
2	Automatic Estimation of the Lip Radiation Effect in Glottal Inverse Filtering	Manu Airaksinen, Tom Bäckström, Paavo Alku
3	Modeling Therapist Empathy through Prosody in Drug Addiction Counseling	Bo Xiao, Daniel Bone, Maarten Van Segbroeck, Zac Imel, David Atkins, Panayiotis Georgiou, Shrikanth Narayanan
4	Intrinsic Spectral Analysis Based on Temporal Context Features for Query by Example Spoken Term Detection	Peng Yang, Cheung-Chi Leung, Lei Xie, Bin Ma, Haizhou Li
5	Direct F0 Control of an Electrolarynx Based on Statistical Excitation Feature Prediction and Its Evaluation through Simulation	Tomoki Toda, Graham Neubig, Sakriani Sakti, Satoshi Nakamura, Ko Tanaka
6	Towards a Neural Measure of Perceptual Distance - Classification of Electroencephalographic Responses to Synthetic Vowels	Manson Cheuk-Man Fong, James William Minett, Thierry Blu, William Shi-Yuan Wang
7	Exploring Modulation Spectrum Features for Speech-Based Depression Level Classification	Elif Bozkurt, Orith Toledo - Ronen, Alexander Sorin, Ron Hoory
8	Lexical Representation of Consonant, Vowels and Tones in Early Childhood	Hwee Hwee Goh, Charlene Fu, Kheng Hui Yeo
9	Speech Synthesis in Various Communicative Situations: Impact of Pronunciation Variations	Sandrine Brognaux, Benjamin Picart, Thomas Drugman	最佳学生论文
10	Adaptive Speech Recognition and Dialogue Management for Users with Speech Disorders	Iñigo Casanueva, Heidi Christensen, Thomas Hain, Phil Green
11	Word-level Invariant Representations from Acoustic Waveforms	Stephen Voinea, Chiyuan Zhang, Georgios Evangelopoulos, Lorenzo Rosasco, Tomaso Poggio	最佳学生论文
12	Acoustic Modeling with Deep Neural Networks Using Raw Time Signal for LVCSR	Zoltán Tüske, Pavel Golik, Ralf Schlüter, Hermann Ney	最佳学生论文

2013 年
时间：2013 年 9 月 14 日 ~ 18 日
地点：法国里昂

序号	最佳学生论文	作者
1	Speaker and Noise Independent Voice Activity Detection	François Germain, Dennis Sun, Gautham Mysore
2	A Two-Step Technique for MRI Audio Enhancement Using Dictionary Learning and Wavelet Packet Analysis	Colin Vaz, Vikram Ramanarayanan, Shrikanth Narayanan
3	Using Text and Acoustic Features to Diagnose Progressive Aphasia and Its Subtypes	Kathleen Fraser, Frank Rudzicz, Elizabeth Rochon

2012 年
时间：2012 年 9月 9 日 ~ 13 日
地点：美国波特兰

序号	最佳学生论文	作者
1	Discriminatively Learning Factorized Finite State Pronunciation Models from Dynamic Bayesian Networks	Preethi Jyothi, Eric Fosler-Lussier, Karen Livescu
2	MAP Estimation of Whole-Word Acoustic Models with Dictionary Priors	Keith Kintzley, Aren Jansen, Hynek Hermansky
3	Age Estimation from Telephone Speech using i-vectors	Mohamad Hasan Bahari, Mitchell McLaren, Hugo Van hamme, David Van Leeuwen

2011 年
时间：2011 年 8 月 27 日 ~ 31 日
地点：意大利佛罗伦萨

序号	最佳学生论文	作者
1	Low-Frequency Bandwidth Extension of Telephone Speech Using Sinusoidal Synthesis and Gaussian Mixture Model	Hannu Pulakka, Ulpu Remes, Santeri Yrttiaho, Kalle Palomäki, Mikko Kurimo, Paavo Alku
2	One-to-Many Voice Conversion Based on Tensor Representation of Speaker Space	Daisuke Saito, Keisuke Yamamoto, Nobuaki Minematsu, Keikichi Hirose
3	Modelling Novelty Preference in Word Learning	Maarten Versteegh, Louis ten Bosch, Lou Boves

2010 年
时间：2010 年 9 月 26 日 ~ 30 日
地点：日本千葉市

序号	最佳学生论文	作者
1	Did you say susi or shushi? Measuring the Emergence of Robust Fricative Contrasts in English- and Japanese-Acquiring Children	Jeffrey J. Holliday, Mary E. Beckman, Chanelle Mays
2	Using Non-Native Error Patterns to Improve Pronunciation Verification	Joost van Doremalen, Catia Cucchiarini, Helmer Strik
3	Reliable Tracking Based on Speech Sample Salience of Vocal Cycle Length Perturbations	Christophe Mertens, Francis Grenez, Lise Crevier-Buchman, Jean Schoentgen

2009 年
时间：2009 年 9 月 6 日 ~ 10 日
地点：英国布莱顿

序号	最佳学生论文	作者
1	Sequencing of Articulatory Gestures using Cost Optimization	Juraj Simko, F. Cummins.
2	A Deterministic plus Stochastic Model of the Residual Signal for Improved Parametric Speech Synthesis	Thomas Drugman, G. Wilfart, T.Dutoit
3	On the Semi-Supervised Learning of Multi-Layered Perceptrons	Jonathan Malkin, Amarnag Subramanya, J. Bilmes

2008 年
时间：2008 年 9 月 22 日 ~ 26 日
地点：澳大利亚布里斯班

序号	最佳学生论文	作者
1	On the Equivalence of Gaussian and Log-linear HMMS	Georg Heigold, Lehnen, P., Schlueter, R., Ney, H.
2	Combining Continuous Progressive Model Adaptation and Factor Analysis for Speaker Verification	Mitchell McLaren, Matrouf, D., Vogt R., Bonastre, J.
3	Effect of Intonational Phrase Boundaries on Pitch-Accented Syllables in American English	Yen-Liang Shue, Shuttuck-Hufnagel, S., Iseli, M., Jun S., Veilleux N., Alwan, A.

2007 年
时间：2007 年 8 月 27 日 ~ 31 日
地点：比利时安特卫普

序号	最佳学生论文	作者
1	Speech Recognition Techniques for a Sign Language Recognition System	Philippe Dreuw, David Rybach, Thomas Deselaers, Morteza Zahedi, Hermann Ney
2	An Empirical Investigation of the Nonuniqueness in the Acoustic-to-Articulatory Mapping	Chao Qin, Miguel A. Carreira-Perpinan

2006 年
时间：2006 年 9 月 17 日 ~ 21 日
地点：美国匹兹堡

序号	最佳学生论文	作者
1	Detecting Question-Bearing Turns in Spoken Tutorial Dialogues	Jackson Liscombe, Jennifer J. Venditti, Julia Hirschberg
2	Soft Margin Estimation of Hidden Markov Model Parameters	Jinyu Li, Ming Yuan, Chin-Hui Lee
3	Acoustic Cues for the Classification of Regular and Irregular Phonation	Kushan Surana, Janet Slifka

2005 年
时间：2005 年 9 月 4 日 ~ 8 日
地点：葡萄牙里斯本

序号	最佳学生论文	作者
1	On the Integration of Speech Recognition and Statistical Machine Translation	E. Matusov, S. Kanthak, H. Ney.

2004 年
时间：2004 年 10 月 4 日 ~ 8 日
地点：韩国济州岛

序号	最佳学生论文	作者
1	Hot Discussion or Frosty Dialogue? Towards a Temperature Metric for Conversational Interactivity	Peter Reichl, Florian Hammer

InterSpeech