


default search action
SLT 2014: South Lake Tahoe, NV, USA
- 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7-10, 2014. IEEE 2014, ISBN 978-1-4799-7129-9

- Ali Orkan Bayer, Giuseppe Riccardi:

Semantic language models for Automatic Speech Recognition. 7-12 - Anna Schmidt, Youssef Oualil, Oliver Ohneiser

, Matthias Kleinert, Marc Schulder
, Arif Khan, Hartmut Helmke, Dietrich Klakow:
Context-based recognition network adaptation for improving on-line ASR in Air Traffic Control. 13-18 - Seyed Hamidreza Mohammadi, Alexander Kain:

Voice conversion using deep neural networks with speaker-independent pre-training. 19-23 - Masahiro Saiko, Hitoshi Yamamoto, Ryosuke Isotani, Chiori Hori:

Efficient multi-lingual unsupervised acoustic model training under mismatch conditions. 24-29 - Vincent Renkens

, Steven Janssens, Bart Ons, Jort F. Gemmeke, Hugo Van hamme
:
Acquisition of ordinal words using weakly supervised NMF. 30-35 - Basil Abraham, Neethu Mariam Joy, Navneeth K. S. Umesh:

A data-driven phoneme mapping technique using interpolation vectors of phone-cluster adaptive training. 36-41 - Md. Akmal Haidar, Douglas D. O'Shaughnessy:

Document-based Dirichlet class language model for speech recognition using document-based n-gram events. 42-47 - Frantisek Grézl, Ekaterina Egorova, Martin Karafiát

:
Further investigation into multilingual training and adaptation of stacked bottle-neck neural network structure. 48-53 - Weiran Wang, Raman Arora

, Karen Livescu
:
Reconstruction of articulatory measurements with smoothed low-rank matrix completion. 54-59 - Hiroaki Sugiyama, Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Minami:

Open-domain utterance generation using phrase pairs based on dependency relations. 60-65 - Bing Zhao, Yik-Cheung Tam:

Bilingual Recurrent Neural Networks for improved statistical machine translation. 66-70 - Maryam Siahbani, Ramtin Mehdizadeh Seraj, Baskaran Sankaran, Anoop Sarkar:

Incremental translation using hierarchichal phrase-based translation system. 71-76 - Alan Wisler

, Visar Berisha
, Julie Liss, Andreas Spanias:
Domain invariant speech features using a new divergence measure. 77-82 - Zhiyang He, Ji Wu, Ping Lv:

Label correlation mixture model for multi-label text categorization. 83-88 - Jose Sousa, Fabiola Araujo, Aldebaro Klautau

:
Utterance copy for Klatt's speech synthesizer using genetic algorithm. 89-94 - Lara J. Martin

, Matthew Stone
, Florian Metze, Jack Mostow:
A methodology for using crowdsourced data to measure uncertainty in natural speech. 95-99 - Herman Kamper

, Aren Jansen, Simon King
, Sharon Goldwater:
Unsupervised lexical clustering of speech segments using fixed-dimensional acoustic embeddings. 100-105 - Gabriel Synnaeve, Thomas Schatz, Emmanuel Dupoux

:
Phonetics embedding learning with side information. 106-111 - Heriberto Cuayáhuitl

, Nina Dethlefs, Helen F. Hastie, Xingkun Liu:
Training a statistical surface realiser from automatic slot labelling. 112-117 - Oscar Saz, Mortaza Doulaty, Thomas Hain

:
Background-tracking acoustic features for genre identification of broadcast shows. 118-123 - Steven J. Rennie, Vaibhava Goel

, Samuel Thomas:
Deep Order Statistic Networks. 124-128 - Murali Karthick B, Srinivasan Umesh

:
Improving deep neural networks using state projection vectors of subspace Gaussian mixture model as features. 129-134 - Romain Serizel, Diego Giuliani:

Vocal tract length normalisation approaches to DNN-based children's and adults' speech recognition. 135-140 - Pengyuan Zhang, Yulan Liu, Thomas Hain

:
Semi-supervised DNN training in meeting recognition. 141-146 - Jen-Tzung Chien

, Tsai-Wei Lu:
Tikhonov regularization for deep neural network acoustic modeling. 147-152 - Ryan Price, Ken-ichi Iso, Koichi Shinoda:

Speaker adaptation of deep neural networks using a hierarchy of output layers. 153-158 - Steven J. Rennie, Vaibhava Goel

, Samuel Thomas:
Annealed dropout training of deep networks. 159-164 - Yajie Miao, Lu Jiang, Hao Zhang, Florian Metze:

Improvements to speaker adaptive training of deep neural networks. 165-170 - Pawel Swietojanski

, Steve Renals
:
Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models. 171-176 - Yuzong Liu, Katrin Kirchhoff:

Graph-based semi-supervised acoustic modeling in DNN-based speech recognition. 177-182 - George Saon

:
A distributed architecture for fast SGD sequence discriminative training of DNN acoustic models. 183-188 - Kaisheng Yao, Baolin Peng, Yu Zhang, Dong Yu, Geoffrey Zweig, Yangyang Shi:

Spoken language understanding using long short-term memory neural networks. 189-194 - Xiaohu Liu

, Ruhi Sarikaya:
A discriminative model based entity dictionary weighting approach for spoken language understanding. 195-199 - Kai Hong, Pengjun Pei, Ye-Yi Wang, Dilek Hakkani-Tür:

Entity ranking for descriptive queries. 200-205 - Jen-Tzung Chien

, Yuan-Chu Ku:
Bayesian recurrent neural network language model. 206-211 - Mickael Rouvier, Benoît Favre, Frédéric Béchet:

Joint decoding of complementary utterances. 212-217 - Mohamed Morchid, Richard Dufour, Mohamed Bouallegue, Georges Linarès:

Author-topic based representation of call-center conversations. 218-223 - Xiang Li, Gökhan Tür

, Dilek Hakkani-Tür
, Qi Li:
Personal knowledge graph population from user utterances in conversational understanding. 224-229 - Ji He, Alex Marin, Mari Ostendorf:

Effective data-driven feature learning for detecting name errors in automatic speech recognition. 230-235 - Gina-Anne Levow, Valerie Freeman

, Alena Hrynkevich, Mari Ostendorf, Richard A. Wright
, Julian Chan, Yi Luan, Trang Tran
:
Recognition of stance strength and polarity in spontaneous speech. 236-241 - Yun-Nung Chen, Dilek Hakkani-Tür

, Gökhan Tür
:
Deriving local relational surface forms from dependency-based entity embeddings for unsupervised spoken language understanding. 242-247 - Jort F. Gemmeke, Siddharth Sehgal, Stuart P. Cunningham

, Hugo Van hamme
:
Dysarthric vocal interfaces with minimal training data. 248-253 - Heidi Christensen

, I. Casanueva, Stuart P. Cunningham
, Phil D. Green
, Thomas Hain
:
Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data. 254-259 - Xiaodan Zhuang, Viktor Rozgic, Michael Crystal, Brian Marx

:
Improving speech-based PTSD detection via multi-view learning. 260-265 - Emily Prud'hommeaux, Eric Morley, Masoud Rouhizadeh, Laura Silverman, Jan P. H. van Santen, Brian Roark, Richard Sproat, Sarah Kauper, Rachel DeLaHunta:

Computational analysis of trajectories of linguistic development in autism. 266-271 - Mahsa Sadat Elyasi Langarani, Jan P. H. van Santen

:
Modeling fundamental frequency dynamics in hypokinetic dysarthria. 272-276 - Verena Venek

, Stefan Scherer, Louis-Philippe Morency, Albert A. Rizzo, John Pestian:
Adolescent suicidal risk assessment in clinician-patient interaction: A study of verbal and acoustic behaviors. 277-282 - Kyusong Lee, Seonghan Ryu, Hongsuck Seo, Seokhwan Kim, Gary Geunbae Lee:

Grammatical error correction based on learner comprehension model in oral conversation. 283-287 - Nichola Lubold, Heather Pon-Barry:

A comparison of acoustic-prosodic entrainment in face-to-face and remote collaborative learning dialogues. 288-293 - Jidong Tao, Keelan Evanini, Xinhao Wang:

The influence of automatic speech recognition accuracy on the performance of an automated speech assessment system. 294-299 - Xuesong Yang

, Anastassia Loukina, Keelan Evanini:
Machine learning approaches to improving pronunciation error detection on an imbalanced corpus. 300-305 - Lasguido Nio, Sakriani Sakti, Graham Neubig, Tomoki Toda

, Satoshi Nakamura:
Improving the robustness of example-based dialog retrieval using recursive neural network paraphrase identification. 306-311 - Lihong Li, He He, Jason D. Williams

:
Temporal supervised learning for inferring a dialog policy from example conversations. 312-317 - Yi Ma, Eric Fosler-Lussier:

A discriminative sequence model for dialog state tracking using user goal change detection. 318-323 - Matthew Henderson, Blaise Thomson, Jason D. Williams

:
The third Dialog State Tracking Challenge. 324-329 - Kai Sun, Lu Chen, Su Zhu, Kai Yu:

A generalized rule based tracker for dialogue state tracking. 330-335 - Su Zhu, Lu Chen, Kai Sun, Da Zheng, Kai Yu:

Semantic parser enhancement for dialogue domain extension with little data. 336-341 - Hang Ren, Weiqun Xu, Yonghong Yan:

Markovian discriminative modeling for cross-domain dialog state tracking. 342-347 - Rudolf Kadlec, Miroslav Vodolán

, Jindrich Libovický
, Jan Macek, Jan Kleindienst:
Knowledge-based Dialog State Tracking. 348-353 - Dongho Kim, Matthew Henderson, Milica Gasic, Pirros Tsiakoulis, Steve J. Young:

The use of discriminative belief tracking in POMDP-based dialogue systems. 354-359 - Matthew Henderson, Blaise Thomson, Steve J. Young:

Robust dialog state tracking using delexicalised recurrent neural networks and unsupervised adaptation. 360-365 - Sebastian Schuster

, Stephanie Pancoast, Milind Ganjoo, Michael C. Frank
, Dan Jurafsky:
Speaker-independent detection of child-directed speech. 366-371 - Abhinav Misra, John H. L. Hansen:

Spoken language mismatch in speaker verification: An investigation with NIST-SRE and CRSS Bi-Ling corpora. 372-377 - Daniel Garcia-Romero, Xiaohui Zhang, Alan McCree, Daniel Povey:

Improving speaker recognition performance in the domain adaptation challenge using deep neural networks. 378-383 - Qian Zhang, John H. L. Hansen:

Training candidate selection for effective rejection in open-set language identification. 384-389 - Xavier Bost, Georges Linarès:

Constrained speaker diarization of TV series based on visual patterns. 390-395 - Maria Joana Correia, Alberto Abad

, Isabel Trancoso
:
Exploiting magnitude and phase spectral information for converted speech detection. 396-401 - Sree Harsha Yella, Andreas Stolcke, Malcolm Slaney

:
Artificial neural network features for speaker diarization. 402-406 - Brian Thompson:

Discrimination between singing and speech in real-world audio. 407-412 - Gregory Sell, Daniel Garcia-Romero:

Speaker diarization with plda i-vector scoring and unsupervised calibration. 413-417 - Gang Liu, Chengzhu Yu, Navid Shokouhi, Abhinav Misra, Hua Xing, John H. L. Hansen:

Utilization of unlabeled development data for speaker verification. 418-423 - Di Xu, Yun Wang, Florian Metze:

EM-based phoneme confusion matrix generation for low-resource spoken term detection. 424-429 - Van Tung Pham, Nancy F. Chen

, Sunil Sivadas, Haihua Xu, I-Fan Chen, Chongjia Ni, Engsiong Chng
, Haizhou Li
:
System and keyword dependent fusion for spoken term detection. 430-435 - Shi-wook Lee, Kazuyo Tanaka, Yoshiaki Itoh:

Effective combination of heterogeneous subword-based spoken term detection systems. 436-441 - Jonathan Wintrode, Sanjeev Khudanpur:

Combining local and broad topic context to improve term detection. 442-447 - Khe Chai Sim:

A multimodal stroke-based predictive input for efficient Chinese text entry on mobile devices. 448-453 - Yuan Liang, Koji Iwano

, Koichi Shinoda:
An efficient error correction interface for speech recognition on mobile touchscreen devices. 454-459 - Matthias Sperber, Graham Neubig, Satoshi Nakamura, Alex Waibel:

On-the-fly user modeling for cost-sensitive correction of speech transcripts. 460-465 - Nurul Lubis, Dessi Puji Lestari, Ayu Purwarianti

, Sakriani Sakti, Satoshi Nakamura:
Emotion recognition on Indonesian television talk shows. 466-471 - Mohammed Abdel-Wahab, Carlos Busso

:
Evaluation of syllable rate estimation in expressive speech and its contribution to emotion recognition. 472-477 - Mostafa Ali Shahin

, Beena Ahmed
, Kirrie J. Ballard
:
Classification of lexical stress patterns using deep neural network architecture. 478-482 - Zhipeng Chen

, Teng Zhang, Ji Wu:
Subword scheme for keyword search. 483-488 - Hang Su, James Hieronymus, Yanzhang He, Eric Fosler-Lussier, Steven Wegmann:

Syllable based keyword search: Transducing syllable lattices to word lattices. 489-494 - Matti Varjokallio, Mikko Kurimo:

A word-level token-passing decoder for subword n-gram LVCSR. 495-500 - Martin Karafiát

, Karel Veselý, Igor Szöke
, Lukás Burget
, Frantisek Grézl, Mirko Hannemann, Jan Cernocký
:
But ASR system for BABEL Surprise evaluation 2014. 501-506 - Seyedmahdad Mirsamadi

, John H. L. Hansen:
Multichannel feature enhancement in distributed microphone arrays for robust distant speech recognition in smart rooms. 507-512 - Christos Koniaris, Saikat Chatterjee:

A sparsity based preprocessing for noise robust speech recognition. 513-518 - Deepak Baby

, Tuomas Virtanen
, Jort F. Gemmeke, Tom Barker, Hugo Van hamme
:
Exemplar-based noise robust automatic speech recognition using modulation spectrogram features. 519-524 - Ahmed Ali, Yifan Zhang, Patrick Cardinal, Najim Dehak

, Stephan Vogel, James R. Glass:
A complete KALDI recipe for building Arabic speech recognition systems. 525-529 - Jan Trmal, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur, Pegah Ghahremani, Xiaohui Zhang, Vimal Manohar, Chunxi Liu, Aren Jansen, Dietrich Klakow, David Yarowsky, Florian Metze:

A keyword search system using open source software. 530-535 - Pallavi Baljekar, Jill Fain Lehman, Rita Singh:

Online word-spotting in continuous speech with recurrent neural networks. 536-541 - Rui Zhao, Jinyu Li

, Yifan Gong:
Variable-activation and variable-input deep neural network for robust speech recognition. 542-547 - Vikramjit Mitra, Wen Wang, Horacio Franco:

Deep convolutional nets and robust features for reverberation-robust speech recognition. 548-553 - Zhaohan Daniel Guo

, Gökhan Tür
, Wen-tau Yih, Geoffrey Zweig:
Joint semantic utterance classification and slot filling with recursive neural networks. 554-559 - Mandy Korpusik, Nicole Schmidt, Jennifer Drexler, Scott Cyphers, James R. Glass:

Data collection and language understanding of food descriptions. 560-565 - Qi Li, Gökhan Tür

, Dilek Hakkani-Tür
, Xiang Li, Tim Paek, Asela Gunawardana, Chris Quirk:
Distributed open-domain conversational understanding framework with domain independent extractors. 566-571 - Anna Prokofieva, Dilek Hakkani-Tür

, Malcolm Slaney
:
Eye gaze for understanding conversational speech. 572-577 - Agustín Gravano, Stefan Benus

, Rivka Levitan, Julia Hirschberg:
Three ToBI-based measures of prosodic entrainment and their correlations with speaker engagement. 578-583 - Yun-Nung Chen, William Yang Wang, Alexander I. Rudnicky

:
Leveraging frame semantics and distributional semantics for unsupervised semantic slot induction in spoken dialogue systems. 584-589 - Yun-Nung Chen, Alexander I. Rudnicky

:
Dynamically supporting unexplored domains in conversational interactions by enriching semantics with neural word embeddings. 590-595 - Georgia Athanasopoulou, Ioannis Klasinas, Spiros Georgiladakis, Elias Iosif, Alexandros Potamianos:

Using lexical, syntactic and semantic features for non-terminal grammar rule induction in Spoken Dialogue Systems. 596-601 - Deepak Ramachandran, Peter Z. Yeh, William Jarrold, Benjamin Douglas, Adwait Ratnaparkhi, Ronald Provine, Jeremy Mendel, Adam Emfield:

An end-to-end dialog system for TV program discovery. 602-607 - Nobal B. Niraula, Amanda Stent, Hyuckchul Jung, Giuseppe Di Fabbrizio, I. Dan Melamed, Vasile Rus:

Forms2Dialog: Automatic dialog generation for Web tasks. 608-613

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














