|
2016 |
Naoya Takahashi and Tofigh Naghibi and Beat Pfister: Automatic pronunciation generation by utilizing a semi-supervised deep neural networks Proceedings of the Interspeech, San Francisco (USA), September 2016. Inproceedings [Details] [BibTeX] [Paper as PDF] |
Naoya Takahashi and Tofigh Naghibi and Beat Pfister and Luc Van Gool: Deep convolutional neural networks and data augmentation for acoustic event recognition Proceedings of the Interspeech, San Francisco (USA), September 2016. Inproceedings [Details] [BibTeX] [Paper as PDF] |
Hui Liang: Detecting emphasised spoken words by considering them prosodic outliers and taking advantage of HMM-based TTS framework Speech Prosody 2016, Boston, USA, May 2016. Inproceedings [Details] [BibTeX] [Paper as PDF] |
2015 |
Tofigh Naghibi and Sarah Hoffmann and Beat Pfister: A semidefinite programming based search strategy for feature selection with mutual information measure IEEE Transactions on Pattern Analysis and Machine Intelligence Volume 37, Issue 8, p. 1529-1541, August 2015. Article [Details] [BibTeX] |
Tofigh Naghibi: Towards Robust Audio-Visual Speech Recognition 2015. PhD Thesis [Details] [BibTeX] |
2014 |
Tofigh Naghibi and Beat Pfister: A boosting framework on grounds of online learning In Proceedings of NIPS, Montréal (Canada), December 2014. Inproceedings [Details] [BibTeX] |
Hui Liang: Investigation into Transferability of Duration of Emphasised Words from Original Expression to Spoken Translation Gloriastrasse 35, 8092 Zurich, Switzerland, November 2014. Techreport [Details] [BibTeX] [Paper as PDF] |
Pratyush Kumar and Lothar Thiele: p-YDS Algorithm: An Optimal Extension of YDS Algorithm to Minimize Expected Energy For Real-Time Jobs 14th International Conference on Embedded Software (EMSOFT), New Delhi, India, October 2014. Inproceedings [Details] [BibTeX] [External Link] [Paper as PDF] |
Hui Liang and Sarah Hoffmann: Capturing Speaker-Independent Prosodic Information by Syntax Tree-Based Prosody Modelling Gloriastrasse 35, 8092 Zurich, Switzerland, June 2014. Techreport [Details] [BibTeX] [Paper as PDF] |
Sarah Hoffmann: A Data-driven Model for the Generation of Prosody from Syntactic Sentence Structures Zurich 2014. PhD Thesis [Details] [BibTeX] |
2013 |
Tofigh Naghibi, Sarah Hoffmann and Beat Pfister: An efficient method to estimate pronunciation from multiple utterances Proceedings of Interspeech, Lyon (France), p. 1951-1955, September 2013. Inproceedings [Details] [BibTeX] [External Link] |
Sarah Hoffmann and Beat Pfister: Text-to-speech alignment of long recordings using universal phone models Proceedings of Interspeech, Lyon (France), p. 1520-1524, September 2013. Inproceedings [Details] [BibTeX] [External Link] |
Tofigh Naghibi, Sarah Hoffmann and Beat Pfister: Convex approximation of the NP-hard search problem in feature subset selection Proceedings of ICASSP, Vancouver (Canada), p. 3273-3277, May 2013. Inproceedings [Details] [BibTeX] [External Link] |
2012 |
Thomas Ewender: Automatic Selection of Speech Segments for Concatenative Speech Synthesis December 2012. PhD Thesis [Details] [BibTeX] [Paper as PDF] |
Sarah Hoffmann and Beat Pfister: Employing sentence structure: Syntax trees as prosody generators Proceedings of Interspeech, Portland, Oregon (USA), September 2012. Inproceedings [Details] [BibTeX] [Paper as PDF] |
Tobias Kaufmann and Beat Pfister: Syntactic language modeling with formal grammars Speech Communication (Elsevier) Volume 54, Issue 6, p. 715-731, July 2012. Article [Details] [BibTeX] [Paper as PDF] |
Tofigh Naghibi and Beat Pfister: Beamformer design for nonstationary signals by means of interfrequency correlations Proceedings of SAM, Hoboken, NJ (USA), p. 261-264, June 2012. Inproceedings [Details] [BibTeX] [External Link] [Paper as PDF] |
Tofigh Naghibi and Beat Pfister: An approach to prevent adaptive beamformers from cancelling the desired signal In Proceedings of ICASSP, Kyoto (Japan), p. 205-208, March 2012. Inproceedings [Details] [BibTeX] [Paper as PDF] |
2011 |
Thomas Ewender and Beat Pfister: Automatically creating a diphone set from a speech database In Proceedings of Interspeech, Florence (Italy), p. 2169-2172, August 2011. Inproceedings [Details] [BibTeX] [Paper as PDF] |
Michael Gerber, Tobias Kaufmann and Beat Pfister: Extended Viterbi algorithm for optimized word HMMs In Proceedings of ICASSP, Prague (Czech Republic), p. 4932-4935, May 2011. Inproceedings [Details] [BibTeX] [Paper as PDF] |
Michael Gerber: Speech Recognition Techniques for Languages with Limited Linguistic Resources Zurich 2011. PhD Thesis [Details] [BibTeX] [Paper as PDF] |
2010 |
Tobias Kaufmann and Beat Pfister: Semi-automatic extension of morphological lexica In Workshop Computational Linguistics - Applications, Wisla (Poland) 2010. Inproceedings [Details] [BibTeX] [Paper as PDF] |
Sarah Hoffmann and Beat Pfister: Fully automatic segmentation for prosodic speech corpora In Proceedings of Interspeech, Makuhari (Japan), p. 1389-1392 2010. Inproceedings [Details] [BibTeX] [Paper as PDF] |
Thomas Ewender and Beat Pfister: Accurate pitch marking for prosodic modification of speech segments In Proceedings of Interspeech, Makuhari (Japan), p. 178-181 2010. Inproceedings [Details] [BibTeX] [Paper as PDF] |
2009 |
Tobias Kaufmann: A Rule-based Language Model for Speech Recognition Zurich, October 2009. PhD Thesis [Details] [BibTeX] |
Tobias Kaufmann, Thomas Ewender and Beat Pfister: Improving broadcast news transcription with a precision grammar and discriminative reranking Proceedings of Interspeech, Brighton (UK), p. 356-359, September 2009. Inproceedings [Details] [BibTeX] [Paper as PDF] |
Harald Romsdorfer: Weighted Neural Network Ensemble Models for Speech Prosody Control Proceedings of Interspeech 2009, Brighton (UK), p. 492-495, September 2009. Inproceedings [Details] [BibTeX] |
Harald Romsdorfer: Polyglot Speech Prosody Control Proceedings of Interspeech 2009, Brighton (UK), p. 488-491, September 2009. Inproceedings [Details] [BibTeX] |
Harald Romsdorfer: Combining Weighted Neural Network Ensembles with Factor Relevance Determination for Speech Prosody Control Proceedings of MLSP Workshop, Grenoble (France), September 2009. Inproceedings [Details] [BibTeX] |
Harald Romsdorfer: Polyglot Text-to-Speech Synthesis: Text Analysis & Prosody Control Zurich, January 2009. PhD Thesis [Details] [BibTeX] |
Thomas Ewender, Sarah Hoffmann and Beat Pfister: Nearly Perfect Detection of Continuous F0 Contour and Frame Classification for TTS Synthesis Proceedings of Interspeech, Brighton (UK), p. 100-103 2009. Inproceedings [Details] [BibTeX] [Paper as PDF] |
2008 |
Michael Gerber and Beat Pfister: Fast search for common segments in speech signals for speaker verification In Proceedings of Interspeech, Brisbane (Australia), p. 375-378 2008. Inproceedings [Details] [BibTeX] [Paper as PDF] |
Beat Pfister and Tobias Kaufmann: Sprachverarbeitung: Grundlagen und Methoden der Sprachsynthese und Spracherkennung 2008. Book [Details] [BibTeX] [External Link] |
2007 |
Harald Romsdorfer and Beat Pfister: Text analysis and language identification for polyglot text-to-speech synthesis Speech Communication Volume 49, Issue 9, p. 697-724, September 2007. Article [Details] [BibTeX] [External Link] [Paper as PDF] |
René Beutler: Improving Speech Recognition through Linguistic Knowledge January 2007. PhD Thesis [Details] [BibTeX] |
Tobias Kaufmann and Beat Pfister: Applying licenser rules to a grammar with continuous constituents In Proceedings of the 14th International Conference on Head-Driven Phrase Structure Grammar, Stanford, p. 150-162 2007. Inproceedings [Details] [BibTeX] [Paper as PDF] |
Michael Gerber, René Beutler and Beat Pfister: Quasi text-independent speaker verification based on pattern matching In Proceedings of Interspeech, Antwerp, p. 1993-1996 2007. Inproceedings [Details] [BibTeX] [Paper as PDF] |
Michael Gerber, Tobias Kaufmann and Beat Pfister: Perceptron-based class verification In Proceedings of NOLISP (ISCA Workshop on non linear speech processing), Paris 2007. Inproceedings [Details] [BibTeX] [Paper as PDF] |
2006 |
Tobias Kaufmann and René Beutler: A Hybrid Language Model for Speech Recognition Poster session at the NCCR IM2 Review Meeting, Martigny,Switzerland, November 2006. Misc [Details] [BibTeX] |
Michael Gerber and Beat Pfister: Quasi text-independent speaker verification with neural networks Poster session at the NCCR IM2 Review Meeting, Martigny,Switzerland, November 2006. Misc [Details] [BibTeX] |
Beat Pfister and René Beutler: Improving Speech Recognition thru Linguistics February 2006. Techreport [Details] [BibTeX] |
Harald Romsdorfer and Beat Pfister: Character Stream Parsing of Mixed-lingual Text ISCA - MultiLing 2006, Stellenbosch, South Africa 2006. Inproceedings [Details] [BibTeX] |
2005 |
René Beutler, Tobias Kaufmann and Beat Pfister: Integrating a Non-Probabilistic Grammar into Large Vocabulary Continuous Speech Recognition 2005 IEEE Automatic Speech Recognition and Understanding Workshop, San Juan, Puerto Rico, p. 104-109, November 2005. Inproceedings [Details] [BibTeX] |
Harald Romsdorfer and Beat Pfister: Phonetic Labeling and Segmentation of Mixed-Lingual Prosody Databases Proceedings of Interspeech 2005, Lisbon, Portugal, p. 3281-3284, September 2005. Inproceedings [Details] [BibTeX] |
Michael Gerber and Beat Pfister: Quasi text-independent speaker verification with neural networks Extended Abstract for MLMI 05 Workshop in Edinburgh, July 2005. Misc [Details] [BibTeX] [Paper as PDF] |
René Beutler, Tobias Kaufmann and Beat Pfister: Using rule-based knowledge to improve LVCSR Proceedings of ICASSP 2005, Philadelphia, p. 829-832, March 2005. Inproceedings [Details] [BibTeX] |
Harald Romsdorfer, Beat Pfister and René Beutler: A Mixed-lingual Phonological Component which Drives the Statistical Prosody Control of a Polyglot TTS Synthesis System Machine Learning for Multimodal Interaction, Berlin Heidelberg New York, p. 263-276, January 2005. Incollection [Details] [BibTeX] |
2004 |
Michael Gerber: Evaluation von Vektorquantisierungsmethoden für das Finden lautlich ähnlicher Abschnitte in Sprachsignalen Report zum NCCR Projekt IM2.ACP, ETH, Zürich, December 2004. Misc [Details] [BibTeX] |
Jozsef Szakos and Ulrike Glavitsch: Seamless Speech Indexing and Retrieval: Developing a New Technology for the Documentation and Teaching of Endangered Formosan Aboriginal Languages Proceedings of the Intl. Conference on Education and Information Systems: Technologies and Applications (EISTA04), Orlando, Florida, Volume 4, p. 88-93, July 2004. Inproceedings [Details] [BibTeX] |
Jozsef Szakos and Ulrike Glavitsch: Portability, modularity and seamless speech-corpus indexing and retrieval: A new software for documenting (not only) the endangered Formosan aboriginal languages E-MELD Language Digitization Project Conference 2004 on Linguistic Databases and Best Practice, Wayne State University, Detroit, Michigan, July 2004. Misc [Details] [BibTeX] |
René Beutler, Tobias Kaufmann and Beat Pfister: Can grammars improve speech recognition accuracy? MLMI04 Workshop, Hotel du Parc, Martigny, Switzerland, June 2004. Misc [Details] [BibTeX] |
Harald Romsdorfer: An Approach to an Improved Segmentation of Speech May 2004. Techreport [Details] [BibTeX] |
Urs Niesen and Beat Pfister: Speaker verification by means of ANNs In Proceedings of the ESANN 04, Bruges (Belgium), p. 145-150, April 2004. Inproceedings [Details] [BibTeX] [Paper as PDF] |
Harald Romsdorfer and Beat Pfister: Multi-Context Rules for Phonological Processing in Polyglot TTS Synthesis Proceedings of Interspeech-ICSLP 2004, Jeju, Korea, p. 845-848 2004. Inproceedings [Details] [BibTeX] |
2003 |
Ulrike Glavitsch: Speaker normalization with respect to F0: a perceptual approach December 2003. Techreport [Details] [BibTeX] |
Harald Romsdorfer and Beat Pfister: Mixed-lingual text analysis for polyglot TTS synthesis Poster Presentation for (IM)2 Summer Institute, (IM)2 Summer Institute, Crans-Montana, Switzerland, October 2003. Misc [Details] [BibTeX] |
Harald Romsdorfer and Beat Pfister: Mixed-lingual text analysis for polyglot TTS synthesis Poster Presentation for (IM)2 NCCR Review , Hotel du Parc, Martigny, SwitzerlandPanel, October 2003. Misc [Details] [BibTeX] |
Beat Pfister and René Beutler: Estimating the weight of evidence in forensic speaker verification Proceedings of Eurospeech 2003, Geneva, p. 701-704, September 2003. Inproceedings [Details] [BibTeX] |
Beat Pfister and Harald Romsdorfer: Mixed-lingual Text Analysis for Polyglot TTS Synthesis Proceedings of Eurospeech 2003, Geneva, Switzerland, p. 2037-2040, September 2003. Inproceedings [Details] [BibTeX] [Paper as PDF] |