printlogo
ETH Zuerich - Homepage
Computer Engineering and Networks Laboratory (TIK)
 

List of Publications in Traditional Format by Group "Computer Engineering" containing Keyword "SPE" sorted by "Year"


59 entries found.

2016

Naoya Takahashi and Tofigh Naghibi and Beat Pfister:
Automatic pronunciation generation by utilizing a semi-supervised deep neural networks
Proceedings of the Interspeech, San Francisco (USA), September 2016.
Inproceedings [Details] [BibTeX] [Paper as PDF]  
Naoya Takahashi and Tofigh Naghibi and Beat Pfister and Luc Van Gool:
Deep convolutional neural networks and data augmentation for acoustic event recognition
Proceedings of the Interspeech, San Francisco (USA), September 2016.
Inproceedings [Details] [BibTeX] [Paper as PDF]  
Hui Liang:
Detecting emphasised spoken words by considering them prosodic outliers and taking advantage of HMM-based TTS framework
Speech Prosody 2016, Boston, USA, May 2016.
Inproceedings [Details] [BibTeX] [Paper as PDF]  


2015

Tofigh Naghibi and Sarah Hoffmann and Beat Pfister:
A semidefinite programming based search strategy for feature selection with mutual information measure
IEEE Transactions on Pattern Analysis and Machine Intelligence
Volume 37, Issue 8, p. 1529-1541, August 2015.
Article [Details] [BibTeX]  
Tofigh Naghibi:
Towards Robust Audio-Visual Speech Recognition
2015.
PhD Thesis [Details] [BibTeX]  


2014

Tofigh Naghibi and Beat Pfister:
A boosting framework on grounds of online learning
In Proceedings of NIPS, Montréal (Canada), December 2014.
Inproceedings [Details] [BibTeX]  
Hui Liang:
Investigation into Transferability of Duration of Emphasised Words from Original Expression to Spoken Translation
Gloriastrasse 35, 8092 Zurich, Switzerland, November 2014.
Techreport [Details] [BibTeX] [Paper as PDF]  
Pratyush Kumar and Lothar Thiele:
p-YDS Algorithm: An Optimal Extension of YDS Algorithm to Minimize Expected Energy For Real-Time Jobs
14th International Conference on Embedded Software (EMSOFT), New Delhi, India, October 2014.
Inproceedings [Details] [BibTeX] [External Link] [Paper as PDF]  
Hui Liang and Sarah Hoffmann:
Capturing Speaker-Independent Prosodic Information by Syntax Tree-Based Prosody Modelling
Gloriastrasse 35, 8092 Zurich, Switzerland, June 2014.
Techreport [Details] [BibTeX] [Paper as PDF]  
Sarah Hoffmann:
A Data-driven Model for the Generation of Prosody from Syntactic Sentence Structures
Zurich 2014.
PhD Thesis [Details] [BibTeX]  


2013

Tofigh Naghibi, Sarah Hoffmann and Beat Pfister:
An efficient method to estimate pronunciation from multiple utterances
Proceedings of Interspeech, Lyon (France), p. 1951-1955, September 2013.
Inproceedings [Details] [BibTeX] [External Link]  
Sarah Hoffmann and Beat Pfister:
Text-to-speech alignment of long recordings using universal phone models
Proceedings of Interspeech, Lyon (France), p. 1520-1524, September 2013.
Inproceedings [Details] [BibTeX] [External Link]  
Tofigh Naghibi, Sarah Hoffmann and Beat Pfister:
Convex approximation of the NP-hard search problem in feature subset selection
Proceedings of ICASSP, Vancouver (Canada), p. 3273-3277, May 2013.
Inproceedings [Details] [BibTeX] [External Link]  


2012

Thomas Ewender:
Automatic Selection of Speech Segments for Concatenative Speech Synthesis
December 2012.
PhD Thesis [Details] [BibTeX] [Paper as PDF]  
Sarah Hoffmann and Beat Pfister:
Employing sentence structure: Syntax trees as prosody generators
Proceedings of Interspeech, Portland, Oregon (USA), September 2012.
Inproceedings [Details] [BibTeX] [Paper as PDF]  
Tobias Kaufmann and Beat Pfister:
Syntactic language modeling with formal grammars
Speech Communication (Elsevier)
Volume 54, Issue 6, p. 715-731, July 2012.
Article [Details] [BibTeX] [Paper as PDF]  
Tofigh Naghibi and Beat Pfister:
Beamformer design for nonstationary signals by means of interfrequency correlations
Proceedings of SAM, Hoboken, NJ (USA), p. 261-264, June 2012.
Inproceedings [Details] [BibTeX] [External Link] [Paper as PDF]  
Tofigh Naghibi and Beat Pfister:
An approach to prevent adaptive beamformers from cancelling the desired signal
In Proceedings of ICASSP, Kyoto (Japan), p. 205-208, March 2012.
Inproceedings [Details] [BibTeX] [Paper as PDF]  


2011

Thomas Ewender and Beat Pfister:
Automatically creating a diphone set from a speech database
In Proceedings of Interspeech, Florence (Italy), p. 2169-2172, August 2011.
Inproceedings [Details] [BibTeX] [Paper as PDF]  
Michael Gerber, Tobias Kaufmann and Beat Pfister:
Extended Viterbi algorithm for optimized word HMMs
In Proceedings of ICASSP, Prague (Czech Republic), p. 4932-4935, May 2011.
Inproceedings [Details] [BibTeX] [Paper as PDF]  
Michael Gerber:
Speech Recognition Techniques for Languages with Limited Linguistic Resources
Zurich 2011.
PhD Thesis [Details] [BibTeX] [Paper as PDF]  


2010

Tobias Kaufmann and Beat Pfister:
Semi-automatic extension of morphological lexica
In Workshop Computational Linguistics - Applications, Wisla (Poland) 2010.
Inproceedings [Details] [BibTeX] [Paper as PDF]  
Sarah Hoffmann and Beat Pfister:
Fully automatic segmentation for prosodic speech corpora
In Proceedings of Interspeech, Makuhari (Japan), p. 1389-1392 2010.
Inproceedings [Details] [BibTeX] [Paper as PDF]  
Thomas Ewender and Beat Pfister:
Accurate pitch marking for prosodic modification of speech segments
In Proceedings of Interspeech, Makuhari (Japan), p. 178-181 2010.
Inproceedings [Details] [BibTeX] [Paper as PDF]  


2009

Tobias Kaufmann:
A Rule-based Language Model for Speech Recognition
Zurich, October 2009.
PhD Thesis [Details] [BibTeX]  
Tobias Kaufmann, Thomas Ewender and Beat Pfister:
Improving broadcast news transcription with a precision grammar and discriminative reranking
Proceedings of Interspeech, Brighton (UK), p. 356-359, September 2009.
Inproceedings [Details] [BibTeX] [Paper as PDF]  
Harald Romsdorfer:
Weighted Neural Network Ensemble Models for Speech Prosody Control
Proceedings of Interspeech 2009, Brighton (UK), p. 492-495, September 2009.
Inproceedings [Details] [BibTeX]  
Harald Romsdorfer:
Polyglot Speech Prosody Control
Proceedings of Interspeech 2009, Brighton (UK), p. 488-491, September 2009.
Inproceedings [Details] [BibTeX]  
Harald Romsdorfer:
Combining Weighted Neural Network Ensembles with Factor Relevance Determination for Speech Prosody Control
Proceedings of MLSP Workshop, Grenoble (France), September 2009.
Inproceedings [Details] [BibTeX]  
Harald Romsdorfer:
Polyglot Text-to-Speech Synthesis: Text Analysis & Prosody Control
Zurich, January 2009.
PhD Thesis [Details] [BibTeX]  
Thomas Ewender, Sarah Hoffmann and Beat Pfister:
Nearly Perfect Detection of Continuous F0 Contour and Frame Classification for TTS Synthesis
Proceedings of Interspeech, Brighton (UK), p. 100-103 2009.
Inproceedings [Details] [BibTeX] [Paper as PDF]  


2008

Michael Gerber and Beat Pfister:
Fast search for common segments in speech signals for speaker verification
In Proceedings of Interspeech, Brisbane (Australia), p. 375-378 2008.
Inproceedings [Details] [BibTeX] [Paper as PDF]  
Beat Pfister and Tobias Kaufmann:
Sprachverarbeitung: Grundlagen und Methoden der Sprachsynthese und Spracherkennung
2008.
Book [Details] [BibTeX] [External Link]  


2007

Harald Romsdorfer and Beat Pfister:
Text analysis and language identification for polyglot text-to-speech synthesis
Speech Communication
Volume 49, Issue 9, p. 697-724, September 2007.
Article [Details] [BibTeX] [External Link] [Paper as PDF]  
René Beutler:
Improving Speech Recognition through Linguistic Knowledge
January 2007.
PhD Thesis [Details] [BibTeX]  
Tobias Kaufmann and Beat Pfister:
Applying licenser rules to a grammar with continuous constituents
In Proceedings of the 14th International Conference on Head-Driven Phrase Structure Grammar, Stanford, p. 150-162 2007.
Inproceedings [Details] [BibTeX] [Paper as PDF]  
Michael Gerber, René Beutler and Beat Pfister:
Quasi text-independent speaker verification based on pattern matching
In Proceedings of Interspeech, Antwerp, p. 1993-1996 2007.
Inproceedings [Details] [BibTeX] [Paper as PDF]  
Michael Gerber, Tobias Kaufmann and Beat Pfister:
Perceptron-based class verification
In Proceedings of NOLISP (ISCA Workshop on non linear speech processing), Paris 2007.
Inproceedings [Details] [BibTeX] [Paper as PDF]  


2006

Tobias Kaufmann and René Beutler:
A Hybrid Language Model for Speech Recognition
Poster session at the NCCR IM2 Review Meeting, Martigny,Switzerland, November 2006.
Misc [Details] [BibTeX]  
Michael Gerber and Beat Pfister:
Quasi text-independent speaker verification with neural networks
Poster session at the NCCR IM2 Review Meeting, Martigny,Switzerland, November 2006.
Misc [Details] [BibTeX]  
Beat Pfister and René Beutler:
Improving Speech Recognition thru Linguistics
February 2006.
Techreport [Details] [BibTeX]  
Harald Romsdorfer and Beat Pfister:
Character Stream Parsing of Mixed-lingual Text
ISCA - MultiLing 2006, Stellenbosch, South Africa 2006.
Inproceedings [Details] [BibTeX]  


2005

René Beutler, Tobias Kaufmann and Beat Pfister:
Integrating a Non-Probabilistic Grammar into Large Vocabulary Continuous Speech Recognition
2005 IEEE Automatic Speech Recognition and Understanding Workshop, San Juan, Puerto Rico, p. 104-109, November 2005.
Inproceedings [Details] [BibTeX]  
Harald Romsdorfer and Beat Pfister:
Phonetic Labeling and Segmentation of Mixed-Lingual Prosody Databases
Proceedings of Interspeech 2005, Lisbon, Portugal, p. 3281-3284, September 2005.
Inproceedings [Details] [BibTeX]  
Michael Gerber and Beat Pfister:
Quasi text-independent speaker verification with neural networks
Extended Abstract for MLMI 05 Workshop in Edinburgh, July 2005.
Misc [Details] [BibTeX] [Paper as PDF]  
René Beutler, Tobias Kaufmann and Beat Pfister:
Using rule-based knowledge to improve LVCSR
Proceedings of ICASSP 2005, Philadelphia, p. 829-832, March 2005.
Inproceedings [Details] [BibTeX]  
Harald Romsdorfer, Beat Pfister and René Beutler:
A Mixed-lingual Phonological Component which Drives the Statistical Prosody Control of a Polyglot TTS Synthesis System
Machine Learning for Multimodal Interaction, Berlin Heidelberg New York, p. 263-276, January 2005.
Incollection [Details] [BibTeX]  


2004

Michael Gerber:
Evaluation von Vektorquantisierungsmethoden für das Finden lautlich ähnlicher Abschnitte in Sprachsignalen
Report zum NCCR Projekt IM2.ACP, ETH, Zürich, December 2004.
Misc [Details] [BibTeX]  
Jozsef Szakos and Ulrike Glavitsch:
Seamless Speech Indexing and Retrieval: Developing a New Technology for the Documentation and Teaching of Endangered Formosan Aboriginal Languages
Proceedings of the Intl. Conference on Education and Information Systems: Technologies and Applications (EISTA04), Orlando, Florida, Volume 4, p. 88-93, July 2004.
Inproceedings [Details] [BibTeX]  
Jozsef Szakos and Ulrike Glavitsch:
Portability, modularity and seamless speech-corpus indexing and retrieval: A new software for documenting (not only) the endangered Formosan aboriginal languages
E-MELD Language Digitization Project Conference 2004 on Linguistic Databases and Best Practice, Wayne State University, Detroit, Michigan, July 2004.
Misc [Details] [BibTeX]  
René Beutler, Tobias Kaufmann and Beat Pfister:
Can grammars improve speech recognition accuracy?
MLMI04 Workshop, Hotel du Parc, Martigny, Switzerland, June 2004.
Misc [Details] [BibTeX]  
Harald Romsdorfer:
An Approach to an Improved Segmentation of Speech
May 2004.
Techreport [Details] [BibTeX]  
Urs Niesen and Beat Pfister:
Speaker verification by means of ANNs
In Proceedings of the ESANN 04, Bruges (Belgium), p. 145-150, April 2004.
Inproceedings [Details] [BibTeX] [Paper as PDF]  
Harald Romsdorfer and Beat Pfister:
Multi-Context Rules for Phonological Processing in Polyglot TTS Synthesis
Proceedings of Interspeech-ICSLP 2004, Jeju, Korea, p. 845-848 2004.
Inproceedings [Details] [BibTeX]  


2003

Ulrike Glavitsch:
Speaker normalization with respect to F0: a perceptual approach
December 2003.
Techreport [Details] [BibTeX]  
Harald Romsdorfer and Beat Pfister:
Mixed-lingual text analysis for polyglot TTS synthesis
Poster Presentation for (IM)2 Summer Institute, (IM)2 Summer Institute, Crans-Montana, Switzerland, October 2003.
Misc [Details] [BibTeX]  
Harald Romsdorfer and Beat Pfister:
Mixed-lingual text analysis for polyglot TTS synthesis
Poster Presentation for (IM)2 NCCR Review , Hotel du Parc, Martigny, SwitzerlandPanel, October 2003.
Misc [Details] [BibTeX]  
Beat Pfister and René Beutler:
Estimating the weight of evidence in forensic speaker verification
Proceedings of Eurospeech 2003, Geneva, p. 701-704, September 2003.
Inproceedings [Details] [BibTeX]  
Beat Pfister and Harald Romsdorfer:
Mixed-lingual Text Analysis for Polyglot TTS Synthesis
Proceedings of Eurospeech 2003, Geneva, Switzerland, p. 2037-2040, September 2003.
Inproceedings [Details] [BibTeX] [Paper as PDF]  

59 entries found.