Michael Pucher

Priv.-Doz. Mag.phil. Dipl.-Ing. Dr.techn. Michael Pucher (michael dot pucher at oeaw dot ac dot at)

Senior Research Scientist

Acoustics Research Institute (ARI), Austrian Academy of Sciences (ÖAW)

Wohllebengasse 12-14 / 1st Floor, Vienna A-1040, Austria






I obtained my doctoral degree (Dr.techn.) in Electrical and Information Engineering from Graz University of Technology in 2007. In 2017 I received the venia docendi in Speech Communication at Graz University of Technology with a habilitation thesis on Speech Processing for Multimodal and Adaptive Systems. I hold a master degree (Dipl.-Ing.) in Computer Science from Vienna University of Technology (TUW) and a diploma degree (Mag.phil.) in philosophy from University of Vienna. During the last years my work was focused on the improvement of state-of-the-art speech synthesis technologies for the synthesis of language varieties and audio-visual speech. I have also made significant contributions in the area of speaker verification spoofing, where we showed how adaptive synthesizers can spoof a speaker verification system. Currently I am working on multimodal dialect synthesis and synthesis of singing speech. From 2007 to 2015 I was Senior Researcher at the Telecommunications Research Center Vienna (FTW). Since 2016 I am Senior Research Scientist at the Acoustics Research Institute (ARI) of the Austrian Academy of Sciences (ÖAW).





Projects

Publications

Teaching

Curriculum Vitae

Professional activities






Projects

Ongoing Research Projects

[NII - National Institute of Informatics, Japan] OPERA - Acoustic analysis and statistical modelling of Vienna opera singers (as external collaborator) 2013 - 2017

Completed Research Projects

[FWF: P23821-N23] AMTV - Acoustic modeling and transformation of varieties for speech synthesis (as principal investigator) 2012 - 2016
[BMWF - Sparkling Science] SALB - Speech synthesis of auditory lecture books for blind children (as principal investigator) 2013 - 2015
[FWF: P22890-N23] AVDS - Adaptive Audio-Visual Dialect Synthesis (as principal investigator) 2011-2014
[EU-NET] EUCOG III: European Network for the Advancement of Artificial Cognitive Systems, Interaction and Robotics (as member)
[EU-COST] Cost 2102: Cross-Modal Analysis of Verbal and Non-verbal Communication (as representative)
[WWTF] VSDS - Viennese Sociolect and Dialect Synthesis (as principal investigator)
[COMET] HI-MONI - Highway Monitoring (as project manager)
[T-LABS] TIDE - Testbed for Interactive Dialog System Evaluation (as project manager)
[EU-FP6] AMI - Augmented Multiparty Interaction
[K-PLUS] MONA - Mobile Multimodal Next Generation Applications
[K-PLUS] Service Platform and Interoperability
[K-PLUS] Speech and More

Development Projects

December 2014: Bad Goisern and Innervillgraten Audio-Visual Dialect Speech Corpus (GIDS).
December 2014: Release of SALB - a frontend for speech synthesis using HTS voice models.
May 2014: Release of Multi-Modal Annotated Synchronous Corpus of Speech (MMASCS).
October 2013: Release of Austrian German open source HTS voice.
April 2012: New demo website with new HTS-44kHz voices (Austrian German, Viennese dialect, Viennese Standard).
September 2010: "Leopold" available for Windows and Mac OSX from the Webshop of Cereproc, UK.
May 2010: Development of "Leopold" the first synthetic voice for Austrian German together with company partners, which was integrated into a web reading service for the Website of the City of Vienna.
May 2010: Open source release of 3 Viennese voices for the Festival Speech Synthesis System presented at the 7th International conference on Language Resources and Evaluation (LREC) [conference paper].





Publications

See also Google scholar citations

Invited talks

Invited talk on Interpolation of language varieties in HMM-based speech synthesis, 23. May 2014, NII SMG group, Tokyo, Japan.
Keynote talk on Acoustic modeling, interpolation, and transformation of language varieties for speech synthesis at the International Dagstuhl Workshop on Multilinguality in Speech Research: Data, Methods and Models, 9.-11. April 2014, Dagstuhl, Germany.


Journal articles

2017, Michael Pucher, Bettina Zillinger, Markus Toman, Dietmar Schabus, Cassia Valentini-Botinhao, Junichi Yamagishi, Erich Schmid, Thomas Woltron, Influence of speaker familiarity on blind and visually impaired children's and young adults' perception of synthetic voices. Computer, Speech, and Language (accepted).
2017, Michael Pucher, Sylvia Moosmüller, Michaela Rausch-Supola, Aufnahme von hochwertigen authentischen Dialektdaten im Feld für die Verwendung in der Sprachsynthese. Germanistische Linguistik (accepted).
2015, Cassia Valentini-Botinhao, Markus Toman, Michael Pucher, Dietmar Schabus, Junichi Yamagishi, Intelligibility of time-compressed synthetic speech: compression method and speaking style. Speech Communication, Volume 74, pp. 52-64, November 2015.
2015, Markus Toman, Michael Pucher, Sylvia Moosmüller, Dietmar Schabus, Unsupervised and phonologically controlled interpolation of Austrian German language varieties for speech synthesis. Speech Communication, Volume 72, pp. 176-193, September 2015 (Samples).
2014, Dietmar Schabus, Michael Pucher, Gregor Hofer, Joint Audiovisual Hidden Semi-Markov Model-based Speech Synthesis. IEEE Journal of Selected Topics in Signal Processing. Vol. 8, No. 2, pp. 336-347, April 2014 (Samples).
2012, Phillip L. De Leon, Michael Pucher, Junichi Yamagishi, Inma Hernaez, Ibon Saratxaga Evaluation of Speaker Verification Security and Detection of HMM-Based Synthetic Speech. IEEE Transactions on Audio, Speech, and Language Processing, Volume 20, Issue 8, October 2012, Pages 2280-2290.
2010, Michael Pucher, Dietmar Schabus, Junichi Yamagishi, Friedrich Neubarth, Volker Strom, Modeling and interpolation of Austrian German and Viennese dialect in HMM-based speech synthesis. Speech Communication, Volume 52, Issue 2, February 2010, Pages 164-179.
2002, Georg Niklfeld, Michael Pucher, Robert Finan, Wolfgang Eckhart, Kombinierte Sprache/Display-Schnittstellen für mobile Datendienste. PIK - Praxis der Informationsverarbeitung und Kommunikation, 25 (4), pages 196-201.


Conference and workshop papers

2017

2017, Michael Pucher, Carina Lozo, Sylvia Moosmüller, Phone mapping and prosodic transfer in speech synthesis of similar dialect pairs. 28th Conference on Electronic Speech Signal Processing, Saarbrücken, Germany, 2017, pp. 180-185.

2016

2016, Michael Pucher, Michaela Rausch-Supola, Sylvia Moosmüller, Markus Toman, Dietmar Schabus, Friedrich Neubarth, Open data for speech synthesis of Austrian German language varieties. 12. Tagung Phonetik und Phonologie im deutschsprachigen Raum,, München, 2016, pp. 147-150.
2016, Michael Pucher, Fernando Villavicencio, Junichi Yamagishi, Development of a statistical parametric synthesis system for operatic singing in German. 9th ISCA Speech Synthesis Workshop (SSW9), Sunnyvale, CA, USA, pp. 64-69. (Samples).
2016, Michael Pucher, Sylvia Moosmüller, Michaela Rausch-Supola, Aufnahme von hochwertigen authentischen Dialektdaten im Feld. 13. Bayerisch-österreichische Dialektologentagung, Erlangen, Germany.
2016, Michael Pucher, Sylvia Moosmüller, Analysis of phonetic dialect/standard relations in model interpolation. Experimental Approaches to Perception and Production of Language Variation, Vienna, Austria.

2015

2015, Fernando Villavicencio, Jordi Bonada, Junichi Yamagishi, Michael Pucher, Efficient Pitch Estimation on Natural Opera-Singing by a Spectral Correlation based Strategy. Information Processing Society of Japan SIG Technical Report, Number 1, pp. 1-6.
2015, Michael Pucher, Dietmar Schabus, Visio-articulatory to acoustic conversion of speech. FAAVSP 2015, Vienna, Austria, Article No. 6.
2015, Dietmar Schabus, Michael Pucher, Comparison of dialect models and phone mappings in HSMM-based visual dialect speech synthesis. FAAVSP 2015, Vienna, Austria, pp. 84-87.
2015, Michael Pucher, Markus Toman, Dietmar Schabus, Cassia Valentini-Botinhao, Junichi Yamagishi, Bettina Zillinger, Erich Schmid, Influence of speaker familiarity on blind and visually impaired children's perception of synthetic voices in audio games. Proccedings of Interspeech 2015, Dresden, Germany, pp. 1625-1629.
2015, Markus Toman, Michael Pucher, Evaluation of state mapping based foreign accent conversion. Proccedings of Interspeech 2015, Dresden, Germany, pp. 304-308.
2015, Michael Pucher, Valon Xhafa, Agni Dika, Markus Toman, Adaptive speech synthesis of Albanian dialects. Text, Speech, and Dialogue (TSD) 2015, Pilsen, Czech Republic, pp. 158-164.
2015, Markus Toman, Michael Pucher, An Open Source Speech Synthesis Frontend for HTS. Text, Speech, and Dialogue (TSD) 2015, Pilsen, Czech Republic, pp. 291-298.

2014

2014, Cassia Valentini-Botinhao, Markus Toman, Michael Pucher, Dietmar Schabus, Junichi Yamagishi, Intelligibility analysis of fast synthesized speech. In Proccedings of Interspeech 2014, Singapore, pp. 2922-2926.
2014, Dietmar Schabus, Michael Pucher, Phil Hoole, The MMASCS multi-modal annotated synchronous corpus of audio, video, facial motion and tongue motion data of normal, fast and slow speech. In Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014), Reykjavik, Iceland, pp. 3411-3416.

2013

2013, Jakob Hollenstein, Michael Pucher, Dietmar Schabus, Visual Control of Hidden-Semi-Markov-Model based Acoustic Speech Synthesis. International Conference on Auditory-Visual Speech Processing (AVSP 2013), Annency, France, pp. 31-35 (Samples).
2013, Dietmar Schabus, Michael Pucher, Gregor Hofer, Objective and Subjective Feature Evaluation for Speaker-Adaptive Visual Speech Synthesis. International Conference on Auditory-Visual Speech Processing (AVSP 2013), Annency, France, pp. 37-42.
2013, Markus Toman, Michael Pucher, Dietmar Schabus, Multi-variety adaptive acoustic modeling in HSMM-based speech synthesis. 8th ISCA Speech Synthesis Workshop (SSW8), Barcelona, Spain, pp. 83-87.
2013, Markus Toman, Michael Pucher, Dietmar Schabus, Cross-variety speaker transformation in HSMM-based speech synthesis. 8th ISCA Speech Synthesis Workshop (SSW8), Barcelona, Spain, pp. 77-81.
2013, Markus Toman, Michael Pucher, Structural KLD for cross-variety speaker adaptation in HMM-based speech synthesis. 10th IASTED International Conference on Signal Processing, Pattern Recognition and Applications (SPPRA2013), Innsbruck, Austria.

2012

2012, Dietmar Schabus, Michael Pucher, Gregor Hofer, Speaker-adaptive visual speech synthesis in the HMM-framework. 13th Annual Conference of the International Speech Communication Association (INTERSPEECH 2012), Portland, USA, pp. 979-982 (Samples).
2012, Ibon Saratxaga, Inma Hernaez, Michael Pucher, Eva Navas, Inaki Sainz, Perceptual Importance of the Phase Related Information in Speech. 13th Annual Conference of the International Speech Communication Association (INTERSPEECH 2012), Portland, USA, pp. 1448-1451.
2012, Michael Pucher, Dietmar Schabus, Gregor Hofer, Nadja Kerschhofer-Puhalo, Sylvia Moosmüller, Regionalizing Virtual Avatars - Towards Adaptive Audio-Visual Dialect Speech Synthesis. CogSys 2012, 5th International Conference on Cognitive Systems, Vienna, Austria, pp. 95.
2012, Michael Pucher, Nadja Kerschhofer-Puhalo, Dietmar Schabus, Sylvia Moosmüller, Gregor Hofer, Language resources for the adaptive speech synthesis of dialects. 7. Kongress der Internationalen Gesellschaft für Dialektologie und Geolinguistik (SIDG), Vienna, Austria, pp. 174-175. [presentation]
2012, Michael Pucher, Dietmar Schabus, Gregor Hofer, From Viennese to Austrian German and back again - An algorithm for the realization of a variety-slider. 7. Kongress der Internationalen Gesellschaft für Dialektologie und Geolinguistik (SIDG), Vienna, Austria, pp. 176-177. [presentation]
2012, Dietmar Schabus, Michael Pucher, Gregor Hofer, Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC 2012), pp. 3313-3316, Istanbul, Turkey.

2011

2011, Michael Pucher, Nadja Kerschhofer-Puhalo, Dietmar Schabus, Phone set selection for HMM-based dialect speech synthesis. 1st Workshop on Algorithms and Resources for Modelling of Dialects and Language Varieties (DIALECTS 2011). EMNLP 2011: Conference on Empirical Methods in Natural Language Processing, Edinburgh, UK, pp. 65-69.
2011, Dietmar Schabus, Michael Pucher, Gregor Hofer, Simultaneous Speech and Animation Synthesis. Poster at 38th International Conference and Exhibition on Computer Graphics and Interactive Techniques (SIGGRAPH 2011), Vancouver, Canada. [video]
2011, Phillip L. De Leon, Inma Hernaez, Ibon Saratxaga, Michael Pucher, Junichi Yamagishi, Detection of synthetic speech for the problem of imposture. In Proceedings of the 36th International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech Republic, pp. 4844-4847.
2011, Dietmar Schabus, Thomas Zemen, Michael Pucher, Distributed Field Estimation Algorithms in Vehicular Sensor Networks. IEEE 73rd Vehicular Technology Conference (VTC2011-Spring), Budapest, Hungary, pp. 1-5.

2010

2010, Michael Pucher, Dietmar Schabus, Junichi Yamagishi, Synthesis of fast speech with interpolation of adapted HSMMs and its evaluation by blind and sighted listeners. 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010), Makuhari, Japan, pp. 2186-2189.
2010, Michael Pucher, Friedrich Neubarth, Volker Strom, Sylvia Moosmüller, Gregor Hofer, Christian Kranzler, Gudrun Schuchmann, Dietmar Schabus, Resources for speech synthesis of Viennese varieties. In Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC), Valletta, Malta, pp. 105-108. [presentation]
2010, Phillip L. De Leon, Michael Pucher, Junichi Yamagishi, Evaluation of the Vulnerability of Speaker Verification to Synthetic Speech. In Proceedings of Odyssey 2010 - The Speaker and Language Recognition Workshop, Brno, Czech Republic, pp. 151-158.
2010, Phillip L. De Leon, Vijendra Raj Apsingekar, Michael Pucher, Junichi Yamagishi, Revisiting the security of speaker verification systems against imposture using synthetic speech. In Proceedings of the 35th International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Dallas, USA, pp. 1798-1801.
2010, Michael Pucher, Dietmar Schabus, Peter Schallauer, Yuriy Lypetskyy, Franz Graf, Harald Rainer, Michael Stadtschnitzer, Sabine Sternig, Josef Birchbauer, Wolfgang Schneider, Bernhard Schalko, Multimodal Highway Monitoring for Robust Incident Detection. 13th International IEEE Conference on Intelligent Transportation Systems (ITSC), Madeira, Portugal, pp. 837-842.
2010, Michael Pucher, Friedrich Neubarth, Dietmar Schabus, Design and development of spoken dialog systems incorporating speech synthesis of Viennese varieties . In Proceedings of the 12th International Conference on Computers Helping People with Special Needs (ICCHP 2010), Vienna, Austria, pp. 361-366.

2009

2009, Michael Pucher, Friedrich Neubarth, Volker Strom, Optimizing phonetic encoding for Viennese dialect unit selection speech synthesis. COST 2102 conference, Dublin, 2009, LNCS 5967, pp. 207-216, 2010.
2009, Christian Kranzler, Franz Pernkopf, Rudolf Muhr, Michael Pucher, Friedrich Neubarth, Text-to-Speech Engine with Austrian German Corpus. In Proceedings of the XIII International conference Speech and Computer (SPECOM 2009), St. Petersburg, Russia.

2008

2008, Michael Pucher, Gudrun Schuchmann, Peter Fröhlich, Regionalized Text-to-Speech Systems: Persona Design and Application Scenarios. In Lecture Notes in Artificial Intelligence (LNAI), volume 5398, pages 216-222. COST Action 2102 School, Vietri sul Mare, Italy.
2008, Friedrich Neubarth, Michael Pucher, Christian Kranzler, Modeling Austrian dialect varieties for TTS. In Proceedings of the 9th Annual Conference of the International Speech Communication Association (INTERSPEECH 2008), pages 1877-1880, Brisbane, Australia.

2007

2007, Michael Pucher, Andreas Türk, Jitendra Ajmera, Natalie Fecher, Phonetic distance measures for speech recognition vocabulary and grammar optimization . In Proceedings of the 3rd congress of the Alps Adria Acoustics Association, Graz, Austria.
2007, Sebastian Möller, Klaus Peter Engelbrecht, Michael Pucher, Peter Fröhlich, Lu Huo, Ulrich Heute, Frank Oberle, TIDE: A testbed for interactive spoken dialogue system evaluation . In Proceedings of the XII International conference Speech and Computer (SPECOM 2007), Moscow, Russia.
2007, Michael Pucher, WordNet-based semantic relatedness measures in automatic speech recognition for meetings. In Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007), pages 129-132, Prague, Czech Republic.

2006

2006, Michael Pucher, Yan Huang, Özgür Çetin, Combination of latent semantic analysis based language models for meeting recognition. In Proceedings of the Second IASTED International Conference on Computational Intelligence (CI 2006), pages 465-469, San Francisco, USA.
2006, Michael Pucher, Yan Huang, Özgür Çetin, Optimization of latent semantic analysis based language model interpolation for meeting recognition. In Proceedings of the 5th Slovenian and 1st International Language Technologies Conference, pages 74-78, Ljubljana, Slovenia.

2005

2005, Michael Pucher, Peter Fröhlich, A user study on the influence of mobile device class, synthesis method, data rate and lexicon on speech synthesis quality. In Proceedings of the 9th European Conference on Speech Communication and Technology (EUROSPEECH 2005), pages 2501-2504, Lisboa, Portugal.
2005, Georg Niklfeld, Hermann Anegg, Michael Pucher, Raimund Schatz, Rainer Simon, Florian Wegscheider, Alexander Gassner, Michael Jank, Günther Pospischil, Device independent mobile multimodal user interfaces with the MONA Multimodal Presentation Server. In Proceedings of the Eurescom summit 2005 on Ubiquitous Services and Applications, Heidelberg, Germany.
2005, Michael Pucher, Performance evaluation of WordNet-based semantic relatedness measures for word prediction in conversational speech. In Proceedings of the 6th International Workshop on Computational Semantics (IWCS 6), pages 332-342, Tilburg, the Netherlands.
2005, Michael Pucher, Yan Huang, Latent semantic analysis based language models for meetings. 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI 2005), Edinburgh, UK.

2004

2004, Hermann Anegg, Thomas Dangl, Michael Jank, Georg Niklfeld, Michael Pucher, Raimung Schatz, Rainer Simon, Florian Wegscheider, Multimodal interfaces in mobile devices - the MONA project . In Proceedings of the Workshop on Emerging Applications for Wireless and Mobile Access. 13th International World Wide Web Conference (WWW 2004), New York, USA.
2004, Lynne Baillie, Michael Pucher, Marian Kepesi, A multimodal mobile robot for the home . In Proceedings of the IADIS International Conference e-Society 2004, Avila, Spain.
2004, Lynne Baillie, Michael Pucher, Marian Kepesi, A supportive multimodal mobile robot for the home . In Lecture Notes in Computer Science (LNCS), volume 3196, pages 375-383. 8th ERCIM Workshop on User Interfaces for All, Vienna, Austria.

2003

2003, Michael Pucher, Friedrich Neubarth, Erhard Rank, Georg Niklfeld, Qi Guan, Combining non-uniform unit selection with diphone based synthesis. In Proceedings of the 8th European Conference on Speech Communication and Technology (EUROSPEECH 2003), pages 1329-1332, Geneva, Switzerland.
2003, Michael Pucher, Marian Kepesi, Multimodal Mobile Robot Control using Speech Application Language Tags. In Lecture Notes in Computer Science (LNCS), volume 2875, pages 56-64. European Symposium on Ambient Intelligence, Eindhoven, the Netherlands.
2003, Michael Pucher, Julia Tertyshnaya, Florian Wegscheider, Personal voice call assistant: SIP and VoiceXML in a distributed environment. In Proceedings of the Workshop on Emerging Applications for Wireless and Mobile Access. 12th International World Wide Web Conference (WWW 2003), Budapest, Hungary.

2002

2002,Georg Niklfeld, Michael Pucher, Robert Finan, Wolfgang Eckhart, Mobile multi-modal data services for GPRS phones and beyond. In Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), Pittsburgh, USA.
2002,Georg Niklfeld, Michael Pucher, Robert Finan, Wolfgang Eckhart, Steps towards multi-modal data services in GPRS and in UMTS or WLAN networks . In Proceedings of the ISCA Tutorial and Research Workshop on Multi-Modal Dialogue in Mobile Environments, Irsee, Germany.

2001

2001, Georg Niklfeld, Robert Finan, Michael Pucher,Multimodal interface architecture for mobile data services. In Proceedings of the Workshop on Wearable Computing (TCMC 2001) , Graz, Austria.
2001, Georg Niklfeld, Robert Finan, Michael Pucher, Architecture for adaptive multimodal dialog systems based on VoiceXML. In Proceedings of the 7th European Conference on Speech Communication and Technology (EUROSPEECH 2001), pages 2341-2344, Aalborg, Denmark.
2001, Georg Niklfeld, Robert Finan, Michael Pucher, Component-based multimodal dialog interfaces for mobile knowledge creation. In Proceedings of the Workshop on Human Language Technology and Knowledge Management, pages 103-110. 39th Annual Meeting of the Association for Computational Linguistics (ACL 2001), Toulouse, France.

Book chapters

2008, Sebastian Möller, Klaus-Peter Engelbrecht, Michael Pucher, Peter Fröhlich, Lu Huo, Ulrich Heute, Frank Oberle, A New Testbed for Semi-automatic Usability Evaluation and Optimization of Spoken Dialogue Systems. In Usability of Speech Dialog Systems - Listening to the Target Audience (T. Hempel, ed.), pages 81-103, Springer, Berlin, Germany.
2005, Georg Niklfeld, Michael Pucher, Robert Finan, Wolfgang Eckhart, Wolfgang Minker, A Path to Multimodal Data Services for Telecommunications. In Spoken Multimodal Human-Computer Dialogue in Mobile Environments, pages 149-167, Springer, Netherlands.


Theses

2017, Michael Pucher, Speech processing for multimodal and adaptive systems, Habilitation thesis, Venia docendi in Speech Communication, Graz University of Technology.
2015, Michael Pucher, A Hidden-Markov-Model (HMM) based Opera Singing Synthesis System for German, Master thesis, Computer Science, Vienna University of Technology.
2007, Michael Pucher, Semantic Similarity in Automatic Speech Recognition for Meetings, Doctoral Thesis, Electrical and Information Engineering, Graz University of Technology.
2001, Michael Pucher, Formale Wahrheitstheorien nach Alfred Tarski, Diploma Thesis, Philosophy, University of Vienna.





Teaching

Winter semester 2016/2017: Lecture on Computational Semantics at Institute of Computer Languages at Vienna University of Technology
Summer semester 2016: Lecture on Cognitive User Interfaces at Institute of Computer Languages at Vienna University of Technology
Summer semester 2015: Lecture on Cognitive User Interfaces at Institute of Computer Languages at Vienna University of Technology
Winter semester 2014/2015: Lecture on Computational Semantics at Institute of Computer Languages at Vienna University of Technology
Summer semester 2014: Lecture on Cognitive User Interfaces at Institute of Computer Languages at Vienna University of Technology
Winter semester 2013/2014: Lecture on Computational Semantics at Institute of Computer Languages at Vienna University of Technology
Summer semester 2013: Lecture on Cognitive User Interfaces at Institute of Computer Languages at Vienna University of Technology
Winter semester 2011/2012: Lecture on Cognitive User Interfaces at Institute of Computer Languages at Vienna University of Technology
July 2011: Seminar on Audio-Visual Speech Synthesis at the Signal Processing Laboratory (Aholab) of the University of the Basque Country
Summer semester 2008: Seminar on Speech Synthesis at the Signal Processing and Speech Communication Laboratory (SPSC Lab) at Graz University of Technology


I co-supervised the following PhD theses:
2016, Markus Toman, Acoustic modeling and transformation of varieties for speech synthesis (Vienna University of Technology).
2014, Dietmar Schabus, Audiovisual speech synthesis based on hidden Markov models (Graz University of Technology).


I co-supervised the following diploma theses:
2013, Jakob Hollenstein, Visual Control of Audio-Visual Speech Synthesis. Master's thesis. Vienna University of Technology.
2009, Dietmar Schabus, Interpolation of Austrian German and Viennese Dialect / Sociolect in HMM-based Speech Synthesis. Diploma thesis. Vienna University of Technology.
2008, Christian Kranzler, Text-to-Speech Engine with Austrian German corpus. Diploma thesis. Graz University of Technology.
2008, Michael Bruss, Quantitative und phonetische Analyse von nicht-linguistischen Partikeln in spontan gesprochener Sprache der Wiener Soziolekte. Magisterarbeit. Universität des Saarlandes, Saarbrücken.





Curriculum Vitae

Professional Experience

Since 2017: Priv.-Doz. at Speech Communication and Signal Processing Lab at Graz University of Technology
Since 2016: Senior Research Scientist at Acoustics Research Institute (ARI), Austrian Academy of Sciences (ÖAW)
Since 2011: Lecturer at Vienna University of Technology (TUW)
2007 to 2015: Senior Researcher at the Telecommunications Research Center Vienna (FTW)
Since 2001: Researcher at the Telecommunications Research Center Vienna (FTW)
1999 to 2002: Software/database design and development with Java2/Oracle
1999: Teaching assistant at the Institute for Database Systems and Artificial Intelligence (DBAI) at Vienna University of Technology (TUW)
1989 to 1993: Worked as a chef in restaurants in Austria and Liechtenstein


Education

2017: Habilitation (venia docendi) in Speech Communication at Graz University of Technology with a thesis on Speech Processing for Multimodal and Adaptive Systems
February to September 2017: Paternity leave
2015: Master degree (Dipl.-Ing.) in Computer Science from Vienna University of Technology
2010 to 2015: Master's studies in Computer Science (Computational Intelligence) at Vienna University of Technology
2007: Doctoral degree (Dr.techn.) in Electrical and Information Engineering (with distinction) from Graz University of Technology
2004 to 2007: Doctoral studies in Electrical Engineering (Speech Communication) at Graz University of Technology
2001: Diploma degree (Mag.phil.) in Philosophy (with distinction) from the University of Vienna
1995 to 2000: Diploma studies in Computer Science (Computational Logic) at Vienna University of Technology
1994 to 2001: Diploma studies in Philosophy, Logic, and Mathematics at University of Vienna
1994: Studienberechtigungsprüfung
1994: Studies in Interdisciplinary Art at Wiener Kunstschule
January to April 1992: French language course in Paris
1984 to 1988: Cook apprenticeship
1979 to 1984: High school in Judenburg, Austria
1975 to 1979: Primary school in Trieben, Austria


Research Visits

April to May 2014: National Institute of Informatics (NII), Tokyo, Japan.
August to September 2008: Centre for Speech Technology Research (CSTR), University of Edinburgh, UK
August 2006: Telekom Innovation Laboratories (T-Labs), Berlin, Germany
February to July 2005: International Computer Science Institute (ICSI), Berkeley, California





Professional activities

Organizing

Area chair for Speech Synthesis and Spoken Language Generation of INTERSPEECH 2015
Organizing committee member of FAAVSP 2015 - The 1st Joint Conference on Facial Analysis, Animation and Auditory-Visual Speech Processing
Program committee member of ACM Multimedia 2014
Organizing committee member of FAA 2012 - The ACM 3rd International Symposium on Facial Analysis and Animation
Organizing committee member of ICAD 2005 Workshop - Combining Speech and Sound in the User Interface


Awards

WINTEC 2016 Preis für Inklusion durch Wissenschaft und Technik (WINTEC 2016 prize for inclusion through science and technology)


Reviewing

Speech Communication, Elsevier
Computer Speech and Language, Elsevier
IEEE Transactions on Audio, Speech, and Language Processing
IEEE Journal of Selected Topics in Signal Processing
IEEE Signal Processing Letters
IEEE Transactions on Systems, Man, and Cybernetics (Part B - Cybernetics)
Journal of the Acoustical Society of America
Cognitive Computation, Springer
Computer Methods and Programs in Biomedicine, Elsevier
The Computer Journal, Oxford University Press


Memberships

IEEE
ACM
International Speech Communication Association (ISCA)
European Network for the Advancement of Artificial Cognitive Systems, Interaction and Robotics (EUCOG III)
Cross-Modal Analysis of Verbal and Nonverbal Communication (COST 2102)