Informaatio- ja tietoliikennetekniikan laitos

Puheen vuorovaikutusteknologia

Tavoitteenamme on parantaa puheeseen perustuvaa vuorovaikutusta esimerkiksi tietoliikennesovelluksissa ja puherajapintoja käytettäessä. Kehitämme tehokkaita ja resurssien kannalta kestäviä menetelmiä, tarjoamme korkean äänenlaadun ja intuitiivisen vuorovaikutuksen säilyttäen samalla käyttäjien yksityisyyden. Erityisen mielenkiinnon kohteena ovat ympäristöt, joissa useat ihmiset ovat vuorovaikutuksessa useiden laitteiden kanssa, mikä edellyttää edistyneitä viestintä-, todentamis- ja käsittelymenetelmiä.
Speech Interaction Technology

Team Photos
Silas Rech

Silas Rech

Doctoral Researcher
Speech Interaction Technology

Teaching 

Our department provides the following courses in speech and language technology:  

Project topics for Bachelor theses, Master’s theses, and special assignments 

We are always open to suggestions of topics for projects, especially when they are related to our current research described above. To aid in finding exciting topics, we maintain a list of suggested project topics at the Special Assignment –page. Note that even if that page is about special assignment projects, most topics can be scaled also to bachelor and master’s theses.  

Resources 

Viimeisimmät julkaisut

Privacy in Speech Technology

Tom Bäckström 2024

Evaluating privacy, security, and trust perceptions in conversational AI: A systematic review

Anna Leschanowsky, Silas Rech, Birgit Popp, Tom Bäckström 2024

User Perspective on Anonymity in Voice Assistants – A comparison between Germany and Finland

Ingo Siegert, Silas Rech, Tom Bäckström, Matthias Haase 2024 Legal and Ethical Issues in Human Language Technologies 2024, LEGAL 2024 at LREC-COLING 2024 - Workshop Proceedings

Optimizing the Performance of Text Classification Models by Improving the Isotropy of the Embeddings using a Joint Loss Function

Joseph Attieh, Abraham Zewoudie, Vladimir Vlassov, Adrian Flanagan, Tom Bäckström 2023 Document Analysis and Recognition – ICDAR 2023 - 17th International Conference, Proceedings

Low-complexity Real-time Neural Network for Blind Bandwidth Extension of Wideband Speech

Esteban Gómez Mellado, Mohammadhassan Vali, Tom Bäckström 2023 31st European Signal Processing Conference, EUSIPCO 2023 - Proceedings

Privacy and Quality Improvements in Open Offices Using Multi-Device Speech Enhancement

Silas Rech, Mohammadhassan Vali, Tom Bäckström 2023 3rd Symposium on Security and Privacy in Speech Communication

The Internet of Sounds: Convergent Trends, Insights and Future Directions

Luca Turchet, Mathieu Lagrange, Cristina Rottondi, György Fazekas, Nils Peters, Jan Østergaard, Frederic Font, Tom Bäckström, Carlo Fischione 2023

Interpretable Latent Space Using Space-Filling Curves for Phonetic Analysis in Voice Conversion

Mohammadhassan Vali, Tom Bäckström 2023 Proceedings of Interspeech Conference

Stochastic Optimization of Vector Quantization Methods in Application to Speech and Image Processing

Mohammadhassan Vali, Tom Bäckström 2023 International Conference on Acoustics, Speech, and Signal Processing

Speech Localization at Low Bitrates in Wireless Acoustics Sensor Networks

Mariem Bouafif, Pablo Perez Zarazaga, Tom Bäckström, Zied Lachiri 2022
Lisää tietoa tutkimuksestamme löytyy Aallon tutkimusportaalista.
Tutkimusportaali
  • Julkaistu:
  • Päivitetty: