News

Audiobooks can today be created with AI

The first audiobooks were created in 1932 on vinyl records. Noticeable audiobook market growth started later in the 1960s by the advance of technology and cassette tapes and in the 1980s by compact discs. Today audiobooks can be created with artificial intelligence text to speech tools and applications.
The Fifth Wave - BRIE-ETLA Collection of Articles book cover

The Department of Industrial Engineering and Management (TUTA) has published its first AI audiobook (four chapters of the book) conducted as an auto-ethnographic study project during the summer 2023 from the book ‘The Fifth Wave – BRIE-ETLA Collection of Articles´ edited by Timo Seppälä, Tomasz Mucha and Juri Mattila. Autoethnography is an autobiographical way of conducting research, in which the researcher's own experiences and the field notes written or otherwise produced from them form the central material of the research.

The motivation behind the AI-based audiobook project was quite straightforward. A colleague of senior lecturer Timo Seppälä from the machinery industry once mentioned that he didn't have the time to read Timo’s book, but he could listen to it while commuting to work or returning home. 

How was the AI-based audiobook done? 

“I read 20 and my colleague Tomasz 50 sentences to the AI tool. We then discovered that 50 sentences produced much better sound quality, so I went back to the studio and read another 60 sentences. Based on these, the AI tool cloned our voices and used those as narrator for the book”, explains Timo.  

The producer of the book, Laura Loikas, is a student of industrial engineering and management at Aalto. She was hired for a month during the summer to work on the project. Laura compared and chose the technology to be used, did the cloning of the voices and edited the articles of the book so that AI could read them. Then she produced the mp3 files and edited the parts where AI made mistakes. Altogether the production costs were around 200 Euros without salary costs. 

“Educational materials are used to be produced in written form, but we see that in the future any written teaching material can be produced in audio by using AI tools and applications. This will also give new tools to the student to learn things regardless of location. As a result of this project, we can now transform written materials into audio with reasonable quality, reduced costs, and increased efficiency. Next time, we will provide instructions for this process”, says Timo. 

Listen to the audiobook

Read more about the book 

  • Published:
  • Updated:

Read more news

Researchers in front of Dipoli in a snowy landscape in Otaniemi
Cooperation, Research & Art Published:

Preserving intangible cultural heritage through immersive XR experiences

Aalto University’s Department of Art and Media is coordinating a European wide project on preserving intangible cultural heritage and using it to address societal challenges with the help of immersive XR environments.
In leaf-like bubbles, different icons refering to e.g. books, gardening, and bicycling
Research & Art, Studies Published:

Find our doctoral community at Aalto Social

Follow the posts, share and discuss anything related to doctoral education at Aalto Social.
Research & Art Published:

Aalto computer scientists in CHI 2024

Ten papers from Aalto CS were accepted to the CHI 2024 conference
Photo of female researcher on bright green background
Awards and Recognition, Research & Art Published:

"Keep an open mind and step outside of your comfort zone"

Read what double award winner Kukka-Emilia Huhtinen wants to share about her doctoral studies in theoretical physics