News

Audiobooks can today be created with AI

The first audiobooks were created in 1932 on vinyl records. Noticeable audiobook market growth started later in the 1960s by the advance of technology and cassette tapes and in the 1980s by compact discs. Today audiobooks can be created with artificial intelligence text to speech tools and applications.
The Fifth Wave - BRIE-ETLA Collection of Articles book cover

The Department of Industrial Engineering and Management (TUTA) has published its first AI audiobook (four chapters of the book) conducted as an auto-ethnographic study project during the summer 2023 from the book ‘The Fifth Wave – BRIE-ETLA Collection of Articles´ edited by Timo Seppälä, Tomasz Mucha and Juri Mattila. Autoethnography is an autobiographical way of conducting research, in which the researcher's own experiences and the field notes written or otherwise produced from them form the central material of the research.

The motivation behind the AI-based audiobook project was quite straightforward. A colleague of senior lecturer Timo Seppälä from the machinery industry once mentioned that he didn't have the time to read Timo’s book, but he could listen to it while commuting to work or returning home. 

How was the AI-based audiobook done? 

“I read 20 and my colleague Tomasz 50 sentences to the AI tool. We then discovered that 50 sentences produced much better sound quality, so I went back to the studio and read another 60 sentences. Based on these, the AI tool cloned our voices and used those as narrator for the book”, explains Timo.  

The producer of the book, Laura Loikas, is a student of industrial engineering and management at Aalto. She was hired for a month during the summer to work on the project. Laura compared and chose the technology to be used, did the cloning of the voices and edited the articles of the book so that AI could read them. Then she produced the mp3 files and edited the parts where AI made mistakes. Altogether the production costs were around 200 Euros without salary costs. 

“Educational materials are used to be produced in written form, but we see that in the future any written teaching material can be produced in audio by using AI tools and applications. This will also give new tools to the student to learn things regardless of location. As a result of this project, we can now transform written materials into audio with reasonable quality, reduced costs, and increased efficiency. Next time, we will provide instructions for this process”, says Timo. 

Listen to the audiobook

Read more about the book 

  • Published:
  • Updated:

Read more news

Event poster with a young researcher looking down with lighst and code reflected around her.
Cooperation, Research & Art, Studies Published:

Unite! Research Week in Grenoble-Autrans, 14-18 October

A networking event focusing on AI, Energy and Industry 4.0 for faculty and doctoral students
Cover
Research & Art Published:

The Tentative Program of ITICAT2024

The tentative program of ITICAT2024 has been published.
Artistic illustration: Algorithms over a computer chip
Research & Art Published:

Aalto computer scientists in STOC 2024

Two papers from the field of computer science were accepted to STOC 2024.
Harald Herlin Learning Center
Research & Art Published:

Learning Centre to pilot new opening hours as of 19 August 2024

The change gives Aalto students and staff more extensive access rights to the Learning Centre spaces and collections.