Towards direct speech synthesis from ECoG: A pilot study

Christian Herff, Garett Johnson, Lorenz Diener, Jerry Shih, Dean Krusienski, Tanja Schultz

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Scopus citations

Abstract

Most current Brain-Computer Interfaces (BCIs) achieve high information transfer rates using spelling paradigms based on stimulus-evoked potentials. Despite the success of this interfaces, this mode of communication can be cumbersome and unnatural. Direct synthesis of speech from neural activity represents a more natural mode of communication that would enable users to convey verbal messages in real-time. In this pilot study with one participant, we demonstrate that electrocoticography (ECoG) intracranial activity from temporal areas can be used to resynthesize speech in real-time. This is accomplished by reconstructing the audio magnitude spectrogram from neural activity and subsequently creating the audio waveform from these reconstructed spectrograms. We show that significant correlations between the original and reconstructed spectrograms and temporal waveforms can be achieved. While this pilot study uses audibly spoken speech for the models, it represents a first step towards speech synthesis from speech imagery.

Original languageEnglish (US)
Title of host publication2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2016
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1540-1543
Number of pages4
Volume2016-October
ISBN (Electronic)9781457702204
DOIs
StatePublished - Oct 13 2016
Event38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2016 - Orlando, United States
Duration: Aug 16 2016Aug 20 2016

Other

Other38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2016
CountryUnited States
CityOrlando
Period8/16/168/20/16

ASJC Scopus subject areas

  • Signal Processing
  • Biomedical Engineering
  • Computer Vision and Pattern Recognition
  • Health Informatics

Fingerprint Dive into the research topics of 'Towards direct speech synthesis from ECoG: A pilot study'. Together they form a unique fingerprint.

  • Cite this

    Herff, C., Johnson, G., Diener, L., Shih, J., Krusienski, D., & Schultz, T. (2016). Towards direct speech synthesis from ECoG: A pilot study. In 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2016 (Vol. 2016-October, pp. 1540-1543). [7591004] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/EMBC.2016.7591004