elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

Contextual Biasing Methods for Improving Rare Word Detection in Automatic Speech Recognition

Helmke, Hartmut and Kleinert, Matthias and Ahrenhold, Nils and Klamert, Lucas and Madikeri, Srikanth and Motlicek, Petr and Helmke, Hartmut and Kleinert, Matthias (2024) Contextual Biasing Methods for Improving Rare Word Detection in Automatic Speech Recognition. In: 49th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024-04-14 - 2024-04-19, Seoul, Republic of Korea. doi: 10.1109/ICASSP48485.2024.10447465. ISBN 979-835034485-1. ISSN 1520-6149.

[img] PDF - Only accessible within DLR
1MB

Abstract

In specialized domains like Air Traffic Control (ATC), a notable challenge in porting a deployed Automatic Speech Recognition (ASR) system from one airport to another is the alteration in the set of crucial words that must be accurately detected in the new environment. Typically, such words have limited occurrences in training data, making it impractical to retrain the ASR system. This paper explores innovative word-boosting techniques to improve the detection rate of such rare words in the ASR hypotheses for the ATC domain. Two acoustic models are investigated: a hybrid CNN-TDNNF model trained from scratch and a pre-trained wav2vec2-based XLSR model fine-tuned on a common ATC dataset. The word boosting is done in three ways. First, an out-of-vocabulary word addition method is explored. Second, G-boosting is explored, which amends the language model before building the decoding graph. Third, the boosting is performed on the fly during decoding using lattice re-scoring. The results indicate that the G-boosting method performs best and provides an approximately 30-43% relative improvement in recall of the boosted words. Moreover, a relative improvement of up to 48% is obtained upon combining G-boosting and lattice-rescoring.

Item URL in elib:https://elib.dlr.de/204829/
Document Type:Conference or Workshop Item (Speech)
Title:Contextual Biasing Methods for Improving Rare Word Detection in Automatic Speech Recognition
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Helmke, HartmutIdiapUNSPECIFIEDUNSPECIFIED
Kleinert, MatthiasIdiap, University of ZurichUNSPECIFIEDUNSPECIFIED
Ahrenhold, NilsIdiap, BUTUNSPECIFIEDUNSPECIFIED
Klamert, LucasIdiapUNSPECIFIEDUNSPECIFIED
Madikeri, SrikanthIdiapUNSPECIFIEDUNSPECIFIED
Motlicek, PetrIdiap, BUTUNSPECIFIEDUNSPECIFIED
Helmke, HartmutHartmut.Helmke (at) dlr.dehttps://orcid.org/0000-0002-1939-0200UNSPECIFIED
Kleinert, MatthiasMatthias.Kleinert (at) dlr.dehttps://orcid.org/0000-0002-0782-4147UNSPECIFIED
Date:April 2024
Journal or Publication Title:49th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2024
Refereed publication:Yes
Open Access:No
Gold Open Access:No
In SCOPUS:Yes
In ISI Web of Science:No
DOI:10.1109/ICASSP48485.2024.10447465
ISSN:1520-6149
ISBN:979-835034485-1
Status:Published
Keywords:Automatic speech recognition;air traffic control, domain adaptation, contextual biasing, rare word recognition
Event Title:ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Event Location:Seoul, Republic of Korea
Event Type:international Conference
Event Start Date:14 April 2024
Event End Date:19 April 2024
Organizer:IEEE Signal Processing Society
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Aeronautics
HGF - Program Themes:Air Transportation and Impact
DLR - Research area:Aeronautics
DLR - Program:L AI - Air Transportation and Impact
DLR - Research theme (Project):L - Integrated Flight Guidance
Location: Braunschweig
Institutes and Institutions:Institute of Flight Guidance > Controller Assistance
Institute of Flight Guidance > ATM-Simulation
Deposited By: Diederich, Kerstin
Deposited On:25 Jul 2024 13:53
Last Modified:26 Jul 2024 13:31

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.