Prediction and extraction of tower controller commands for speech recognition applications |
| |
Affiliation: | 1. Faculty of Fisheries Sciences, Hokkaido University, 3-1-1 Minato-cho, Hakodate, Hokkaido 041-8611, Japan;2. College of Food Science and Engineering, Ocean University of China, No.5 Yu Shan Road, Qingdao, Shandong Province 266003, China;1. Biology of Marine Organisms and Biomimetics Unit, University of Mons - UMONS, Mons 7000, Belgium;2. Haute Ecole du Hainaut, Mons 7000, Belgium;3. Proteomics and Microbiology Lab, University of Mons - UMONS, Mons 7000, Belgium;4. Marine Biology Laboratory, Free University of Brussels, Bruxelles 1050, Belgium;5. Polyaquaculture Research Unit, Institut Halieutique et des Sciences Marines, University of Toliaria, Tuléar 601, Madagascar |
| |
Abstract: | Air traffic controllers' (ATCos) workload often is a limiting factor for air traffic capacity. Thus, electronic support systems intend to reduce ATCos' workload. Automatic speech recognition can extract controller command elements from verbal clearances to deliver automatic input for air traffic control systems, thereby avoiding manual input. Assistant Based Speech Recognition (ABSR) with high command recognition rates and low error rates has proven to dramatically reduce ATCos’ workload and increase capacity in approach scenarios. However, ABSR needs accurate hypotheses on expected commands and accurate extractions of command annotations from utterance transcriptions to achieve the required performance. Based on the experience of implementation for approach control, a hypotheses generator and a command extractor have been developed for speech recognition applications regarding tower control communication to face current and future challenges in the aerodrome environment. Three human-in-the-loop multiple remote tower simulation studies were performed with 16 ATCos from Hungary, Lithuania, and Finland at DLR Braunschweig from 2017 to 2019. Roughly 100 h of speech with corresponding radar data were recorded. Around 6000 speech utterances resulting in 16,000 commands have been manually transcribed and annotated. Some parts of the data have been used for training prediction models and command extraction algorithms. Other parts were used for evaluation of command prediction and command extraction. The automatic command extractor achieved a command extraction rate of 96.7%. The hypotheses generator showed operational feasibility with a sufficiently low command prediction error rate of 7.3%. |
| |
Keywords: | Air traffic controller Multiple remote tower commands Command prediction Command extraction Assistant based speech recognition |
本文献已被 ScienceDirect 等数据库收录! |
|