elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Contact | Deutsch
Fontsize: [-] Text [+]

Learning to Tag Environmental Sounds in Nightlong Audio

Khan, Mohd Saif (2022) Learning to Tag Environmental Sounds in Nightlong Audio. Master's, Bauhaus-Universität Weimar / DLR Institut für Datenwissenschaften.

[img] PDF - Only accessible within DLR
3MB

Abstract

Environmental sound events are defined as sounds occurring naturally or produced due to human activity. Devices need to identify the environmental sounds for better perception of the environment. The emergence of environmental sound classification using machine learning has led to the development of more context-aware technologies like smart homes, multimedia search, etc. In this thesis, different sound events and their starting and ending times are determined using classification models. The audio data is provided by the German Aerospace Center and is composed of night-long audio clips that are recorded near an airport and contain different environmental sounds. Many challenges like noise in data, sounds of different lengths, and their effects on the classifier are also discussed. We investigated if different models prefer to identify sounds of different lengths in the same audio. An overlapping window approach is devised to improve the identification of starting and ending times of the predicted events. The results of the thesis are encouraging as the best model is able to identify various sound events with an accuracy of 0.94 on a balanced dataset of 4 different classes. Finally, a system is also conceptualized where the environmental sounds are identified, and their spans are visualized against time.

Item URL in elib:https://elib.dlr.de/188886/
Document Type:Thesis (Master's)
Title:Learning to Tag Environmental Sounds in Nightlong Audio
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Khan, Mohd SaifBauhaus-Universität WeimarUNSPECIFIEDUNSPECIFIED
Date:2022
Refereed publication:Yes
Open Access:Yes
Number of Pages:54
Status:Published
Keywords:Audio Analysis, Environmental Sound Classification
Institution:Bauhaus-Universität Weimar / DLR Institut für Datenwissenschaften
Department:Faculty of Media / Datengewinnung und -mobilisierung
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:Space System Technology
DLR - Research area:Raumfahrt
DLR - Program:R SY - Space System Technology
DLR - Research theme (Project):R - Environment, Health and Big Data
Location: Jena
Institutes and Institutions:Institute of Data Science > Data Acquisition and Mobilisation
Deposited By: Kersten, Dr.-Ing. Jens
Deposited On:02 Nov 2022 11:34
Last Modified:11 Nov 2022 12:38

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.