Opin vísindi

Speed-Up of Machine Learning for Sound Localization via High-Performance Computing

Speed-Up of Machine Learning for Sound Localization via High-Performance Computing


Title: Speed-Up of Machine Learning for Sound Localization via High-Performance Computing
Author: Sumner, Eric Michael
Aach, Marcel
Lintermann, Andreas
Unnthorsson, Runar   orcid.org/0000-0002-1960-0263
Riedel, Morris   orcid.org/0000-0003-1810-9330
Date: 2022-02-16
Language: English
Scope: 1-4
University/Institute: Háskóli Íslands
University of Iceland
School: Verkfræði- og náttúruvísindasvið (HÍ)
School of Engineering and Natural Sciences (UI)
Department: Iðnaðarverkfræði-, vélaverkfræði- og tölvunarfræðideild (HÍ)
Faculty of Industrial Eng., Mechanical Eng. and Computer Science (UI)
ISBN: 978-1-6654-2127-0
Series: 2022 26th International Conference on Information Technology (IT);
DOI: 10.1109/IT54280.2022.9743519
Subject: Reiknilíkön; Hugbúnaðargerð; Gervigreind
URI: https://hdl.handle.net/20.500.11815/3304

Show full item record

Citation:

E. M. Sumner, M. Aach, A. Lintermann, R. Unnthorsson and M. Riedel, "Speed-Up of Machine Learning for Sound Localization via High-Performance Computing," 2022 26th International Conference on Information Technology (IT), 2022, pp. 1-4, doi: 10.1109/IT54280.2022.9743519.

Abstract:

Sound localization is the ability of humans to determine the source direction of sounds that they hear. Emulating this capability in virtual environments can have various societally relevant applications enabling more realistic virtual acoustics. We use a variety of artificial intelligence methods, such as machine learning via an Artificial Neural Network (ANN) model, to emulate human sound localization abilities. This paper addresses the particular challenge that the training and optimization of these models is very computationally-intensive when working with audio signal datasets. It describes the successful porting of our novel ANN model code for sound localization from limiting serial CPU-based systems to powerful, cutting-edge High-Performance Computing (HPC) resources to obtain significant speed-ups of the training and optimization process. Selected details of the code refactoring and HPC porting are described, such as adapting hyperparameter optimization algorithms to efficiently use the available HPC resources and replacing third-party libraries responsible for audio signal analysis and linear algebra. This study demonstrates that using innovative HPC systems at the Jülich Supercomputing Centre, equipped with high-tech Graphics Processing Unit (GPU) resources and based on the Modular Supercomputing Architecture, enables significant speed-ups and reduces the time-to-solution for sound localization from three days to three hours per ANN model.

Description:

Post-print (lokaútgáfa höfunda).

Rights:

© 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Files in this item

This item appears in the following Collection(s)