Titill: | Speed-Up of Machine Learning for Sound Localization via High-Performance Computing |
Höfundur: |
|
Útgáfa: | 2022-02-16 |
Tungumál: | Enska |
Umfang: | 1-4 |
Háskóli/Stofnun: | Háskóli Íslands University of Iceland |
Svið: | Verkfræði- og náttúruvísindasvið (HÍ) School of Engineering and Natural Sciences (UI) |
Deild: | Iðnaðarverkfræði-, vélaverkfræði- og tölvunarfræðideild (HÍ) Faculty of Industrial Eng., Mechanical Eng. and Computer Science (UI) |
ISBN: | 978-1-6654-2127-0 |
Birtist í: | 2022 26th International Conference on Information Technology (IT); |
DOI: | 10.1109/IT54280.2022.9743519 |
Efnisorð: | Reiknilíkön; Hugbúnaðargerð; Gervigreind |
URI: | https://hdl.handle.net/20.500.11815/3304 |
Tilvitnun:E. M. Sumner, M. Aach, A. Lintermann, R. Unnthorsson and M. Riedel, "Speed-Up of Machine Learning for Sound Localization via High-Performance Computing," 2022 26th International Conference on Information Technology (IT), 2022, pp. 1-4, doi: 10.1109/IT54280.2022.9743519.
|
|
Útdráttur:Sound localization is the ability of humans to determine the source direction of sounds that they hear. Emulating this capability in virtual environments can have various societally relevant applications enabling more realistic virtual acoustics. We use a variety of artificial intelligence methods, such as machine learning via an Artificial Neural Network (ANN) model, to emulate human sound localization abilities. This paper addresses the particular challenge that the training and optimization of these models is very computationally-intensive when working with audio signal datasets. It describes the successful porting of our novel ANN model code for sound localization from limiting serial CPU-based systems to powerful, cutting-edge High-Performance Computing (HPC) resources to obtain significant speed-ups of the training and optimization process. Selected details of the code refactoring and HPC porting are described, such as adapting hyperparameter optimization algorithms to efficiently use the available HPC resources and replacing third-party libraries responsible for audio signal analysis and linear algebra. This study demonstrates that using innovative HPC systems at the Jülich Supercomputing Centre, equipped with high-tech Graphics Processing Unit (GPU) resources and based on the Modular Supercomputing Architecture, enables significant speed-ups and reduces the time-to-solution for sound localization from three days to three hours per ANN model.
|
|
Athugasemdir:Post-print (lokaútgáfa höfunda).
|
|
Leyfi:© 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
|