Hyppää sisältöön
    • Suomeksi
    • In English
Trepo
  • Suomeksi
  • In English
  • Kirjaudu
Näytä viite 
  •   Etusivu
  • Trepo
  • TUNICRIS-julkaisut
  • Näytä viite
  •   Etusivu
  • Trepo
  • TUNICRIS-julkaisut
  • Näytä viite
JavaScript is disabled for your browser. Some features of this site may not work without it.

Binaural source localization using deep learning and head rotation information

Garcia Barrios, Guillermo; Krause, Daniel Aleksander; Politis, Archontis; Mesaros, Annamaria; Gutierrez-Arriola, Juana M.; Fraile, Ruben (2022-10-18)

 
Avaa tiedosto
Binaural_source_localization_using_deep_learning_and_head_rotation_information.pdf (332.1Kt)
Lataukset: 

URI
https://eurasip.org/Proceedings/Eusipco/Eusipco2022/pdfs/0000036.pdf


Garcia Barrios, Guillermo
Krause, Daniel Aleksander
Politis, Archontis
Mesaros, Annamaria
Gutierrez-Arriola, Juana M.
Fraile, Ruben
18.10.2022

This publication is copyrighted. You may download, display and print it for Your own personal use. Commercial use is prohibited.
doi:10.23919/EUSIPCO55093.2022.9909764
Näytä kaikki kuvailutiedot
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:tuni-202302082145

Kuvaus

Peer reviewed
Tiivistelmä
This work studies learning-based binaural sound source localization, under the influence of head rotation in reverberant conditions. Emphasis is on whether knowledge of head rotation can improve localization performance over the non-rotating case for the same acoustic scene. Simulations of binaural head signals of a static and rotating head were conducted, for 5 different rotation speeds and a wide range of reverberant conditions. Several convolutional recurrent neural network models were evaluated including a static head scenario, a model without rotation information, and distinct models differentiated on the way of manipulating the quaternions. The results were analyzed based on the direction-of-arrival error, and they show the importance of using quaternions as additional features, with the best localization accuracy obtained when using an additional convolutional branch that merges the features through addition or concatenation. Nevertheless, raw quaternion features presented lower performance than the static baseline model. Additionally, the study shows the importance of the analysis time window length when using information about head rotation.
Kokoelmat
  • TUNICRIS-julkaisut [23847]
Kalevantie 5
PL 617
33014 Tampereen yliopisto
oa[@]tuni.fi | Tietosuoja | Saavutettavuusseloste
 

 

Selaa kokoelmaa

TekijätNimekkeetTiedekunta (2019 -)Tiedekunta (- 2018)Tutkinto-ohjelmat ja opintosuunnatAvainsanatJulkaisuajatKokoelmat

Omat tiedot

Kirjaudu sisäänRekisteröidy
Kalevantie 5
PL 617
33014 Tampereen yliopisto
oa[@]tuni.fi | Tietosuoja | Saavutettavuusseloste