


default search action
18th IWAENC 2024: Aalborg, Denmark
- 18th International Workshop on Acoustic Signal Enhancement, IWAENC 2024, Aalborg, Denmark, September 9-12, 2024. IEEE 2024, ISBN 979-8-3503-6185-8

- Erik Fleischhauer, Sebastian Nagel, Peter Jax:

Binaural Direction-of-Arrival Estimation Incorporating Head Movement Information. 1-5 - Srikanth Korse

, Oliver Thiergart, Emanuël A. P. Habets:
Sample Rate Offset Compensated Acoustic Echo Cancellation for Multi-Device Scenarios. 1-5 - Esteban Gómez, Tom Bäckström

:
Real-Time Joint Noise Suppression and Bandwidth Extension of Noisy Reverberant Wideband Speech. 6-10 - Aleksej Chinaev, Till Spitz, Stefan Thaleiser, Gerald Enzner

:
Matrix Study of Feature Compression Types and Instrumental Speech Quality Metrics in Ultra-Light DNN-Based Spectral Speech Enhancement. 11-15 - Mohamed F. Mansour:

Maximum Likelihood Estimation of the Direction of Sound in a Reverberant Noisy Environment. 16-20 - Christoph Weyer, Peter Jax

:
Analysis of Earbud-Mounted Bone-Conduction Microphones. 21-25 - Jonas Van Damme, Stijn Kindt, Siyuan Song, Jasper Maes, Nilesh Madhu

:
Investigation On System Bandwidth For DNN-Based Binaural Sound Localisation For Hearing AIDS. 26-30 - Wei-Ting Lai, Lachlan Birnie, Xingyu Chen

, Amy Bastine
, Thushara D. Abhayapala, Prasanga N. Samarasinghe
:
Source Localization by Multidimensional Steered Response Power Mapping with Sparse Bayesian Learning. 31-35 - Shrishti Saha Shetu

, Emanuël A. P. Habets, Andreas Brendel:
Comparative Analysis of Discriminative Deep Learning-Based Noise Reduction Methods in Low SNR Scenarios. 36-40 - Alexis Favrot, Christof Faller:

Direction of Arrival Estimation on a Sphere. 41-44 - Huajian Fang, Timo Gerkmann:

Uncertainty-Based Remixing for Unsupervised Domain Adaptation in Deep Speech Enhancement. 45-49 - Mhd Modar Halimeh

, Matteo Torcoli
, Emanuël A. P. Habets:
ConcateNet: Dialogue Separation Using Local and Global Feature Concatenation. 50-54 - Shahan Nercessian, Alexey Lukin, Johannes Imort:

DSP-Informed Bandwidth Extension using Locally-Conditioned Excitation and Linear Time-Varying Filter Subnetworks. 55-59 - Zohre Foroushi, Richard M. Dansereau:

Dynamic Audio-Visual Speech Enhancement using Recurrent Variational Autoencoders. 60-64 - Tomohiro Nakatani, Naoyuki Kamo, Marc Delcroix, Shoko Araki:

Multi-Stream Diffusion Model for Probabilistic Integration of Model-Based and Data-Driven Speech Enhancement. 65-69 - Maurice Oberhag

, Yan Zeng, Rainer Martin
:
On the Impact of Frequency Resolution on Female and Male Speech in DNN-Based Noise Reduction Systems. 70-74 - Svantje Voit, Gerald Enzner:

Tiny Neural-Network Control of Frequency-Domain Adaptive Filtering for Linear System Identification in Acoustic Echo Cancellation. 75-79 - Alexander Bohlender

, Ann Spriet, Wouter Tirry, Nilesh Madhu
:
Weakly DOA Guided Speaker Separation with Random Look Directions and Iteratively Refined Target and Interference Priors. 80-84 - Xingyu Chen

, Hanwen Bi, Wei-Ting Lai, Fei Ma:
Monaural Speech Enhancement on Drone via Adapter Based Transfer Learning. 85-89 - Danilo de Oliveira, Eric Grinstein, Patrick A. Naylor

, Timo Gerkmann:
LASER: Language-Queried Speech Enhancer. 90-94 - Zbynek Koldovský, Jirí Málek, Jaroslav Cmejla

, Stephen O'Regan:
Informed FastICA: Semi-Blind Minimum Variance Distortionless Beamformer. 95-99 - Shuai Tao, Pejman Mowlaee, Jesper Rindom Jensen

, Mads Græsbøll Christensen
:
Learning-Based Multi-Channel Speech Presence Probability Estimation using A Low-Parameter Model and Integration with MVDR Beamforming for Multi-Channel Speech Enhancement. 100-104 - Yu Morinaga, Naoto Kotake, Iori Hashimoto, Suehiro Shimauchi, Shigeaki Aoki:

Spherical Mapping of Short-Time Spectral Components. 105-109 - Mahdi Amiri, Ina Kodrasi:

Suppressing Noise Disparity in Training data for Automatic Pathological Speech Detection. 110-114 - Michal Svento, Pavel Rajmic, Ondrej Mokrý

:
Plug-and-Play Audio Restoration with Diffusion Denoiser. 115-119 - Eloi Moliner, Jean-Marie Lemercier, Simon Welker, Timo Gerkmann, Vesa Välimäki:

BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models. 120-124 - Anselm Lohmann, Toon van Waterschoot, Jörg Bitzer

, Simon Doclo:
Reference Microphone Selection for the Weighted Prediction Error Algorithm using the Normalized L-P Norm. 125-129 - Jiawen Chua, Longfei Felix Yan, W. Bastiaan Kleijn

:
An Effective MVDR Post-Processing Method for Low-Latency Convolutive Blind Source Separation. 130-134 - Gal Itzhak, Simon Doclo, Israel Cohen:

Joint Optimization of Microphone Array Geometry and Region-of-Interest Beamforming with Sparse Circular Sector Arrays. 135-139 - YingWei Tan, XueFeng Ding:

Split-Attention Mechanisms with Graph Convolutional Network for Multi-Channel Speech Separation. 140-144 - Yoshiaki Sumura, Diego Di Carlo, Aditya Arie Nugraha, Yoshiaki Bando, Kazuyoshi Yoshii

:
Joint Audio Source Localization and Separation with Distributed Microphone Arrays Based on Spatially-Regularized Multichannel NMF. 145-149 - Frank Jiarui Wang

, Prasanga N. Samarasinghe
, Thushara D. Abhayapala, Jihui Aimee Zhang
:
The Acoustic Velocity Vectors of the Outgoing Sound Field. 150-154 - Boris Rubenchik, Elior Hadad, Eli Tzirkel, Ethan Fetaya, Sharon Gannot

:
Low-Latency Single-Microphone Speaker Separation with Temporal Convolutional Networks Using Speaker Representations. 155-159 - Satoru Emura:

Estimation of Output SI-SDR of Speech Signals Separated From Noisy Input by Conv-Tasnet. 160-164 - Benjamin Lentz

, Rainer Martin
:
Utilizing Head Rotation Data in DNN-based Multi-Channel Speech Enhancement for Hearing AIDS. 165-169 - Shinya Furunaga, Hiroshi Sawada, Rintaro Ikeshita, Tomohiro Nakatani, Shoji Makino:

Accurate Delayed Source Model for Multi-Frame Full-Rank Spatial Covariance Analysis. 170-174 - Yaakov Buchris, Israel Cohen, Alon Amar:

Greedy Design of Circular Concentric Arrays for Broadband MVDR. 175-179 - Manan Mittal, Ryan M. Corey, Yongjie Zhuang, Andrew C. Singer:

Low Latency Two Stage Beamforming with Distributed Microphone Arrays Using a Planewave Decomposition. 180-184 - Martin Strauss, Okan Köpüklü:

Efficient Area-Based and Speaker-Agnostic Source Separation. 185-189 - Srikanth Raj Chetupalli

, Emanuël A. P. Habets:
A Unified Approach to Speaker Separation and Target Speaker Extraction Using Encoder-Decoder Based Attractors. 190-194 - Shaoheng Xu

, Jihui Aimee Zhang
, Thushara D. Abhayapala, Amy Bastine
, Prasanga N. Samarasinghe
:
Iterative and Complex Orthogonal Matching Pursuit for Broadband Sparse Sound Field Reconstruction. 195-199 - Alina Mannanova, Kristina Tesch, Jean-Marie Lemercier, Timo Gerkmann:

Meta-Learning For Variable Array Configurations in End-to-End Few-Shot Multichannel Speech Enhancement. 200-204 - Kohei Saijo, Gordon Wichern, François G. Germain, Zexu Pan, Jonathan Le Roux:

TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement. 205-209 - Carlos Hernandez-Olivan, Marc Delcroix, Tsubasa Ochiai, Naohiro Tawara, Tomohiro Nakatani, Shoko Araki:

Interaural Time Difference Loss for Binaural Target Sound Extraction. 210-214 - Federico Miotello

, Ferdinando Terminiello, Mirco Pezzoli, Alberto Bernardini
, Fabio Antonacci, Augusto Sarti:
A Physics-Informed Neural Network-Based Approach for the Spatial Upsampling of Spherical Microphone Arrays. 215-219 - Jingli Xie, Xudong Zhao

, Junqing Zhang, Jacob Benesty, Jingdong Chen:
On Limitations and Improvement of Differential Beam Forming Via Quadratic Eigenvalue Optimization. 220-224 - Bunlong Lay, Sebastian Zaczek, Kristina Tesch, Timo Gerkmann:

Robustness of Speech Separation Models for Similar-Pitch Speakers. 225-229 - Shekhar Kumar Yadav

, Nithin V. George:
Third-Order Tensor Decomposition Based Multichannel Linear Prediction for Robust Dereverberation. 230-234 - Emilie D'Olne, Vincent W. Neo, Patrick A. Naylor

:
Latency-Agnostic Speech Enhancement for Wireless Acoustic Sensor Networks Using Polynomial Eigenvalue Decomposition. 235-239 - Sebastian Braun, Hannes Gamper:

Multi-Label Audio Classification with a Noisy Zero-Shot Teacher. 240-244 - Stijn Kindt, Jihyun Kim, Hong-Goo Kang, Nilesh Madhu

:
Efficient, Cluster-Informed, Deep Speech Separation with Cross-Cluster Information in AD-HOC Wireless Acoustic Sensor Networks. 245-249 - Duygu Dogan, Huang Xie

, Toni Heittola
, Tuomas Virtanen:
Multi-Label Zero-Shot Audio Classification with Temporal Attention. 250-254 - Luca Becker

, Kamel Naame, Rainer Martin
:
Source Signal Capture in Acoustic Sensor Networks based on Robust Beamforming and Source-Related Cluster Estimation. 255-259 - Paul Didier

, Pourya Behmandpoor, Toon van Waterschoot, Marc Moonen:
One-Shot Distributed Node-Specific Signal Estimation with Non-Overlapping Latent Subspaces in Acoustic Sensor Networks. 260-264 - Mohamed F. Mansour:

Sound Field Synthesis with Acoustic Waves. 265-269 - Dushyant Sharma, James Fosburgh, Sri Harsha Dumpala, Chandramouli Shama Sastri, Stanislav Yu. Kruchinin, Patrick A. Naylor

:
XANE Background Acoustic Embeddings: Ablation and Clustering Analysis. 270-273 - H. Nazim Bicer, Cagdas Tuna, Andreas Walther, Emanuël A. P. Habets:

Evaluation of Data-Driven Room Geometry Inference Methods Using a Smart Speaker Prototype. 274-278 - Tobias Gburrek, Adrian Meise, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:

Diminishing Domain Mismatch for DNN-Based Acoustic Distance Estimation via Stochastic Room Reverberation Models. 279-283 - Zeyu Xu, Emanuël A. P. Habets, Albert G. Prinn

:
Simulating Sound Fields in Rooms with Arbitrary Geometries Using the Diffraction-Enhanced Image Source Method. 284-288 - Philipp Götz, Cagdas Tuna, Andreas Brendel, Andreas Walther, Emanuël A. P. Habets:

Blind Acoustic Parameter Estimation Through Task-Agnostic Embeddings Using Latent Approximations. 289-293 - Vinal Patel, Sankha Subhra Bhattacharjee

, Constantin Paleologu, Mads Græsbøll Christensen
, Jacob Benesty, Jesper Rindom Jensen
:
A Third-Order Tensor Decomposition Based Linear-In-The-Parameters Nonlinear Adaptive Filter. 294-298 - Philipp Götz, Georg Götz

, Nils Meyer-Kahlen
, Kyung Yun Lee, Karolina Prawda
, Emanuël A. P. Habets, Sebastian J. Schlecht:
A Multi-Room Transition Dataset for Blind Estimation of Energy Decay. 299-303 - Junqing Zhang, Jingli Xie, Wen Zhang, Jingdong Chen:

Directivity Analysis of A Vibrating Spherical Cap on A Rigid Sphere. 304-308 - Sankha Subhra Bhattacharjee

, Andreas Jonas Fuglsig
, Jesper Rindom Jensen
, Liming Shi
, Guoli Ping, Hao Shen, Mads Græsbøll Christensen
:
Low Complexity Signal Adaptive Sound Zone Control Using Subspace Tracking. 309-313 - James Brooks-Park

, Steven van de Par, Jan Østergaard
, Søren Bech
, Martin Bo Møller:
Room Impulse Response Prototyping Using Receiver Distance Estimations for High Quality Room Equalisation Algorithms. 314-318 - David Sundström

, Shoichi Koyama, Andreas Jakobsson
:
Sound Field Estimation Using Deep Kernel Learning Regularized by the Wave Equation. 319-323 - Shihori Kozuka, Shoichi Koyama, Hiroaki Itou, Noriyoshi Kamado:

Sound Field Estimation in Region Including Scattering Objects based on Kernel Interpolation: Evaluation for Various Scatterers. 324-328 - Jesper Brunnström

, Martin Bo Møller, Jan Østergaard
, Marc Moonen:
Bayesian Sound Field Estimation Using Uncertain Data. 329-333 - Yosef Soussana, Elior Hadad, Sharon Gannot

:
Multi-Speaker DOA Tracking Algorithm Utilizing Probability Hypothesis Density Filter and Weighted Histogram of SRP-PHAT. 334-338 - Till Hardenbicker, Peter Jax:

Online System Identification on Learned Acoustic Manifolds Using an Extended Kalman Filter. 339-343 - Zhengpu Zhang, Jianyuan Feng, Yongjian Mao, Yehang Zhu, Junjie Shi, Xuzhou Ye, Shilei Liu, Derong Liu, Chuanzeng Huang:

High-Fidelity Diffusion-Based Audio Codec. 344-348 - Shrishti Saha Shetu

, Naveen Kumar Desiraju, Jose Miguel Martinez Aponte, Emanuël A. P. Habets, Edwin Mabande:
A Hybrid Approach for Low-Complexity Joint Acoustic Echo and Noise Reduction. 349-353 - Patrick Kechichian, Akshaya Ravi, Erik Schuijers:

A Cross-Domain Approach to Temporal Envelope Shaping in Parametric Stereo Coding Using Deep Learning. 354-358 - Renzheng Shi, Andreas Bär, Marvin Sach, Wouter Tirry, Tim Fingscheidt:

Non-Causal to Causal SSL-Supported Transfer Learning: Towards A High-Performance Low-Latency Speech Vocoder. 359-363 - Amir Ivry, Israel Cohen:

E-URES: Efficient User-Centric Residual-Echo Suppression Framework with a Data-Driven Approach to Reducing Computational Costs. 364-368 - Xianrui Wang, Kaien Mo, Yichen Yang, Liyuan Zhang, Shoji Makino, Jingdong Chen:

A Cascaded Semi-Blind Source Separation Method for Joint Acoustic Echo Cancellation, Interference Suppression, and Noise Reduction. 369-373 - Eloi Moliner, Sebastian Braun, Hannes Gamper:

Gaussian Flow Bridges for Audio Domain Transfer with Unpaired Data. 374-378 - Yichen Yang, Xianrui Wang, Andreas Brendel, Wen Zhang, Jacob Benesty, Shoji Makino, Jingdong Chen:

A Data-Reuse Semi-Blind Source Separation Approach for Nonlinear Acoustic Echo Cancellation. 379-383 - Ryu Kato, Natsuki Ueno, Nobutaka Ono, Ryo Matsuda, Kazunobu Kondo:

Complexity Reduction for Classification of Musical Instruments Using Element Selection. 384-388 - Arunava Kr. Kalita

, Christian Dittmar, Paolo Sani, Frank Zalkow, Emanuël A. P. Habets, Rusha Patra:
PAD-VC: A Prosody-Aware Decoder for Any-to-Few Voice Conversion. 389-393 - Florian Hilgemann

, Peter Jax:
Low-Order Controllers for Active Noise Cancellation Based on Hankel Matrix Rank Minimization. 399-403 - Jule Pohlhausen

, Francesco Nespoli, Jörg Bitzer
:
Long-Term Conversation Analysis: Privacy-Utility Trade-Off Under Noise and Reverberation. 404-408 - Giovanni Bologni

, Richard Heusdens, Richard C. Hendriks:
Harmonics to the Rescue: Why Voiced Speech is Not a WSS Process. 409-413 - Yile Angela Zhang, Thushara D. Abhayapala, Huiyuan Sun, Prasanga N. Samarasinghe

, Amy Bastine
:
A Multi-Noise Multi-Channel ANC System using Relative Transfer Matrix-Based Approach. 414-418 - Iori Hashimoto, Yu Morinaga, Suehiro Shimauchi, Shigeaki Aoki:

Derivative Features of Short-Time Holomorphic Fourier Transform. 419-423 - Zining Liang, Hucheng Wang, Yichen Yang, Wen Zhang, Thushara D. Abhayapala:

Active Road Noise Control Based on Data-Driven Predictions of Passenger Ear Noise Signal. 424-428 - Or Berebi, Zamir Ben-Hur, David Lou Alon, Boaz Rafaely:

Feasibility of iMagLS-BSM - ILD Informed Binaural Signal Matching with Arbitrary Microphone Arrays. 429-433 - Yurii Iotov

, Rasmus Elofsson, Sidsel Marie Nørholm, Mads Græsbøll Christensen
:
Predicting Subjective Satisfaction with Speech Prediction-Based ANC Using Perceptually Relevant Metrics Correlated with Sound Attributes. 434-438 - Inmo Yeon, Jung-Woo Choi:

RGI-Net: 3D Room Geometry Inference from Room Impulse Responses with Hidden First-Order Reflections. 439-443 - Filippo Villani

, Wai-Yip Chan, Zheng-Hua Tan
, Jan Østergaard
, Jesper Jensen:
Near-End Listening Enhancement Using a Noise-Robust Linear Time-Invariant Filter. 444-448 - Yicheng Hsu, Mingsian R. Bai:

A Tunable Binaural Audio Telepresence System Capable of Balancing Immersive and Enhanced Modes. 449-453 - Ayal Schwartz, Sharon Gannot

, Shlomo E. Chazan:
Magnitude or Phase? A Two-Stage Algorithm for Single-Microphone Speech Dereverberation. 454-458 - Julian Wechsler, Srikanth Raj Chetupalli

, Mhd Modar Halimeh
, Oliver Thiergart, Emanuël A. P. Habets:
Neural Directional Filtering: Far-Field Directivity Control with a Small Microphone Array. 459-463 - Thomas Joubaud, Veronique Zimpfer:

Convolutional Neural Network-Based Prediction of a French Modified Rhyme Test Recorded with a Body-Conduction Microphone. 464-468 - Thomas Muller, Stéphane Ragot, Vincent Barriac, Pascal Scalart:

Evaluation of Objective Quality Models on Neural Audio Codecs. 469-473 - Amy Bastine

, Lachlan Birnie, Thushara D. Abhayapala, Prasanga N. Samarasinghe
, Vladimir Tourbabin:
Magnitude Least-Squares Based Ambisonics Estimation of Head-Worn Device Microphone Measurements for Binaural Reproduction. 474-478 - Femke B. Gelderblom, Tron V. Tronstad, Iván López-Espejo:

Evaluating Speech Enhancement Systems Through Listening Effort. 479-480

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














