Neural Synthesis of Binaural Speech From Mono Audio

03/05/2021

Neural Synthesis of Binaural Speech From Mono Audio

Alexander Richard, Dejan Markovic, Israel Gebru, Steven Krenn, Gladstone A Butler, Fernando Torre, Yaser Sheikh

Keywords: speech generation, speech processing, binaural speech, neural sound synthesis, sound spatialization, binaural audio

Abstract Paper Similar Papers

Abstract: We present a neural rendering approach for binaural sound synthesis that can produce realistic and spatially accurate binaural sound in realtime. The network takes, as input, a single-channel audio source and synthesizes, as output, two-channel binaural sound, conditioned on the relative position and orientation of the listener with respect to the source. We investigate deficiencies of the l2-loss on raw waveforms in a theoretical analysis and introduce an improved loss that overcomes these limitations. In an empirical evaluation, we establish that our approach is the first to generate spatially accurate waveform outputs (as measured by real recordings) and outperforms existing approaches by a considerable margin, both quantitatively and in a perceptual study. Dataset and code are available online.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/11/2020

Group masked autoencoder based density estimator for audio anomaly detection

Ritwik Giri, Fangzhou Cheng, Karim Helwani and
Srikanth V. Tenneti, Umut Isik, Arvindh Krishnaswamy

Keywords Paper

0

0

0

0

15:43

02/11/2020

On multitask loss function for audio event detection and localization

Huy Phan, Lam Pham, Philipp Koch and
Ngoc Q. K. Duong, Ian McLoughlin, Alfred Mertins

Keywords Paper

0

0

0

0

15:16

03/05/2021

Score-Based Generative Modeling through Stochastic Differential Equations

Yang Song, Jascha Sohl-Dickstein, Durk Kingma and
Abhishek Kumar, Stefano Ermon, Ben Poole

Keywords Paper

score matching, stochastic differential equations, score-based generative models, diffusion, generative models

0

0

0

0

15:27

02/11/2020

Self-supervised classification for detecting anomalous sounds

Ritwik Giri, Srikanth V. Tenneti, Fangzhou Cheng and
Karim Helwani, Umut Isik, Arvindh Krishnaswamy

Keywords Paper

0

0

0

0

13:28

02/11/2020

Temporal sub-sampling of audio feature sequences for automated audio captioning

Khoa Nguyen, Konstantinos Drossos, Tuomas Virtanen

Keywords Paper

0

0

0

0

14:09

30/11/2020

GAN-based Noise Model for Denoising Real Images

Linh Duy Tran, Son Minh Nguyen, Masayuki Arai

Keywords Paper

0

0

0

0

7:13

18/07/2021

SoundDet: Polyphonic Moving Sound Event Detection and Localization from Raw Waveform

Yuhang He, Niki Trigoni, Andrew Markham

Keywords Paper

Applications, Audio and Speech Processing

0

0

0

0

4:34

14/06/2020

Deep Residual Flow for Out of Distribution Detection

Ev Zisselman, Aviv Tamar

Keywords Paper

neural-networks, out-of-distribution detection, flow models, neural generative models, machine learning architectures.

0

0

0

0

0:59

02/11/2020

Sound event localization and detection based on CRNN using rectangular filters and channel rotation data augmentation

Francesca Ronchini, Daniel Arteaga, Andrés Pérez-López

Keywords Paper

0

0

0

0

12:51

06/12/2020

Latent Dynamic Factor Analysis of High-Dimensional Neural Recordings

Heejong Bong, Zongge Liu, Zhao Ren and
Matthew Smith, Valerie Ventura, Rob E Robert

Keywords Paper

0

0

0

0

3:13

03/05/2021

Efficient Inference of Flexible Interaction in Spiking-neuron Networks

Feng Zhou, Yixuan Zhang, Jun Zhu

Keywords Paper

conjugacy, auxiliary latent variable, nonlinear Hawkes process, neural spike train

0

0

0

0

5:39

06/12/2020

A Spectral Energy Distance for Parallel Speech Synthesis

Alexey Gritsenko, Tim Salimans, Rianne van den Berg and
Jasper Snoek, Nal Kalchbrenner

Keywords Paper

0

0

0

0

3:11

26/04/2020

High Fidelity Speech Synthesis with Adversarial Networks

Mikołaj Bińkowski, Jeff Donahue, Sander Dieleman and
Aidan Clark, Erich Elsen, Norman Casagrande, Luis C. Cobo, Karen Simonyan

Keywords Paper

texttospeech, speechsynthesis, audiosynthesis, gans, generativeadversarialnetworks, implicitgenerativemodels

0

0

0

0

15:07

12/07/2020

Predictive Sampling with Forecasting Autoregressive Models

Auke Wiggers, Emiel Hoogeboom

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

12:23

03/08/2020

MaskAAE: Latent space optimization for Adversarial Auto-Encoders

Arnab Mondal, Sankalan Pal Chowdhury, Aravind Jayendran and
Himanshu Asnani, Parag Singla, Prathosh A P

Keywords Paper

0

0

0

0

7:54

06/12/2020

NanoFlow: Scalable Normalizing Flows with Sublinear Parameter Complexity

Sang-gil Lee, Sungwon Kim, Sungroh Yoon

Keywords Paper

0

0

0

0

3:17

06/12/2020

Consistency Regularization for Certified Robustness of Smoothed Classifiers

Jongheon Jeong, Jinwoo Shin

Keywords Paper

0

0

0

0

3:16

03/05/2021

Multiscale Score Matching for Out-of-Distribution Detection

Ahsan Mahmood, Junier Oliva, Martin A Styner

Keywords Paper

out-of-distribution detection, deep learning, score matching, outlier detection

0

0

0

0

5:13

06/12/2020

Post-training Iterative Hierarchical Data Augmentation for Deep Networks

Adil Khan, Khadija Fraz

Keywords Paper

Probabilistic Methods -> MCMC, Applications -> Privacy, Anonymity, and Security

0

0

0

0

3:19

26/04/2020

Restricting the Flow: Information Bottlenecks for Attribution

Karl Schulz, Leon Sixt, Federico Tombari, Tim Landgraf

Keywords Paper

Attribution, Informational Bottleneck, Interpretable Machine Learning, Explainable AI

0

0

0

0

12:52

19/08/2021

On Smoother Attributions using Neural Stochastic Differential Equations

Sumit Jha, Rickard Ewetz, Alvaro Velasquez, Susmit Jha

Keywords Paper

AI Ethics, Trust, Fairness, Explainability, Validation and Verification

0

0

0

0

13:39

02/11/2020

Acoustic scene classification with spectrogram processing strategies

Helin Wang, Yuexian Zou, DaDing Chong

Keywords Paper

0

0

0

0

10:20

26/04/2020

From Variational to Deterministic Autoencoders

Partha Ghosh, Mehdi S. M. Sajjadi, Antonio Vergari and
Michael Black, Bernhard Scholkopf

Keywords Paper

Unsupervised learning, Generative Models, Variational Autoencoders, Regularization

0

0

0

0

4:59

06/12/2021

Neural Population Geometry Reveals the Role of Stochasticity in Robust Perception

Joel Dapello, Jenelle Feather, Hang Le and
Tiago Marques, David Cox, Josh McDermott, James J DiCarlo, Sueyeon Chung

Keywords Paper

deep learning, machine learning, robustness, adversarial robustness and security, neuroscience

0

0

0

0

14:19

02/02/2021

SSD-GAN: Measuring the Realness in the Spatial and Spectral Domains

Yuanqi Chen, Ge Li, Cece Jin and
Shan Liu, Thomas Li

Keywords Paper

0

0

0

0

14:27

03/05/2021

An Unsupervised Deep Learning Approach for Real-World Image Denoising

Dihan Zheng, Sia Huat Tan, Xiaowen Zhang and
Zuoqiang Shi, Kaisheng Ma, Chenglong Bao

Keywords Paper

Real-world image denoising, unsupervised image denoising

0

0

0

0

4:31

06/12/2021

Deep inference of latent dynamics with spatio-temporal super-resolution using selective backpropagation through time

Feng Zhu, Andrew Sedler, Harrison A Grier and
Nauman Ahad, Mark Davenport, Matthew Kaufman, Andrea Giovannucci, Chethan Pandarinath

Keywords Paper

deep learning, neuroscience, generative model

0

0

0

0

7:16

02/11/2020

Domain-adversarial training and trainable parallel front-end for the DCASE 2020 task 4 sound event detection challenge

Samuele Cornell, Michel Olvera, Manuel Pariente and
Giovanni Pepe, Emanuele Principi, Leonardo Gabrielli, Stefano Squartini

Keywords Paper

0

0

0

0

9:59

06/12/2020

Variational Bayesian Monte Carlo with Noisy Likelihoods

Luigi Acerbi

Keywords Paper

0

0

0

0

3:13

14/06/2020

Forward and Backward Information Retention for Accurate Binary Neural Networks

Haotong Qin, Ruihao Gong, Xianglong Liu and
Mingzhu Shen, Ziran Wei, Fengwei Yu, Jingkuan Song

Keywords Paper

model compression, binary neural networks, deep learning, quantization, computer vision

0

0

0

0

1:00

06/12/2021

A universal probabilistic spike count model reveals ongoing modulation of neural variability

David Liu, Mate Lengyel

Keywords Paper

generative model, kernel methods

0

0

0

0

15:06

03/05/2021

Learning Energy-Based Models by Diffusion Recovery Likelihood

Ruiqi Gao, Yang Song, Ben Poole and
Yingnian Wu, Durk Kingma

Keywords Paper

recovery likelihood, EBM, energy-based model, generative model, HMC, Langevin dynamics, MCMC, diffusion process

0

0

0

0

6:03

18/07/2021

Unsupervised Representation Learning via Neural Activation Coding

Yookoon Park, Sangho Lee, Gunhee Kim, David Blei

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

13:50

06/12/2021

Explaining heterogeneity in medial entorhinal cortex with task-driven neural networks

Aran Nayebi, Alexander Attinger, Malcolm Campbell and
Kiah Hardcastle, Isabel Low, Caitlin S Mallory, Gabriel Mel, Ben Sorscher, Alex H Williams, Surya Ganguli, Lisa Giocomo, Dan Yamins

Keywords Paper

deep learning

0

0

0

0

14:10

04/07/2020

Evaluating Robustness to Input Perturbations for Neural Machine Translation

Xing Niu, Prashant Mathur, Georgiana Dinu, Yaser Al-Onaizan

Keywords Paper

Neural Translation, Neural models, subword methods, relative degradation

0

0

0

0

6:55

06/12/2020

Factorized Neural Processes for Neural Processes: K-Shot Prediction of Neural Responses

Ronald (James) Cotton, Fabian Sinz, Andreas Tolias

Keywords Paper

0

0

0

0

3:18

06/12/2020

Path Sample-Analytic Gradient Estimators for Stochastic Binary Networks

Alexander Shekhovtsov, Viktor Yanush, Boris Flach

Keywords Paper

0

0

0

0

3:24

06/12/2021

Impression learning: Online representation learning with synaptic plasticity

Colin Bredenberg, Benjamin Lyo, Eero P Simoncelli, Cristina Savin

Keywords Paper

neuroscience, representation learning

0

0

0

0

14:11

14/06/2020

How Does Noise Help Robustness? Explanation and Exploration under the Neural SDE Framework

Xuanqing Liu, Tesi Xiao, Si Si and
Qin Cao, Sanjiv Kumar, Cho-Jui Hsieh

Keywords Paper

adversarial, defense, neural ode, neural sde

0

0

0

0

4:59

06/12/2021

On the Out-of-distribution Generalization of Probabilistic Image Modelling

Mingtian Zhang, Andi Zhang, Steven McDonagh

Keywords Paper

generative model

0

0

0

0

10:06