Hierarchical Timbre-painting and Articulation Generation

11/10/2020

Hierarchical Timbre-painting and Articulation Generation

Michael M Michelashvili, Lior Wolf

Keywords: Domain knowledge, Machine learning/Artificial intelligence for music, Representations of music, MIR fundamentals and methodology, Music signal processing, MIR tasks, Music synthesis and transformation

Abstract Paper Similar Papers

Abstract: We present a fast and high-fidelity method for music generation, based on specified f0 and loudness, such that the synthesized audio mimics the timbre and articulation of a target instrument. The generation process consists of learned source-filtering networks, which reconstruct the signal at increasing resolutions. The model optimizes a multi-resolution spectral loss as the reconstruction loss, an adversarial loss to make the audio sound more realistic, and a perceptual f0 loss to align the output to the desired input pitch contour. The proposed architecture enables high-quality fitting of an instrument, given a sample that can be as short as a few minutes, and the method demonstrates state-of-the-art timbre transfer capabilities. Code and audio samples are shared at https://github.com/mosheman5/timbre_painting.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ISMIR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

03/05/2021

WaveGrad: Estimating Gradients for Waveform Generation

Nanxin Chen, Yu Zhang, Heiga Zen and
Ron Weiss, Mohammad Norouzi, William Chan

Keywords Paper

gradient estimation, waveform generation, score matching, vocoder, diffusion, text-to-speech

0

0

0

0

5:09

02/11/2020

On the effectiveness of spatial and multi-channel features for multi-channel polyphonic sound event detection

Thi Ngoc Tho Nguyen, Douglas L. Jones, Woon Seng Gan

Keywords Paper

0

0

0

0

12:29

17/08/2020

Directional sources and listeners in interactive sound propagation using reciprocal wave field coding

Chakravarty R. Alla Chaitanya, Nikunj Raghuvanshi, Keith W. Godin and
Zechen Zhang, Derek Nowrouzezahrai, John M. Snyder

Keywords Paper

sound propagation, head-related transfer function (HRTF), equalization, wave simulation, virtual acoustics, source directivity, spatial audio, bidirectional impulse response

0

0

0

0

15:53

02/11/2020

Self-supervised classification for detecting anomalous sounds

Ritwik Giri, Srikanth V. Tenneti, Fangzhou Cheng and
Karim Helwani, Umut Isik, Arvindh Krishnaswamy

Keywords Paper

0

0

0

0

13:28

18/07/2021

SoundDet: Polyphonic Moving Sound Event Detection and Localization from Raw Waveform

Yuhang He, Niki Trigoni, Andrew Markham

Keywords Paper

Applications, Audio and Speech Processing

0

0

0

0

4:34

11/10/2020

Music Structure Analysis Based on an LSTM-HSMM Hybrid Model

Go Shibata, Ryo Nishikimi, Kazuyoshi Yoshii

Keywords Paper

Musical features and properties, Structure, segmentation, and form

0

0

0

0

4:06

11/10/2020

Unsupervised Disentanglement of Pitch and Timbre for Isolated Musical Instrument Sounds

Yin-Jyun Luo, Kin Wai Cheuk, Tomoyasu Nakano and
Masataka Goto, Dorien Herremans

Keywords Paper

Domain knowledge, Machine learning/Artificial intelligence for music

0

0

0

0

4:08

18/07/2021

Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech

Vadim Popov, Ivan Vovk, Vladimir Gogoryan and
Tasnima Sadekova, Mikhail Kudinov

Keywords Paper

Applications, Audio and Speech Processing

0

0

0

0

5:12

02/11/2020

Training sound event detection on a heterogeneous dataset

Nicolas Turpault, Romain Serizel

Keywords Paper

0

0

0

0

11:54

11/10/2020

Uncovering Audio Patterns in Music with Nonnegative Tucker Decomposition for Structural Segmentation

Axel Marmoret, Jeremy Cohen, Frédéric Bimbot, Nancy Bertin

Keywords Paper

MIR fundamentals and methodology, Music signal processing, Musical features and properties, Structure, segmentation, and form

0

0

0

0

4:02

11/10/2020

Joint Analysis of Mode and Playing Technique in Guqin Performance with Machine Learning

Yu-Fen Huang, Jeng-I Liang, I-CHIEH WEI, Li Su

Keywords Paper

Domain knowledge, Computational music theory and musicology, MIR tasks, Automatic classification, Musical features and properties, Expression and performative aspects of music

0

0

0

0

4:09

11/10/2020

Learning to Denoise Historical Music

Yunpeng Li, Marco Tagliasacchi, Beat Gfeller, Dominik Roblek

Keywords Paper

Domain knowledge, Machine learning/Artificial intelligence for music, Applications, Digital libraries and archives, MIR fundamentals and methodology, Music signal processing, MIR tasks, Music synthesis and transformation

0

0

0

0

4:15

06/12/2021

Improved Regularization and Robustness for Fine-tuning in Neural Networks

Dongyue Li, Hongyang Zhang

Keywords Paper

deep learning, machine learning, robustness, vision, transfer learning

0

0

0

0

12:03

11/10/2020

Multiple F0 Estimation in Vocal Ensembles Using Convolutional Neural Networks

Helena Cuesta, Brian McFee, Emilia Gomez

Keywords Paper

MIR tasks, Music transcription and annotation, MIR fundamentals and methodology, Music signal processing, Musical features and properties, Melody and motives

0

0

0

0

4:07

22/11/2021

Taming Visually Guided Sound Generation

Vladimir Iashin, Esa Rahtu

Keywords Paper

multi-modal learning, audio generation, video understanding, transformer, VQVAE, MelGAN, perceptual loss, generation metrics, VGGSound, VAS

0

0

0

0

9:54

14/06/2020

Joint Filtering of Intensity Images and Neuromorphic Events for High-Resolution Noise-Robust Imaging

Zihao W. Wang, Peiqi Duan, Oliver Cossairt and
Aggelos Katsaggelos, Tiejun Huang, Boxin Shi

Keywords Paper

event cameras, guided filtering, event denoising and super resolution, video frame synthesis, motion deblur, hdr imaging, motion tracking

0

0

0

0

1:00

02/02/2021

Audio-Visual Localization by Synthetic Acoustic Image Generation

Valentina Sanguineti, Pietro Morerio, Alessio Del Bue, Vittorio Murino

Keywords Paper

0

0

0

0

16:28

03/05/2021

A Good Image Generator Is What You Need for High-Resolution Video Synthesis

Yu Tian, Jian Ren, Menglei Chai and
Kyle Olszewski, Xi Peng, Dimitris Metaxas, Sergey Tulyakov

Keywords Paper

contrastive learning, cross-domain video generation, high-resolution video generation

0

0

0

0

10:03

02/11/2020

On multitask loss function for audio event detection and localization

Huy Phan, Lam Pham, Philipp Koch and
Ngoc Q. K. Duong, Ian McLoughlin, Alfred Mertins

Keywords Paper

0

0

0

0

15:16

11/10/2020

The MIDI Degradation Toolkit: Symbolic Music Augmentation and Correction

Andrew McLeod, James Owers, Kazuyoshi Yoshii

Keywords Paper

Evaluation, datasets, and reproducibility, Novel datasets and use cases, MIR fundamentals and methodology, Symbolic music processing, MIR tasks, Music transcription and annotation

0

0

0

0

4:07

03/05/2021

End-to-end Adversarial Text-to-Speech

Jeff Donahue, Sander Dieleman, Mikolaj Binkowski and
Erich Elsen, Karen Simonyan

Keywords Paper

end-to-end, speech synthesis, feed-forward, text-to-speech, adversarial, generative model, GAN

0

0

0

0

15:23

11/10/2020

Drumgan: Synthesis of Drum Sounds with Timbral Feature Conditioning Using Generative Adversarial Networks

Javier Nistal, Stefan Lattner, Gaël Richard

Keywords Paper

MIR tasks, Music synthesis and transformation, Domain knowledge, Machine learning/Artificial intelligence for music, Human-centered MIR, Human-computer interaction and interfaces

0

0

0

0

4:08

02/11/2020

Ensemble of sequence matching networks for dynamic sound event localization, detection, and tracking

Thi Ngoc Tho Nguyen, Douglas L. Jones, Woon Seng Gan

Keywords Paper

0

0

0

0

11:06

03/05/2021

Neural Synthesis of Binaural Speech From Mono Audio

Alexander Richard, Dejan Markovic, Israel Gebru and
Steven Krenn, Gladstone A Butler, Fernando Torre, Yaser Sheikh

Keywords Paper

speech generation, speech processing, binaural speech, neural sound synthesis, sound spatialization, binaural audio

0

0

0

0

15:00

03/05/2021

Learning with Feature-Dependent Label Noise: A Progressive Approach

Yikai Zhang, Songzhu Zheng, Pengxiang Wu and
Mayank Goswami, Chao Chen

Keywords Paper

Noisy Label, Classification, Deep Learning

0

0

0

0

10:37

14/06/2020

WaveletStereo: Learning Wavelet Coefficients of Disparity Map in Stereo Matching

Menglong Yang, Fangrui Wu, Wei Li

Keywords Paper

stereo matching, wavelet coefficients, inverse wavelet transform, supervised learning, deep representation, multi-scale features, multi-resolution cost volume, wavelet regression, disparity reconstruction, disparity refinement

0

0

0

0

1:01

11/10/2020

Modelling Hierarchical Key Structure with Pitch Scapes

Robert Lieck, Martin Rohrmeier

Keywords Paper

Domain knowledge, Machine learning/Artificial intelligence for music, Computational music theory and musicology, Representations of music, MIR tasks, Automatic classification, Musical features and properties, Harmony, chords, and tonality, Structure, segmentation, and form

0

0

0

0

3:21

03/05/2021

Score-Based Generative Modeling through Stochastic Differential Equations

Yang Song, Jascha Sohl-Dickstein, Durk Kingma and
Abhishek Kumar, Stefano Ermon, Ben Poole

Keywords Paper

score matching, stochastic differential equations, score-based generative models, diffusion, generative models

0

0

0

0

15:27

11/10/2020

Explaining Perceived Emotion Predictions in Music: an Attentive Approach

Sanga Chaki, Pranjal Doshi, Sourangshu Bhattacharya, Prof. Priyadarshi Patnaik

Keywords Paper

Musical features and properties, Musical affect, emotion, and mood, Applications, Music recommendation and playlist generation, Music retrieval systems, Domain knowledge, Machine learning/Artificial intelligence for music, MIR tasks, Automatic classification, Pattern matching and detection

0

0

0

0

3:15

03/05/2021

Training with Quantization Noise for Extreme Model Compression

Pierre Stock, Angela Fan, Benjamin Graham and
Edouard Grave, Rémi Gribonval, Hervé Jégou, Armand Joulin

Keywords Paper

Product Quantization, Compression, Efficiency

0

0

0

0

4:58

06/12/2021

Neural Population Geometry Reveals the Role of Stochasticity in Robust Perception

Joel Dapello, Jenelle Feather, Hang Le and
Tiago Marques, David Cox, Josh McDermott, James J DiCarlo, Sueyeon Chung

Keywords Paper

deep learning, machine learning, robustness, adversarial robustness and security, neuroscience

0

0

0

0

14:19

06/12/2021

PCA Initialization for Approximate Message Passing in Rotationally Invariant Models

Marco Mondelli, Ramji Venkataramanan

Keywords Paper

theory

0

0

0

0

13:57

06/12/2021

SNIPS: Solving Noisy Inverse Problems Stochastically

Bahjat Kawar, Gregory Vaksman, Michael Elad

Keywords Paper

0

0

0

0

12:27

26/04/2020

DDSP: Differentiable Digital Signal Processing

Jesse Engel, Lamtharn (Hanoi) Hantrakul, Chenjie Gu, Adam Roberts

Keywords Paper

dsp, audio, music, nsynth, wavenet, wavernn, vocoder, synthesizer, sound, signal, processing, tensorflow, autoencoder, disentanglement

0

0

0

0

5:11

22/11/2021

PS-Transformer: Learning Sparse Photometric Stereo Network using Self-Attention Mechanism

Satoshi Ikehata

Keywords Paper

photometric stereo, transformer

0

0

0

0

2:56

14/06/2020

Neural Voxel Renderer: Learning an Accurate and Controllable Rendering Tool

Konstantinos Rematas, Vittorio Ferrari

Keywords Paper

neural rendering, image synthesis

0

0

0

0

1:00

07/09/2020

Multimodal Image Translation with Stochastic Style Representations and Mutual Information Loss

Sanghyeon Na, Seungjoo Yoo, Jaegul Choo

Keywords Paper

image-to-image translation, generative adversarial network

0

0

0

0

9:52

11/10/2020

Generating Music with a Self-correcting Non-chronological Autoregressive Model

Wayne Chi, Prachi Kumar, Suri Yaddanapudi and
Suresh Rahul, Umut Isik

Keywords Paper

Domain knowledge, Machine learning/Artificial intelligence for music, Applications, Music composition, performance, and production, Representations of music, MIR tasks, Music synthesis and transformation

0

0

0

0

4:33

03/05/2021

Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis

Rafael Valle, Kevin J Shih, Ryan Prenger, Bryan Catanzaro

Keywords Paper

normalizing flows, deep learning, Text to speech synthesis

0

0

0

0

5:11

11/10/2020

Content Based Singing Voice Source Separation via Strong Conditioning Using Aligned Phonemes

Gabriel Meseguer Brocal, Geoffroy Peeters

Keywords Paper

MIR tasks, Sound source separation, Evaluation, datasets, and reproducibility, Novel datasets and use cases, MIR fundamentals and methodology, Lyrics and other textual data, web mining, and natural language processing, Multimodality

0

0

0

0

4:08