Table of Contents 1. Dilated causal convolutions for audio and text generation causal dilation convolution 1.1. goal 1.2. motivation 1.3. ingredients 1.4. steps 1.5. outlook 1.6. resources Dilated causal convolutions for audio and text generation goal In today’s summary we dive into the architecture of WaveNet and its successor ByteNet which are autoregressive generative models for generating audio and respectively sentences on character-level. motivation The architectures behind both models are based on dilated causal convolutional layers which recently got much attention also in image generation tasks. Especially modeling sequential data with long term dependencies like audio or text seem to benefit from convolutions with dilations to increase the receptive field. ingredients dilation, causal convolution, residual blocks, skip connection, gated activation function, steps Without […]