Learn how to build an audio preprocessing pipeline for AI applications in Python. The pipeline batch preprocess audio files applying Short-Time Fourier Transform, zero-padding, min-max normalization all in one go!
Code:
[ Ссылка ]
Free Spoken Digit Dataset:
[ Ссылка ]
===============================
Interested in hiring me as a consultant/freelancer?
[ Ссылка ]
Join The Sound Of AI Slack community:
[ Ссылка ]
Follow Valerio on Facebook:
[ Ссылка ]
Connect with Valerio on Linkedin:
[ Ссылка ]
Follow Valerio on Twitter:
[ Ссылка ]
===============================
Content:
0:00 Intro
0:48 The Free Spoken Digit Dataset
1:37 Pipeline intuition + design
5:24 Implementating Loader
9:02 Implementing Padder
15:16 Implementing LogSpectrogramExtractor
20:21 Implementing MinMaxNormaliser
25:38 Implementing Preprocessing Pipeline
44:01 Implementing Saver
51:35 Recap of implemented classes
53:04 Prunning the preprocessing pipeline
58:17 Outro
Preprocessing Audio Datasets for Machine Learning
Теги
audio preprocessingaudio preprocessing pipelineprocessing audio dataprocessing audio for AIextracting audio featuresnormalization of audio datanormalizing audio datazero padding audio datasound generationVariational Auto Encoder sound generationaudio VAEaudio AIsound AIAI musicmachine learning preprocessingmin max normaliser audiospeech generation