From the course: Hands-On PyTorch Machine Learning

Unlock the full course today

Join today to access over 24,900 courses taught by industry experts.

Torchaudio for audio understanding

Torchaudio for audio understanding

- Now let's continue with a pyrees use case. We would like to transcribe the audio and video lectures into scripts. We'll go through an example using Pie Torch's Torch Audio Library to transcribe some audio speech. This example is provided by Hiro Moto and can be found at the Pie Torch website will be using pre-trained models from web to back 2.0 through Torch Audio. The high level steps of this pipeline includes extract the acoustic features from audio waveform and then estimate the class of the acoustic features framed by frame and then generate hypothesis from the sequence of the class probabilities. Torch audio provides an easy access to the pre-trained weights and associated information such as the expected sample rate and class labels. They're bundled together and available under the Torch Audio dot pipelines module. First, we import map plot lib. Next, we'll create a Wave two VAC two model that performs the feature…

Contents