
Deepspeech using a GPU
Virtual environment
We will need to create another virtual environment for deepspeech-gpu
conda create -n ds-gpu python=3.8
conda activate ds-gpuInstall deepspeech-gpu
pip install deepspeech-gpuModel and audio files
If you've been following along you can use the same model and audio files from the Deepspeech basics article
If not you can install them like so:
curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.6.1/deepspeech-0.6.1-models.tar.gz
tar xvf deepspeech-0.6.1-models.tar.gz
curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.6.1/audio-0.6.1.tar.gz
tar xvf audio-0.6.1.tar.gzInstalling CUDA and cuDNN
This goes without saying but make sure you have an Nvidia GPU and the proprietary drivers installed
You will need both of these libraries in order to run inference
If you already have CUDA 10 installed then great you can move on
If not I have an easy way to do so using conda
conda install cudatoolkit=10.0.130
conda install cudnnRun inference
We can now transcribe the audio file
deepspeech --model deepspeech-0.6.1-models/output_graph.pbmm --audio audio/2830-3980-0043.wav