Skip to content

Installing wav2letter in 2021 #998

@SleekEag

Description

@SleekEag

Bug Description

Is it still possible to install wav2letter (not wav2letter++) in 2021? A lot of links in the wiki are broken and I've been having problems with installation. My end game is to use it for transcriptions of audio files and possible real-time transcription in Python.

Reproduction Steps

Here is a summary of what I've tried:

What I've Tried

This goes temporally, so I don't recall a lot of the earlier errors/problems:

Using Win11

  • Couldn't get Flashlight, a dependency, to install
  • Tried compiling binary inference model myself but didn't have all the header files

Using Ubuntu 20.04 LTS with x86_64 ISA on VMware Player

Using pre-built Docker images as laid out here

  1. Went well until I tried the git remote set-url https://github.com/facebookresearch/wav2letter.git in the "for Inferences pipeline" above the this header, I got a usage error for set-url because two arguments were expected.

  2. From here, I tried doing git remote set-url origin https://github.com/facebookresearch/wav2letter.git and moving forward, eventually reaching the error:

Your configuration specifies to merge with the ref 'refs/heads/master'
from the remote, but no such ref was fetched.
  1. I tried to build with cmake anyway, which was an apparent success.
  2. Then from the build folder, I tried following the Simple Streaming ASR Example from here. From here, I got this error:
cat: /root/audio/LibriSpeech/dev-clean/777/126732/777-126732-0070.flac.wav: No such file or directory
Started features model file loading ... 
terminate called after throwing an instance of 'std::runtime_error'
  what():  failed to open feature file=/root/model/feature_extractor.bin for reading
Aborted (core dumped)

From here, I shut down and deleted the container

Simple Streaming ASR Example using Docker as laid out here

Here I ran the listed command and received this error:

cat: /root/host/audio/LibriSpeech/dev-clean/777/126732/777-126732-0070.flac.wav: No such file or directory
Started features model file loading ... 
terminate called after throwing an instance of 'std::runtime_error'
  what():  failed to open feature file=/root/host/model/feature_extractor.bin for reading
Aborted (core dumped)

Building the Docker image myself as seen here

Here, cloning went fine, but after that I got this error:

unable to prepare context: unable to evaluate symlinks in Dockerfile path: lstat /home/ryan/wav2letter/Dockerfile-Inference: no such file or directory

Then I ran sudo cmake CMakeLists.txt from the wav2letter directory and got this error:

-- The CXX compiler identification is unknown
-- The C compiler identification is GNU 9.3.0
CMake Error at CMakeLists.txt:3 (project):
  No CMAKE_CXX_COMPILER could be found.

  Tell CMake where to find the compiler by setting either the environment
  variable "CXX" or the CMake cache entry CMAKE_CXX_COMPILER to the full path
  to the compiler, or to the compiler name if it is in the PATH.


-- Check for working C compiler: /usr/bin/cc
-- Check for working C compiler: /usr/bin/cc -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Detecting C compile features
-- Detecting C compile features - done
-- Configuring incomplete, errors occurred!
See also "/home/ryan/wav2letter/CMakeFiles/CMakeOutput.log".
See also "/home/ryan/wav2letter/CMakeFiles/CMakeError.log".

Not using Docker

Download and build wav2letter-inference as seen here

This led to needing MKL and Flashlight. I could not get Flashlight to install. I tried

  1. as library with vcpkg
  2. from source with vcpkg
  3. building from source which lead to problems downloading Flashlight.

Lastly, I tried following this tutorial

Eventually running into an error, I believe installing Flashlight

Platform and Hardware

  • OS is Win11
  • CPU is i5-11300H
  • x86_64 ISA
  • No GPU
  • 16 GB RAM

Also attempted installation on a VM

  • VMWare Player
  • Ubuntu 20.04
  • x86_64 ISA

Additional Context

[Add any additional information here]

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions