Extending Gentle Aligner

Week 12

Overview:

Trained Russian automated speech recognition model, built customized Russian langauge model and used it to produce time alignments, visualizations.
Built Singularity Container containing my work on time alignment of Russian data
Documented work through blogpost & git repo
Singularity image for the project can be found at gentle-singularity
You can find the built singularity image on google drive link to google drive
The idea was to make Gentle produce time alignments in langauges other than English.
My work accomplishes generation of automated time alignments in Russian, using Gentle’s langauge model generation and decoding graph compilation code. I have extensively used Kaldi’s C libraries for generating the initial langauge files, decoding an audio input and producing conversation-to-time files (ctm) link to code.
Visualization: I also worked on a visualization that can clearly represent the time alignment produced by generating json time alignment data through my code link to viz.

What’s Done?

To Be Done Next!

Getting automated time alignments for German langauge using the customized langauge model
Evaluating the results of customized langauge model and whole langauge model (trained on entire dataset)

References:

Tools: Kaldi, Python, C, Bash Scripting