README.md 1.04 KB
Newer Older
1
2
# MPAI-MMC Automatic Speech Recognition

Carl De Sousa Trias's avatar
Carl De Sousa Trias committed
3
This code refers to the implementation of the MMC-ASR, as described in the [AIM](https://mpai.community/standards/mpai-mmc/v2-2/ai-modules/automatic-speech-recognition/).
4

Mattia Bergagio's avatar
cleanup    
Mattia Bergagio committed
5
Use of this AI Module is for developers who are familiar with Python and downloading models from HuggingFace,
6
7
8
9
10
11
12

A wrapper for the Whisper NN Module:

    1. Manages input files and parameters: Speech Object
    2. Performs Speech Recognition on each Speech Object by executing the Whisper Module.
    3. Outputs Recognised Text.

Mattia Bergagio's avatar
cleanup    
Mattia Bergagio committed
13
The MMC-ASR Reference Software is found at the NNW gitlab site (registration required). It contains:
14
15
16
17

    1. The python code implementing the AIM.
    2. The required libraries are: pytorch and transformers (HuggingFace).

Mattia Bergagio's avatar
cleanup    
Mattia Bergagio committed
18
19
20
21
22
23
24
25
26
Implementation of MMC-ASR as a class in Python.

## Installation
Code was designed and tested on an Ubuntu 20.04 operating system using anaconda 23.7.2 and Python 3.9.
An environment with all the necessary libraries can be created using:
```bash
conda create --name <env> --file requirements.txt
```