Commit 5ea04f0c authored by Carl De Sousa Trias's avatar Carl De Sousa Trias
Browse files

Init with ReadMe

parents
# MPAI-MMC Text To Speech
This code refers to the implementation of the MMC-TTS, as described in the [AIW](https://mpai.community/standards/mpai-mmc/v2-2/ai-workflows/answer-to-multimodal-question/)
## Guide to the TTS code
Use of this AI Module is for developers who are familiar with Python and downloading models from HuggingFace,
A wrapper for the speech5 NN Module
1.Manages input files and parameters: Text Object
2.Executes the BLIP Module to perform the Speech Recognition on each individual pair of Text and Visual Object.
3.Outputs Speech Object as answer.
The MMC-TTS Reference Software is found at the MPAI-NNW gitlab site. It contains:
1.The python code implementing the AIM
2.Required libraries are: pytorch, transformers (HuggingFace), datasets (HuggingFace), and soundfile.
## Installation
Code was designed and tested on an Ubuntu 20.04 operating system using anaconda 23.7.2 and Python 3.9.
An environment with all the necessary libraries can be created using:
```bash
conda create --name <env> --file requirements.txt
```
Supports Markdown
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment