Voxii is a Python-based project that leverages the ChatGPT API, DeepL API, and VoiceVox to create a voice translation system. The project integrates speech recognition, translation, and voice synthesis to enable users to record spoken phrases in one language, translate them to another language, and then convert the translated text into synthetic character voice using VoiceVox. The Push-to-Talk mechanism is implemented for voice recording.
-
If you don’t have Python installed, install it from here.
-
Clone this repository.
-
Navigate into the project directory:
$ cd Aya_Voxii
-
Create a new virtual environment:
$ python -m venv venv $ . venv/bin/activate
-
Install the requirements:
$ pip install -r requirements.txt
-
edit the environment variables file with your api keys:
-
Run the app