This repository contains a FastAPI-based web service for speech transcription and synthesis. The service provides endpoints for converting audio to text (speech-to-text) and text to audio (text-to-speech) using the VocalizeService.
- Speech-to-Text: Convert audio input into text transcriptions.
- Text-to-Speech: Generate audio output from text input.
- URL:
/create-transcription - Method: POST
- Parameters:
duration(int): The duration of the audio to be transcribed.
- Response: Transcription of the provided audio duration.
- URL:
/create-speech - Method: POST
- Parameters:
text(str): The text to be converted into speech.
- Response: Generated audio from the provided text.
- FastAPI
- Pydantic
- Uvicorn
- Clone the repository:
git clone https://github.com/your-username/speech-service-api.git cd speech-service-api - Install the dependencies:
pip install -r requirements.txt
- Run the application:
uvicorn main:app --reload