ASR Speech Recognition
Inputs
- input: Raw audio data containing the speech
Outputs
- output: Transcribed text from the input audio.
Configuration
- source: The path to a model or the name of a pre-trained model.
- device: The device to run the model on (e.g. “cpu” or “cuda”).
- chunk_size: The size of each chunk of audio data to process.
- left_context_size: The number of chunks to consider as context for each chunk.
Description
Receives audio data and transcribes it.
Notes
This task uses a pre-trained ASR model from SpeechBrain and transcribes the input audio in real-time. It is recommended to use a high-performance device (e.g. GPU) to run this task efficiently.