AudioStreamSpeechWhisper

$29
Publisher Martin Cvengroš
File size 9.49MB
Number of files 67
Latest version 1
Latest release date 2024-01-03 04:27:42
First release date 2023-05-31 09:55:12
Supported Unity versions 2018.4.2 or higher

OpenAI Whisper is a robust offline speech recognition system which runs locally on user device and provides transcription, translation to English, and language recognition of user speech either using a microphone, or by analysing an audio file.

This asset contains native platforms libraries of Whisper, together with user friendly Editor interface.


This asset also features Voice Activity Detection (VAD) system which is able to adapt to incoming audio and determine when to invoke Whisper recognition automatically - suitable for open mic like scenarios; it's still possible to invoke Whisper manually


Currently WIndows (x86|x64), macOS (both Intel and Apple Silicon), Linux (x64), Android/ChromeOS, and iOS are supported.

It is currently optimized primarily for Apple Silicon (M1 equivalent or better) - but runs on standalones with higher CPU usage.


For demos please see:

Windows x64 | macOS | Linux (x64) | Android/ChromeOS


Latest documentation is also available here

For support please use Unity forums


Top Publisher Assets