Publisher | Martin Cvengroš |
---|---|
File size | 9.49MB |
Number of files | 67 |
Latest version | 1 |
Latest release date | 2024-01-03 04:27:42 |
First release date | 2023-05-31 09:55:12 |
Supported Unity versions | 2018.4.2 or higher |
OpenAI Whisper is a robust offline speech recognition system which runs locally on user device and provides transcription, translation to English, and language recognition of user speech either using a microphone, or by analysing an audio file.
This asset contains native platforms libraries of Whisper, together with user friendly Editor interface.
This asset also features Voice Activity Detection (VAD) system which is able to adapt to incoming audio and determine when to invoke Whisper recognition automatically - suitable for open mic like scenarios; it's still possible to invoke Whisper manually
Currently WIndows (x86|x64), macOS (both Intel and Apple Silicon), Linux (x64), Android/ChromeOS, and iOS are supported.
It is currently optimized primarily for Apple Silicon (M1 equivalent or better) - but runs on standalones with higher CPU usage.
For demos please see:
Windows x64 | macOS | Linux (x64) | Android/ChromeOS
Latest documentation is also available here
For support please use Unity forums