Publisher | AiKodex |
---|---|
File size | 6.42MB |
Number of files | 76 |
Latest version | 3.0.0 |
Latest release date | 2025-05-07 10:36:02 |
First release date | 2025-05-07 09:53:14 |
Supported Unity versions | 2018.4.2 or higher |
FeatureㅤㅤㅤDeepVoiceㅤㅤㅤDeepVoice Pro
------------------------------------------------
Quotaㅤㅤㅤㅤㅤ80,000ㅤㅤㅤㅤㅤ150,000
Char / Genㅤㅤㅤ500ㅤㅤㅤㅤㅤㅤㅤ2,500
HQ Voicesㅤㅤㅤ25ㅤㅤㅤㅤㅤㅤㅤㅤㅤ50
Formatㅤㅤㅤㅤ.mp3 / .wavㅤㅤㅤ.wav (High)
Gen Timeㅤㅤㅤ0.3s/tokㅤㅤㅤㅤㅤ0.1s/tok
Accentㅤㅤㅤㅤㅤ✅ㅤㅤㅤㅤㅤㅤㅤㅤㅤ✅
Pacingㅤㅤㅤㅤㅤ❌ㅤㅤㅤㅤㅤㅤㅤㅤㅤ✅
Toneㅤㅤㅤㅤㅤㅤ❌ㅤㅤㅤㅤㅤㅤㅤㅤㅤ✅
If you already own DeepVoice, you can upgrade to DeepVoice Pro for a 80% discount or more!
No sign-up, No API Keys, no recurring payments, no subscription fees, no additional costs, just one-click easy to use inferences on our voice model.
ABOUT
DeepVoice is an LAM (Large Audio Model) of networks and libraries that are capable of life-like voice generation through text using AI and deep learning made for Unity.
QUOTA
150,000 characters per month of voice over and narration takes with DeepVoice. 150,000 characters translates to 50 pages of 12-point text in Calibri. This quota is issued on the 1st of every month. Process up to 5x characters at a time now.
LINKS
Works in realtime, both in, Edit Mode or Play Mode inside of the Unity Editor. This asset has a one-click, beginner friendly GUI and does not require any coding to use.
Please note: The voices you hear in this description and the videos (Trailer and Getting Started) are AI generated.
Please check out the forum page for the latest developments and discussion related to this asset. We are researching and adding more functionality continuously. Your support is appreciated.
Website and Support | Documentation | Forum Page
Pipelines Supported: Standard, HDRP, URP and SRP. (All)
FEATURES
🗣 Text to Voice Converter: The main function of the asset is to provide you with ready for production voices. Simply enter the text to be voiced out and click on generate.
Narration / Dialogues / Voice over / Dubbing
"In the end, it's not the powers that define us, but the choices we make"
"Sometimes, the greatest adventures are found in the most unexpected places."
"In the darkest of nights, hope shines like a single star, reminding us that heroes are born from adversity."
"Had to be me. Someone else might have gotten it wrong."
"I think it was called Ueno Station, but I'm not sure. I've never been to Tokyo before, so everything is unfamiliar to me."
Pauses
"So I think - I should take this route if I want to reach on time"
Or
"But well... I'm not entirely convinced"
Emotions
Note: The dialogue tag ("he said confused", "he shouted angrily") has been cut out using the audio trimmer within the asset.
"I have had enough!" he shouted angrily.
"I wish you were right, I truly do, but you're not" he said, assertively.
MODELS:
Examples ( [In Order] Japanese, Spanish, German, Korean, French, Hindi, Italian, Portuguese, Polish, Arabic, Indonesian, Dutch, Turkish, Filipino, Polish, Swedish, Bulgarian, Romanian, Czech, Greek, Finnish, Croatian, Malay, Slovak, Danish, Tamil, Ukrainian)
"ひとのために することは けっきょく めぐりめぐって じぶんのためにも なっているものだし" (Japanese)
"El Hombre Que Se Levanta Es Aún Más Grande Que El Que No Ha Caído." (Spanish)
"Wenn man dem Teufel den kleinen Finger gibt, so nimmt er die ganze Hand" (German)
"세상에서 보고싶은 변화가 있다면 당신 스스로 그 변화가 되어라." (Korean)
"L’amour est comme le vent, nous ne savons pas d’où il vient." (French)
"जल्दी सोना और जल्दी उठना इंसान को स्वस्थ ,समृद्ध और बुद्धिमान बनाता है." (Hindi)
"Penso che se io morissi e che con i miei mali finisse il desiderio, un amore così grande si spegnerebbe e il mondo intero rimarrebbe senza amore" (Italian)
"Os limites da minha linguagem são os limites do meu mundo." (Portuguese)
"Proście, a będzie wam dane, szukajcie, a znajdziecie." (Polish)
"الخطر الأعظم على معظمنا" (Arabic)
"Persiapkan hari ini sebaik-baiknya untuk menghadapi hari ésok yang baru." (Indonesian)
"Wees de verandering die je in de wereld wil zien gebeuren." (Dutch)
"Kadınlar beğenince değil, güvenince âşık olur." (Turkish)
"Ang pagiging dukha ay hindi hadlang sa tagumpay." (Filipino)
"För att lyckas i livet behöver du två saker: okunnighet och självförtroende." (Swedish)
"Никога не изневерявайте на специфичния си талант." (Bulgarian)
"Mai bine să ierți pe cel vinovat, decât să pedepsești pe cel nevinovat" (Romainian)
"Mluviti stříbro, mlčeti zlato." (Czech)
"ήρθα, είδα, νίκησα" (Greek)
"Yleensä elämässä on viisasta luottaa siihen, että kaikki menee hyvin." (Finnish)
"Tko svoj može biti, tuđ neka ne bude" (Croatian)
"Menderita sekarang, kesoronokan kemudian." (Malay)
"Horšia dlžoba ako chudoba." (Slovak)
"Af to onde Kaar skal man vaelge det bedste." (Danish)
"ஆனால் வெற்றி நம்பிக்கை உள்ளோரிடம் மட்டுமே வரும்." (Tamil)
"Ложка дьогтю у бочці меду" (Ukrainian)
👅 Language and Accent Support: DeepVoice_Instruct model supports Afrikaans, Arabic, Armenian, Azerbaijani, Belarusian, Bosnian, Bulgarian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, Galician, German, Greek, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Kannada, Kazakh, Korean, Latvian, Lithuanian, Macedonian, Malay, Marathi, Maori, Nepali, Norwegian, Persian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, Slovenian, Spanish, Swahili, Swedish, Tagalog, Tamil, Thai, Turkish, Ukrainian, Urdu, Vietnamese, and Welsh.
The DeepVoice_Multi model supports different languages such as English, Japanese, German, Hindi, French, Korean, Portuguese, Italian, Spanish, Indonesian, Dutch, Turkish, Filipino, Polish, Swedish, Bulgarian, Romanian, Arabic, Czech, Greek, Finnish, Croatian, Malay, Slovak, Danish, Tamil, Ukrainian and the
🔊 Voice Modulation controls: These controls allow users to adjust parameters such as speech clarity and variability in voices, as well as add emotions through text prompting. By manipulating these parameters, users can customize the generated speech to better suit their needs and preferences.
〰️ Preview waveform: Play sound clips right inside the editor without going into the play mode. Scrub the play head to play any part of the clip. Timestamps and simple graphic of the waveform is shown for better clarity inside the editor.
✂️ Trim audio: A user friendly GUI in the Editor to trim the ends of an audio clip if in case a part of the clip is not required or is empty.
➕ Combine clips: Multiple audio clips can be combined into one using an intuitive user friendly feature in the editor. Simply select clips, rearrange their order with ease and merge them into one.
⚙️ Equalize tracks: Mastering audio clips involves equalization of clips which can easily be done within the editor itself. Simply select the clip, adjust gain, pitch and frequency band sliders. A 6 band equalization is offered in the editor.
📄 Editor Script: The Editor Script displays all the options neatly in one panel. The editor has an in-built preview audio player. Simple design for trimming, combining and equalizing or mastering audio tracks.
EDITOR
Keeping it all in the editor: Keeping all assets in one workspace inside the Editor and having to switch to fewer services can have several benefits, such as:
- Improved Efficiency: When all assets are located in one workspace, it becomes easier to access and manage them. Users do not have to spend time switching between different services or applications, which can be time-consuming and lead to a loss of productivity.
- Streamlined Workflow: Having all assets in one workspace can help create a more streamlined workflow. This is because users can easily move between different assets, such as code files, images, and documents, without having to navigate between different services. This can help to speed up the development process and make it more efficient.
- Reduced Complexity: Using fewer services can help to reduce the complexity of the development process.
In the pack, you will find a demo scene and an editor window which help you to access the TTS models. There are other useful audio settings like trimming, combining and mastering the audio track that can be accessed through the DeepVoice Editor Window.
DEPENDENCIES
This tool requires the Editor Coroutines package from the package manager and an active internet connection.
LIMITATIONS
Since this tool is still under development, there are a few limitations:
- Process up to 2500 character at a single time.
- Audio generation time is ~8-15 seconds per clip. This may increase with an increased number of tokens and user base.
- Character count per month is limited to 150000.