DeepVoice AI - Text To Voice

AiKodex

(2)

$79

GET NOW

Publisher	AiKodex
File size	21.78MB
Number of files	111
Latest version	2.1.3
Latest release date	2024-03-18 10:03:26
First release date	2023-07-11 02:57:12
Supported Unity versions	2018.4.2 or higher

No sign-up, No API Keys, no recurring payments, no subscription fees, no additional costs, just one-click easy to use inferences on our voice model.

ABOUT

DeepVoice is an LAM (Large Audio Model) of networks and libraries that are capable of life-like voice generation through text using AI and deep learning made for Unity.

QUOTA

30,000 characters per month (refreshed every 15 days -> 15,000 characters) of voice over and narration takes with DeepVoice. 15,000 characters translates to 5 pages of 12-point text in Calibri. This quota is issued on the 15th and the 1st of every month.

LINKS

Works in realtime, both in, Edit Mode or Play Mode inside of the Unity Editor. This asset has a one-click, beginner friendly GUI and does not require any coding to use.

Please note: The voices you hear in this description and the videos are AI generated.

Please check out the forum page for the latest developments and discussion related to this asset. We are researching and adding more functionality continuously. Your support is appreciated.

Website and Support | Documentation | Forum Page

Pipelines Supported: Standard, HDRP, URP and SRP. (All)

FEATURES

🗣 Text to Voice Converter: The main function of the asset is to provide you with ready for production voices. Simply enter the text to be voiced out and click on generate.

Examples for prompting:

Narration / Dialogues / Voice over / Dubbing

"In the darkest of nights, hope shines like a single star, reminding us that heroes are born from adversity."

▶︎ Play

"Had to be me. Someone else might have gotten it wrong."

▶︎ Play

"I think it was called Ueno Station, but I'm not sure. I've never been to Tokyo before, so everything is unfamiliar to me."

▶︎ Play

Pauses

"So I think - I should take this route if I want to reach on time"

▶︎ Play

"But well... I'm not entirely convinced"

▶︎ Play

Emotions

Note: The dialogue tag ("he said confused", "he shouted angrily") has been cut out using the audio trimmer within the asset.

"I have had enough!" he shouted angrily.

▶︎ Play

"I wish you were right, I truly do, but you're not" he said, assertively.

▶︎ Play

Famous Personalities

"I don’t hire a lot of number-crunchers, and I don’t trust fancy marketing surveys. I do my own surveys and draw my own conclusions."

▶︎ Play

"Nothing can stand in the way of the power of millions of voices calling for change."

▶︎ Play

More examples are given in the documentation

👅 Language and Accent Support: The DeepVoice_Multi model supports different languages such as English, German, Polish, Spanish, Italian, French, Portuguese, Hindi. The DeepVoice_Standard model supports different English accents such as Irish, Arabic, Mandarin, Danish, Dutch, Spanish, French, Italian, Korean, Russian, Swedish, Turkish, Welsh.

🔊 Voice Modulation controls: These controls allow users to adjust parameters such as speech clarity and variability in voices, as well as add emotions through text prompting. By manipulating these parameters, users can customize the generated speech to better suit their needs and preferences.

〰️ Preview waveform: Play sound clips right inside the editor without going into the play mode. Scrub the play head to play any part of the clip. Timestamps and simple graphic of the waveform is shown for better clarity inside the editor.

✂️ Trim audio: A user friendly GUI in the Editor to trim the ends of an audio clip if in case a part of the clip is not required or is empty.

➕ Combine clips: Multiple audio clips can be combined into one using an intuitive user friendly feature in the editor. Simply select clips, rearrange their order with ease and merge them into one.

⚙️ Equalize tracks: Mastering audio clips involves equalization of clips which can easily be done within the editor itself. Simply select the clip, adjust gain, pitch and frequency band sliders. A 6 band equalization is offered in the editor.

📄 Editor Script: The Editor Script displays all the options neatly in one panel. The editor has an in-built preview audio player. Simple design for trimming, combining and equalizing or mastering audio tracks.

EDITOR

Keeping it all in the editor: Keeping all assets in one workspace inside the Editor and having to switch to fewer services can have several benefits, such as:

- Improved Efficiency: When all assets are located in one workspace, it becomes easier to access and manage them. Users do not have to spend time switching between different services or applications, which can be time-consuming and lead to a loss of productivity.

- Streamlined Workflow: Having all assets in one workspace can help create a more streamlined workflow. This is because users can easily move between different assets, such as code files, images, and documents, without having to navigate between different services. This can help to speed up the development process and make it more efficient.

- Reduced Complexity: Using fewer services can help to reduce the complexity of the development process.

In the pack, you will find a demo scene and an editor window which help you to access the TTS models. There are other useful audio settings like trimming, combining and mastering the audio track that can be accessed through the DeepVoice Editor Window.

DEPENDENCIES

This tool requires the Editor Coroutines package from the package manager and an active internet connection.

LIMITATIONS

Since this tool is still under development, there are a few limitations:

- For now, the text that can be processed is set to a limit of 200 characters or 30 to 50 words or 5 to 6 sentences or one paragraph.

- There are around 80 voices to choose from, out of which Mono/Multi have 15. We are working on adding more.

- Audio generation time is ~8-15 seconds per clip. This may increase with an increased number of tokens and user base.

- Character count per fortnight is limited to 15000. Per month, this translates to a limitation of 30000 characters.

Please check out the documentation for an in-depth explanation and working of the asset.