Below is a directory of helpful software you’ll want to get started working with UTAU and other vocalsynths! The majority of these applications are free to download, with any exceptions marked as such.
This list is continually updated and polished as new editors and tools come out. If you feel that we are lacking a particular application, engine/resampler or other tool, feel free to contact us and we will add it to the list!
UTAU
- RECOMMENDED: OpenUtau – Open source, multiplatform UTAU front end.
- UTAU – Windows only, requires Japanese locale on system prior to download/installation.
DeepVocal
- DeepVocal Editor – Windows only. Original DeepVocal editor.
Resamplers
“Resamplers” are the concatenative rendering engines compatible with UTAU and OpenUtau. Different resamplers will process the same recordings differently, resulting in unique tonal qualities between each one.
NOTE: The below resamplers are intended to work on Windows only. While wrapping these in Wine with this method may work on Mac and Linux, rendering quality may vary when paired with the default OpenUtau wavtool.
- resampler – The default resampler.
- moresampler – A resampler with powerful flags to control voice quality. Can also automatically generate otos.
- fresamp11 – An older iteration of fresamp, considered a bit more stable. Has nasal quality without “F0” flag.
- fresamp14 – A newer iteration of fresamp. Has nasal quality without “F0” flag.
- TIPS – A resampler good for soft, breathy voices.
- bkh01 – A resampler that tends to add a little bit of metallic buzz to sibilant consonants.
- phavoco – “Phase Vocoder”
- vs4u
- world4utau (aka “w4u”) – A very picky resampler, only works on mono recordings.
- EFB-GT – Good for solid voices.
- tn_fnds – Another good resampler for soft, gentle voices.
- utaugrowl – A resampler that has flags to add growl to the voice.
Wavtools
“Wavtools” are the postprocessor in UTAU that strings the rendered sounds together for the final .wav output.
NOTE: The below wavtools are Windows only and currently cannot be Wine wrapped for use on Mac or Linux.
- wavtool – The default UTAU wavtool.
- wavtool2 – A slightly improved wavtool.
- wavtool4vcv – A wavtool optimized for VCV and other strung-sound voicebanks.
AI Engines
AI engines generate audio based on a voice model rather than manipulating raw .wav recordings in a database. There are currently 2 AI engines compatible with the different UTAU editors.
- NNSVS / ENUNU (ENUNU for OpenUtau) – A 2-part system, be sure to download both NNSVS and ENUNU to produce/use AI voicebanks.
- DiffSinger (for OpenUtau)
UTAU Plugins
NOTE: The following plugins are intended for use in original UTAU, not OpenUtau.
- Resampler Patcher – Makes most common resamplers render much faster.
- Iroiro – An all-in-one plugin that converts between romaji and hiragana, among other things.
- Lyric Diphonizer – A simple plugin that converts CV USTs to VCV.
- autoVCCV 2.0 – Converts a UST from CV to CVVC by inserting the VC samples.
- presamp – Plays a CV UST as CVVC. Can be customized with a “presamp.ini” & includes “wavtoolex”
- utalis – A .wav-tracing automatic tuning plugin.
Recording & Cleanup
- OREMO – Windows only. Preferred tool for recording UTAU voicebanks.
- Akorin – Multiplatform OREMO alternative. NOTE: Mac version must be built from source, and lacks guideBGM functionality.
- Audacity – General recording & audio editing tool, good for audio cleanup.
- utauwav – Compresses and optimizes UTAU voicebank samples without reducing audio quality.
Configuration & Labeling
- RECOMMENDED: vLabeler – For configuring both UTAU oto.ini and AI data labeling.
- setParam – Windows only. Tool for configuring UTAU oto.ini.
- DeepVocal ToolBox – Windows only. Toolkit for configuring/labeling DeepVocal voicebanks.
- Notepad++ – Windows only. Advanced text editor useful for creating reclists or modifying oto.ini files.
- Voicebank Aliaser – Automatically generates romaji or hiragana aliases for any style of Japanese voicebanks.
Free* DAWs
For mixing, making MIDIs, or composing.
- Cockos Reaper – Windows & Mac (*unlimited free trial)
- Frinika – Windows, Mac, Linux
- LMMS – Windows, Mac, Linux
- Cakewalk – Windows
- Audacity – Windows, Mac, Linux. EXTREMELY primitive workflow. Not recommended for complex projects.
- Garageband – Comes preinstalled on all Apple devices.
Paid DAWs
- FL Studio – Windows & Mac. Extremely commonly used among vocalsynth producers.
- Cubase – Windows & Mac. Cubase LE comes free with some hardware and voicebanks.
- Logic Pro – Mac only. $199, but less with educational bundle license.
Additional Tools
- vocalshifter – Allows for post-render .wav modification of pitch and other parameters, similar to Melodyne.
File Compression
When using UTAU and related programs that often have Japanese filenames, you’ll want a file compression app that can handle multiple compression types and file encoding so your Japanese files don’t become gibberish!
Video & Animation
- MMD – For creating 3D animations.
- LipSync – For creating 2D lip-sync puppets that can import VSQ data for animation.
- AviUtl – Freeware video editor for Windows.
- utauview – AviUtl plugin that allows one to import a UST and create a smoothly side-scrolling view of the UST that “plays” as it would appear in the UTAU editor.