mbrola - multilingual software speech synthesizer
Contents
Description
mbrola is a speech synthesizer based on the concatenation of diphones. It takes a list of phonemes as
input, together with prosodic information (duration of phonemes and a piecewise linear description of
pitch), and produces speech samples on 16 bits (linear), at the sampling frequency of the diphone
database.
It is therefore NOT a Text-To-Speech (TTS) synthesizer, since it does not accept raw text as input. In
order to obtain a full TTS system, you need to use this synthesizer in combination with a text
processing system that produces phonetic and prosodic commands. For instance, you may use package
freephone to obtain complete English TTS.
A `-' instead of pho_file or output_file means stdin or stdout. Extension of output_file ( raw, au, wav,
aiff ) tells the wanted audio format.
Name
mbrola - multilingual software speech synthesizer
Options
mbrola understands following command line options.
-h
Show summary of options.
-i
Display the database information if any.
-e
Ignore fatal errors on unknown diphone.
-ccomment_char
Set COMMENT char (escape sequence in pho files).
-Fflush_command
Set FLUSH command name.
-vvolume
Volume ratio. Float ratio applied to output samples.
-ffreq_ratio
Frequency ratio. Float ratio applied to pitch points.
-ttime_ratio
Time ratio. Float ratio applied to phone durations.
-lvoice_freq
Voice frequency. Target frequency for voice quality in Hz.
-Rrename_list
Phoneme rename list of the form: `a A b B ...'
-Cclone_list
Phoneme clone list of the form: `a A b B ...'
-Ifile
Initialization file containing one command per line. CLONE, RENAME, VOICE, TIME, FREQ, VOLUME,
FLUSH, COMMENT, and IGNORE are available.
See Also
freephone(1), sox(1). See /usr/share/doc/mbrola/readme.txt.gz for more info.
Synopsis
mbrola [options]<voice_database><pho_file>...<output_file>
