hts_engine — HMM-based speech synthesis engine
Contents
Description
This manual page documents briefly the hts_engine command.
This manual page was written for the Debian distribution because the original program does not have a
manual page. Instead, it has documentation in the GNU Info format; see below.
hts_engine is a program that synthesize speech waveform from HMMs trained by the HMM-based speech
synthesis system (HTS).
Example
If you installed hts-voice-nitech-jp-atr503-m001 in the current directory, the following command let you
make a voice file from input.lab:
% hts_engine -s 48000 -p 240 -a 0.55 \
-m nitech_jp_atr503_m001.htsvoice \
-ow output.wav \
input.lab
Name
hts_engine — HMM-based speech synthesis engine
Options
A summary of options is included below.
-mhtsvoice
HTS voice files
-ods filename of output label with duration
-oms filename of output spectrum
-ofs filename of output Log F0
-ols filename of output low-pass filter
-ors filename of output raw audio (generated speech)
-ows filename of output wav audio (generated speech)
-ots filename of output trace information
-vp use phoneme alignment for duration
-iif1..fi
enable interpolation & specify number(i),coefficient(f)
-si sampling frequency [auto][ 1-- ]
-pi frame period (point) [auto][ 1-- ]
-af all-pass constant [auto][0.0--1.0]
-bf postfiltering coefficient [0.0][0.0--1.0]
-rf speech speed rate [1.0][0.0-- ]
-fmf add half-tone [0.0][ -- ]
-uf voiced/unvoiced threshold[0.5][0.0--1.0]
-jmf weight of GV for spectrum [1.0][0.0-- ]
-jff weight of GV for Log F0 [1.0][0.0-- ]
-zi audio buffer size (if i==0, turn off) [ 0][0-- ]
infile label file
generated spectrum, log F0, and low-pass filter coefficient sequences are saved in natural endian, binary
(float) format.
Synopsis
hts_engine [options] [infile]
