logo
Free, unlimited AI code reviews that run on commit
git-lrc git-lrc GitHub Install Now We'd appreciate a star git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt

hts_engine — HMM-based speech synthesis engine

Author

       This manual page was written by Koichi Akabe vbkaisetsu@gmail.com for the Debian system (and may be  used
       by others).  Permission is granted to copy, distribute and/or modify this document under the terms of the
       GNU General Public License, Version 2 any later version published by the Free Software Foundation.

       On Debian systems, the complete text of the GNU General Public License can be found in /usr/share/common-
       licenses/GPL.

                                                                                                    htsengine(1)

Description

       This manual page documents briefly the hts_engine command.

       This  manual  page  was  written for the Debian distribution because the original program does not have a
       manual page.  Instead, it has documentation in the GNU       Info format; see below.

       hts_engine is a program that synthesize speech  waveform  from  HMMs  trained  by  the  HMM-based  speech
       synthesis system (HTS).

Example

       If  you installed hts-voice-nitech-jp-atr503-m001 in the current directory, the following command let you
       make a voice file from input.lab:

              % hts_engine -s 48000 -p 240 -a 0.55 \
              -m nitech_jp_atr503_m001.htsvoice \
              -ow output.wav \
              input.lab

Name

       hts_engine — HMM-based speech synthesis engine

Options

       A summary of options is included below.

       -mhtsvoice
                 HTS voice files

       -ods     filename of output label with duration

       -oms     filename of output spectrum

       -ofs     filename of output Log F0

       -ols     filename of output low-pass filter

       -ors     filename of output raw audio (generated speech)

       -ows     filename of output wav audio (generated speech)

       -ots     filename of output trace information

       -vp       use phoneme alignment for duration

       -iif1..fi
                 enable interpolation & specify number(i),coefficient(f)

       -si     sampling frequency [auto][  1--   ]

       -pi     frame period (point) [auto][  1--   ]

       -af     all-pass constant [auto][0.0--1.0]

       -bf     postfiltering coefficient [0.0][0.0--1.0]

       -rf     speech speed rate [1.0][0.0--   ]

       -fmf     add half-tone [0.0][   --   ]

       -uf     voiced/unvoiced threshold[0.5][0.0--1.0]

       -jmf     weight of GV for spectrum [1.0][0.0--   ]

       -jff     weight of GV for Log F0 [1.0][0.0--   ]

       -zi     audio buffer size (if i==0, turn off) [  0][0--   ]

       infile    label file

       generated spectrum, log F0, and low-pass filter coefficient sequences are saved in natural endian, binary
       (float) format.

Synopsis

hts_engine [options]  [infile]

See Also