logo
Free, unlimited AI code reviews that run on commit
git-lrc git-lrc GitHub Install Now We'd appreciate a star git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt

g2p-sk - phonetic transcription for Slovak

Author

       Jozef Ivanecky (dodo (at) kanoistika.sk)

Description

       The phonetic transcription is essential for some linguistic or speech recognition applications. Depending
       on the language either rule based or statistical approach is being used. g2p-sk implements the rule based
       approach but in the future it may be replaced by statistical one.

       Each  input  word  consisting of the sequence of graphemes is transcribed in to the sequence of phones in
       the SAMPA coding. If no input file is specified, the standard input is expected. If input  file  is  used
       then  the  output  is  written  in to the file as well. The filename is input filename with the extension
       "_trans.txt".

       The input output code page is ISO 8859-2. To use it with different CP use some CP  converter  and  pipes.
       For example to have input and output in UTF-8 use (for interactive use): filtermUTF8-iso2iso2-UTF8g2p-sk or (for batch processing) iconv-fUTF-8-tISO_8859-2|g2p-sk|iconv-fISO_8859-2-tUTF-8

       Performance  of the phonetic transcription depend on the morphematic segmentation. To improve the quality
       of the morphematic segmentation is possible to replace  the  small  version  of  the  simple  morphematic
       dictionary   in   the  /usr/share/g2p_sk/Exceptions/morfemy.ddat  with  the  better  one.   The  syllabic
       segmentation is as important as morphematic one. The  syllabic  segmentation  is  provided  by  sylseg-sk
       package.

       The  design  of the g2p-sk is language dependent. To use it for another language the all rules need to be
       rewritten.

Examples

       Use standard input and debug level 3:
              g2p-sk --dl 3

       Process all the from file aaa.txt:
              g2p-sk aaa.txt

Exit Status

       g2p-sk returns a zero if it succeeds to process all the input words

Name

       g2p-sk - phonetic transcription for Slovak

Options

       --color
              Enable color output.

       --dl 1..5
              Set the debug level. Control the amount of  displayed  information  The  debug  level  0  displays
              nothing. The maximum level 5 displays full debugging report. The default debug level is 1.

       --help Display a short help text

       --ofile <file_name>
              Write output also in to given file.

       --stats
              Count and display statistic for each phone

See Also

sylseg-sk(1), filterm(1), iconv(1), konwert(1)

version 0.4                                       May 17, 2009                                         g2p-sk(1)

Synopsis

g2p-sk [--color] [--dl debug level] [--help] [--stats] [--ofile <file_name>] [<input file>]

See Also