logo
Free, unlimited AI code reviews that run on commit
git-lrc git-lrc GitHub Install Now We'd appreciate a star git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt

set_unicharset_properties - set properties about the unichars

Author

       The Tesseract OCR engine was written by Ray Smith and his research groups at Hewlett Packard (1985-1995)
       and Google (2006-2018).

                                                   01/19/2025                            SET_UNICHARSET_PROPE(1)

Copying

       Copyright (C) 2012 Google, Inc. Licensed under the Apache License, Version 2.0

Description

set_unicharset_properties(1) reads a unicharset file, puts the result in a UNICHARSET object, fills it
       with properties about the unichars it contains and writes the result back to another unicharset file.

History

set_unicharset_properties(1) was first made available for tesseract version 3.03.

Name

       set_unicharset_properties - set properties about the unichars

Options

--script_dir/path/to/langdata
           (Input) Specify the location of directory for universal script unicharsets and font xheights
           (type:string default:)

       --Uunicharsetfile
           (Input) Specify the location of the unicharset to load as input.

       --Ounicharsetfile
           (Output) Specify the location of the unicharset to be written with updated properties.

Resources

       Main web site: https://github.com/tesseract-ocr Information on training:
       https://tesseract-ocr.github.io/tessdoc/Training-Tesseract.html

See Also

tesseract(1)

Synopsis

set_unicharset_properties --U input_unicharsetfile --script_dir /path/to/langdata --O
       output_unicharsetfile

See Also