man page merge_unicharsets section 1

merge_unicharsets(1)                                      merge_unicharsets(1)

NAME

       merge_unicharsets - Simple tool to merge two or more unicharsets.

SYNOPSIS

       merge_unicharsets unicharset-in-1 ... unicharset-in-n unicharset-out

DESCRIPTION

       merge_unicharsets(1) is a simple tool to merge two or more unicharsets.
       It could be used to create a combined unicharset for a script-level
       engine, like the new Latin or Devanagari.

IN/OUT ARGUMENTS

       unicharset-in-1
           (Input) The name of the first unicharset file to be merged.

       unicharset-in-n
           (Input) The name of the nth unicharset file to be merged.

       unicharset-out
           (Output) The name of the merged unicharset file.

HISTORY

       merge_unicharsets(1) was first made available for
       tesseract4.00.00alpha.

RESOURCES

       Main web site: https://github.com/tesseract-ocr Information on training
       tesseract LSTM:
       https://tesseract-ocr.github.io/tessdoc/TrainingTesseract-4.00.html

COPYING

       Copyright (C) 2012 Google, Inc. Licensed under the Apache License,
       Version 2.0

AUTHOR

       The Tesseract OCR engine was written by Ray Smith and his research
       groups at Hewlett Packard (1985-1995) and Google (2006-2018).

                                  08/31/2024              merge_unicharsets(1)

tesseract 5.4.1 - Generated Thu Oct 3 16:28:29 CDT 2024

NAME

SYNOPSIS

DESCRIPTION

IN/OUT ARGUMENTS

HISTORY

RESOURCES

SEE ALSO

COPYING

AUTHOR