Cherokee OCR in C#

126 More Languages

IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 languages, including Cherokee.

It is an advanced fork of Tesseract, built exclusively for .NET developers and regularly outperforms other Tesseract engines for both speed and accuracy.

Contents of IronOcr.Languages.Cherokee

This package contains 120 OCR languages for .NET:

  • CherokeeAlphabet
  • CherokeeAlphabetBest
  • CherokeeAlphabetFast
  • Cherokee
  • CherokeeBest
  • CherokeeFast

Download

Cherokee Language Pack [ᏣᎳᎩ ᎦᏬᏂᎯᏍᏗ]

Installation

The first thing we have to do is install our Cherokee OCR package to your .NET project.

PM> Install-Package IronOCR.Languages.Cherokee
PM> Install-Package IronOCR.Languages.Cherokee
SHELL

Code Example

This C# code example reads Cherokee text from an image or PDF document.

// Import the IronOcr namespace
using IronOcr;

class Program
{
    static void Main()
    {
        // Create an instance of the IronTesseract class
        var Ocr = new IronTesseract();

        // Set the language for OCR to Cherokee
        Ocr.Language = OcrLanguage.Cherokee;

        // Use OcrInput to specify the image or PDF to be read
        using (var Input = new OcrInput(@"images\Cherokee.png"))
        {
            // Perform OCR on the input document
            var Result = Ocr.Read(Input);

            // Extract all recognized text
            var AllText = Result.Text;

            // Display the recognized text
            Console.WriteLine(AllText);
        }
    }
}
// Import the IronOcr namespace
using IronOcr;

class Program
{
    static void Main()
    {
        // Create an instance of the IronTesseract class
        var Ocr = new IronTesseract();

        // Set the language for OCR to Cherokee
        Ocr.Language = OcrLanguage.Cherokee;

        // Use OcrInput to specify the image or PDF to be read
        using (var Input = new OcrInput(@"images\Cherokee.png"))
        {
            // Perform OCR on the input document
            var Result = Ocr.Read(Input);

            // Extract all recognized text
            var AllText = Result.Text;

            // Display the recognized text
            Console.WriteLine(AllText);
        }
    }
}
$vbLabelText   $csharpLabel

Here's a short description of how the code works:

  • IronTesseract: An instance of the IronTesseract class is created to handle OCR operations.
  • OcrLanguage.Cherokee: The language is set to Cherokee using the OcrLanguage.Cherokee property, which tells the OCR engine to recognize text in Cherokee.
  • OcrInput: An input path is provided to the OcrInput class where the image or PDF document resides.
  • Ocr.Read: The Read method is called on the OCR object, passing in the input. It performs the OCR process.
  • Result.Text: Extracts the recognized text from the result and stores it in the AllText variable for further use.
  • Console output: The recognized text is printed out to the console.