Assamese OCR in C# and .NET

126 More Languages

IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 languages, including Assamese.

It is an advanced fork of Tesseract, built exclusively for .NET developers and regularly outperforms other Tesseract engines for both speed and accuracy.

Contents of IronOcr.Languages.Assamese

This package contains 49 OCR languages for .NET:

  • Assamese
  • AssameseBest
  • AssameseFast

Download

Assamese Language Pack [অসমীয়া]

Installation

The first thing we have to do is install our Assamese OCR package to your .NET project.

PM> Install-Package IronOCR.Languages.Assamese
PM> Install-Package IronOCR.Languages.Assamese
SHELL

Code Example

This C# code example reads Assamese text from an Image or PDF document.

// Make sure to install the necessary package:
// PM> Install-Package IronOcr.Languages.Assamese

using IronOcr;

class OCRExample
{
    public void ReadAssameseText()
    {
        // Create an instance of IronTesseract OCR engine
        var Ocr = new IronTesseract();

        // Set the language to Assamese
        Ocr.Language = OcrLanguage.Assamese;

        // Create an OCR input object with the specified image or PDF file
        using (var Input = new OcrInput(@"images\Assamese.png"))
        {
            // Read the text from the input file
            var Result = Ocr.Read(Input);

            // Retrieve the text from the OCR result
            var AllText = Result.Text;

            // Output the recognized text to the console
            Console.WriteLine(AllText);
        }
    }
}
// Make sure to install the necessary package:
// PM> Install-Package IronOcr.Languages.Assamese

using IronOcr;

class OCRExample
{
    public void ReadAssameseText()
    {
        // Create an instance of IronTesseract OCR engine
        var Ocr = new IronTesseract();

        // Set the language to Assamese
        Ocr.Language = OcrLanguage.Assamese;

        // Create an OCR input object with the specified image or PDF file
        using (var Input = new OcrInput(@"images\Assamese.png"))
        {
            // Read the text from the input file
            var Result = Ocr.Read(Input);

            // Retrieve the text from the OCR result
            var AllText = Result.Text;

            // Output the recognized text to the console
            Console.WriteLine(AllText);
        }
    }
}
$vbLabelText   $csharpLabel
  • IronTesseract: This is the main class responsible for OCR operations.
  • OcrLanguage.Assamese: This specifies the language for OCR. In this case, it's set to Assamese.
  • OcrInput: This class is used to load images or PDFs from which you want to extract text.
  • Result.Text: Contains the complete text extracted from the image or PDF.