Assamese OCR in C# and .NET
IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 languages, including Assamese.
It is an advanced fork of Tesseract, built exclusively for .NET developers and regularly outperforms other Tesseract engines for both speed and accuracy.
Contents of IronOcr.Languages.Assamese
This package contains 49 OCR languages for .NET:
- Assamese
- AssameseBest
- AssameseFast
Download
Assamese Language Pack [অসমীয়া]
Installation
The first thing we have to do is install our Assamese OCR package to your .NET project.
PM> Install-Package IronOCR.Languages.Assamese
PM> Install-Package IronOCR.Languages.Assamese
Code Example
This C# code example reads Assamese text from an Image or PDF document.
// Make sure to install the necessary package:
// PM> Install-Package IronOcr.Languages.Assamese
using IronOcr;
class OCRExample
{
public void ReadAssameseText()
{
// Create an instance of IronTesseract OCR engine
var Ocr = new IronTesseract();
// Set the language to Assamese
Ocr.Language = OcrLanguage.Assamese;
// Create an OCR input object with the specified image or PDF file
using (var Input = new OcrInput(@"images\Assamese.png"))
{
// Read the text from the input file
var Result = Ocr.Read(Input);
// Retrieve the text from the OCR result
var AllText = Result.Text;
// Output the recognized text to the console
Console.WriteLine(AllText);
}
}
}
// Make sure to install the necessary package:
// PM> Install-Package IronOcr.Languages.Assamese
using IronOcr;
class OCRExample
{
public void ReadAssameseText()
{
// Create an instance of IronTesseract OCR engine
var Ocr = new IronTesseract();
// Set the language to Assamese
Ocr.Language = OcrLanguage.Assamese;
// Create an OCR input object with the specified image or PDF file
using (var Input = new OcrInput(@"images\Assamese.png"))
{
// Read the text from the input file
var Result = Ocr.Read(Input);
// Retrieve the text from the OCR result
var AllText = Result.Text;
// Output the recognized text to the console
Console.WriteLine(AllText);
}
}
}
- IronTesseract: This is the main class responsible for OCR operations.
- OcrLanguage.Assamese: This specifies the language for OCR. In this case, it's set to Assamese.
- OcrInput: This class is used to load images or PDFs from which you want to extract text.
- Result.Text: Contains the complete text extracted from the image or PDF.