Telugu OCR in C# and .NET
Other versions of this document:
IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 languages, including Telugu.
It is an advanced fork of Tesseract, built exclusively for .NET developers and regularly outperforms other Tesseract engines for both speed and accuracy.
Contents of IronOcr.Languages.Telugu
This package contains several OCR language models for .NET related to Telugu:
- Telugu
- TeluguBest
- TeluguFast
- TeluguAlphabet
- TeluguAlphabetBest
- TeluguAlphabetFast
Download
Telugu Language Pack [తెలుగు]
Installation
The first step is to install the Telugu OCR package into your .NET project.
Install-Package IronOCR.Languages.Telugu
Code Example
This is a C# code example that reads Telugu text from an image or PDF document.
// Ensure that you have installed the IronOCR.Languages.Telugu package before running this code.
using IronOcr;
public class TeluguOcrExample
{
public static void Main()
{
// Create a new IronTesseract instance
var Ocr = new IronTesseract();
// Specify the OCR language as Telugu
Ocr.Language = OcrLanguage.Telugu;
// Create a new OcrInput and specify the path to the image or PDF
using (var Input = new OcrInput(@"images\Telugu.png"))
{
// Perform OCR on the input file
var Result = Ocr.Read(Input);
// Extract and store the recognized text
var AllText = Result.Text;
// Output the recognized text to the console (optional)
Console.WriteLine(AllText);
}
}
}
// Ensure that you have installed the IronOCR.Languages.Telugu package before running this code.
using IronOcr;
public class TeluguOcrExample
{
public static void Main()
{
// Create a new IronTesseract instance
var Ocr = new IronTesseract();
// Specify the OCR language as Telugu
Ocr.Language = OcrLanguage.Telugu;
// Create a new OcrInput and specify the path to the image or PDF
using (var Input = new OcrInput(@"images\Telugu.png"))
{
// Perform OCR on the input file
var Result = Ocr.Read(Input);
// Extract and store the recognized text
var AllText = Result.Text;
// Output the recognized text to the console (optional)
Console.WriteLine(AllText);
}
}
}
' Ensure that you have installed the IronOCR.Languages.Telugu package before running this code.
Imports IronOcr
Public Class TeluguOcrExample
Public Shared Sub Main()
' Create a new IronTesseract instance
Dim Ocr = New IronTesseract()
' Specify the OCR language as Telugu
Ocr.Language = OcrLanguage.Telugu
' Create a new OcrInput and specify the path to the image or PDF
Using Input = New OcrInput("images\Telugu.png")
' Perform OCR on the input file
Dim Result = Ocr.Read(Input)
' Extract and store the recognized text
Dim AllText = Result.Text
' Output the recognized text to the console (optional)
Console.WriteLine(AllText)
End Using
End Sub
End Class
This code snippet initializes an OCR engine using the IronOCR package, sets the Telugu language for OCR processing, and reads text from an input image file specified by the user.