Multi-Language OCR Application

A React-based Optical Character Recognition (OCR) application that supports both English and Bangla text extraction from images. Built using Tesseract.js, this application can process images containing text in English, Bangla, or both languages simultaneously.

Features

Support for English and Bangla text recognition
Bilingual mode for mixed language content
Real-time processing progress indicator
Confidence score display
Image preview before processing
Clean and intuitive user interface

Sample Results

Sample 1: Passport Data

Sample passport image showing bilingual text recognition capabilities

Sample 2: Bangla Text

Sample Bangla text document demonstrating native language processing

Note: All sample images used in this documentation are collected from Google Images and are used for demonstration purposes only.

Accuracy Notes

Tesseract OCR's accuracy varies depending on several factors:

Image quality (resolution, contrast, lighting)
Text clarity and font type
Language complexity

In testing, Tesseract.js typically achieves:

English text: 85-95% accuracy with clear images
Bangla text: 75-85% accuracy with clear images
Mixed language: 70-80% accuracy

For best results:

Use clear, well-lit images
Ensure text has good contrast with background
Avoid skewed or rotated text
Use high-resolution images

Technologies Used

React + Vite
Tesseract.js
CSS3 for styling

Getting Started

Clone the repository
Install dependencies:

npm install

npm run dev

## Author
en-arnob

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
public		public
samples		samples
src		src
.gitignore		.gitignore
README.md		README.md
eslint.config.js		eslint.config.js
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
vite.config.js		vite.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multi-Language OCR Application

Features

Sample Results

Sample 1: Passport Data

Sample 2: Bangla Text

Accuracy Notes

Technologies Used

Getting Started

About

Uh oh!

Releases

Packages

Uh oh!

Languages

en-arnob/ocr-react

Folders and files

Latest commit

History

Repository files navigation

Multi-Language OCR Application

Features

Sample Results

Sample 1: Passport Data

Sample 2: Bangla Text

Accuracy Notes

Technologies Used

Getting Started

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages