Skip to page content

An NISC LogoCompany

IMC's Foreign Language Optical Character Recognition (OCR)

IMC's Foreign Language Optical Character Recognition automatically recognizes as accurately as possible characters in primarily Arabic foreign language paper documents and renders electronic duplicates that comprise digital documents. For retrieval purposes, it produces a full-text index of the documents scanned to accompany the electronic document images in your archive.

With pattern recognition built-in, IMC's Foreign Language OCR is especially useful for documents in various Arabic languages like Farsi whose cursive-like characters that overlap and may be broken or accompanied by various diacritic markings that denote different meanings. It can also duplicate fonts, underlining, bold and other formatting.

Our Foreign Language OCR operates in a Windows environment with most scanners and achieves recognition rates of up to 800 characters per second. Also, its Artificial Intelligence improves recognition rates the more documents that you scan because it "learns" as it accrues a larger data set from which to infer the most plausible choices of characters.