16.2.6+103

📅 2025-05-16

New features

N/A

Improvements

Text direction hint for OCR

The possibility of specifying a text direction for OCR of Asian languages has been reintroduced to improve recognition accuracy in specific scenarios.

As a consequence, the value TextDirection.Any has also been added to the relevant enum to represent automatic detection, which remains the default behavior.

Default vs. Explicit behavior

  • Default behavior: The OCR engine automatically detects the text direction (TextDirection.Any). This will give the appropriate output in the vast majority of the use cases.

  • Explicit behavior: In certain scenarios — especially in the context of zonal OCR — automatic detection may misinterpret the direction due to limited context. In these cases, explicitly setting the text direction when already known will improve accuracy.

The following snippet demonstrates how to configure OCR with an explicit text direction:

  COcrContext objContext = COcrContext::Create(evLanguage);
  objContext.SetTextDirection(TextDirection::TopToBottom);

  CTextRecognition objTextRecognition = CTextRecognition::Create(objIdrs);
  objTextRecognition.SetOcrParams(COcrPageParams::Create(objContext));

  objTextRecognition.RecognizeText(objImage);

Greyscale detection during PDF loading

A new property, CImageLoadOptionsPdf.GreyscaleDetection, has been introduced in order to enable/disable the rasterization of PDF pages as greyscale CImage objects.

This property is enabled by default, maintaining behavior consistent with previous iDRS releases.

Disabling greyscale detection can offer performance improvements in terms of speed, but at the cost of increased memory usage, as pages will be rasterized in color rather than greyscale.

Added/removed resources

N/A

Fixed bugs

Internal ID Description Service desk IDs

IDRSRD-9934

PDF/UA generated by the iDRS doesn’t succeed compliance validation when document contains complex tables

IDRSRD-9927

the iDRS encounters a crash when rotating a specific image

IDRSRD-9920

The iDRS requires extra unexpected OCR resources to run auto-orientation only

IDRSRD-9908

The iDRS should expose a flag to enable or disable greyscale image detection during loading of a PDF page

IDRSRD-9903

Confidence values of language and orientation detection feature are unusable with 16.2.6+82

ISD-36788

IDRSRD-9901

The default character set is missing some supported characters for Japanese language

IDRSRD-9900

.NET samples cannot be compiled on Linux

IDRSRD-9884

Graphic shapes detected by iDRS are incorrectly scaled on output document if input image resolution is different than 300 dpi

IDRSRD-9883

The OCR engine library is linked with WS2_32.dll for no reason

ISD-36751

IDRSRD-9882

API reference main pages are different between C++, .NET and C APIs

IDRSRD-9856

The iDRS can return OCR results outside of input zones when running zonal OCR

IDRSRD-9839

The iDRS merges lines from different text columns on a specific image

IDRSRD-9796

Orientation detection gives unexpected answer on a border-case scenario

IDRSRD-9795

An integrator should be able to hint the OCR engine for the text direction to detect, when processing Asian documents

IDRSRD-9754

The iDRS is not compatible with VirtualBox VMs running on Windows Hosts

ISD-36479

IDRSRD-9703

The new segmentation filters isolated punctuations or characters

IDRSRD-5619

DOCX output created by the iDRS is poor for specific images