Optical Character Recognition(OCR)

Optical Character Recognition (OCR) involves the identification, categorization, and labeling of relevant information (referred to as entities) in the document. Tensoract Studio offers a range of features and settings to significantly enhance the effectiveness of OCR tasks.

To get started, refer below for a quick overview of the OCR project creation process.

_images/OCR_PROJECT_Updated.gif

For details about the project creation process, refer to Getting Started.

Labels and Relationships

Pre-Labeling Model

To utilize a custom pre-labeling model in a project, it must be registered within the Tensoract Studio prior to usage. Refer to Model Integration for more details.

  1. Registered models are visible in the Pre-Labeling Model and Auto Labeling Modesl drop-down menus.

  2. Select one or more custom model(s) from the Pre-Labeling drop-down.

  3. Once the model is selected, label classes can be mapped from model to the project. This mapping ensures that the corresponding annotations will be available within the tasks.

Table & Key/Value Detection

  1. If a pre-labeling model is not selected, the options of Table Detection Model and Key/Value Detection Model are available for selection.

  2. For Table Detection, use AWS Textract Table Detection Model.

  3. For Key/Value Detection, use AWS Textract Key/Value Detection Model.

Upon selecting Table and Key/Value Detection models, following options become available for Textract processing:

  1. Enable key value pairs dictionary is used to enable dictionary mapping to labeling classes.

  2. Dictionary:match keys only match the dictionary patterns with the extracted key. The default patterns matches the key and value pairs.

  3. Dictionary:ignore non matching key-value pairs ignores non matching key value pairs.

Auto Labeling Models

  1. You can also optionally select one or more auto labeling models to enable assistive labeling in the task.

  2. Once the model is selected, label classes can be mapped from the model to the project. This mapping ensures that the appropriate annotations are available within the task.

Labels

You can map a list of label classes for the Pre-Labeling and Auto Labeling models selected above.

Advanced Settings

The Advanced Settings section sets the behavior of the task workflow in the current project.

Project Attributes

The following Project Attribute settings are available for configuration:

Allow Text Mode -

  • Enable/disable icon to highlight text in a document.

  • If Text option is checked, then default text mode is enabled in the task.

  • If Labels is checked, then default labels are displayed in the task.

Allow Table Mode -

  • Enable/disable toggle table mode icon in the task.

Alternative text
  • To create a table, click on the table icon in the top menu bar and select the area where the table needs to be created.

Alternative text
  • Edit table by right-clicking on cells.

  • Select multiple cells with a right-click.

  • Upon right-click, a pop-up menu shows up with following options.

Alternative text

Allow Bounding Box Rotation

Enables ability to rotate the bounding box to match text orientation.

Disable Automatic Key-pair linking

It disables automatic linking of key value pair but users can manually link the key value pair.

Annotator View

Shortcut Keys

The following shortcut keys serve as helpful references for annotators and reviewers during the task annotation process.

Keys

Use

Ctrl + F or F3

Find Text

Ctrl + G

Find Next

Ctrl + ]

Move to next entity

Ctrl + [

Move to previous entity

Ctrl + }

Move to next entity of same type

Ctrl + {

Move to previous entity of same type

Ctrl + Arrow Right

Move to next page

Ctrl + Arrow Left

Move to previous page

Ctrl +

Zoom In

Ctrl -

Zoom Out

Ctrl + 0

Zoom Reset

Alt

Hide Annotations, Label popup, Relationship

Shift + Text Select

Add words to selected text

Ctrl + Text Select

Unselect words from current annotation

Alt + Entity Select

Enable Entity Comment

Delete

Delete selected entity/relationship/box

Once logged into the studio, the annotator is presented with the task queue, which includes a list of tasks available for them to work on.

_images/OCR-Annotation-View.gif

Reviewer View

The reviewers have the authority to approve, reject, or request adjustments from the annotator for a given task. Watch the following video to get a better understanding of the QC/Reviewer view.

_images/OCR-Review-View.gif