Options Properties

The TextConverter's Options pane is located at the lower right part of the user interface (UI). You can hide the Options pane by clicking on the top part of the its left splitter.

Options properties provide generic control over the TextConverter behavior. TextConverter can work in one of three modes: “Auto”, “Manual” and “Delimited”. The first item in the Options properties list “Mode” allows selecting the operational mode for the current project.

Auto Mode (default) - provides visual means that enable users with no programming skills to solve a wide range of data extraction tasks. It also allows experienced users to take advantage of the extensive capabilities of the Target development platform.

Manual Mode - provides ultimate flexibility for professionals with programming background to solve data extraction and processing tasks of virtually any complexity. In this mode, projects are configured using VBScript taking advantage of the vast VBScript resources combined with powerful built-in TextConverter libraries. The platform libraries cover a wide range of functionality including text parsing, data transformation and processing as well as working with the file system, email objects, and internet resources. Scripting also can be used in combination with all other operational modes. 

Delimited Mode - supports working with structured input documents such as MS Excel and delimited text files. It mostly used for data mapping, transformation and loading into relational databases or flat file data storages. While TextConverter provides convenient interface for importing structured data files, SiMX also offers professional users DbConverter, a fully featured component for more advanced mapping of structured data sources. 

The Options properties differ for each operational mode and input format. Consequently, there are multiple combinations of options properties, which all are presented in the two tables below: Common Options Properties and Context Dependent Options Properties.
 
Common Options Properties
 

Property

Sub-Property

Value

Comment

Mode

Auto

 

Use this mode for configuring data extraction projects visually - with minimal or no scripting.

Manual

 

Use this mode for implementing pure script-based parsing projects.

Delimited

 

Use this mode for extracting data from delimited data sources, such as Excel and any delimited text files.

Record Delimiter

Character string used as record separator.

Field Delimiter

Character string used as field separator.

Text Qualifier

Character string used as text qualifier.

Get Field Names 

 from Line

Defines the line number, from which the field names will be taken from.

 Freeze the input 

 dictionary

If checked, prevents the input dictionary from changing on loading a new data source.

Scanned Image

 

 True/False

If checked, ignores the fonts.

PDF Mode

 As is

In these modes, TextConverter presents the input document’s text at the ‘points’ resolution, reproducing the original document’s layout as accurately as possible, taking in consideration the original font properties (type, size, color). Suitable for most of the tasks, but especially useful for PDF documents with significantly different fonts.

 Zoom 1

 Zoom 2

 Zoom 3

 Zoom spread

 Uniform

In this mode, TextConverter presents the input text as ASCII code characters, ignoring the original font properties and using fixed size font instead. In some cases, it might deliver better results and could be more convenient for implementing script customizations.

First Page

 Numeric

Defines the first page, from which the data extraction should start.

Last Page

 Numeric

Defines the last page of the data extraction processing range.

Pages for Preview

 Numeric

Number of pages to show in the dynamic output preview.

Ignore Left Padding

 True/False

If checked, ignores the left page padding.

Visible Sheets Only (Excel)

 

 True/False

If checked, ignores the invisible Excel sheets.

Lines for Preview

 

Numeric

Number of lines to show in the dynamic output preview (text input).

Skip Lines

None

Does not skip any input lines.

 

Skip

String

Regular expression defining the lines to skip.

 

Range

Skip from

Regular expression defining the first line in the line range to skip.

Skip to

Regular expression defining the last line in the line range to skip.

Auto Preview

True/False

If checked, automatically updates the dynamic output preview.

Suppress Empty Lines

True/False

If checked, skips the empty input lines.

Case Sensitive

True/False

If checked, enforces the case sensitivity in the Script Editor.

Append to existing table

True/False

If checked, appends the output records to the existing table without recreating it on run.

Suppress Duplicates               

Unique Fields

String

Coma separated list of key fields identifying the record uniqueness.

 

Check Input Only

True/False

If checked, enforces the input document’s record uniqueness only.

Show Missing Lines

True/False

If checked, saves the input lines that were not used in the data extraction process into an Excel file, accessible via an extra toolbar button. Used for configuration debugging purposes.

Batch Processing

Batch Folder

 

If checked, run the configured TextConverter project processing multiple files found in the specified folder.

 

Include subfolders

True/False

If checked, scans all the subfolders found in the specified folder.

Include Excel sheets

True/False

If checked, processes Excel sheets as separate data sources.

Include Files

String

The wildcard for filtering file to be included into the processing.

Exclude Files

String

The wildcard for filtering file to be excluded from the processing.

Cleanup Input Files

Leave

 

Does nothing.

Move – To Folder

String

Pathname of the folder that the processed files will be moved to.

Move – Add time stamp

True/False

If checked, the moved files will be renamed by adding time stamps.

Delete

 

The processed files will be deleted.

Run post processing

True/False

If checked, processes the configured built-in components (DbConverter, Target Reports and Free Form Reports) after the data extraction process is finished.

No DS Check

True/False

If checked, does not automatically check the data sources on run. Used during automated processing to avoid opening the ‘Replace Data Source’ dialog.

No file output

True/False

If checked, does not produces the output file on run. Used in cases when output is generated using script.

Log Path

String

Pathname of the folder for storing the processing log file.

Configuration Files Path

String

Pathname of the folder for storing the configuration files.

 


Context Dependent Options Properties

Property

Mode

Input

Txt

Csv

Html

Pdf

Doc

Excel

Visible Sheets Only

Auto

 

 

 

 

 

Manual

 

 

 

 

 

Delimited

 

 

 

 

 

Record Delimiter

Delimited

Field Delimiter

Delimited

 

Text Qualifier

Delimited

Get Field Names From Line:

Delimited

Freeze the Input Dictionary

Delimited

Replace Tabs With

Auto

 

 

 

 

 

Manual

 

 

 

 

 

Scanned Image, etc.

Auto

 

 

 

 

Manual

 

 

 

 

First Line

Auto

Manual

Delimited

Open As

Auto

 

 

 

 

Manual

 

 

 

 

Delimited

 

 

 

 

 

Conversion to text

Manual

 

 

 

Delimited

 

 

 

Include Empty Fields

Manual

 

 

 

Delimited

 

 

 

Skip Lines

Auto

Manual

Delimited

Suppress Empty Lines

Auto

Manual

Delimited

Auto Preview

 

Auto

Manual

Delimited

Append to existing table

Auto

Manual

Delimited

Batch Processing

Auto

Manual

Delimited

Case Sensitive    

Auto

Manual

Delimited

Suppress Duplicates         

Auto

Manual

Delimited

Show Missing Lines

Auto

Cleanup Input Files

Auto

Manual

Delimited

Run post processing

Auto

Manual

Delimited

No DS Check

Auto

Manual

Delimited

No file output

Auto

Manual

Delimited

Log Path

Auto

Manual

Delimited

Configuration Files Path

Auto

Manual

Delimited



Comments