SiMX Help‎ > ‎1. TextConverter‎ > ‎2. User Manual‎ > ‎A1. User Interface‎ > ‎

Options Properties



Options properties provide generic control over the TextConverter behavior. TextConverter can work in one of three modes: “Auto”, “Manual” and “Delimited”. The first item in the Options properties list “Mode” allows selecting the operational mode for the current project.
  1. Auto Mode (default) provides visual means that enable users with no programming skills to solve a wide range of data extraction tasks. It also allows experienced users to take advantage of the extensive capabilities of the Target development platform.

  2. Manual Mode provides ultimate flexibility for professionals with programming background to solve data extraction and processing tasks of virtually any complexity. In this mode, projects are configured using VBScript taking advantage of the vast VBScript resources combined with powerful built-in TextConverter libraries. The platform libraries cover a wide range of functionality including text parsing, data transformation and processing as well as working with the file system, email objects, and internet resources. Scripting also can be used in combination with all other operational modes.

  3. Delimited Mode supports working with structured input documents such as MS Excel and delimited text files. It mostly used for data mapping, transformation and loading into relational databases or flat file data storages. While TextConverter provides convenient interface for importing structured data files, SiMX also offers professional users DbConverter, a fully featured component for more advanced mapping of structured data sources.

    The Options properties differ for each operational mode and input format. Consequently, there are multiple combinations of options properties, which all are presented in the two tables below: Common Options Properties and Context Dependent Options Properties.

Common Options Properties

Property

Sub-Property

Value

Comment

Mode

Auto

 

Use this mode for configuring data extraction projects visually - with minimal or no scripting.

Manual

 

Use this mode for implementing pure script-based parsing projects.

Delimited

 

Use this mode for extracting data from delimited data sources, such as Excel and any delimited text files.

Record Delimiter

Character string used as record separator.

Field Delimiter

Character string used as field separator.

Text Qualifier

Character string used as text qualifier.

Get Field Names 

 from Line

Defines the line number, from which the field names will be taken from.

 Freeze the input 

 dictionary

If checked, prevents the input dictionary from changing on loading a new data source.

Scanned Image

 

 True/False

If checked, ignores the fonts.

PDF Mode

 As is

In these modes, TextConverter presents the input document’s text at the ‘points’ resolution, reproducing the original document’s layout as accurately as possible, taking in consideration the original font properties (type, size, color). Suitable for most of the tasks, but especially useful for PDF documents with significantly different fonts.

 Zoom 1

 Zoom 2

 Zoom 3

 Zoom spread

 Uniform

In this mode, TextConverter presents the input text as ASCII code characters, ignoring the original font properties and using fixed size font instead. In some cases, it might deliver better results and could be more convenient for implementing script customizations.

First Page

 Numeric

Defines the first page, from which the data extraction should start.

Last Page

 Numeric

Defines the last page of the data extraction processing range.

Pages for Preview

 Numeric

Number of pages to show in the dynamic output preview.

Ignore Left Padding

 True/False

If checked, ignores the left page padding.

Visible Sheets Only (Excel)

 

 True/False

If checked, ignores the invisible Excel sheets.

Lines for Preview

 

Numeric

Number of lines to show in the dynamic output preview (text input).

Skip Lines

None

Does not skip any input lines.

 

Skip

String

Regular expression defining the lines to skip.

 

Range

Skip from

Regular expression defining the first line in the line range to skip.

Skip to

Regular expression defining the last line in the line range to skip.

Auto Preview

True/False

If checked, automatically updates the dynamic output preview.

Suppress Empty Lines

True/False

If checked, skips the empty input lines.

Case Sensitive

True/False

If checked, enforces the case sensitivity in the Script Editor.

Append to existing table

True/False

If checked, appends the output records to the existing table without recreating it on run.

Suppress Duplicates               

Unique Fields

String

Coma separated list of key fields identifying the record uniqueness.

 

Check Input Only

True/False

If checked, enforces the input document’s record uniqueness only.

Show Missing Lines

True/False

If checked, shows missing lines.

Batch Processing

Batch Folder

 

If checked,

 

Include subfolders

True/False

If checked, scans subfolders.

Include Excel sheets

True/False

If checked, processes Excel sheets as separate data sources.

Include Files

String

The wildcard for filtering file to be included into the processing.

Exclude Files

String

The wildcard for filtering file to be excluded from the processing.

Cleanup Input Files

Leave

 

Does nothing.

Move – To Folder

String

Pathname of the folder that the processed files will be moved to.

Move – Add time stamp

True/False

If checked, the moved files will be renamed by adding time stamps.

Delete

 

The processed files will be deleted.

Run post processing

True/False

If checked, processes the configured built-in components (DbConverter, Target Reports and Free Form Reports) after the data extraction process is finished.

No DS Check

True/False

If checked, does not check the input data sources file.

No file output

True/False

If checked, does not produces the output file on run.

Log Path

String

Pathname of the folder for storing the processing log file.

Configuration Files Path

String

Pathname of the folder for storing the configuration file.


Context Dependent Options Properties

#

Property

Mode

Input

Txt

Csv

Html

Pdf

Doc

Excel

1.      

Visible Sheets Only

Auto

 

 

 

 

 

Manual

 

 

 

 

 

Delimited

 

 

 

 

 

2.      

Record Delimiter

Delimited

3.      

Field Delimiter

Delimited

 

4.      

Text Qualifier

Delimited

5.      

Get Field Names From Line:

Delimited

6.      

Freeze the Input Dictionary

Delimited

7.      

Replace Tabs With

Auto

 

 

 

 

 

Manual

 

 

 

 

 

8.      

Scanned Image, etc.

Auto

 

 

 

 

Manual

 

 

 

 

9.      

First Line

Auto

Manual

Delimited

10.   

Open As

Auto

 

 

 

 

Manual

 

 

 

 

Delimited

 

 

 

 

 

11.   

Conversion to text

Manual

 

 

 

Delimited

 

 

 

12.   

Include Empty Fields

Manual

 

 

 

Delimited

 

 

 

13.   

Skip Lines

Auto

Manual

Delimited

14.   

Suppress Empty Lines

Auto

Manual

Delimited

15.   

Auto Preview

 

Auto

Manual

Delimited

16.   

Append to existing table

Auto

Manual

Delimited

17.   

Batch Processing

Auto

Manual

Delimited

18.   

Case Sensitive                               

Auto

Manual

Delimited

19.   

Suppress Duplicates      

Auto

Manual

Delimited

20.   

Show Missing Lines

Auto

21.   

Cleanup Input Files

Auto

Manual

Delimited

22.   

Run post processing

Auto

Manual

Delimited

23.   

No DS Check

Auto

Manual

Delimited

24.   

No file output

Auto

Manual

Delimited

25.   

Log Path

Auto

Manual

Delimited

26.   

Configuration Files Path

Auto

Manual

Delimited


Back to Topics