Editorial Reviews:
Synopsis
Have you faced the task of extracting data from unstructured sources (such as PDF, Word, HTML, Excel documents and/or various text files)? You probably had to choose between expensive and cumbersome manual programming and even more expensive enterprise BI tools.
Well, SiMX TextConverter 3 is the most efficient answer to this problem. It will let you set up your automated data extraction, transformation and database loading processing for a fraction of cost. The solution will be durable, easily to maintain, portable and scalable and, will be fully supported by SiMX's service bureau.
The goal is achieved by employing an artificial intelligence (AI) engine that is capable of recognizing reoccurring patterns of data layout, formatting, and content thus automatically identifying records and fields to be extracted.
In addition to these automatic extraction templates, TextConverter 3 also provides a convenient environment for manual parsing using VBScript. It offers rich libraries of high level objects and the ability to easily access an unlimited number of external data sources.
Product Features:
Input formats: PDF, DOC, RTF, XLS, HTML, CSV, and Txt files
Automatically detects patterns that represent record and field structure
Accesses multiple data sources
VB Script support with proprietary objects and methods for data transformation
Preview the output as you work
Output: any OLE DB or ODBC compliant database - ORACLE, SQL Server, MySQL, DB2, Access, or generate files for Excel, FoxPro, and more
Access TextConverter's COM and .NET automation objects through the API
Automate and schedule tasks through windows scheduler
Detailed examples are provided for Java, C#, C++, VBScript, and VB.NET