Home Français Français Contact Us Customer Login

Language Technology Solutions For:

Businesses Governments Organizations LSPs Translators

Product

Server Solutions
Options and Add-Ons
Collaboration Options
Deployment Options
Other Available Products
Product Sheets
 

FilterPack

The FilterPack is an add-on module which allows users to add over 500 file types to their server TextBases. Based on Oracle's Outside In Technology, FilterPack gives access to the content of over 500 unstructured file formats. From the latest office suites, such as Microsoft Office 2007, to specialty formats and legacy files. FilterPack extracts text and metadata from native files (including Microsoft Office and PDF documents) and automatically converts the text and properties from multiple possible encodings into a single encoding. It is optimized for performance, interactively providing data to MultiTrans in memory as the input file is processed. This speeds up the process of converting and importing data by up to ten times.

  • Clean Content—Extracts, analyzes, and scrubs text, metadata, and hidden information from Word, Excel, PowerPoint and PDF; bursts and reassembles PowerPoint presentations
  • Content Access—Extracts text and metadata from more than 500 file types
  • File ID—Quickly and accurately identifies file types, without using unreliable file extensions or MIME types
  • Extracts text and metadata information from diverse file types via a common interface
  • Translates the text and properties from multiple possible encodings in a single encoding
  • Optimized for performance in high-throughput server environments

Licensing:

  • Server license

For more information please contact us.