Skip to content

Text Extraction with Content Parsers

Knowledge AI uses content parsers to extract texts from uploaded files in different formats. The content parsers are categorized based on their complexity and the techniques they use.

You can use the following content parsers:

Content Parsers Overview

The table provides a comparative overview of the content parsers.

Details Basic Parser Advanced Parser (Default) Azure AI Document Intelligence
Functionality Fundamental text extraction. Advanced text extraction. Advanced, cloud-based text extraction.
Use Cases Extracting text from simple documents. Extracting text from complex documents that include tables, images, footers, and more. Extracting text from complex documents that include tables, images, footers, and more.
Advantages
  • Fast processing.
  • Low resource consumption.
  • Better accuracy.
  • Handles various formats.
  • Includes OCR.
  • Better accuracy.
  • Handles various formats.
  • Includes OCR.
  • Customizable settings on the Azure AI Document Intelligence side.
Limitations
  • Limited handling of complex documents.
  • Less accuracy.
  • Limited file format support.
  • Lack of customization capability.
Lack of customization capability.
  • Dependency on a third-party service.
  • Payment for using the service.
Price Free of charge for all environments. Included in the Knowledge AI license.
  • Free of charge for shared and dedicated SaaS installations. Included in the Knowledge AI license.
  • For on-premises installations, payment proceeds through Azure AI Document Intelligence in accordance with their pricing policy.
For all installations, payment proceeds through Azure AI Document Intelligence in accordance with their pricing policy.

Automatic Fallback to Basic Parsing

If the Advanced Parser or Azure AI Document Intelligence fails, Knowledge AI falls back automatically on the Basic Parser.

More Information