Skip to content

Web Page

Version badge

Knowledge AI supports using web pages to create Knowledge Sources.

Restrictions

  • The web page must be hosted on a publicly accessible website.
  • When you create a Knowledge Source from a web page, Knowledge AI processes all visible text on the web page, including potentially unwanted text, such as cookie notices.
  • Knowledge AI doesn't process web page content with anti-crawling measures.
  • Knowledge AI doesn't support images or Optical Character Recognition (OCR) capabilities.

Chunking Process

When processing a web page, Knowledge AI:

  1. Visits the URL as a page in a browser session.
  2. Scrolls to the bottom of the web page.
  3. Accesses lazy-loaded1 content by checking for any text changes until the web page is stable and no longer loads additional text.
  4. Generates Knowledge Source content based on the visible text result.

Knowledge AI imports the web page content into a Knowledge Source once. If the web page you used as a source is updated and you want to include the new content in the web page, create a new Knowledge Source based on the updated web page.

More Information


  1. Lazy loading is a web development technique that delays loading non-critical or non-visible content until it is necessary. This technique improves web page loading times and user experience.