Help > Collections and Crawlers > Perceptive Content

Crawler: Perceptive Content

When configuring the system to crawl Perceptive Content, you must provide the full URL to a Perceptive Content Integration Server, along with the credentials of a user who has access to all the content you which to search.

If the Perceptive Content implementation has multiple Drawers, one index per Drawer will be created, using the Drawer name as the index name. You may limit the Drawers indexed using Crawler Rules.

The crawler will use the ‘SysDocumentsAll’ view when scanning for documents, this can be changed by setting the ‘View’ property in Crawler Options.

Each page of each document will be indexed as a single item, and will contain the metadata of both the page and the document. In the case of documents without pages, a metadata-only record will be indexed.

Requirements:

Crawler Options:

Perceptive Content Integration Server

The full URL to the Integration Server, including protocol

User Name

The username to log into the Integration Server

Password

The password of the above user

View

The view to use when enumerating content, defaults to SysDocumentsAll

Drawers

The drawers to scan. If empty, all drawers are scanned

Proxy Server

The proxy server to use to connect to the Integration Server (if needed)

Bypass List

A list of bypass exceptions for the proxy server (i.e. direct connections when a proxy server is used)

Meta Folder

Include folder information as metadata

Meta Workflow

Include workflow information as metadata

Include Digital Signatures as metadata

Include digital signatures as metadata

Crawler Metadata:

The crawler will contain the following metadata: