When configuring the system to crawl Perceptive Content, you must provide the full URL to a Perceptive Content Integration Server, along with the credentials of a user who has access to all the content you which to search.
If the Perceptive Content implementation has multiple Drawers, one index per Drawer will be created, using the Drawer name as the index name. You may limit the Drawers indexed using Crawler Rules.
The crawler will use the ‘SysDocumentsAll’ view when scanning for documents, this can be changed by setting the ‘View’ property in Crawler Options.
Each page of each document will be indexed as a single item, and will contain the metadata of both the page and the document. In the case of documents without pages, a metadata-only record will be indexed.
Perceptive Content Integration Server | The full URL to the Integration Server, including protocol |
User Name | The username to log into the Integration Server |
Password | The password of the above user |
View | The view to use when enumerating content, defaults to SysDocumentsAll |
Drawers | The drawers to scan. If empty, all drawers are scanned |
Proxy Server | The proxy server to use to connect to the Integration Server (if needed) |
Bypass List | A list of bypass exceptions for the proxy server (i.e. direct connections when a proxy server is used) |
Meta Folder | Include folder information as metadata |
Meta Workflow | Include workflow information as metadata |
Include Digital Signatures as metadata | Include digital signatures as metadata |
The crawler will contain the following metadata: