Help > Using the results > Advanced options > Deduplicating the result list

Deduplicating the result list

In the world of corporate networks, there will be scenarios where the same document will be inadvertently stored in multiple locations. While these duplicates may not be easily detectable due to the sheer volume of data across a network, Perceptive Search has the ability to de-duplicate them to ensure that only a single copy is returned in the search results. Perceptive Search achieves this by storing the text of each document in a checksum which is used at query time to determine which documents have been duplicated. This prevents the need for users to waste their time filtering through the clutter of duplicate documents and more time working with relevant results.

Enabling Document De-duplication

Document de-duplication is enabled by default for all indexes that were originally created in version 9, however, if you are upgrading your indexes from a previous version (8 or below) you will need to enable it via the index options.

  1. In Perceptive Enterprise Search - Local Administration Console, ensure that the index that you wish to de-duplicate is the active index and go to Index > Index Options.
  2. Select the 'Indexing' heading.
  3. Tick the checkbox to 'De-duplicate documents in index'. Click 'OK'.
  4. You will need to perform a Reindex for the changes to take effect, so go to Index > Reindex. Click 'OK' at the prompt to start indexing.