Google Drive
The Google Drive Crawler indexes the data on a single user’s Google Drive. It requires pre-configuration in your own Google Developers Console (see
https://developers.google.com/console/help/new/).
- Log in to your Google Developers Console
- Create a new Project
- Enable the Drive API
- From the Home screen of your Project, click Enable and manage APIs
- Click on Drive API, then Enable API
- Add API credentials
- From the listed items on the left side of the console, expand APIs & auth
- Click Credentials
- Click Add Credentials and select OAuth 2.0 client ID
- Click Configure consent screen
- On the consent screen configure a suitable Product name (for example, Company Google Drive Crawler) and set any other options to your liking
- Click Save when done
- Back in the Add Credentials screen, choose Application type Other
- Change the suggested name of the Credential entry if desired, then click Create
- A pop up window shows you a client ID and a client secret. You will copy these values into the Perceptive index wizard later. Note that you can review the client id and client secret from the console at any time. Keep the client secret secure
- Click Save on the credential page when done. This completes the configuration of the Google Developers Console.
- Copy the client ID and client secret from the developers console into the Perceptive Enterprise Search index wizard for the Google Drive Crawler.
- The wizard will pop up a web browser which shows a secure consent page from Google requesting Offline Access to a user account. Enter the name and password of the user account of the Google Drive you wish to crawl.
- Click on Allow and then copy the code from the browser into the index wizard. You may then close the browser window that contains the code.
- Note: If you later decide to revoke access from the crawler to the user account, log in to the user’s Google account at https://myaccount.google.com, click Connected Apps & sites, then Manage Apps. You will see the Product name you chose earlier. From there you can disconnect the crawler app. (Also note that Google Drive is a Google product, unrelated to the Crawler, that you probably intend to continue to have access to your account).
- Complete the index wizard.
Crawler Options:
client_id | The client ID obtained from your Google Developers Console. |
client_secret | The client secret obtained from your Google Developers Console. |
refresh_token | The refresh_token is provided by Google’s authentication service after providing a valid client_id, client_secret, and authorization code. |
Crawler Metadata:
- Original Filename
- Owners
- URI