Google Drive in Pentaho Data Integration

One of the new features in Pentaho Data Integration 8.1 is the ability to directly connect to Google Drive. PDI uses the Virtual File System (VFS) which allows you to connect to a variety of file systems in a transparent way.

To do this, you first have to enable the Google Drive api in and create a service account (with OAuth). Download and place the JSON key under <data-integration>/plugins/pentaho-googledrive-vfs/credentials (the name of the key has to be client_secret.json).

After restarting Spoon go to File -> Open URL, the Google Drive option should be visible.
When selecting google drive, Google will ask you to authenticate 


Screenshot from 2018-05-31 15-46-38


After authenticating, an extra file will be saved under the 'credentials' folder, allowing you to connect without authenticating. You will be able to browse your Google Drive locations


Screenshot from 2018-05-31 15-48-35

Now you can directly open and save files under Google Drive, or use the googledrive:// prefix in input and output steps to directly access and store files.

  Talk to an expert!


You may also like

These blogs about pentaho data integration

A look at what's new in Pentaho 8.0

3 reasons to move your ETL to the web, cloud ETL development heavily relies on the desktop with files, database and network connections that require the…

PCM17 - Technical Room Read our overview of the Keynotes  Read our overview of the talks in the Business room Data Pipelines - Running PDI on AWS Lambda - Dan…