googledrive initial release

  tidyverse, googledrive

  Jenny Bryan

We are tickled pink to announce the initial CRAN release of the googledrive package. This is a collaboration between Jenny Bryan and tidyverse intern, Lucy D’Agostino McGowan (blog post on that coming soon!).

googledrive wraps the Drive REST API v3. The most common file operations are implemented in high-level functions designed for ease of use. You can find, list, create, trash, delete, rename, move, copy, browse, download, share and publish Drive files, including those on Team Drives.

Install googledrive with:

install.packages("googledrive")

The auth flow should “just work” for most people, especially for early and interactive use. For more advanced usage, functions are available that give the user much more control, i.e. to support non-interactive and remote usage. A low-level interface, used internally, is also exposed for those who wish to program around the Drive API.

First use

You can see and filter your files, similar to the experience at https://drive.google.com/, with drive_find(). Expect to be taken to the browser here, in order to authorize googledrive to act on your behalf. We limit drive_find() to 10 files and just a few file types for brevity.

library(googledrive)
(x <- drive_find(n_max = 10, type = c("spreadsheet", "jpg", "pdf")))
#> Auto-refreshing stale OAuth token.
#> # A tibble: 10 x 3
#>                             name
#>  *                         <chr>
#>  1                   chicken.jpg
#>  2            README-mirrors.csv
#>  3            README-mirrors.csv
#>  4                      logo.jpg
#>  5                     sheet-xyz
#>  6                      logo.jpg
#>  7                         test2
#>  8    foo_pdf-TEST-drive-publish
#>  9  foo_sheet-TEST-drive-publish
#> 10 export-mime-type-defaults.csv
#> # ... with 2 more variables: id <chr>, drive_resource <list>

googledrive holds Drive file info in a dribble, a “Drive tibble”. This allows us to provide human-readable file names, packaged with API-friendly file ids and other useful metadata in the drive_resource list-column. Each row represents a Drive file.

drive_reveal() adds extra information to the dribble by excavating interesting metadata out of drive_resource or by making additional API calls.

(x <- x %>%
   drive_reveal("permissions") %>% 
   drive_reveal("mime_type"))
#> # A tibble: 10 x 6
#>                             name                               mime_type
#>                            <chr>                                   <chr>
#>  1                   chicken.jpg                              image/jpeg
#>  2            README-mirrors.csv application/vnd.google-apps.spreadsheet
#>  3            README-mirrors.csv application/vnd.google-apps.spreadsheet
#>  4                      logo.jpg                              image/jpeg
#>  5                     sheet-xyz application/vnd.google-apps.spreadsheet
#>  6                      logo.jpg                              image/jpeg
#>  7                         test2 application/vnd.google-apps.spreadsheet
#>  8    foo_pdf-TEST-drive-publish                         application/pdf
#>  9  foo_sheet-TEST-drive-publish application/vnd.google-apps.spreadsheet
#> 10 export-mime-type-defaults.csv application/vnd.google-apps.spreadsheet
#> # ... with 4 more variables: shared <lgl>, id <chr>,
#> #   drive_resource <list>, permissions_resource <list>

Since files are stored in a specialized tibble, you can use your usual tidyverse workflows to manipulate this data frame.

x %>% 
  filter(mime_type == "image/jpeg") %>% 
  select(name, id, drive_resource)
#> # A tibble: 3 x 3
#>          name                           id drive_resource
#> *       <chr>                        <chr>         <list>
#> 1 chicken.jpg 0B0Gh-SuuA2nTbEhtYnIzcFNfX3M    <list [39]>
#> 2    logo.jpg 0B0Gh-SuuA2nTLVE0VGZQVU40Z0U    <list [38]>
#> 3    logo.jpg 0B0Gh-SuuA2nTZjlpQmlVTm9BcVU    <list [38]>

Test drive and cleanup

We’ll quickly show off a few basic file operations and then clean up. This example also demonstrates the file conversion features available when importing into and exporting out of native Google file types.

First, we make a new folder. Then we make a new Drive file (a Google Sheet) by uploading a local file (csv). A few example files ship with googledrive and drive_example() is just a convenience function to get the local path on any system.

folder <- drive_mkdir("blog-folder")
#> Folder created:
#>   * blog-folder

chicken_sheet <- drive_example("chicken.csv") %>% 
  drive_upload(path = "blog-folder/blog-chicken-sheet", type = "spreadsheet")
#> Local file:
#>   * /Users/jenny/resources/R/library/googledrive/extdata/chicken.csv
#> uploaded into Drive file:
#>   * blog-chicken-sheet: 1qGE2yLpe4qN-wZCMCl1t5ml-0acGW-xEPZ2l6PMHd7o
#> with MIME type:
#>   * application/vnd.google-apps.spreadsheet

Let’s list the contents of the folder and reveal file path to confirm the new file exists and has been placed in the target folder.

drive_ls(folder) %>%
  drive_reveal("path")
#> # A tibble: 1 x 4
#>                 name                             path
#> *              <chr>                            <chr>
#> 1 blog-chicken-sheet ~/blog-folder/blog-chicken-sheet
#> # ... with 2 more variables: id <chr>, drive_resource <list>

Here we rename the file and download it to a local file. Since we are downloading from a Google Sheet, we must convert to a more conventional file type, such as csv or, the default, an Excel workbook.

chicken_sheet %>% 
  drive_rename("blog-famous-chickens") %>% 
  drive_download()
#> File renamed:
#>   * blog-chicken-sheet -> blog-famous-chickens
#> File downloaded:
#>   * blog-famous-chickens
#> Saved locally as:
#>   * blog-famous-chickens.xlsx

Finally, we clean up by deleting the local spreadsheet, as well as the folder we created on Drive.

file.remove("blog-famous-chickens.xlsx")
#> [1] TRUE
drive_rm(folder)
#> Files deleted:
#>   * blog-folder: 0B0Gh-SuuA2nTMTl0cjIwZThQRzg

We look forward to hearing how this package is useful to you. Feel free to chime in on GitHub issues to discuss how googledrive can best support your workflows.

Learn more about googledrive at: