R case study: public datasets in Google BigQuery using DBplyr
Building on earlier Rfun workshops, exploit your knowledge of familar Tidyverse syntax to query remote databases via RStudio. Attendees will be introduced to the dBplyr package as an alternative to SQL database querying. Following a review of dplyr and an overview of Google BigQuery public datasets, attendees will practice querying Google BigQuery public data.
Prerequisites:
- Introductory familiarity with R and the Tidyverse (e.g. quickStart with R)
- Use Google BigQuery public datasets
- Install R and RStudio on your computer
- tidyverese, DBI, dbplyr, bigrquery installed in your R environment
install.packages(c("tidyverse", "DBI", "dbplyr", "bigrquery"))
- Databases using R
- library(DBI)
- library(bigrquery)
- library(dbplyr)
- RStudio Conf 2019 (15 min. video)
Get a Google Cloud account
- BigQuery sandbox getting started
- Google Cloud Platform free tier
- BigQuery public datasets
- BigQuery
- GCP-BigQuery Console
John Little
https://JohnLittle.info
https://Rfun.library.duke.edu
https://library.duke.edu/data
CC BY-NC
Creative Commons Attribution-NonCommercial
https://creativecommons.org/licenses/by-nc/4.0