dbt project to make it easier to work with BigQuery INFORMATION_SCHEMA views. Treats these views as data sources, and builds them into a Distributed Acyclic Graph (a.k.a. DAG) via dbt, that can be easily browsed using the dbt docs
command, to see which views logically connect, and to give some examples of how to use them in your BigQuery project
Planning on making this more user friendly at some point, but for now it works
- In dbt_bigquery_info_schema/profiles.yml, Edit
project:
&dataset:
- In dbt_bigquery_info_schema/dbt_project.yml, Edit
vars
to fit your use case - Run the setup script:
source $(git rev-parse --show-toplevel)/dbt_project_setup.sh
...here's what it does:- set working directory to top level of repo
- set up virtual environment and activate it <-- note, the venv persists after that script runs b/c we're using 'source'
- change working directory to dbt project
- Add venv* to .gitignore file
- test via
dbt debug
- After running that script, you should have a working dbt project (w/ successful debug test), and the script cd'ed you into it
- Go do dbt stuff!
For master list of GBQ stuff and PM approach, see Issue #1 - and otherwise, see Issues
For now, please fork this repo, make your changes there, and submit a PR. If you want less hassle and plan on working on this a bunch, contact @Brent Brewington
in dbt Slack
Try to follow GitHub Flow - basically, small set of changes on fork/branch that shouldn't live too long before getting merged (Merge conflicts, while manageable, add chaos)
I'm going to use ChatGPT to create clickbait commit messages. You can too: Tweet by @shaundai