GitHub - Rae-Zou/dbt-core: A repo for my dbt-core projects along with duckdb.

Data pipeline: the integration of dbt with DuckDB

The DBT (data build tool) is a framework, which uses SQL as a syntax base, for processing/transforming analytical data. It focuses on the Transformation (T) step of the ETL (Extraction, Transformation and Load)

What is DuckDB?

DuckDB is a relational embeddable analytical DBMS that focuses on supporting analytical query workloads (OLAP). Similar to SQLite, DuckDB prioritizes simplicity and ease of integration by eliminating external dependencies for compilation and run-time. Why DuckDB ? DuckDB is designed to be embedded within applications or used as a serverless database. You can integrate it directly into your data pipeline without the need for a separate server installation or configuration.

Dependencies

dbt core
duckdb
DBeaver (optional)

Set up the project

Create an isolated virtual environment for dbt-core
```
conda create --name dbtenv python=3.11
```
Activate the Environment
```
conda activate dbtenv
```
Install duckdb adapter
```
pip install dbt-duckdb
```

Run the project

dbt seed
dbt run
dbt test
dbt docs generate
dbt docs serve Data source reference: https://www.kaggle.com/c/acquire-valued-shoppers-challenge/data?select=offers.csv.gz

(Optional) Verify the data using DBeaver IDE

Connect DuckDB to DBeaver

The Path should the same as you defined in the profiles.yml or choose Open to browse up the directory.

Resources:

Learn more about dbt in the docs
Learn more about DuckDB in the docs
Check out the blog for the latest news on dbt's development and best practices

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
.vscode		.vscode
dbt_duckdb_demo		dbt_duckdb_demo
dbt_duckdb_kaggle_challenge		dbt_duckdb_kaggle_challenge
dbt_refactoring_for_modularity		dbt_refactoring_for_modularity
dbt_refactoring_practice		dbt_refactoring_practice
dbt_snowflake_demo		dbt_snowflake_demo
images		images
logs		logs
.gitignore		.gitignore
.user.yml		.user.yml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data pipeline: the integration of dbt with DuckDB

What is DuckDB?

Dependencies

Set up the project

Run the project

(Optional) Verify the data using DBeaver IDE

Resources:

About

Releases

Packages

Languages

Rae-Zou/dbt-core

Folders and files

Latest commit

History

Repository files navigation

Data pipeline: the integration of dbt with DuckDB

What is DuckDB?

Dependencies

Set up the project

Run the project

(Optional) Verify the data using DBeaver IDE

Resources:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages