Versatile Data Kit 0.6 #950
antoniivanov
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Summary
Major features include:
Configuration auto-wiring improvement: detect non vdk_ prefixed environment variables
Before configuration option must have been prefixed with "vdk_" when set as an environment variable in order to be recognized.
This was very error prone since the options are documented without the prefix.
Now they can be set without a prefix as well.
The following are equivalent:
If both are set, the "prefixed" variable has a higher priority.
New plugin/library: vdk-lineage-model
VDK Lineage Model plugin aims to abstract emitting lineage data from VDK data jobs, so that different lineage loggers can be configured at run time in any plugin that supports emitting lineage data
Check out more at the plugin page.
New export-csv command
Alongside
vdk ingest-csv
which enabled users to import (or ingest) CSV data into a table.Users can now export CSV with a simple command from SQL query:
Checkout out more at the plugin page
In memory properties client
Until now properties required Control Service to be able to work. Sometimes for prototyping and testing purposes, you do not need to connect to external services.
In a specific job's config file (config.ini
Or as an environment variable
New example: Ingest and anonymize
Example how to anonymize any data being ingested using VDK with a plugin.
Check out more at the example page
New example: Airflow integration
Example how to create dependencies between data job in Airflow.
Check out more at the example page
Package versions
See installation instructions here.
The versions of VDK components released under VDK 0.6 are:
Main components
control-service 1.5.620438292
vdk-core==0.3.620677184
Plugins
airflow-provider-vdk==0.0.602273476
vdk-lineage-model== 0.0.581430542
vdk-kerberos-auth==0.3.584577337
vdk-ingest-http==0.2.616713987
vdk-impala==0.4.613570906
vdk-lineage== 0.3.604201902
vdk-trino== 0.4.605101952
What's Changed
New Contributors
Full Changelog: v0.5...v0.6
This discussion was created from the release Versatile Data Kit 0.6.
Beta Was this translation helpful? Give feedback.
All reactions