Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix(datasets): Cleanup of dependency #822

Merged
merged 4 commits into from
Sep 9, 2024

Conversation

DimedS
Copy link
Contributor

@DimedS DimedS commented Sep 3, 2024

Description

This pull request addresses issue #747.

Modifications:

  1. Added dask-csvdataset.
  2. Removed polars-genericdataset as it currently doesn't exist.
  3. Added spark-sparkstreamingdataset.
  4. Restructured the polars and spark groups for consistency with other groups.
  5. Removed hdfs-base and s3fs-base from spark-sparkjdbcdataset as they are not required. They were retained in other groups as they may be necessary depending on the data storage.
  6. Updated Jinja2 to <3.2.0 for consistency with documentation requirements.
  7. Removed the special line for tables on Windows, as it was relevant only for Python 3.8 and is no longer necessary.

Checklist

  • Opened this PR as a 'Draft Pull Request' if it is work-in-progress
  • Updated the documentation to reflect the code changes
  • Added a description of this change in the relevant RELEASE.md file
  • Added tests to cover my changes

Signed-off-by: Dmitry Sorokin <129520297+DmitrySorokinQB@users.noreply.github.com>
@DimedS DimedS marked this pull request as ready for review September 3, 2024 11:05
@@ -140,8 +144,15 @@ snowflake = ["kedro-datasets[snowflake-snowparktabledataset]"]
spark-deltatabledataset = ["kedro-datasets[spark-base,hdfs-base,s3fs-base,delta-base]"]
spark-sparkdataset = ["kedro-datasets[spark-base,hdfs-base,s3fs-base]"]
spark-sparkhivedataset = ["kedro-datasets[spark-base,hdfs-base,s3fs-base]"]
spark-sparkjdbcdataset = ["kedro-datasets[spark-base,hdfs-base,s3fs-base]"]
spark = ["kedro-datasets[spark-deltatabledataset]"]
spark-sparkjdbcdataset = ["kedro-datasets[spark-base]"]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

does jbdbcdataset not need the hdfs-base or s3fs-base anymore?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe JDBC is intended for relational databases and cannot be used with HDFS or S3FS?

kedro-datasets/pyproject.toml Outdated Show resolved Hide resolved
DmitrySorokinQB and others added 2 commits September 6, 2024 09:25
Signed-off-by: Dmitry Sorokin <129520297+DmitrySorokinQB@users.noreply.github.com>
Copy link
Contributor

@ankatiyar ankatiyar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Copy link
Member

@astrojuanlu astrojuanlu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🚀

@DimedS DimedS merged commit 3da39f7 into main Sep 9, 2024
14 checks passed
@DimedS DimedS deleted the 747-kedro-datasets-cleanup-of-dependency-groups branch September 9, 2024 19:54
harm-matthias-harms pushed a commit to harm-matthias-harms/kedro-plugins that referenced this pull request Oct 1, 2024
* Update dependencies
---------

Signed-off-by: Dmitry Sorokin <129520297+DmitrySorokinQB@users.noreply.github.com>
Co-authored-by: Dmitry Sorokin <129520297+DmitrySorokinQB@users.noreply.github.com>
Signed-off-by: Harm Matthias Harms <matthias.harms@quis.de>
MinuraPunchihewa pushed a commit to MinuraPunchihewa/kedro-plugins that referenced this pull request Oct 1, 2024
* Update dependencies
---------

Signed-off-by: Dmitry Sorokin <129520297+DmitrySorokinQB@users.noreply.github.com>
Co-authored-by: Dmitry Sorokin <129520297+DmitrySorokinQB@users.noreply.github.com>
Signed-off-by: Minura Punchihewa <minurapunchihewa17@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

kedro-datasets: Cleanup of dependency (groups are incomplete, old dependencies should be deleted)
4 participants