Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docs(glossary) Add Flower Datasets glossary entry #4235

Merged
merged 21 commits into from
Oct 1, 2024
Merged
Changes from all commits
Commits
Show all changes
21 commits
Select commit Hold shift + click to select a range
d5a93f6
Add Flower Datasets glossary entry
adam-narozniak Sep 19, 2024
a4515ee
Merge branch 'main' into add-flower-datasets-glossary-entry
yan-gao-GY Sep 19, 2024
45ef6ef
Merge branch 'main' into add-flower-datasets-glossary-entry
tanertopal Sep 20, 2024
432173b
Merge branch 'main' into add-flower-datasets-glossary-entry
adam-narozniak Sep 26, 2024
4bb0128
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 26, 2024
111c0a6
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 26, 2024
c96691f
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 26, 2024
a963070
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 26, 2024
b308c1c
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 26, 2024
46896f5
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 26, 2024
70784ae
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 26, 2024
33b2c13
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 26, 2024
d61a506
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 26, 2024
889a217
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 27, 2024
46af1fa
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 27, 2024
23ea568
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 27, 2024
69920f2
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 27, 2024
079e7ff
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 30, 2024
c2023e0
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 30, 2024
1c42076
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Sep 30, 2024
a22e2c9
Merge branch 'main' into add-flower-datasets-glossary-entry
flwrmachine Oct 1, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
27 changes: 27 additions & 0 deletions glossary/flower-datasets.mdx
Original file line number Diff line number Diff line change
@@ -0,0 +1,27 @@
---
title: "Flower Datasets"
description: "Flower Datasets is a library that enables the creation of datasets for federated learning by partitioning centralized datasets to exhibit heterogeneity or using naturally partitioned datasets."
date: "2024-05-24"
author:
name: "Adam Narożniak"
position: "ML Engineer at Flower Labs"
website: "https://discuss.flower.ai/u/adam.narozniak/summary"
related:
- text: "Flower Datasets documentation"
link: "https://flower.ai/docs/datasets/"
- text: "Flower Datasets GitHub page"
link: "https://github.com/adap/flower/tree/main/datasets"
---

Flower Datasets is a library that enables the creation of datasets for federated learning/analytics/evaluation by partitioning centralized datasets to exhibit heterogeneity or using naturally partitioned datasets. It was created by the Flower Labs team, which also created Flower - a Friendly Federated Learning Framework.

The key features include:
* downloading datasets (HuggingFace `datasets` are used under the hood),
* partitioning (simulate different levels of heterogeneity by using one of the implemented partitioning schemes or create your own),
* creating centralized datasets (easily utilize centralized versions of the datasets),
* reproducibility (repeat the experiments with the same results),
* visualization (display the created partitions),
* ML agnostic (easy integration with all popular ML frameworks).


It is a supplementary library to Flower, with which it integrates easily.