-
Notifications
You must be signed in to change notification settings - Fork 47
Issues: Lightning-AI/litdata
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
CI error: Something isn't working
help wanted
Extra attention is needed
All chunks should've been deleted
keeps coming back
bug
#437
opened Dec 20, 2024 by
deependujha
Restart training with new data, mid-epoch
enhancement
New feature or request
#436
opened Dec 17, 2024 by
schopra8
Advanced Batching Logic with CombinedStreamingDataset
enhancement
New feature or request
#434
opened Dec 13, 2024 by
schopra8
The progress bar for Something isn't working
help wanted
Extra attention is needed
merge_datasets
does not accurately represent the status of the job.
bug
#433
opened Dec 5, 2024 by
abysmalocean
Question: Is there a list for publicly available s3 links of datasets of New feature or request
litdata.StreamingDataset
format?
enhancement
#430
opened Dec 2, 2024 by
2catycm
Question: Is litdata faster when loading local dataset or network storage s3 dataset?
enhancement
New feature or request
#428
opened Nov 30, 2024 by
2catycm
Add 'New contributors' section to main README
enhancement
New feature or request
#426
opened Nov 28, 2024 by
robmarkcole
Link on readme redirects to generic page
bug
Something isn't working
help wanted
Extra attention is needed
#422
opened Nov 27, 2024 by
robmarkcole
Add support for multi node for Optimize & StreamingDataset
bug
Something isn't working
help wanted
Extra attention is needed
#419
opened Nov 22, 2024 by
tchaton
Clear Examples of use with different dataset types and code changes.
enhancement
New feature or request
#409
opened Nov 4, 2024 by
Woodr7
training hangs with lightning ddp and cloud dir?
bug
Something isn't working
help wanted
Extra attention is needed
#408
opened Nov 1, 2024 by
rxqy
incorrect dataloader length when Something isn't working
help wanted
Extra attention is needed
drop_last=False
bug
#402
opened Oct 28, 2024 by
grez72
Improve CombinedStreamingDataset to handle multiple subdatasets efficiently
enhancement
New feature or request
#386
opened Oct 2, 2024 by
bhimrazy
The config isn't consistent between chunks
bug
Something isn't working
help wanted
Extra attention is needed
#370
opened Sep 17, 2024 by
AugustDev
How can I shut down automatically distributing data when using StreamingDataset?
enhancement
New feature or request
question
Further information is requested
#368
opened Sep 12, 2024 by
ygtxr1997
Failed to Resume Training w/ CombinedStreamingDataset
bug
Something isn't working
duplicate
This issue or pull request already exists
help wanted
Extra attention is needed
#363
opened Sep 5, 2024 by
schopra8
StreamingDataset causes NCCL timeout when using multiple nodes
bug
Something isn't working
help wanted
Extra attention is needed
#340
opened Aug 26, 2024 by
hubenjm
Lazyload subsamples if subsample=1.0
enhancement
New feature or request
question
Further information is requested
#339
opened Aug 21, 2024 by
deependujha
StreamingDataset intermittently fails due to lack of index.json
bug
Something isn't working
help wanted
Extra attention is needed
#337
opened Aug 20, 2024 by
plra
Bug: Inconsistent Behavior with StreamingDataloader loading states (specific to CombinedStreamingDataset)
bug
Something isn't working
help wanted
Extra attention is needed
#331
opened Aug 14, 2024 by
bhimrazy
Use different batch sizes in CombinedStreamingDataset
enhancement
New feature or request
help wanted
Extra attention is needed
#327
opened Aug 10, 2024 by
schopra8
Add support for multi sample item in optimize and yielding from the _getitem_ of the StreamingDataset
enhancement
New feature or request
help wanted
Extra attention is needed
#317
opened Aug 8, 2024 by
tchaton
Explore about integrating homomorphic encryption
enhancement
New feature or request
help wanted
Extra attention is needed
#313
opened Aug 7, 2024 by
bhimrazy
Investigate keeping the content of the downloaded chunks in RAM instead of writing it to file.
enhancement
New feature or request
help wanted
Extra attention is needed
#291
opened Aug 1, 2024 by
tchaton
Add training mode compression for zstd
enhancement
New feature or request
help wanted
Extra attention is needed
#283
opened Jul 31, 2024 by
tchaton
Previous Next
ProTip!
Adding no:label will show everything without a label.