-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Pull requests: huggingface/datasets
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: add return_file_name support to Parquet packaged builder
#8020
opened Feb 23, 2026 by
dhruvildarji
Loading…
feat: add return_file_name support to CSV packaged builder
#8019
opened Feb 23, 2026 by
dhruvildarji
Loading…
3 tasks
Improve error message for deprecated dataset scripts with migration guidance
#8017
opened Feb 22, 2026 by
suryanshbt211
Loading…
Speed up local 'get_data_patterns' by avoiding repeated recursive scans
#8014
opened Feb 21, 2026 by
AsymptotaX
Loading…
fix: prevent duplicate keywords in load_dataset_builder (#4910)
#8008
opened Feb 16, 2026 by
DhyeyTeraiya
Loading…
fix save_to_disk/load_from_disk with pathlib.Path input
#8004
opened Feb 13, 2026 by
Mr-Neutr0n
Loading…
Fix Dataset.map writer initialization when early examples return None
#7996
opened Feb 8, 2026 by
veeceey
Loading…
✨ Add 'SparseCsv' builder and 'sparse_collate_fn' for efficient high-dimensional sparse data loading
#7993
opened Feb 4, 2026 by
Ebraheem1
Loading…
Fix index out of bound error with original_shard_lengths.
#7987
opened Feb 4, 2026 by
jonathanasdf
Loading…
Fix unstable tokenizer fingerprinting (enables map cache reuse)
#7982
opened Feb 2, 2026 by
KOKOSde
Loading…
feat: implement iter_arrow for skip, take and step iterables
#7972
opened Jan 30, 2026 by
Edge-Explorer
Loading…
Issue 7756 Fix - multiprocessing hang issue with start method check
#7967
opened Jan 28, 2026 by
vedanta777
Loading…
Use Sequence instead of list in Dataset.from_parquet type hints
#7962
opened Jan 26, 2026 by
Mukundtimbadiya20
Loading…
#5354: replace list with Sequence in from_parquet type hints
#7953
opened Jan 19, 2026 by
ashmi8
Loading…
feat: Add GenBank file format support for biological sequence data
#7951
opened Jan 19, 2026 by
behroozazarkhalili
Loading…
2 tasks done
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.