Croc-Prog-HF
's Collections
MultiLang-Texts HQ Datasets
updated
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
•
3.5B
•
289k
•
939
Viewer
•
Updated
•
7.18B
•
40.1k
•
584
Viewer
•
Updated
•
165M
•
5.98k
•
14
BramVanroy/CommonCrawl-CreativeCommons-strict
Viewer
•
Updated
•
32.8M
•
704
•
1
BramVanroy/CommonCrawl-CreativeCommons-fine
Viewer
•
Updated
•
75.1M
•
173
•
2
Viewer
•
Updated
•
1.28B
•
496
•
56
Viewer
•
Updated
•
88.8k
•
10.8k
•
1.48k
Viewer
•
Updated
•
470M
•
33.1k
•
339
Viewer
•
Updated
•
206k
•
3.97k
•
334
Viewer
•
Updated
•
32.5M
•
1.4k
•
5
OpenLLM-France/Claire-Dialogue-French-0.1
Viewer
•
Updated
•
37k
•
169
•
50
Viewer
•
Updated
•
4.48B
•
102k
•
745
Viewer
•
Updated
•
476M
•
39.9k
•
816
HuggingFaceH4/Multilingual-Thinking
Viewer
•
Updated
•
1k
•
12.8k
•
109