Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
Licenses
Other
Modalities
Reset Modalities
3D
Audio
Geospatial
Image
Tabular
Text
Time-series
Video
Size (rows)
Reset Size
< 1K
> 1T
Format
Reset Format
json
csv
parquet
imagefolder
soundfolder
webdataset
text
arrow
Apply filters
Datasets
180,614
Full-text search
Add filters
Sort: Trending
HuggingFaceTB/smollm-corpus
Viewer
•
Updated
3 days ago
•
237M
•
677
•
70
HuggingFaceM4/Docmatix
Viewer
•
Updated
1 day ago
•
1.27M
•
27
•
66
proj-persona/PersonaHub
Viewer
•
Updated
8 days ago
•
375k
•
3.73k
•
323
Salesforce/xlam-function-calling-60k
Viewer
•
Updated
about 5 hours ago
•
60k
•
2.68k
•
270
amphion/Emilia
Preview
•
Updated
11 days ago
•
12
•
62
mlfoundations/dclm-baseline-1.0
Preview
•
Updated
about 8 hours ago
•
107
•
55
HuggingFaceFW/fineweb
Viewer
•
Updated
3 days ago
•
46B
•
134k
•
1.58k
fka/awesome-chatgpt-prompts
Viewer
•
Updated
Mar 7, 2023
•
153
•
7.02k
•
5.04k
BAAI/Infinity-Instruct
Viewer
•
Updated
5 days ago
•
2.97M
•
3.06k
•
298
OpenFace-CQUPT/FaceCaption-15M
Viewer
•
Updated
2 days ago
•
13.4M
•
101
•
37
roneneldan/TinyStories
Viewer
•
Updated
Dec 4, 2023
•
2.14M
•
158k
•
466
lmms-lab/M4-Instruct-Data
Updated
5 days ago
•
7
•
29
PawanKrd/math-gpt-4o-200k
Viewer
•
Updated
20 days ago
•
200k
•
268
•
33
KBlueLeaf/danbooru2023-florence2-caption
Viewer
•
Updated
6 days ago
•
13.3M
•
30
•
49
Anthropic/hh-rlhf
Viewer
•
Updated
May 26, 2023
•
169k
•
176k
•
1.11k
Vikhrmodels/GrandMaster-PRO-MAX
Viewer
•
Updated
about 6 hours ago
•
142k
•
14
google/spiqa
Viewer
•
Updated
about 16 hours ago
•
666
•
102
•
22
SkunkworksAI/reasoning-0.01
Viewer
•
Updated
14 days ago
•
29.9k
•
385
•
55
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
Jun 12
•
3B
•
246k
•
394
wikimedia/wikipedia
Viewer
•
Updated
Jan 9
•
61.6M
•
45.5k
•
443
OpenCo7/UpVoteWeb
Viewer
•
Updated
2 days ago
•
557M
•
100
•
81
BAAI/IndustryCorpus
Preview
•
Updated
4 days ago
•
6
•
12
BAAI/DenseFusion-1M
Viewer
•
Updated
8 days ago
•
1.18M
•
8
•
13
futurehouse/lab-bench
Viewer
•
Updated
3 days ago
•
1.97k
•
16
•
10
allenai/c4
Viewer
•
Updated
Jan 9
•
10.4B
•
167k
•
235
OleehyO/latex-formulas
Viewer
•
Updated
May 9
•
1.56M
•
933
•
41
Wenetspeech4TTS/WenetSpeech4TTS
Updated
11 days ago
•
1.34k
•
57
zwq2018/Multi-modal-Self-instruct
Viewer
•
Updated
about 10 hours ago
•
76k
•
100
•
13
mii-llm/pinocchio
Viewer
•
Updated
3 days ago
•
137k
•
36
•
9
mlfoundations/dclm-baseline-1.0-parquet
Viewer
•
Updated
about 8 hours ago
•
2.73B
•
48
•
9
Previous
1
2
3
...
6,021
Next