Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
mkurman
's Collections
GLM-4.7-Flash-SynthLabs
NeuroBLAST v3
Medical Pre-Training Datasets
Medical QA Datasets
Medical Pre-Training Datasets
updated
Aug 23, 2025
A collection of medical datasets suitable for LLMs pretraining
Upvote
2
openmed-community/TheBlueScrubs-v1-fixed
Viewer
•
Updated
Aug 29, 2025
•
11.1M
•
977
•
13
mkurman/hindawi-journals-2007-2023
Viewer
•
Updated
Jun 9, 2025
•
298k
•
593
•
5
epfl-llm/guidelines
Viewer
•
Updated
Mar 7, 2024
•
38k
•
1.38k
•
152
ncbi/Open-Patients
Viewer
•
Updated
May 11, 2025
•
180k
•
208
•
28
AGBonnet/augmented-clinical-notes
Viewer
•
Updated
Jan 24, 2024
•
30k
•
1.17k
•
73
harishnair04/mtsamples
Viewer
•
Updated
Nov 7, 2024
•
5k
•
258
•
3
Tonic/Health-Bench-Eval-OSS-2025-07
Viewer
•
Updated
May 17, 2025
•
9.67k
•
335
•
4
zeroshot/arxiv-biology
Viewer
•
Updated
Jan 5, 2023
•
1.28k
•
153
•
14
Upvote
2
Share collection
View history
Collection guide
Browse collections