Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mayongqiang 's Collections
PDF 解析
R1_data
SCINER
ACL papers
SciPapersData
sci_critic_reader_v2_model
SurveyTableocContentGeneration
sci_critic_reader
gpts_tagging
arxiv_cs_2024

SciPapersData

updated Mar 2, 2025
Upvote
-

  • armanc/scientific_papers

    Updated Jan 18, 2024 • 12.8k • 173

    Note Scientific papers datasets contains two sets of long and structured documents. The datasets are obtained from ArXiv and PubMed OpenAccess repositories. Both "arxiv" and "pubmed" have two features: - article: the body of the document, paragraphs separated by "/n". - abstract: the abstract of the document, paragraphs separated by "/n". - section_names: titles of sections, separated by "/n".


  • neuralwork/arxiver

    Viewer • Updated Nov 1, 2024 • 63.4k • 577 • 365

  • SciPhi/AgentSearch-V1

    Viewer • Updated Jan 14, 2024 • 70k • 2.23k • 90

  • laion/medrXiv-pdf

    Viewer • Updated Oct 17, 2024 • 57.6k • 65 • 5

  • laion/biorXiv-pdf

    Viewer • Updated Oct 18, 2024 • 1.5k • 1.01k • 4

  • NeuML/txtai-arxiv

    Sentence Similarity • Updated Nov 17, 2025 • 34 • 20

  • laion/biorXiv_metadata

    Viewer • Updated Nov 10, 2024 • 354k • 93 • 3
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs