Improve model card: Add OBR paper details, usage, and tags

#1
by nielsr HF Staff - opened

This PR significantly improves the model card for the Qwen2.5-7B-FlatQuant-OBR-GPTQ-W4A4KV4S50 model (as implied by the context of FlatQuant and Qwen2.5).

Key changes include:

  • Replacing the boilerplate content with a detailed description, including the paper title and abstract.
  • Adding a link to the official Hugging Face paper page: Optimal Brain Restoration for Joint Quantization and Sparsification of LLMs.
  • Adding a link to the GitHub repository: https://github.com/csguoh/OBR.
  • Setting the pipeline_tag to text-generation to improve discoverability.
  • Updating the license to mit, aligning with the underlying FlatQuant method used for this specific model.
  • Adding relevant tags: quantization, sparsity, llm, qwen2.
  • Including a clear "How to Get Started" section with a functional code snippet for transformers inference, adapted from best practices and the context provided in the GitHub repository.
  • Adding the BibTeX citation.
  • Populating various "Model Details" fields and removing empty boilerplate sections.

These changes make the model card much more informative and user-friendly, aligning it with Hugging Face's best practices.

HangGuo changed pull request status to merged

Sign up or log in to comment