Improve model card: Add OBR paper details, usage, and tags
#1
by nielsr HF Staff - opened
This PR significantly improves the model card for the Qwen2.5-7B-FlatQuant-OBR-GPTQ-W4A4KV4S50 model (as implied by the context of FlatQuant and Qwen2.5).
Key changes include:
- Replacing the boilerplate content with a detailed description, including the paper title and abstract.
- Adding a link to the official Hugging Face paper page: Optimal Brain Restoration for Joint Quantization and Sparsification of LLMs.
- Adding a link to the GitHub repository: https://github.com/csguoh/OBR.
- Setting the
pipeline_tagtotext-generationto improve discoverability. - Updating the
licensetomit, aligning with the underlying FlatQuant method used for this specific model. - Adding relevant tags:
quantization,sparsity,llm,qwen2. - Including a clear "How to Get Started" section with a functional code snippet for
transformersinference, adapted from best practices and the context provided in the GitHub repository. - Adding the BibTeX citation.
- Populating various "Model Details" fields and removing empty boilerplate sections.
These changes make the model card much more informative and user-friendly, aligning it with Hugging Face's best practices.
HangGuo changed pull request status to merged