Submitted by akhaliq 53 Seed-Music: A Unified Framework for High Quality and Controlled Music Generation · 38 authors 4
Submitted by iofu728 43 RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval · 14 authors 2
Submitted by emanuelevivoli 24 One missing piece in Vision and Language: A Survey on Comics Understanding Vision, Language and Reading 133 2
Submitted by ZCODE0 15 Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models · 5 authors 19 2
Submitted by Sreyan88 13 ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds · 6 authors 33 2
Submitted by amanchadha 8 Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types · 3 authors 0 2
Submitted by Swtheking 8 Policy Filtration in RLHF to Fine-Tune LLM for Code Generation · 2 authors 34 3
Submitted by dek924 5 Towards Predicting Temporal Changes in a Patient's Chest X-ray Images based on Electronic Health Records · 4 authors 9 2
Submitted by beeformer 4 beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems · 3 authors 106 2
Submitted by IAMJB 3 LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study · 3 authors 1