5 19 5

Shuo Xing

shuoxing

https://shuoxing98.github.io/

ShuoXing98

AI & ML interests

MLLMs, LLMs

Recent Activity

published a model about 1 hour ago

shuoxing/llama3-8b-full-pretrain-mix-low-tweet-1m-en-reproduce-bs8

updated a model 1 day ago

shuoxing/llama3-8b-full-pretrain-junk-tweet-1m-en-reproduce-bs8

published a model 1 day ago

shuoxing/llama3-8b-full-pretrain-junk-tweet-1m-en-reproduce-bs8

View all activity

Organizations

published a model about 1 hour ago

shuoxing/llama3-8b-full-pretrain-mix-low-tweet-1m-en-reproduce-bs8

Updated about 1 hour ago

updated a model 1 day ago

shuoxing/llama3-8b-full-pretrain-junk-tweet-1m-en-reproduce-bs8

Text Generation • 266k • Updated 1 day ago • 20

published a model 1 day ago

shuoxing/llama3-8b-full-pretrain-junk-tweet-1m-en-reproduce-bs8

Text Generation • 266k • Updated 1 day ago • 20

updated a model 1 day ago

shuoxing/llama3-8b-full-pretrain-junk-tweet-1m-en-reproduce

Text Generation • 8B • Updated 1 day ago • 91

published a model 2 days ago

shuoxing/llama3-8b-full-pretrain-junk-tweet-1m-en-reproduce

Text Generation • 8B • Updated 1 day ago • 91

updated a collection 6 days ago

MLLM Reasoning, Rewarding, and Understanding

Collection

Papers on the reasoning, rewarding, and understanding of the MLLMs and LLMs • 28 items • Updated 6 days ago • 1

updated a model 13 days ago

shuoxing/llama3-8b-full-pretrain-control-tweet-1m-en-no-packing-new-sft-bs128

Text Generation • 266k • Updated 13 days ago • 37

published a model 13 days ago

shuoxing/llama3-8b-full-pretrain-control-tweet-1m-en-no-packing-new-sft-bs128

Text Generation • 266k • Updated 13 days ago • 37

updated a model 13 days ago

shuoxing/llama3-8b-full-pretrain-mix-high-tweet-1m-en-no-packing-new-sft-bs128

Text Generation • 266k • Updated 13 days ago • 41

published a model 13 days ago

shuoxing/llama3-8b-full-pretrain-mix-high-tweet-1m-en-no-packing-new-sft-bs128

Text Generation • 266k • Updated 13 days ago • 41

updated a model 13 days ago

shuoxing/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-no-packing-new-sft-bs128

Text Generation • 266k • Updated 13 days ago • 39

published a model 13 days ago

shuoxing/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-no-packing-new-sft-bs128

Text Generation • 266k • Updated 13 days ago • 39

updated a model 13 days ago

shuoxing/llama3-8b-full-pretrain-mix-low-tweet-1m-en-no-packing-new-sft-bs128

Text Generation • 266k • Updated 13 days ago • 32

published a model 13 days ago

shuoxing/llama3-8b-full-pretrain-mix-low-tweet-1m-en-no-packing-new-sft-bs128

Text Generation • 266k • Updated 13 days ago • 32

updated a model 13 days ago

shuoxing/qwen-0_5b-full-pretrain-control-tweet-1m-en-no-packing-new-sft-bs128

Text Generation • 0.5B • Updated 13 days ago • 14

published a model 13 days ago

shuoxing/qwen-0_5b-full-pretrain-control-tweet-1m-en-no-packing-new-sft-bs128

Text Generation • 0.5B • Updated 13 days ago • 14

Shuo Xing

AI & ML interests

Recent Activity

Organizations

shuoxing's activity