RL-Project rgtjf/ppo-LunarLander-v2 Reinforcement Learning • Updated Oct 15, 2024 • 3 rgtjf/q-FrozenLake-v1-4x4-noSlippery Reinforcement Learning • Updated Oct 16, 2024 rgtjf/q-Taxi-v3 Reinforcement Learning • Updated Oct 16, 2024
UtK rgtjf/Qwen2-UtK-72B-128K 73B • Updated Oct 17, 2024 • 8 rgtjf/Qwen2-UtK-7B-128K 8B • Updated Oct 17, 2024 • 11 rgtjf/Qwen2-UtK-ChatQA2-72B-128K 73B • Updated Oct 17, 2024 • 10 rgtjf/Qwen2-UtK-ChatQA2-7B-128K 8B • Updated Oct 17, 2024 • 7
RL-Project rgtjf/ppo-LunarLander-v2 Reinforcement Learning • Updated Oct 15, 2024 • 3 rgtjf/q-FrozenLake-v1-4x4-noSlippery Reinforcement Learning • Updated Oct 16, 2024 rgtjf/q-Taxi-v3 Reinforcement Learning • Updated Oct 16, 2024
UtK rgtjf/Qwen2-UtK-72B-128K 73B • Updated Oct 17, 2024 • 8 rgtjf/Qwen2-UtK-7B-128K 8B • Updated Oct 17, 2024 • 11 rgtjf/Qwen2-UtK-ChatQA2-72B-128K 73B • Updated Oct 17, 2024 • 10 rgtjf/Qwen2-UtK-ChatQA2-7B-128K 8B • Updated Oct 17, 2024 • 7