RL-Project rgtjf/ppo-LunarLander-v2 Reinforcement Learning • Updated Oct 15, 2024 • 2 rgtjf/q-FrozenLake-v1-4x4-noSlippery Reinforcement Learning • Updated Oct 16, 2024 rgtjf/q-Taxi-v3 Reinforcement Learning • Updated Oct 16, 2024
UtK rgtjf/Qwen2-UtK-72B-128K 73B • Updated Oct 17, 2024 • 9 rgtjf/Qwen2-UtK-7B-128K 8B • Updated Oct 17, 2024 • 7 rgtjf/Qwen2-UtK-ChatQA2-72B-128K 73B • Updated Oct 17, 2024 • 6 rgtjf/Qwen2-UtK-ChatQA2-7B-128K 8B • Updated Oct 17, 2024 • 5
RL-Project rgtjf/ppo-LunarLander-v2 Reinforcement Learning • Updated Oct 15, 2024 • 2 rgtjf/q-FrozenLake-v1-4x4-noSlippery Reinforcement Learning • Updated Oct 16, 2024 rgtjf/q-Taxi-v3 Reinforcement Learning • Updated Oct 16, 2024
UtK rgtjf/Qwen2-UtK-72B-128K 73B • Updated Oct 17, 2024 • 9 rgtjf/Qwen2-UtK-7B-128K 8B • Updated Oct 17, 2024 • 7 rgtjf/Qwen2-UtK-ChatQA2-72B-128K 73B • Updated Oct 17, 2024 • 6 rgtjf/Qwen2-UtK-ChatQA2-7B-128K 8B • Updated Oct 17, 2024 • 5