ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8 Reinforcement Learning • 8B • Updated Mar 28 • 4.95k • 187
AXONVERTEX-AI-RESEARCH/Orchestrator-8B-Q8_0-GGUF Reinforcement Learning • 8B • Updated 8 days ago • 469 • 7
emiliodavola/french-solitaire-dqn-single-solution Reinforcement Learning • Updated 25 days ago • 25 • 2
0xgr3y/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-tall_tame_panther Text Generation • 0.5B • Updated 18 days ago • 2.13k • 1