Search
torl-fsdp-qwen_qwen2.5-coder-1.5b-grpo-n16-b128-t1.0-lr1e-6new-no-toolusepenalty-430-step
VerlTool
AI model: VerlTool/torl-fsdp-qwen_qwen2.5-coder-1.5b-grpo-n16-b128-t1.0-lr1e-6new-no-toolusepenalty-430-step
tool-use-sql-test-debug
nathomas
AI model: nathomas/tool-use-sql-test-debug
tool_use
AndreasX1206
AI model: AndreasX1206/tool_use
torl-fsdp_agent-qwen_qwen2.5-coder-1.5b-grpo-n16-b128-t1.0-lr1e-6new-no-toolusepenalty-430-step
VerlTool
AI model: VerlTool/torl-fsdp_agent-qwen_qwen2.5-coder-1.5b-grpo-n16-b128-t1.0-lr1e-6new-no-toolusepenalty-430-step
tool-use-sql-debug
nathomas
AI model: nathomas/tool-use-sql-debug
qwen3_8b_sft_tool_use
1nstaller
AI model: 1nstaller/qwen3_8b_sft_tool_use
qwen3_8b_tool_use_final
1nstaller
AI model: 1nstaller/qwen3_8b_tool_use_final
gemma-3-1b-tool-use-merged
ronaldhandiwinata
AI model: ronaldhandiwinata/gemma-3-1b-tool-use-merged
nemotron-hinglish-4b-thinking-tool-use
ankitdhiman
AI model: ankitdhiman/nemotron-hinglish-4b-thinking-tool-use
yi_6b_chat_tool_use
tomhao
AI model: tomhao/yi_6b_chat_tool_use
Apertus-8B-Instruct-2509-tool-use
mattiaferrarini
AI model: mattiaferrarini/Apertus-8B-Instruct-2509-tool-use
MamayLM-tool-use
TymofiiNas
AI model: TymofiiNas/MamayLM-tool-use
generate_data_tool_use
morimae
AI model: morimae/generate_data_tool_use
generate_data_tool_use__bank
morimae
AI model: morimae/generate_data_tool_use__bank
sft-step2900-userquery-tooluse
CL19
AI model: CL19/sft-step2900-userquery-tooluse
base-userquery-tooluse
CL19
AI model: CL19/base-userquery-tooluse
sft-step100-userquery-tooluse
CL19
AI model: CL19/sft-step100-userquery-tooluse
sft-step2000-userquery-tooluse
CL19
AI model: CL19/sft-step2000-userquery-tooluse
sft-step1000-userquery-tooluse
CL19
AI model: CL19/sft-step1000-userquery-tooluse
fullinstruct-bashsft-step270-userquery-tooluse
CL19
AI model: CL19/fullinstruct-bashsft-step270-userquery-tooluse