Search-R1: A Tool for Reinforcement Learning to Train Large Models for Search and Reasoning
General Introduction Search-R1 is an open source project, developed by PeterGriffinJin on GitHub, built on the veRL framework. It trains Large Language Models (LLMs) through Reinforcement Learning (RL) techniques, allowing the models to autonomously learn...