Tülu 3 405B, the latest in the Tülu 3 series, outperforms DeepSeek-V3, rivals GPT-4o & other open-weight post-trained models like Llama 3.1. Leveraging Reinforcement Learning from Verifiable Rewards (RVLR), it scales to 405B parameters, setting new benchmarks.