The Shanghai-based AI lab said on Monday its latest Step 3.5 Flash model was designed to deliver advanced reasoning and agentic capabilities while maintaining efficiency.
Despite its relatively modest size of about 196 billion parameters – far smaller than Moonshot AI’s Kimi K2.5 with 1 trillion parameters or DeepSeek V3.2 with 671 billion parameters – Step 3.5 Flash outperformed its larger rivals across several benchmark tests measuring agentic, reasoning and coding capabilities, according to the company’s self-reported results.
Advertisement
Parameters are the variables that encode an AI system’s “intelligence”, with a larger number usually indicating stronger performance.
Step 3.5 Flash topped four reasoning benchmarks, including AIME 2025 and IMOAnswerBench, outperforming leading systems from DeepSeek, Moonshot AI, Zhipu AI and MiniMax, and trailing only Microsoft-backed OpenAI in certain tests.
