Meituan Open Sources LongCat 2.0, a 1.6 Trillion Parameter Coder Trained Entirely on Chinese Chips
AI & ML

Meituan Open Sources LongCat 2.0, a 1.6 Trillion Parameter Coder Trained Entirely on Chinese Chips

A Chinese delivery giant just shipped a near frontier, agentic coding model trained without a single Nvidia GPU, and posted the weights for anyone to download. The decoupling of advanced AI from American silicon is no longer theoretical.

PublishedJuly 3, 2026
Read time5 min read
Share

A delivery company ships a frontier coder

The most consequential AI release of the week did not come from a research lab or a hyperscaler. It came from Meituan, the Chinese food delivery and local services giant, which open-sourced LongCat-2.0, a 1.6 trillion parameter agentic coding model, on June 30. That a delivery company is now shipping near frontier models is itself a marker of how far the capability has diffused. The barrier to building at this scale is no longer the exclusive province of a handful of specialist firms.

The number that draws attention is the parameter count, but the number that should draw attention is the trajectory. LongCat-2.0 nearly triples the size of its predecessor, LongCat-Flash, which shipped at 560 billion parameters in September 2025. That is a roughly threefold increase in under nine months, a cadence that reflects both ample compute and organizational urgency. Companies do not move this fast on side projects, and Meituan is clearly treating frontier AI as core to its future rather than a laboratory curiosity.

The architecture behind the numbers

LongCat-2.0 is a Mixture-of-Experts model, which is the technical reason a 1.6 trillion parameter system can be practical to serve. Rather than firing every parameter for every token, the model dynamically activates a subset, roughly 33 to 56 billion parameters at a time. That keeps inference cost far below what the headline size would imply, while retaining the capacity that a very large parameter count provides. It is the standard trick for making enormous models economically viable, executed here at serious scale.

The design intent is agentic coding, and the specification reflects it. A one million token context window means the model can hold entire codebases, long logs, and multi step task histories in view at once, which is what agentic workflows demand. This is not a general chat model repurposed for engineering. It is built for the workloads where an assistant plans, edits, runs, and iterates across a large repository, the exact pattern that has come to define the most valuable applications of coding models.

Trained without American silicon

The detail that gives this release geopolitical weight is the training hardware. Meituan reports that LongCat-2.0 was trained on a 50,000 card cluster of domestic Chinese chips, with no Nvidia A100 or H100 accelerators and no AMD MI300X parts in the loop. The use of Huawei's Collective Communication Library for training points to Huawei silicon underneath. The entire pipeline, by the company's account, relied on Chinese manufactured hardware.

For years, the working assumption behind export controls has been that restricting access to leading edge American accelerators would meaningfully slow Chinese frontier development. LongCat-2.0 is a direct, public challenge to that assumption. As the reporting summarized it, the model demonstrates that Chinese companies can produce competitive frontier models without American chips. That does not mean the domestic hardware is at parity, efficiency and yield gaps almost certainly remain, but it does mean the ceiling imposed by export controls is higher and softer than policymakers may have hoped.

Benchmarks and real world traction

The performance figures are respectable enough to matter. LongCat-2.0 scored 59.5 on SWE-bench Pro, a demanding test of real software engineering tasks, and 70.8 on Terminal-Bench, which measures a model's ability to operate in a command line environment. These are not toy metrics, they are among the harder public evaluations for coding agents, and scores at this level place the model in serious contention rather than novelty status.

More telling than any benchmark is adoption. The model briefly led usage on OpenRouter, the popular routing service that aggregates demand across many models, which means developers were not just noting the release, they were routing real traffic to it. Usage is the benchmark that resists gaming, because it reflects thousands of independent decisions about which model actually gets the job done. That an open weight Chinese model reached the top of that chart, however briefly, is a signal the market took it seriously.

What open weights mean for enterprise engineering teams

The weights are freely available on Hugging Face under the Meituan organization, and that availability changes the calculus for engineering leaders. An open weight model at this capability level can be self hosted, inspected, fine tuned on proprietary code, and run inside a controlled environment where source never leaves the building. For organizations with strict data governance or a preference against sending code to a third party API, that combination is genuinely attractive.

The considerations are the familiar ones for any open weight adoption, sharpened by provenance. Teams must weigh the operational burden of hosting a trillion parameter class model against the control it buys, and they must think carefully about supply chain assurance, licensing terms, and whether their governance framework is comfortable with a model of this origin. None of that is disqualifying, but it is a real evaluation rather than a default. What LongCat-2.0 establishes is that the open weight frontier is now a serious tier that enterprise architects have to include in their build versus buy analysis, not a fringe option they can wave away.

Tagged#news#engineering#ai-ml#llm#open-source#agents