LongCat-2.0 is a 1.6 trillion-parameter mixture-of-experts language model with approximately 48 billion activated parameters per token, open-sourced by Meituan on June 30, 2026. It uses LongCat Sparse Attention for 1M-context efficiency, N-gram Embedding for parameter scaling, and post-training expert groups for agent, reasoning, and interaction tasks. Official page: longcat.chat/blog/longcat-2.0.

Who made LongCat-2.0?

LongCat-2.0 comes from Meituan — the Chinese technology company known for food delivery and local services. This is a different product from the earlier LongCat Video Avatar open-source model (talking avatars). LongCat-2.0 is a frontier-scale text/code/agent LLM.

What are LongCat-2.0's benchmark scores?

Meituan reports in-house scores (via Claude Code harness unless marked external): Terminal-Bench 2.1 at 70.8, SWE-bench Pro at 59.5, SWE-bench Multilingual at 77.3, FORTE office-agent benchmark at 73.2, BrowseComp at 79.9, GPQA-diamond at 88.9. External reported Opus 4.8 leads Terminal-Bench at 78.9 and SWE-bench Pro at 69.2 on Meituan's comparison table.

Can I run LongCat-2.0 locally?

Weights and inference code are live as of July 5, 2026, under the MIT license — meituan-longcat/LongCat-2.0 on Hugging Face has full safetensors (BF16/F32, an FP8 variant, and community quantizations). "Locally" still means datacenter-class hardware: Meituan recommends 16x H20 GPUs with tensor + expert parallelism via a dedicated SGLang PR, plus a separate NPU path through SGLang-FluentLLM. At 1.6T total parameters even 2-bit quantization implies 400GB+ of weight storage — this is not a consumer-GPU model.

Does LongCat-2.0 work with Claude Code and OpenClaw?

Yes, per Meituan's announcement. LongCat-2.0 is integrated with mainstream agent harnesses including Claude Code, OpenClaw, and Hermes. Terminal-Bench and SWE-bench Pro numbers in the launch post were measured via Claude Code with 4c8g–8c16g sandboxes.

How is LongCat-2.0 different from Kimi K2.7-Code?

LongCat-2.0 is larger (1.6T vs 1T total, ~48B vs 32B active), trained on AI ASIC superpods rather than Nvidia-centric stacks, and publishes stronger in-house Terminal-Bench (70.8) and SWE-bench Pro (59.5) numbers than Kimi's vendor suites. Kimi K2.7-Code weights are already on Hugging Face under Modified MIT; LongCat weights are pending. Both are open-weight MoE coders relevant during the Fable 5 export ban.

LongCat-2.0: 1.6T MoE Open Model — ASIC Training | explainx.ai Blog

explainx.ainewsletter3.5k

workshops ↗

LongCat-2.0: 1.6T MoE Open Model — ASIC Training | explainx.ai Blog | explainx.ai

Component	What it does
Streaming-aware Indexing (SI)	Reshapes token selection for coalesced HBM access — contiguous reads instead of fragmented scatter
Cross-Layer Indexing (CLI)	One indexing pass serves multiple consecutive layers at inference — saliency stable across adjacent layers
Hierarchical Indexing (HI)	Coarse-to-fine scoring — block-level recall, then fine token selection inside candidates

Parameter	LongCat-2.0	Kimi K2.7-Code (comparison)
Total params	1.6T	1T
Active per token	~48B	32B
Context training	Up to 1M	256K
Attention	LSA (sparse)	MLA

Model	Terminal-Bench 2.1	SWE-bench Pro	SWE-bench Multilingual
LongCat-2.0	70.8	59.5	77.3
Gemini 3.1 Pro	70.7*	54.2*	76.9*
GPT-5.5	73.8*	58.6*	—
Opus 4.6	—	57.3*	77.8*
Opus 4.7	71.7*	64.3*	80.5*
Opus 4.8	78.9*	69.2*	84.8*

Model	FORTE †	BrowseComp	RWSearch
LongCat-2.0	73.2	79.9	78.8
Gemini 3.1 Pro	70.3	85.9*	76.3
GPT-5.5	77.8	84.4*	85.3
Opus 4.8	77.2	84.3*	77.3

Model	IFEval	Writing Bench	IMO-AnswerBench	GPQA-diamond
LongCat-2.0	90.0	83.8	81.8	88.9
Opus 4.8	86.0	85.2	75.3	92.4

Expert group	Focus
Agent Experts	Code, work, search — tool invocation, parameter parsing, self-correction vs infinite loops
Reasoning Experts	Math, STEM, multi-hop — adaptive compute by difficulty
Interaction Experts	Instruction following, hallucination suppression, safety bounds

Channel	Status
Try it	Web demo and chat at longcat.ai
API	LongCat API Platform — international payment support still "under active development" per Meituan
GitHub	meituan-longcat/LongCat-2.0 — MIT license
Hugging Face	meituan-longcat/LongCat-2.0 — weights live, BF16/F32 safetensors + FP8 + community quantizations
ModelScope	meituan-longcat/LongCat-2.0
GPU inference	SGLang PR #30042 — Meituan recommends 16x H20 with tensor + expert parallelism (`--tp 16 --ep 16`); hierarchical indexing not supported in this path for simplicity
NPU inference	SGLang-FluentLLM (NPU branch)
Claude Code / OpenClaw / Hermes	Harness integration claimed at launch, unchanged
Community	Discord for support and feedback

Model	Org	Total / Active	Weights status	Standout claim
LongCat-2.0	Meituan	1.6T / ~48B	Available (MIT, since July 5)	Terminal-Bench 70.8, ASIC training, 1M context
Kimi K2.7-Code	Moonshot	1T / 32B	Available (Modified MIT)	MCP Mark 81.1 vs Opus 76.4, $0.95/M API
GLM-5.2	Zhipu	—	Available (MIT)	BridgeBench reasoning, security parity narrative
Opus 4.8	Anthropic	Closed	API	Official Fable 5 fallback; leads LongCat table on SWE-bench Pro

Released	June 30, 2026 (official blog)
Organization	Meituan (food delivery / local services giant)
Architecture	MoE — 1.6T total, ~48B active per token
Context	Trained on 1M-context data (hundreds of billions of tokens)
Attention	LongCat Sparse Attention (LSA) — evolution of DeepSeek Sparse Attention
Training	50K+ AI ASIC superpods, 35T+ tokens, deterministic ops
Harnesses	Claude Code, OpenClaw, Hermes
Weights	Live since July 5, 2026 — Hugging Face, MIT licensed, no restrictions
Deployment	GPU (16x H20, SGLang) and NPU (SGLang-FluentLLM) — both documented
HN signal	43 points — community focused on ASIC training story and weight availability

LongCat-2.0: Meituan's 1.6T MoE Open Model Trained on AI ASIC Superpods

TL;DR — LongCat-2.0 at a glance

Related posts

Kimi K2.7-Code: Moonshot AI's 1T-Parameter Open Coding Powerhouse

Cohere North Mini Code: Open-Source Agentic Coding Model (Apache 2.0)

Codeberg Bans Vibe-Coded Projects: What the New ToU Actually Says

Why LongCat-2.0 matters beyond the leaderboard

1. Frontier training without Nvidia as the hero

2. Open weights in the Fable 5 vacuum

3. Harness-native integration

Architecture — LSA, N-gram Embedding, and MoE at 1.6T

LongCat Sparse Attention (LSA)

N-gram Embedding (135B parameters)

Scale summary

Training infrastructure — 50K ASICs and deterministic ops

Benchmarks — code, agents, and foundations

Code agent

General agent

Foundational

Post-training — Agent, Reasoning, and Interaction experts

Demos — what Meituan claims in production

How to access LongCat-2.0 today

Community reaction — Hacker News, June 30

Reaction after the July 5 weight release

LongCat-2.0 vs Kimi K2.7 vs GLM-5.2 — June 2026 open coder map

Evaluation checklist

Bottom line