Agents-A1 is a 35.11B-parameter mixture-of-experts agentic language model from InternScience, released June 30, 2026 on Hugging Face and ModelScope under Apache 2.0. It targets long-horizon search, engineering tasks, scientific research, instruction following, and tool calling with 256K native context. The technical report is titled "Scaling the Horizon, Not the Parameters: Reaching Trillion-Parameter Performance with a 35B Agent" (arxiv:2606.30616).

How does Agents-A1 compare to Qwen 3.6 35B A3B?

On InternScience's published table, Agents-A1 beats Qwen3.6-35B-A3B on BrowseComp (75.51 vs 67.93), GAIA (96.04 vs 78.64), Seal-0 (56.36 vs 38.74), IFBench (80.61 vs 64.4), and IFEval (94.82 vs 91.3). SciCode is closer — 44.33 vs 35.8, both well below GPT-5.5's 56.1. For local MoE coding quality vs dense trade-offs, see explainx.ai's Qwen 3.6 27B hands-on guide.

Can I run Agents-A1 locally?

Weights are Hugging Face Transformers safetensors — not GGUF for llama.cpp at launch. Serve with vLLM or SGLang on a GPU with enough VRAM for a 35B MoE at 262K context (InternScience documents single-GPU configs with --max-model-len 262144). Consumer laptops should use API hosting or quant ports when community GGUF drops. See the vLLM/SGLang blocks in this post.

Does Agents-A1 support tool calling?

Yes. The model card documents vLLM and SGLang launches with --enable-auto-tool-choice and --tool-call-parser qwen3_coder (vLLM) or --tool-call-parser qwen3_coder (SGLang). Recommended sampling includes temperature 0.85, top_p 0.95, top_k 20, presence_penalty 1.1.

Is Agents-A1 related to Qwen-AgentWorld?

The Hugging Face model card lists architecture tag qwen3_5_moe, and X commenters note lineage overlap with Qwen's agent stack — InternScience does not foreground AgentWorld in the launch copy. Functionally, Agents-A1 is a policy/agent model trained for multi-step tool use; Qwen-AgentWorld is a language world model that simulates environment observations. They address different sides of the agent loop — see explainx.ai's AgentWorld guide for the distinction.

What benchmarks did Agents-A1 claim SOTA on?

Overall SOTA (per model card, June 30 2026): Seal-0 (56.4), HiPhO (46.4), FrontierScience-Olympiad (79.0), FrontierScience-Research (40.0), IFBench (80.6), IFEval (94.8). Best among ~35B-class models on BrowseComp (75.5), XBench-DS-2510 (86.0), GAIA (96.0), SciCode (44.3), HLE with tools (47.6), and MolBench-bind (56.8). Frontier-scale models still lead some rows — e.g. GPT-5.5 on BrowseComp (84.4), Kimi-K2.6 on τ2-Bench (82.2).

Agents-A1 — 35B MoE Agent Model (June 2026) | explainx.ai Blog

explainx.ainewsletter3.5k

workshops ↗

Agents-A1 — 35B MoE Agent Model (June 2026) | explainx.ai Blog | explainx.ai

June 30, 2026 — 4:19 PM: ModelScope announced Agents-A1 on X — a 35B MoE agentic model from InternScience built for long-horizon search, engineering, scientific research, instruction following, and tool calling. Weights landed on Hugging Face the same day under Apache 2.0, with a technical report claiming trillion-parameter-class agent performance without trillion-parameter weights.

The launch sits in a crowded week: LongCat-2.0 from Meituan, ongoing Qwen 3.6 local-dev hype, and Fable 5 still offline. Agents-A1's pitch is different — not raw coding SWE scores alone, but heterogeneous agent horizons: search loops, science tools, instruction evals, and function-calling at 256K context.

TL;DR — what people asked on X

Question	Answer
What is it?	35.11B MoE agent model, architecture, server context

qwen3_5_moe

Benchmark	Agents-A1	Qwen3.6-35B-A3B	Kimi-K2.6	GPT-5.5 (xhigh)
BrowseComp	🟢 75.51	67.93	83.2	🥇 84.4
XBench-DS-2510	🟢 86.0	71.0	🥇 90.0	84.0
Seal-0	🥇 56.36	38.74	50.45	42.34
GAIA	🟢 96.04	78.64	80.58	87.38

Benchmark	Agents-A1	Qwen3.6-35B-A3B	DeepSeek-V4-pro
FrontierScience-Olympiad	🥇 79.0	60.3	76.0
FrontierScience-Research	🥇 40.0	2.9	13.3
HiPhO	🥇 46.4	37.7	38.7
HLE w/ tools	🟢 47.6	36.2	48.2

Benchmark	Agents-A1	Qwen3.6-35B-A3B	GPT-5.5
IFBench	🥇 80.61	64.4	75.9
IFEval	🥇 94.82	91.3	93.35
LongBench-v2	🟢 60.2	57.7	—

Benchmark	Agents-A1	Qwen3.6-35B-A3B	Kimi-K2.6	GPT-5.5
SciCode	🟢 44.33	35.8	53.5	🥇 56.1
MLE-Lite	🟢 43.94	34.85	62.12	🥇 72.73

Field	Value
Parameters	35.11B (MoE)
Format	Safetensors, BF16
Architecture tag	`qwen3_5_moe`
Context	256K native; servers document 262144 max
Modalities	Text + vision encoder (text-only mode skips vision to free KV cache)
License	Apache 2.0

	Qwen-AgentWorld	Agents-A1
Role	Predict environment observations	Execute agent policy (plan, call tools)
Training focus	World-model / simulation RL	Multi-teacher agent distillation
Open weights	35B-A3B MoE	35B MoE

bash

uv venv --python 3.12 --seed --managed-python
source .venv/bin/activate
uv pip install sglang

SGLANG_USE_MODELSCOPE=true python -m sglang.launch_server \
  --model-path InternScience/Agents-A1 \
  --port 8000 \
  --tp-size 1 \
  --mem-fraction-static 0.8 \
  --context-length 262144 \
  --reasoning-parser qwen3

bash

SGLANG_USE_MODELSCOPE=true python -m sglang.launch_server \
  --model-path InternScience/Agents-A1 \
  --port 8000 \
  --tp-size 1 \
  --mem-fraction-static 0.8 \
  --context-length 262144 \
  --reasoning-parser qwen3 \
  --tool-call-parser qwen3_coder

bash

uv venv --python 3.12 --seed --managed-python
source .venv/bin/activate
uv pip install vllm --torch-backend=auto

VLLM_USE_MODELSCOPE=true vllm serve InternScience/Agents-A1 \
  --port 8000 \
  --tensor-parallel-size 1 \
  --max-model-len 262144 \
  --reasoning-parser qwen3

bash

VLLM_USE_MODELSCOPE=true vllm serve InternScience/Agents-A1 \
  --port 8000 \
  --tensor-parallel-size 1 \
  --max-model-len 262144 \
  --reasoning-parser qwen3 \
  --enable-auto-tool-choice \
  --tool-call-parser qwen3_coder

Parameter	Value
`temperature`	0.85
`top_p`	0.95
`top_k`	20
`min_p`	0.0
`presence_penalty`	1.1
`repetition_penalty`	1.0

text

Coding-first open MoE     →  LongCat-2.0, Kimi K2.7-Code
Local daily driver (dense)  →  Qwen 3.6 27B + llama.cpp
World-model / sim RL        →  Qwen-AgentWorld
Heterogeneous long-horizon  →  Agents-A1  ← this launch
Closed frontier (if allowed)→  GPT-5.5, Fable/Mythos (offline)

Agents-A1: InternScience 35B MoE Agent Model — Long-Horizon Search, GAIA 96, and vLLM Setup

TL;DR — what people asked on X

Related posts

Apodex 1.0-mini: 35B Open Model Tops FutureX — Beats Sonnet 4.6 and GPT-5.5

LM Studio Bionic: Open-Model Agent for Code and Work Projects

Hermes WebUI: The Self-Hosted AI Agent Interface That Remembers Everything (2026 Complete Guide)

What InternScience claims

Benchmark table — where 35B punches up

Long-horizon search

Scientific research

Instruction following

Engineering / coding (the skeptical row)

Architecture and lineage

How to run Agents-A1 (vLLM and SGLang)

SGLang — standard server

SGLang — tool calling

vLLM — standard server

vLLM — tool calling

Text-only (save KV cache)

Recommended sampling (from model card)

Wire to agent harnesses

What X got right (and what to verify)

Where Agents-A1 fits in the 2026 open-agent ladder

Reproduction checklist

TL;DR — what people asked on X

Related posts

Apodex 1.0-mini: 35B Open Model Tops FutureX — Beats Sonnet 4.6 and GPT-5.5

LM Studio Bionic: Open-Model Agent for Code and Work Projects

Hermes WebUI: The Self-Hosted AI Agent Interface That Remembers Everything (2026 Complete Guide)

What InternScience claims

Benchmark table — where 35B punches up

Long-horizon search

Scientific research

Instruction following

Engineering / coding (the skeptical row)

Architecture and lineage

How to run Agents-A1 (vLLM and SGLang)

SGLang — standard server

SGLang — tool calling

vLLM — standard server

vLLM — tool calling

Text-only (save KV cache)

Recommended sampling (from model card)

Wire to agent harnesses

What X got right (and what to verify)

Where Agents-A1 fits in the 2026 open-agent ladder

Reproduction checklist

Related on explainx.ai