What is new in Matt Pocock Skills v1.0?

Version 1.0 introduces progressive disclosure—short skill summaries load into context first; full guides load only when invoked. Matt Pocock reports 63% token cost reduction. New skills include /codebase-design (deep modules), /domain-modeling (project glossaries), /grilling (tough questioning loops), and /ask-matt (routes users to the right skill).

How do I install Matt Pocock Skills v1.0?

Install via npx into Claude Code, Cursor, or compatible agents following Anthropic's Agent Skills spec: npx skills add mattpocock/skills (or the specific skill path). Skills are portable markdown packages with SKILL.md frontmatter—see our Agent Skills guide for the full ecosystem.

What is progressive disclosure in agent skills?

Instead of injecting full skill documents into every session, the agent sees compact descriptions first. When a skill triggers, it loads detailed instructions on demand—reducing baseline context burn while keeping depth available.

What is Matt Pocock's skill-invocable tier proposal?

Pocock wants three tiers: user-invocable (slash commands), skill-invocable (only callable by other skills, hidden from model descriptions), and model-invocable (description always in context). Today, extracting /run-automated-checks from /review forces model-invocable status and wastes tokens—a spec gap he flagged publicly in June 2026.

How does v1.0 relate to Matt Pocock's earlier skills repo?

Our April 2026 deep dive covered 20+ production skills (/tdd, /to-prd, /improve-codebase-architecture). v1.0 is an architectural release: same engineering philosophy, better token economics, and new architecture-focused skills for systematic codebase work.

Matt Pocock (@mattpocockuk) created Total TypeScript and AI Hero, former Vercel engineer, and one of the most visible advocates for production-grade AI coding—not vibe coding. His skills repo has tens of thousands of GitHub stars.

Matt Pocock Skills v1.0: Progressive Disclosure Guide | explainx.ai Blog

Matt Pocock shipped mattpocock/skills v1.0 in June 2026—the first semver major since his repo became the most-starred Agent Skills collection on GitHub (135,000+ stars, 11,700+ forks).

The headline number: 63% lower token costs via progressive disclosure—load short summaries first, pull full skill bodies only when needed.

For teams burning context on bloated SKILL.md files in every Claude Code session, v1.0 is a pattern worth copying even if you never install a single slash command. Full skill inventory and four failure modes: Matt Pocock agent skills deep dive.

TL;DR

Item	Detail
Repo	github.com/mattpocock/skills
Version	1.0.1 (June 17, 2026)
Stars	135,000+
Token savings	~63% (progressive disclosure)
Taxonomy	User-invoked (orchestrate) vs Model-invoked (discipline)
New skills	`/codebase-design`, `/domain-modeling`, `/grilling`, `/ask-matt`
Setup	`/setup-matt-pocock-skills` — tracker, labels, docs
Install	`npx skills@latest add mattpocock/skills`
Open debate	User- vs skill- vs model-invocable tiers

Progressive Disclosure: Why v1.0 Matters

Early agent skills dumped entire workflows into context at session start—planning guides, TDD checklists, architecture essays—whether or not the task needed them.

Pocock's v1.0 pattern:

snippet

Session start → compact skill index (names + one-line triggers)
       │
       User or model invokes /tdd (or matched intent)
       │
       ▼
Full TDD skill body loads → agent executes with complete rules

Result: Baseline context stays lean; depth arrives on demand. Pocock reports 63% token reduction across typical sessions—meaning cheaper runs, longer conversations, and less Headroom-style compression pressure.

This mirrors web performance (lazy loading) and Karpathy's LLM Wiki index-first navigation—don't load what you have not asked for yet. The Kaggle Agent Skills whitepaper formalizes the same three-level model: metadata → body → bundled resources.

v1.0 Architecture: Shared Design Skills

The 1.0.0 changelog is the real story—not just token savings, but composition:

Change	Impact
`codebase-design` (new)	Deep-module vocabulary (interface, depth, seam, adapter)—generalized from `improve-codebase-architecture/LANGUAGE.md`
`domain-modeling` (new)	Active glossary + `CONTEXT.md` + ADR maintenance
`improve-codebase-architecture`	Now delegates to both shared skills
`tdd`	Removed inline `deep-modules.md`; uses `codebase-design`
`grill-with-docs`	Builds domain model via `domain-modeling`
`grilling` (exposed)	Model-invoked loop shared by `grill-me` and `grill-with-docs`
`ask-matt` (new)	User-invoked router over all user-invoked skills
`writing-great-skills`	Replaces `write-a-skill` + `GLOSSARY.md`
`diagnosing-bugs`	Renamed from `diagnose`
`resolving-merge-conflicts`	New standalone merge/rebase loop
Removed	`caveman`, `zoom-out`, `design-an-interface` → `prototype`

Breaking: Install codebase-design and domain-modeling whenever you install architecture or TDD skills—they are dependencies, not optional extras.

User-Invoked vs Model-Invoked (Official Taxonomy)

v1.0 renamed Commands / Skills to match Pocock's spec proposal:

snippet

YOU type /grill-me  (user-invoked — orchestrates)
        │
        └──► grilling  (model-invoked — discipline loop)
        └──► domain-modeling  (model-invoked — if grill-with-docs)

Rules from docs/invocation.md:

User-invoked skills orchestrate; model-invoked skills hold reusable discipline
A user-invoked skill may call model-invoked skills
A user-invoked skill must never call another user-invoked skill
Model-invoked skills can also fire automatically when the task fits

This is the production implementation of the three-tier debate below—today, "skill-invocable" is approximated by making helper skills model-invoked and accepting the token cost.

Four Failure Modes → Skill Mapping

The README frames the entire library around four problems:

#	Failure	Root cause	Primary skills
1	Agent didn't do what I want	Misalignment	`/grill-me`, `/grill-with-docs`
2	Agent too verbose	No shared language	`/grill-with-docs`, `domain-modeling`, `CONTEXT.md`
3	Code doesn't work	Weak feedback loops	`/tdd`, `diagnosing-bugs`
4	Ball of mud	Design neglected	`/to-prd`, `/improve-codebase-architecture`, `codebase-design`

Pocock positions these against GSD, BMAD, and Spec-Kit—frameworks that own the whole process. His skills stay small and composable so you keep control when the process breaks.

Ubiquitous language example (from README):

Before: "when a lesson inside a section of a course is made 'real'..."
After: "problem with the materialization cascade"

That one term replaces a paragraph in every future session.

New Skills in v1.0

Skill	Purpose
`/codebase-design`	Deep modules, boundaries, dependency direction—architecture you can maintain
`/domain-modeling`	Project glossaries, ubiquitous language, bounded contexts
`/grilling`	Adversarial questioning loops—stress-test plans before code
`/ask-matt`	Meta-router: sends you to the right skill for your question

These extend the production set we covered in Matt Pocock's agent skills for real engineers—/tdd, /to-prd, /improve-codebase-architecture, /git-guardrails-claude-code—with more systematic design tooling.

v1.0.1 patch (June 17)

teach is now reuse-first: lessons build from components in ./assets/ (stylesheets, quiz widgets, simulators). The agent reads ./assets/ before authoring and extracts new reusable pieces instead of inlining one-offs.

Full Skill Inventory (June 2026)

Engineering (user-invoked)

Skill	Purpose
ask-matt	Router — which skill or flow fits?
grill-with-docs	Grilling + domain model + ADRs
triage	Issue state machine (GitHub, Linear, local, PRs)
improve-codebase-architecture	HTML deepening report → grill
setup-matt-pocock-skills	One-time repo config
to-issues	Vertical-slice issue breakdown
to-prd	Conversation → PRD → tracker
prototype	Throwaway terminal or multi-UI prototypes

Engineering (model-invoked)

Skill	Purpose
diagnosing-bugs	reproduce → minimise → hypothesise → instrument → fix
tdd	Red-green-refactor vertical slices
domain-modeling	Glossary, `CONTEXT.md`, ADRs
codebase-design	Deep modules, seams, adapters

Productivity

Skill	Invocation	Purpose
grill-me	User	Plan/design interview
handoff	User	Conversation → handoff doc
teach	User	Multi-session teaching workspace
writing-great-skills	User	Skill authoring reference
grilling	Model	Shared interview loop

Community reactions:

@geren8te asked how "writing great skills" was authored—preferences vs research-backed
@johnlindquist noted /review behaves like a script or subagent, not a lightweight skill
@iocapon praised /ask-matt as a neat routing pattern
@ShivamS1123 and educators use /teach for instruction workflows

Video walkthrough: Pocock pointed to aihero.dev for the v1.0 changes first.

The Spec Gap: Three Tiers of Invocable

Pocock's most technical v1.0 thread is not a skill—it is a spec proposal for Anthropic's Agent Skills format.

Today (simplified):

Tier	Behavior
User-invocable	Slash command; description hidden from model (saves tokens)
Model-invocable	Description always in context; model can auto-trigger

Problem: Composite workflows. Example /review:

Run automated checks
Check against spec
Check coding standards

Step 1 wants to be /run-automated-checks—reused by /review and callable alone. But extracting it forces model-invocable status → its description burns tokens forever, even when you only want it inside /review.

Pocock's ask: Add skill-invocable — invokable by users or parent skills only, never auto-suggested to the model, no standing description in context.

Workaround today: write descriptions like "never invoke except from other skills"—spec-legal, wasteful, honest about the hack.

@dexhorthy (HumanLayer) connected this to post-IDE tooling—skills architecture is load-bearing for the next generation of dev environments.

Install and Use

bash

npx skills@latest add mattpocock/skills

Works across agents implementing the Agent Skills spec—Claude Code, Cursor with skills support, Codex, and compatible harnesses via skills.sh.

Required first run:

snippet

/setup-matt-pocock-skills

Configures issue tracker (GitHub / Linear / local), triage labels, and docs directory.

Suggested starter path:

/setup-matt-pocock-skills — once per repo
/tdd + codebase-design — red-green-refactor with interface discipline
/grill-me or /grill-with-docs — alignment before code
/git-guardrails-claude-code — safety rails

v1.0 additions for larger codebases:

domain-modeling — shared vocabulary in CONTEXT.md
/improve-codebase-architecture — periodic deepening scans
/ask-matt — when you do not know which skill fits

v1.0 vs Vibe Coding

Pocock's positioning remains consistent with his AI Coding for Real Engineers workshop:

Skills encode constraints (TDD, planning, architecture)
Agents get procedures, not vibes
Token efficiency (63%) makes discipline affordable at scale

Progressive disclosure is the missing economic layer—without it, rigorous skills price themselves out of long sessions.

Summary

Matt Pocock Skills v1.0.1 is three releases in one:

Shared design skills — codebase-design + domain-modeling as composable primitives
Progressive disclosure — ~63% token reduction
Formal taxonomy — user-invoked orchestrators vs model-invoked discipline

The skill-invocable tier debate matters for composite skills—/review calling /run-automated-checks should not tax every session. If Anthropic adopts Pocock's three-tier model, the whole ecosystem gets cheaper overnight.

Repository: github.com/mattpocock/skills · 135K+ stars · MIT license

v1.0.0 / v1.0.1 from CHANGELOG.md. Token claims from @mattpocockuk (June 2026).

The headline number: 63% lower token costs via progressive disclosure—load short summaries first, pull full skill bodies only when needed.

TL;DR

Item	Detail
Repo	github.com/mattpocock/skills
Version	1.0.1 (June 17, 2026)
Stars	135,000+
Token savings	~63% (progressive disclosure)
Taxonomy	User-invoked (orchestrate) vs Model-invoked (discipline)
New skills	`/codebase-design`, `/domain-modeling`, `/grilling`, `/ask-matt`
Setup	`/setup-matt-pocock-skills` — tracker, labels, docs
Install	`npx skills@latest add mattpocock/skills`
Open debate	User- vs skill- vs model-invocable tiers

Progressive Disclosure: Why v1.0 Matters

Early agent skills dumped entire workflows into context at session start—planning guides, TDD checklists, architecture essays—whether or not the task needed them.

Pocock's v1.0 pattern:

snippet

Session start → compact skill index (names + one-line triggers)
       │
       User or model invokes /tdd (or matched intent)
       │
       ▼
Full TDD skill body loads → agent executes with complete rules

v1.0 Architecture: Shared Design Skills

The 1.0.0 changelog is the real story—not just token savings, but composition:

Change	Impact
`codebase-design` (new)	Deep-module vocabulary (interface, depth, seam, adapter)—generalized from `improve-codebase-architecture/LANGUAGE.md`
`domain-modeling` (new)	Active glossary + `CONTEXT.md` + ADR maintenance
`improve-codebase-architecture`	Now delegates to both shared skills
`tdd`	Removed inline `deep-modules.md`; uses `codebase-design`
`grill-with-docs`	Builds domain model via `domain-modeling`
`grilling` (exposed)	Model-invoked loop shared by `grill-me` and `grill-with-docs`
`ask-matt` (new)	User-invoked router over all user-invoked skills
`writing-great-skills`	Replaces `write-a-skill` + `GLOSSARY.md`
`diagnosing-bugs`	Renamed from `diagnose`
`resolving-merge-conflicts`	New standalone merge/rebase loop
Removed	`caveman`, `zoom-out`, `design-an-interface` → `prototype`

Breaking: Install codebase-design and domain-modeling whenever you install architecture or TDD skills—they are dependencies, not optional extras.

User-Invoked vs Model-Invoked (Official Taxonomy)

v1.0 renamed Commands / Skills to match Pocock's spec proposal:

snippet

YOU type /grill-me  (user-invoked — orchestrates)
        │
        └──► grilling  (model-invoked — discipline loop)
        └──► domain-modeling  (model-invoked — if grill-with-docs)

Rules from docs/invocation.md:

User-invoked skills orchestrate; model-invoked skills hold reusable discipline
A user-invoked skill may call model-invoked skills
A user-invoked skill must never call another user-invoked skill
Model-invoked skills can also fire automatically when the task fits

This is the production implementation of the three-tier debate below—today, "skill-invocable" is approximated by making helper skills model-invoked and accepting the token cost.

Four Failure Modes → Skill Mapping

The README frames the entire library around four problems:

#	Failure	Root cause	Primary skills
1	Agent didn't do what I want	Misalignment	`/grill-me`, `/grill-with-docs`
2	Agent too verbose	No shared language	`/grill-with-docs`, `domain-modeling`, `CONTEXT.md`
3	Code doesn't work	Weak feedback loops	`/tdd`, `diagnosing-bugs`
4	Ball of mud	Design neglected	`/to-prd`, `/improve-codebase-architecture`, `codebase-design`

Pocock positions these against GSD, BMAD, and Spec-Kit—frameworks that own the whole process. His skills stay small and composable so you keep control when the process breaks.

Ubiquitous language example (from README):

Before: "when a lesson inside a section of a course is made 'real'..."
After: "problem with the materialization cascade"

That one term replaces a paragraph in every future session.

New Skills in v1.0

Skill	Purpose
`/codebase-design`	Deep modules, boundaries, dependency direction—architecture you can maintain
`/domain-modeling`	Project glossaries, ubiquitous language, bounded contexts
`/grilling`	Adversarial questioning loops—stress-test plans before code
`/ask-matt`	Meta-router: sends you to the right skill for your question

v1.0.1 patch (June 17)

Full Skill Inventory (June 2026)

Engineering (user-invoked)

Skill	Purpose
ask-matt	Router — which skill or flow fits?
grill-with-docs	Grilling + domain model + ADRs
triage	Issue state machine (GitHub, Linear, local, PRs)
improve-codebase-architecture	HTML deepening report → grill
setup-matt-pocock-skills	One-time repo config
to-issues	Vertical-slice issue breakdown
to-prd	Conversation → PRD → tracker
prototype	Throwaway terminal or multi-UI prototypes

Engineering (model-invoked)

Skill	Purpose
diagnosing-bugs	reproduce → minimise → hypothesise → instrument → fix
tdd	Red-green-refactor vertical slices
domain-modeling	Glossary, `CONTEXT.md`, ADRs
codebase-design	Deep modules, seams, adapters

Productivity

Skill	Invocation	Purpose
grill-me	User	Plan/design interview
handoff	User	Conversation → handoff doc
teach	User	Multi-session teaching workspace
writing-great-skills	User	Skill authoring reference
grilling	Model	Shared interview loop

Community reactions:

@geren8te asked how "writing great skills" was authored—preferences vs research-backed
@johnlindquist noted /review behaves like a script or subagent, not a lightweight skill
@iocapon praised /ask-matt as a neat routing pattern
@ShivamS1123 and educators use /teach for instruction workflows

Video walkthrough: Pocock pointed to aihero.dev for the v1.0 changes first.

The Spec Gap: Three Tiers of Invocable

Pocock's most technical v1.0 thread is not a skill—it is a spec proposal for Anthropic's Agent Skills format.

Today (simplified):

Tier	Behavior
User-invocable	Slash command; description hidden from model (saves tokens)
Model-invocable	Description always in context; model can auto-trigger

Problem: Composite workflows. Example /review:

Run automated checks
Check against spec
Check coding standards

Pocock's ask: Add skill-invocable — invokable by users or parent skills only, never auto-suggested to the model, no standing description in context.

Workaround today: write descriptions like "never invoke except from other skills"—spec-legal, wasteful, honest about the hack.

@dexhorthy (HumanLayer) connected this to post-IDE tooling—skills architecture is load-bearing for the next generation of dev environments.

Install and Use

bash

npx skills@latest add mattpocock/skills

Works across agents implementing the Agent Skills spec—Claude Code, Cursor with skills support, Codex, and compatible harnesses via skills.sh.

Required first run:

snippet

/setup-matt-pocock-skills

Configures issue tracker (GitHub / Linear / local), triage labels, and docs directory.

Suggested starter path:

/setup-matt-pocock-skills — once per repo
/tdd + codebase-design — red-green-refactor with interface discipline
/grill-me or /grill-with-docs — alignment before code
/git-guardrails-claude-code — safety rails

v1.0 additions for larger codebases:

domain-modeling — shared vocabulary in CONTEXT.md
/improve-codebase-architecture — periodic deepening scans
/ask-matt — when you do not know which skill fits

v1.0 vs Vibe Coding

Pocock's positioning remains consistent with his AI Coding for Real Engineers workshop:

Skills encode constraints (TDD, planning, architecture)
Agents get procedures, not vibes
Token efficiency (63%) makes discipline affordable at scale

Progressive disclosure is the missing economic layer—without it, rigorous skills price themselves out of long sessions.

Summary

Matt Pocock Skills v1.0.1 is three releases in one:

Shared design skills — codebase-design + domain-modeling as composable primitives
Progressive disclosure — ~63% token reduction
Formal taxonomy — user-invoked orchestrators vs model-invoked discipline

Repository: github.com/mattpocock/skills · 135K+ stars · MIT license

v1.0.0 / v1.0.1 from CHANGELOG.md. Token claims from @mattpocockuk (June 2026).

Matt Pocock Skills v1.0: 63% Fewer Tokens With Progressive Disclosure

TL;DR

Progressive Disclosure: Why v1.0 Matters

v1.0 Architecture: Shared Design Skills

User-Invoked vs Model-Invoked (Official Taxonomy)

Four Failure Modes → Skill Mapping

New Skills in v1.0

v1.0.1 patch (June 17)

Full Skill Inventory (June 2026)

Engineering (user-invoked)

Engineering (model-invoked)

Productivity

The Spec Gap: Three Tiers of Invocable

Install and Use

v1.0 vs Vibe Coding

Summary

Matt Pocock Skills v1.0: 63% Fewer Tokens With Progressive Disclosure

TL;DR

Progressive Disclosure: Why v1.0 Matters

v1.0 Architecture: Shared Design Skills

User-Invoked vs Model-Invoked (Official Taxonomy)

Four Failure Modes → Skill Mapping

New Skills in v1.0

v1.0.1 patch (June 17)

Full Skill Inventory (June 2026)

Engineering (user-invoked)

Engineering (model-invoked)

Productivity

The Spec Gap: Three Tiers of Invocable

Install and Use

v1.0 vs Vibe Coding

Summary

Related posts

Matt Pocock's agent skills for real engineers: TDD, planning, and production-grade workflows

Claude 5 Context Engineering: Stop Over-Constraining the Model

Hallmark by Nutlope: Anti-AI-Slop Design Skill for Claude Code, Cursor, and Codex

Related posts

Matt Pocock's agent skills for real engineers: TDD, planning, and production-grade workflows

Claude 5 Context Engineering: Stop Over-Constraining the Model

Hallmark by Nutlope: Anti-AI-Slop Design Skill for Claude Code, Cursor, and Codex

TL;DR

Progressive Disclosure: Why v1.0 Matters

v1.0 Architecture: Shared Design Skills

User-Invoked vs Model-Invoked (Official Taxonomy)

Four Failure Modes → Skill Mapping

New Skills in v1.0

v1.0.1 patch (June 17)

Full Skill Inventory (June 2026)

Engineering (user-invoked)

Engineering (model-invoked)

Productivity

The Spec Gap: Three Tiers of Invocable

Install and Use

v1.0 vs Vibe Coding

Summary

Related Reading

TL;DR

Progressive Disclosure: Why v1.0 Matters

v1.0 Architecture: Shared Design Skills

User-Invoked vs Model-Invoked (Official Taxonomy)

Four Failure Modes → Skill Mapping

New Skills in v1.0

v1.0.1 patch (June 17)

Full Skill Inventory (June 2026)

Engineering (user-invoked)

Engineering (model-invoked)

Productivity

The Spec Gap: Three Tiers of Invocable

Install and Use

v1.0 vs Vibe Coding

Summary

Related Reading

Related posts

Matt Pocock's agent skills for real engineers: TDD, planning, and production-grade workflows

Claude 5 Context Engineering: Stop Over-Constraining the Model

Hallmark by Nutlope: Anti-AI-Slop Design Skill for Claude Code, Cursor, and Codex

Related posts

Matt Pocock's agent skills for real engineers: TDD, planning, and production-grade workflows

Claude 5 Context Engineering: Stop Over-Constraining the Model

Hallmark by Nutlope: Anti-AI-Slop Design Skill for Claude Code, Cursor, and Codex