Featured image of post The Claude Code Skills Explosion — What Five Repos in One Day Are Telling Us

The Claude Code Skills Explosion — What Five Repos in One Day Are Telling Us

A deep dive into five Claude Code skill and agent collection repos that surfaced together on 2026-05-10 and what it says about skills becoming the primary primitive for agent engineering

Overview

Five Claude Code skill and agent collection repos surfaced around the same time on 2026-05-10. One is Andrej Karpathy’s own autonomous research agent. One is Matt Pocock’s engineering-grade skill set. One is a full meta-framework called SuperClaude. This is not a coincidence — it is a sign that “skill” has crystallized into the primary primitive for agent engineering.

Why Skills Are Crystallizing

A skill is the pattern Anthropic formalized in fall 2025. The format is dead simple — a folder, a SKILL.md, optional helper scripts. Claude Code looks at the user’s task context and decides which skill to invoke itself.

That simplicity is the reason for the explosion.

  • Version-controllable — it’s just text. Review with git diff, accept PRs against it.
  • Composable — one skill can call another. /grill-me/to-prd/to-issues/tdd becomes a natural pipeline.
  • Model-agnostic in spirit — Claude Code is the first mover, but the format is markdown, so it ports trivially. SuperGemini and SuperQwen forks already exist.
  • Shareable — pull an entire repo into your agent with /plugin marketplace add.

These five repos are five facets of that pattern crystallizing.

1. karpathy/autoresearch — Skill as a Research Agent’s program.md

karpathy/autoresearch sits at 80,223 stars. Created 2026-03-06, “AI agents running research on single-GPU nanochat training automatically.”

The idea is simple. Hand an AI agent a small but real LLM training setup and let it experiment overnight. Modify code → train 5 min → compare → keep or discard → repeat. You wake up to a log of experiments and (hopefully) a better model.

The structure is what matters.

prepare.py    constants, data prep (do not modify)
train.py      model/optimizer/training loop (agent edits this)
program.md    agent instructions (human edits this)

Karpathy himself states it in the README:

The program.md file is essentially a super lightweight “skill”.

That’s the line. Karpathy chose the word “skill.” Not a 10,000-line framework wrapping autonomous research orchestration on top of nanochat training code — one markdown file. The human evolves program.md. The agent evolves train.py. Two meta-evolution loops, cleanly separated.

Why this matters — Karpathy is the last person you’d expect to outsource a training setup. If he ends at one markdown file, everyone else has license to simplify harder.

2. forrestchang/andrej-karpathy-skills — Skills as Behavioral Correction

forrestchang/andrej-karpathy-skills has 123,691 stars. “A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy’s observations on LLM coding pitfalls.”

It distills four principles from Karpathy’s X post on LLM coding pitfalls.

PrincipleAddresses
Think Before CodingWrong assumptions, hidden confusion, missing tradeoffs
Simplicity FirstOvercomplication, bloated abstractions
Surgical ChangesTouching unrelated code, “improving” things you shouldn’t
Goal-Driven ExecutionLoop until verifiable success criteria

Installation is two paths — /plugin marketplace add forrestchang/andrej-karpathy-skills for Claude Code, or curl the CLAUDE.md into your project. The same ruleset is committed as .cursor/rules/karpathy-guidelines.mdc for Cursor.

The thesis quote:

“LLMs are exceptionally good at looping until they meet specific goals… Don’t tell it what to do, give it success criteria and watch it go.” — Karpathy

This is skills used as a ruleset that corrects model behavior. Not adding capabilities — subtracting failure modes.

3. mattpocock/skills — Skills For Real Engineers

mattpocock/skills sits at 69,128 stars, MIT, last pushed 2026-05-10. “Skills for Real Engineers. Straight from my .claude directory.”

This repo stakes out an explicit position against full-process frameworks like GSD, BMAD, and Spec-Kit. The README is blunt:

Approaches like GSD, BMAD, and Spec-Kit try to help by owning the process. But while doing so, they take away your control and make bugs in the process hard to resolve.

These skills are designed to be small, easy to adapt, and composable. They work with any model.

Matt’s four failure modes and their skills:

Failure modeSkill
#1 The Agent Didn’t Do What I Want/grill-me, /grill-with-docs
#2 The Agent Is Way Too VerboseCONTEXT.md shared language (built into grill-with-docs)
#3 The Code Doesn’t Work/tdd, /diagnose
#4 We Built A Ball Of Mud/to-prd, /zoom-out, /improve-codebase-architecture

Installation goes through the skills.sh installer:

npx skills@latest add mattpocock/skills

After install, /setup-matt-pocock-skills configures your issue tracker (GitHub / Linear / local files), your triage label vocabulary, and your doc storage path. From there, to-issues, to-prd, triage, diagnose, tdd, improve-codebase-architecture, and zoom-out all wire together against the same convention.

Pocock’s reading list is itself a signal — Pragmatic Programmer, Domain-Driven Design, Extreme Programming Explained, A Philosophy of Software Design. The argument: skills are not a new paradigm, they are an LLM-shaped interface to 30 years of software engineering practice.

4. SuperClaude_Framework — A Meta-Programming Layer On Top of Skills

SuperClaude-Org/SuperClaude_Framework has 22,726 stars, MIT, homepage superclaude.netlify.app. Created 2025-06-22.

Opposite pole from skill minimalism.

MetricCount
Slash Commands30
Specialized AI Agents20
Behavioral Modes7
MCP Servers8

Self-described as “a meta-programming configuration framework that transforms Claude Code into a structured development platform through behavioral instruction injection and component orchestration.”

Install via PyPI:

pipx install superclaude
superclaude install

Headline commands — /sc:research (deep research, Tavily MCP), /sc:brainstorm, /sc:implement, /sc:test, /sc:pm. Optional MCP servers — Serena (2-3x faster code understanding), Sequential (30-50% fewer tokens), Tavily, Context7 — all routed through airis-mcp-gateway.

v5.0 is in development, with a TypeScript plugin system tracked in issue #419. Once shipped, install drops to /plugin marketplace add SuperClaude-Org/superclaude-plugin-marketplace.

What SuperClaude proves — skills are stable enough that a meta-framework can rest on top of them without collapsing. And the fact that the same format ports to Gemini and Qwen is empirical evidence of model-neutrality.

5. hesreallyhim/awesome-claude-code — The Curation Layer

hesreallyhim/awesome-claude-code has 43,273 stars, created 2025-04-19 — the oldest of this set. “A curated list of awesome skills, hooks, slash-commands, agent orchestrators, applications, and plugins for Claude Code by Anthropic.”

It follows the awesome-list convention. The repo’s topic tags are revealing — agentic-coding, agent-skills, ai-workflow-optimization, coding-agents. The README currently notes “the previous Table of Contents was no longer fit for purpose” and is mid-reorganization — which is itself the message. The Claude Code ecosystem has outgrown what one awesome-list can hold.

Why this repo belongs in the set: the other four provide new skills. This repo solves where to find them. Curation is itself a meta-skill.

Insights

1. Skill is now the consensus primitive. Five different people, five different angles, all settling on the same word. Karpathy’s program.md, Matt Pocock’s SKILL.md, SuperClaude’s slash commands — all framed as “skills.” The prior generation of terms (“prompt template”, “agent rules”, “system message”) has collapsed into a single noun.

2. Full-process frameworks vs. micro-skills is the live fault line. SuperClaude (30 commands) and Matt Pocock (small, composable) surfacing the same day is coincidence, but the split is real. Both survive. The interesting move is Pocock explicitly naming GSD/BMAD/Spec-Kit as the opposition.

3. Skills are used to subtract failure modes, not just add capabilities. Forrest Chang’s Karpathy guidelines give the model no new abilities. They prevent behaviors. What Anthropic does at the model level with Constitutional AI, users now do at the workflow level with skills.

4. Skills are the substrate of model neutrality — Claude Code is just the first surface. SuperClaude maintains SuperGemini and SuperQwen forks. Forrest Chang ships a Cursor .mdc in the same repo. Matt Pocock writes “They work with any model” as a top-line selling point. As the format standardizes, IDE/model lock-in weakens.

5. The program.md pattern has reached training code. In autoresearch, the human-edited file and the agent-edited file are physically separated. If that generalizes, every automated codebase trends toward a human.md + agent-modifiable/ shape.

6. What comes next — skill marketplaces, skill SDKs, skill evals. /plugin marketplace exists. SuperClaude is listed on Smithery. skills.sh emerged as a separate installer. The missing pieces are quality evaluation (which skills actually improve model output) and a skill SDK (build/test skills as if they were code).

7. Curation itself becomes a skill. awesome-claude-code earning 43K stars is the symptom of “there are too many skills to triage manually.” That’s the cue for a meta layer.

References

Source repos

Background

Related

Built with Hugo
Theme Stack designed by Jimmy