PodcastsHear the voice. See the shape of the thought.

ClaudeClaude Code subagents2 months ago

Designing effective subagents

This tutorial from Anthropic's Claude Code series covers four concrete patterns that separate reliable subagents from ones that drift, stall, or stomp on files they shouldn't touch. The narrator walks through each pattern with code-reviewer and web-search subagents as running examples, showing exactly which config knobs to turn and why. ## [00:03] Controlling sub-agent behavior with name and description Every message sent to the main context-window agent includes the name and description of each registered subagent in the system prompt. That means the description does double duty: it tells the orchestrator *when* to launch the subagent and provides the template it uses when writing the input prompt. The tutorial demonstrates with a code-reviewer subagent. In the original config, the orchestrator writes a generic prompt telling the subagent to call `git diff` itself. Updating the description to say "you must tell the agent precisely which files you want it to review" shifts ownership of file selection to the orchestrator — the next run produces a noticeably more specific input prompt. The same lever works for web-search subagents: adding "return sources that can be cited" to the description causes the main thread to include that instruction automatically when it delegates. > *"If you want to better control when the main agent launches a sub agent automatically, you should modify the name and description."* ## [01:41] Defining output formats The narrator identifies output format as the single most impactful improvement available. Without one, a subagent has no clear signal for when it has done enough — it keeps running, accumulating context, and burning tokens. A structured output format creates a natural stopping point. The subagent knows it is finished when the required fields are filled. Practically, this means adding an explicit schema — a summary block, a findings list, a status field — directly to the subagent's system prompt. > *"Without a defined output format, sub agents struggle to decide when enough research has been done and they tend to run much much longer than sub agents that are given an output format."* ## [02:04] Reporting obstacles in the summary When a subagent solves a problem — a dependency conflict, a command that needs unexpected flags, an environment quirk — the main thread needs that information or it will hit the same wall on the next step. The solution is to require obstacle reporting in the output format itself. The narrator lists the categories that should always surface: obstacles encountered, setup issues, workarounds discovered, commands that needed special flags or configuration, and imports or dependencies that caused problems. Baking these into the required output schema ensures the main thread inherits the subagent's hard-won discoveries rather than rediscovering them from scratch. > *"Otherwise, the main thread has to rediscover the same solutions, obstacles encountered, any setup issues, workarounds discovered or environment quirks, commands that needed special flags or configuration, dependencies or imports that cause problems."* ## [02:42] Limiting tool access by role Tool access is not just a safety control — it is a clarity tool. A readonly subagent with only `glob`, `grep`, and `read` cannot accidentally modify files, which makes its role unambiguous to anyone reading the config. The narrator maps three access tiers to three subagent roles: a research subagent gets read-only access because exploring the codebase never requires writes; a reviewer gets `bash` for `git diff` but still no file-editing tools; only subagents explicitly tasked with changing code — like one applying CSS updates — get `edit` and `write`. With several subagents in play, the tool list becomes a machine-readable summary of what each one is supposed to do. > *"Only give edit and write to sub agents that should actually change your code, like a styling agent applying CSS updates."* ## [03:27] Four patterns for effective sub-agents The tutorial closes with a one-sentence recap of all four patterns: structured output, obstacle reporting, specific descriptions, and restricted tool access. Each pattern reinforces the others — precise descriptions reduce ambiguity in input prompts, output formats create stopping points, obstacle reporting preserves context across agent boundaries, and minimal tool access prevents side effects that would compound any remaining ambiguity. > *"So effective sub agents use structured output report obstacles have specific descriptions and limit tool access."* ## Entities - **Anthropic Tutorial Narrator** (Person): host of the Claude Code subagents tutorial series, presenting on behalf of Anthropic - **Claude Code** (Software): Anthropic's agentic coding tool that orchestrates subagents to complete multi-step engineering tasks - **Subagent** (Concept): a specialized Claude instance launched by an orchestrator agent, given its own system prompt, tool access, and input prompt - **Output format** (Concept): a required schema defined in a subagent's system prompt that creates a stopping condition and structures information returned to the main thread - **Obstacle reporting** (Concept): a pattern of requiring subagents to surface workarounds, dependency issues, and environment quirks in their output so the orchestrator does not need to rediscover them - **Tool access scoping** (Concept): restricting each subagent to only the tools its role requires — read-only for research, bash for review, edit/write only for agents that must change files - **Anthropic** (Organization): creator of Claude and the Claude Code agentic coding platform

#claude-code#subagents#ai-agents

2:48

ClaudeClaude Code subagents2 months ago

What are subagents?

Sub-agents are specialized assistants that Claude Code can delegate tasks to — each one runs in its own isolated context window, does its work autonomously, then hands back a focused summary while the full intermediate trace gets discarded. This two-minute tutorial from Anthropic walks through why that isolation matters for keeping the main context window usable, shows a concrete code-exploration scenario that illustrates the tradeoff, and lists the built-in sub-agents that Claude Code ships with today. ## [00:03] What sub-agents are A sub-agent runs in its own separate conversation context window, initialized with a custom system prompt you define. The parent agent (Claude Code in the main thread) hands the sub-agent a task description based on what you asked for. The sub-agent works through it autonomously, then returns a summary to the main thread — while all the intermediate work stays isolated. > *"Sub-agents are specialized assistants that Claude can delegate tasks to."* The critical design point: once the sub-agent finishes, its entire conversation thread gets completely discarded. Only the returned summary survives back into the main conversation. ## [00:24] Managing the context window Every tool call Claude makes in the main thread — file reads, searches, function traces — accumulates in the main context window. Over a long session, that fills up fast. Sub-agents exist specifically to offload discrete research or action tasks so the cost doesn't land in the main window. > *"Each sub-agent runs in its own conversation contacts window with a custom system prompt that you define."* The tradeoff is explicit: the main window gains a clean context, but it loses visibility into how the sub-agent reached its conclusions and what it discovered along the way. You get the answer, not the reasoning trace. ## [01:13] A concrete example: the payment system Say you're using Claude Code to figure out which service handles refunds in an unfamiliar codebase. Without a sub-agent, Claude might read 15 files, run several searches, and trace through multiple function calls — and all of that fills the main context window even though you only needed one fact. > *"With a sub-agent, you get the answer without the journey."* The sub-agent explores the codebase, discovers the answer, and returns a focused summary — keeping your main context clean. The lost visibility is the cost: you won't see which files it read or which traces it followed to get there. ## [02:00] Claude Code's built-in sub-agents Claude Code ships with three built-in sub-agents ready to use immediately: - **General-purpose sub-agent** — for multi-step tasks that require both exploration and action. - **Explore sub-agent** — fast searching of codebases without the overhead of a full task loop. - **Plan sub-agent** — runs during plan mode to research and analyze the codebase before presenting a plan to you. > *"And you can also create your own sub-agents with custom system prompts and tool access."* Beyond these three, you can define custom sub-agents with their own system prompts and tool access lists, tailored to specific workflows. ## [02:30] When to use sub-agents Sub-agents pay off when you have a discrete, self-contained question or task that would otherwise dump a lot of intermediate context into your main window. > *"Sub-agents like Claude Code break work into focused pieces, keep your main context window clean, and bring back just what you need, whether you're using the built-in ones or creating your own."* They're most valuable in longer Claude Code sessions where context window pressure accumulates — offloading a sub-task to a sub-agent rather than letting it sprawl through the main thread directly extends how long a session stays effective. ## Entities - **Anthropic Tutorial Narrator** (Person): narrator of the "Claude Code subagents" tutorial series produced by Anthropic - **Claude Code** (Software): Anthropic's agentic coding assistant; the host environment in which sub-agents operate - **Claude** (Software): the underlying AI model powering Claude Code and its sub-agents - **Sub-agent** (Concept): a specialized assistant Claude Code delegates tasks to, running in an isolated context window with its own system prompt - **Context window** (Concept): the finite token buffer holding all conversation history, tool calls, and results; sub-agents prevent it from filling with intermediate work - **General-purpose sub-agent** (Software): built-in Claude Code sub-agent for multi-step exploration-and-action tasks - **Explore sub-agent** (Software): built-in Claude Code sub-agent optimized for fast codebase searching - **Plan sub-agent** (Software): built-in Claude Code sub-agent that researches the codebase during plan mode before presenting a plan - **Anthropic** (Organization): creator of Claude and Claude Code; producer of this tutorial series

#claude-code#subagents#context-window

Terence Tao – How the world's top mathematician uses AI

1:23:44

Dwarkesh Patel2 months ago

Terence Tao – How the world's top mathematician uses AI

Tao and Dwarkesh use Kepler's discovery of planetary motion as a lens for what AI is actually changing in science. Tao argues hypothesis generation is now nearly free, so the bottleneck moves to evaluation, peer review, and the test of time. Today's AIs win on breadth (try every standard technique on every problem) while humans win on depth (build cumulatively on partial progress) — so hybrid configurations will dominate mathematics for at least another decade. ## [00:00] Kepler was a high temperature LLM Tao retells how Kepler got to the three laws of planetary motion. Kepler started from a wrong-but-beautiful theory — Platonic solids inscribed between the planets' orbits — and only abandoned it after grinding Tycho Brahe's stolen naked-eye observations for years. The ellipses, equal-areas, and cube-square law came out of decade-long data analysis, with Newton's explanation a century later. Dwarkesh's framing: Kepler resembles a high-temperature LLM cycling through random relationships against a verifiable dataset. Tao agrees on the mechanics but pushes back on the bottleneck. Idea generation was already cheap — Kepler had no shortage of theories. What he needed was Brahe's order-of-magnitude better data and the patience to discard ideas the data killed. > *But as you say, it has to be matched by an equal amount of verification, otherwise it's slop.* ## [11:44] How would we know if there's a new unifying concept within heaps of AI slop? Tao: if AI has driven idea generation to near-zero cost, peer review and the test of time become the new constraint. Journals are already drowning in AI-generated submissions. The standing of any idea depends on what later science does with it — Copernicus was less accurate than Ptolemy until Kepler completed the picture — so the assessment is hard to automate from inside the moment. Dwarkesh asks how science would identify a Bell-Labs-style unifying concept (Shannon's bit, the transformer) buried in millions of mediocre papers. Tao's answer points at the part that may stay human: scientists don't just produce theories, they tell stories that convince other scientists to invest years in following up. Darwin's prose did the work that Newton's Latin equations didn't. > *AI has driven the cost of idea generation down to almost zero, in a very similar way to how the internet drove the cost of communication down to almost zero.* ## [26:10] The deductive overhang Tao on the under-explored signal in existing data. Astronomy has been the discipline that extracts maximum information from minimal data for centuries — which is also why quant hedge funds preferentially hire astronomy PhDs. He gives one favorite example: researchers measured how often scientists actually read the papers they cite by tracking which typos propagated through citation chains. He suggests the same sociology-of-science treatment for AI progress itself — mining citation patterns, conference mentions, and other footprints to detect whether a result actually constituted progress, rather than waiting for the test of time to do it slowly. > *One takeaway was that the deductive overhang in many fields could be so much bigger than people realize.* ## [30:31] Selection bias in reported AI discoveries AI has solved roughly 50 of ~1,100 Erdős problems, then plateaued. Tao explains the selection effect: those 50 had near-zero literature — one obscure technique plus one known result was enough, and AI tools are excellent at "try every standard combination." When the problem has 80% of the work done by existing methods, AI clears it. When it needs a genuinely new technique, the tools stall, and the per-problem success rate from systematic sweeps is 1-2%. Tao's metaphor: AI tools are jumping robots loose in a mountain range, in the dark. They can clear short walls humans can't reach, but they can't grab a handhold, stay there, and pull up from partial progress. The bullish reading — once AIs reach a given level, you can run a million parallel copies on a million problems, which no human community can do — is also the structural reason science needs new paradigms that actually exploit breadth. > *They excel at breadth, and humans excel at depth, human experts at least.* ## [46:43] AI makes papers richer and broader, but not deeper Tao on his own working pattern: papers now carry more code, more figures, deeper literature surveys, because the auxiliary tasks got roughly 5x cheaper. The actual core — solving the hardest part of a problem — still happens on pen and paper. He'd be reluctant to call himself "2x more productive" because the metric isn't one-dimensional; what changed is the type of paper he writes, not the rate at which he answers the question he started with. The cleverness-vs-intelligence distinction lands in the same place. When two humans collaborate on a math problem, each failed prototype becomes a foothold for the next. With current AIs, a new session forgets what the last one figured out. The cumulative pull-up step is missing — only brute trial-and-error and (eventually) absorption into the next training run. > *It's made the papers richer and broader, but not necessarily deeper.* ## [53:00] If AI solves a problem, can humans get understanding out of it? Could an AI prove the Riemann hypothesis in Lean and leave us none the wiser? Tao isn't worried. Lean has the property that any proof can be decomposed atomically — each lemma can be inspected, ablated, and tested in isolation. So even a 3,000-line generated proof becomes raw material: other AIs can refactor for elegance, other humans can extract the conceptual content, and the artifact is still useful even if the original derivation was opaque. He predicts an entire profession of mathematicians whose job is to take giant Lean-generated proofs apart and find the ideas inside them — a kind of proof archaeology, with both human judgment and AI ablation tools. > *You'll get a lot more mileage out of the interplay of humans collaborating with these tools.* ## [59:20] We need a semi-formal language for the way that scientists actually talk to each other Dwarkesh asks what a semi-formal language for mathematical strategies (as opposed to mathematical proofs) would look like. Tao traces the question through Gauss's prime number theorem — the first major statistical conjecture in math, derived from raw data before any proof existed — and through the twin prime conjecture, which mathematicians believe because the random model of the primes predicts it. Math has both rigorous proofs and rigorous heuristics; only the proof side has been formalized into something Lean can check. The reason the heuristic side hasn't been formalized: any RL-checkable grader becomes a target for exploitation, and the subjective part of "this argument is convincing" doesn't admit a hackable framework yet. Tao would love a way to benchmark conjecture-generation and strategy-selection at scale, possibly by running small AIs in toy mathematical universes and watching what strategies emerge. > *There's some subjective aspect of science that we don't know how to capture in a way that we can insert AI into it in any useful way.* ## [69:48] How Terry uses his time Tao on how he absorbs new subfields. He places himself as a fox in Berlin's sense — a little about everything, occasionally a hedgehog when needed. The driver is a completionist obsession: if another mathematician can prove a result with a technique he doesn't know, he has to chase down what their trick was. (He had to quit video games for the same reason.) Collaboration with other mathematicians is the main vehicle, and writing things down on his blog is the memory aid he developed after repeatedly losing arguments six months after deriving them. On his calendar, Tao deliberately leaves serendipity room. He'd hate to optimize his time so tightly that he never sits in a meeting outside his comfort zone. The year he spent at the Institute for Advanced Study confirmed the trap — two weeks of pure research were great, then he ran out of inspiration. The accidental discovery on the next library shelf, the casual hallway chat, and the meeting he reluctantly attended were doing more work than they looked. > *Those serendipitous interactions may not seem optimal, but they are actually really important.* ## [77:05] Human-AI hybrids will dominate math for a lot longer When will AI just do mathematics? Tao reframes — AI already does math humans can't, since calculators, just on a different frontier. Within roughly a decade he expects much of what graduate students currently do — applying standard techniques, grinding literature — will move to AI, but the field will move up a level the way it did when computer algebra systems absorbed symbolic integration. Genetics didn't end when sequencing got cheap; it scaled up to ecosystems. Math will do the same. His advice to students entering math now: assume change, but get your credentials the old-fashioned way — for now there's still no substitute for working through math the traditional path. At the same time, stay adaptable enough that you can use entirely new modes of research as they appear, including ones that don't exist yet. The unusual fact is that with AI tools and Lean, a high schooler can contribute to real math research today, which wasn't true five years ago. > *I guess I do believe that hybrid human plus AIs will dominate mathematics for a lot longer.* ## Entities - **Terence Tao** (Person): Fields medalist (2006), UCLA mathematician, writes regularly on AI's role in mathematical research. - **Dwarkesh Patel** (Person): Host of the Dwarkesh Podcast; long-form interviews on AI, science, and technology. - **Johannes Kepler** (Person): Astronomer (1571-1630) who derived the three laws of planetary motion from Tycho Brahe's observations. - **Tycho Brahe** (Person): Danish naked-eye astronomer whose decades of planetary observations were the dataset Kepler needed. - **Lean** (Software): Proof assistant in which mathematical proofs are formalized and can be checked, decomposed, and ablated atomically. - **Erdős problems** (Concept): The roughly 1,100 open problems posed by Paul Erdős; AI has solved ~50, almost all with near-zero prior literature. - **The deductive overhang** (Concept): The idea that existing data already encodes far more derivable knowledge than has been extracted, with astronomy as the model. - **Riemann hypothesis** (Concept): Unsolved conjecture on prime distribution; the test case for whether an AI proof would advance human mathematical understanding.

#ai-for-math#terence-tao#kepler

2:54

#claude-code#ai-tools#developer-productivity

What are skills?

Claude Code skills are reusable markdown files that encode specialized knowledge once — Claude then activates them automatically whenever a request matches, without the user needing to repeat instructions or type a slash command. This three-minute tutorial covers what skills are, where they live, how they differ from CLAUDE.md files, and the signal that tells you it is time to write one. ## [00:03] The repetition problem skills solve Every time you explain your team's coding standards, re-describe how you want PR feedback structured, or remind Claude of your preferred commit message format, you are repeating yourself. The narrator opens with three back-to-back examples to name the exact friction point skills address. > *"Every time you explain your team's coding standards to Claude, you're repeating yourself."* ## [00:20] What a skill is and how Claude picks one A skill is a markdown file that teaches Claude how to do something once. Claude stores the instruction, then applies it automatically whenever the situation calls for it. In Claude Code, that file is SKILL.md. The description field inside that file is the key mechanism: when you ask Claude to review this PR, it compares your request against every available skill description and activates the matching one. > *"Claude reads your request, compares it to all available skill descriptions, and activates the ones that match."* ## [01:05] Where to store skills: personal vs project Skills live in two places depending on who needs them. Personal skills go in ~/.claude/skills and follow you across every project: commit message style, documentation format, how you like code explained. Project skills go in .claude/skills inside the repository root; anyone who clones the repo gets them automatically. That second location is where team standards live: brand guidelines, preferred fonts and colors for web design. > *"Anyone who clones the repository gets these skills automatically."* ## [01:42] Skills vs CLAUDE.md: automatic and context-efficient Claude Code offers several customization layers, and skills occupy a distinct niche. CLAUDE.md files load into every conversation unconditionally, right for rules like always use TypeScript strict mode. Skills load on demand, only when they match the current request, and only the name and description enter context at that point. The full skill body loads only when triggered. That keeps the PR review checklist out of the context window while you are debugging, and pulls it in only when you actually ask for a review. Slash commands require you to type them; skills do not. > *"Skills are unique because they're automatic and task-specific."* ## [02:27] When to write a skill Skills work best for specialized knowledge tied to specific tasks: code review standards your team follows, commit message formats, brand guidelines. The closing rule is blunt and practical: if you find yourself explaining the same thing to Claude repeatedly, that is a skill waiting to be written. > *"If you find yourself explaining the same thing to Claude repeatedly, well, that's a skill waiting to be written."* ## Entities - **Anthropic Tutorial Narrator** (Person): narrator and host of the Claude Code skills tutorial series - **Claude Code** (Software): Anthropic's AI coding assistant; the runtime where skills are discovered and applied - **SKILL.md** (Concept): the markdown file that defines a skill — contains a name, description, and instructions for Claude - **CLAUDE.md** (Concept): project-level or global instruction file that loads into every Claude Code conversation unconditionally, contrasted with skills - **Anthropic** (Organization): creator of Claude and Claude Code

3:53

#claude-code#skills#developer-tools

Sharing skills

A PR review skill used by one engineer is useful; the same skill deployed across a team standardizes code review and creates a consistent experience throughout the organization. This tutorial walks through four concrete distribution methods — repository commits, plugins, enterprise managed settings, and custom sub-agents — and explains exactly when each one applies. The sub-agent section carries a non-obvious caveat: sub-agents don't inherit skills automatically, and built-in agents can't access them at all. ## [00:01] Why sharing multiplies skill value A skill kept to one developer does its job. The same skill pushed to a team locks in standards, eliminates per-person drift, and means every review looks and feels the same. The narrator opens with a direct contrast between individual and team-scale use before listing the four sharing mechanisms. > *"A PR review skill that only you use is helpful. The same skill shared across your team standardizes code review and provides a consistent experience amongst your organization which is much better."* ## [00:18] Committing skills to your repository The lowest-friction method: place skills in `.claude/skills` inside the project repo. Anyone who clones the repository gets those skills immediately — no install step, no extra tooling. Updates push through the normal `git pull` cycle. This path fits team coding standards, project-specific workflows, and skills that reference the codebase's own structure. > *"Anyone who clones the repository gets these skills automatically. No extra installation, it's just what you're doing already."* ## [00:45] Distributing skills through plugins Plugins extend Claude Code with custom functionality designed to travel beyond a single project. Inside the plugin project, a `skills/` directory mirrors the structure of `.claude/` — skill name, `SKILL.md`. Once published to a marketplace, any Claude Code user can download and activate the plugin. This channel is best for skills general enough to serve the broader community rather than one team's conventions. > *"Think of plugins as ways to extend Claude Code with custom functionality, but designed to be shared across teams and projects."* ## [01:26] Enterprise-wide deployment via managed settings Administrators can push skills to every developer in an organization through managed settings. Enterprise skills take the highest priority: they override personal, project, and plugin skills that share the same name. The intended use is mandatory standards — security requirements, compliance workflows, coding practices that must be uniform. The narrator stresses the word "must" explicitly: these aren't suggestions. > *"This is for mandatory standards, security requirements, compliance workflows, or coding practices that must be consistent across the organization."* ## [01:52] Custom sub-agents and explicit skill loading Sub-agents don't inherit the main conversation's skills. Built-in agents (explorer, planner, verify) can't access skills at all. Only custom sub-agents — defined by an `agent.md` file in `.claude/agents` — can use skills, and only the ones explicitly listed in the `skills:` field of that file. Skills load when the sub-agent starts, not on demand, so the list should stay tight: only skills always relevant to the agent's purpose. The tutorial demonstrates creating a sub-agent with the Claude Code sub-agent creator and attaching skills to an existing `agent.md`. > *"Built-in agents like the explorer, planner, and verify can't access skills at all. Only custom sub-agents you define can use them, and only when you explicitly list them."* ## [03:18] Recap: choosing the right distribution method The closing beats map each method to its scenario: project directories for team access, plugins for cross-repository sharing, enterprise deployment for org-wide mandatory standards, and explicit sub-agent skill lists for isolated task delegation. The sub-agent reminder lands again — list only the skills always relevant to a given agent's job, because they load at startup, not lazily. > *"Share skills through project directories for team access, plugins for cross-repository distribution, or enterprise deployment for organization-wide standards."* ## Entities - **Anthropic Tutorial Narrator** (Person): single presenter for the Claude Code skills tutorial series - **Claude Code** (Software): Anthropic's AI coding assistant; the runtime environment where skills are authored and deployed - **Skills** (Concept): reusable instruction sets placed in `.claude/skills` that extend Claude Code's behavior - **Plugins** (Concept): distributable packages that bundle skills for sharing across teams and marketplace users - **Managed settings** (Concept): enterprise administrator mechanism to deploy skills org-wide with highest priority override - **Sub-agents** (Concept): custom Claude Code agents defined via `agent.md` in `.claude/agents`; the only agent type that can load skills, and only when explicitly listed - **Anthropic** (Organization): company that built Claude Code and produces the Claude Code skills tutorial series

4:04

#claude-code#skills#configuration

Configuration and multi-file skills

A four-minute tutorial from the Claude Code skills series covering the advanced configuration fields that turn a basic skill into a reliable, context-efficient tool. The narrator walks through the full agentskills.io field set — `name`, `description`, `allowed_tools`, `model` — then explains how to structure larger skills using progressive disclosure so that reference material and scripts load only when needed, not on every invocation. ## [00:02] Overview of advanced skill fields The agentskills.io open standard defines several fields beyond the mandatory `name` and `description`. `name` must be lowercase with hyphens, capped at 64 characters, and must match the directory name. `description` tops out at 1,024 characters and is the primary signal Claude uses for skill matching. Two optional fields round out the configuration: `allowed_tools`, which constrains which tools the skill can invoke, and `model`, which pins the skill to a specific Claude version. > *"A basic skill works with just a name and description, but here are some other advanced tips that can make your skills really effective in Claude Code."* ## [00:39] Writing effective descriptions A vague description — "help with dogs" — leaves Claude guessing at scope and triggers. A good description answers exactly two questions: what does this skill do, and when should Claude use it? Matching keywords to the natural phrasing of user requests is the lever for fixing skills that fail to trigger. > *"A good description answers two questions. What does this skill do? And when should Claude use it?"* ## [01:20] Restricting tools with allowed_tools `allowed_tools` is the mechanism for locking a skill to a defined surface — read-only access for security-sensitive workflows, for instance. When the field is set, Claude can only call those tools without asking for permission; no editing, writing, or Bash. Omitting the field entirely leaves Claude's normal permission model intact. > *"When this skill is active, Claude can only use those tools without asking permission. No editing, no writing, no bash commands."* ## [01:49] Progressive disclosure for multi-file skills Skills share Claude's context window with the live conversation. Stuffing everything into a single 20,000-line SKILL.md bloats context on every invocation and makes the file painful to maintain. The solution: put essential instructions in `SKILL.md` and move reference material into separate files Claude reads only when the user's request actually requires them. The standard suggests three supporting directories — `scripts/` for executable code, `references/` for documentation, and `assets/` for images and templates. A link in `SKILL.md` acts like a table of contents entry; if the topic never comes up, the file never loads. Scripts in the skill directory run without their source loading into context at all — only their output consumes tokens. The narrator recommends keeping `SKILL.md` under 500 lines; exceeding that is a signal the skill should be split. > *"It's like having a table of contents in the context window rather than fitting the whole entire document in there."* ## [03:18] Recap: skill metadata and best practices The tutorial closes by restating the full configuration surface: `name` and `description` are required; `allowed_tools` restricts the tool surface; `model` pins the Claude version. Descriptions need concrete action verbs and trigger phrases to match reliably. For larger skills, progressive disclosure keeps `SKILL.md` under 500 lines and defers supporting files until they're actually needed. Scripts execute without loading source, keeping context lean. > *"Scripts can execute without loading their contents, keeping context efficient."* ## Entities - **Anthropic Tutorial Narrator** (Person): single host for this tutorial series, presenting Claude Code skill configuration. - **Claude Code** (Software): Anthropic's CLI tool that loads and executes skills from the agentskills.io standard. - **agentskills.io** (Organization): open standard defining the skill manifest schema, including `name`, `description`, `allowed_tools`, `model`, and directory conventions. - **SKILL.md** (Concept): the primary manifest file for a Claude Code skill; should stay under 500 lines with links to supporting files. - **allowed_tools** (Concept): optional skill field that whitelists specific Claude tools, enabling read-only or sandboxed skill modes. - **Progressive disclosure** (Concept): structuring a multi-file skill so that reference files and scripts load into context only when the active request requires them. - **Context window** (Concept): shared token budget between the conversation and any skill files Claude loads; the key resource progressive disclosure is designed to conserve.

3:47

#claude-code#skills#developer-tools

Creating your first skill

This 3-minute tutorial walks through building a personal Claude Code skill from first principles: create a directory with a SKILL.md file, verify the skill loads at startup, and watch Claude apply it against a real request. The second half explains exactly how Claude's skill-loading pipeline works—from the four scan locations and the name-only startup pass, to the confirmation gate and the four-tier priority order that resolves naming conflicts. ## [00:03] What This Tutorial Builds The narrator opens with the concrete goal: a skill that teaches Claude to explain code using visual diagrams and analogies. After building it, the tutorial will also trace what happens internally when Claude picks up and executes a skill. > *"This skill will teach Claude how we would like it to explain code using visual diagrams and analogies."* ## [00:18] Creating the Skill File Personal skills live under the home directory (not inside a project), so the first step is making a new directory named after the skill inside `~/.claude/skills/`. Inside that directory goes a single `SKILL.md` file. Three sections matter: `name` (the identifier Claude stores at startup), `description` (the matching criteria Claude uses to decide whether to invoke the skill), and everything after the second `---` delimiter (the actual instructions Claude follows when the skill fires). > *"Take into consideration that we're creating a directory with the skill name inside of the skills directory."* ## [00:52] Loading and Testing Your Skill Claude Code scans for skills at startup, not on demand, so a session restart is required after creating the file. Running `/skills` should list "PR description" (or whichever skill was just created). To test, make a branch with some changes and send the plain-English request "Write a PR description for my changes." Claude surfaces that it's invoking the PR description skill, then reads the diff and writes a description that matches the template—same format every single time. > *"Claude will then show you that it's using the PR description skill."* ## [01:25] How Claude Loads Skills Under the Hood At startup, Claude Code scans four locations: enterprise managed settings, personal `~/.claude/skills/`, the project's `.claude/` directory, and installed plugins. It loads only the `name` and `description`—not the full content. When a request arrives, Claude compares it against those stored descriptions; "explain what this function does" overlaps with "explain code with visual diagrams" so the skill matches. Claude then asks for confirmation before reading the complete SKILL.md. That confirmation gate keeps the user aware of which context is being injected. > *"It loads only the name and description of each skill, not the full content. This is important later."* ## [02:02] Priority Rules and Naming Conflicts Cloning a repository that ships its own skills can create name collisions. Claude resolves them with a fixed priority ladder: enterprise (highest) → personal → project → plugins (lowest). An enterprise `code-review` skill beats a personal `code-review` skill every time. The practical fix is descriptive naming: `security-review` or `frontend-pr-review` instead of the generic `review`, so conflicts never arise in the first place. > *"If your company has an enterprise code review skill and you create a personal code review skill, the enterprise version of that takes precedence."* ## [02:52] Updating and Removing Skills Updating a skill is a direct file edit: open the SKILL.md, change what needs changing, save. Removing a skill means deleting the directory. Both operations require restarting Claude Code for the change to take effect—the skill list is built once at session startup, not watched for live changes. > *"Edit the skill.md file to update a skill and restart Claude Code for changes to take effect."* ## Entities - **Anthropic Tutorial Narrator** (Person): single host walking through the skill creation tutorial for the Claude Code skills series - **Claude Code** (Software): Anthropic's CLI for Claude; scans for skills at startup and applies them when user requests match skill descriptions - **SKILL.md** (Concept): the single file that defines a skill—contains YAML frontmatter (name, description) and freeform instruction text after the second `---` delimiter - **Skills** (Concept): reusable, named instruction sets that teach Claude a consistent behavior pattern; stored as directories containing a SKILL.md file - **Enterprise Skills** (Concept): organization-managed skills that sit at the top of the four-tier priority order, overriding personal, project, and plugin skills - **Anthropic** (Organization): creator of Claude and Claude Code; produces this tutorial series at claude.com/resources/courses

How skills compare to other Claude Code features

3:01