Skills

Skills give the LLM modular “skills”—tool usage, workflows, best practices, guardrails, and more. Compared to stuffing everything in the prompt, Skills distill durable know-how and load on demand instead of every turn.

ESP-Claw uses Skills to manage context dynamically and extend the Agent—including driving hardware through documented tool flows.

Skills Lab Download more Skills from Skills Lab

Directory structure and metadata

Unlike Skills in common desktop LLM agent frameworks, ESP-Claw Skills can include Lua scripts, provide API references for Lua scripts, and include static assets so they can extend ESP-Claw Agent capabilities. Because of this, ESP-Claw Skills must follow a specific directory structure and metadata specification.

You can find the ESP-Claw Skill specification in claw-skill-spec.md.

Metadata

ESP-Claw reads Skill metadata from JSON frontmatter at the top of each SKILL.md. The directory name, name, and built path must match: skills/<skill_id>/SKILL.md.

name and description map to the Skill ID and summary shown to the model in the Skills catalog.

metadata.manage_mode must be readonly, web, or runtime. Skills Lab projects typically use web; on device, web behaves like readonly for management. Runtime-registered skills use runtime.

metadata.cap_groups is optional: a list of capability group ids. When the Skill activates, those groups become visible to the LLM (tools exposed). This binds “how to use tools” with “having the tools” for the same lifecycle so the model cannot call tools without the guide.

For other fields, see claw-skill-spec.md.

Writing effective Skills

After activation, the Skill .md is injected into the chat, so write for the LLM, not for human developers skimming a repo.

Strong Skill docs include:

Element	Purpose
Scenario framing	States when to activate so the model can self-select
Invocation rules	Parameter constraints, ordering, rate limits
JSON examples	Copy-ready tool payloads—not prose paraphrases
Error playbook	Frequent errors and how to recover
Cross-tool notes	e.g. “Call X first to discover paths, then call this tool”

Here is a full template for the body of a capability-backed Skill:

# My Feature

One sentence describing when this Skill should activate.

## Usage rules

- When calling `my_action`, `param` must be non-empty
- On failure, validate `param`, then retry once
- Never issue more than three calls in a row; wait ≥ 500 ms between attempts

## Example calls

Basic invocation:
```json
{"param": "hello"}
```

With optional fields:
```json
{"param": "hello", "mode": "fast"}
```

## Error handling

- `Error: param is required` — add `param`, then retry
- `Error: invalid state` — device not ready; tell the user to try later

Lua module Skills

lua_module_* / lua_driver_* Skills target LLM-authored Lua, not JSON tool calls like cap_* docs. Provide precise API references instead of generic prose.

Lua module Skill docs should cover:

Element	Purpose
Hardware summary	What it drives, which pins or buses
Init story	Whether `require` has side effects, extra setup steps
Full API table	Parameters, returns, short Lua snippets per function
Usage rules	Rate limits, dependencies (“initialize `board_manager` first”)
Failure modes	Cases that raise Lua errors

# myled

Controls a single LED connected to GPIO 2.

## Setup

No explicit init required. The module configures GPIO 2 on first `require`.

## API

### `myled.set(on)`

Turns the LED on or off.

- `on`: boolean

```lua
local myled = require("myled")
myled.set(true)   -- on
myled.set(false)  -- off
```

### `myled.get()` → boolean

Returns the current LED state.

## Rules

- The module only controls GPIO 2; there is no parameter to change the pin.
- Do not call `myled.set` more than once per 50 ms.

The lua_module_display Skill is a good model: it spells out each drawing helper, optional table fields, return values, and constraints such as “text is ASCII-only”—all mandatory context for code-generating models.

One Skill or many?

Prefer keeping one Skill clear and complete. Split into multiple Skills only when the Capability is complex and has multiple independent usage scenarios.

Let the Agent add, remove, or edit Skills

Skills usually capture repeatable work or guidance, so after a task you can ask the Agent to summarize and evolve them.

ask "search today's weather in Tokyo for me"
ask "turn the earlier city-weather lookup into a reusable skill"

How Skills work

Build time: sync component Skills into the FATFS image

The Skill files included with cap_*, lua_module_*, and lua_driver_* components are located in the skills/ directory of the component source code and are bundled into the FATFS image during build.

Directoryfatfs
- Directoryskills
  - Directorycap_lua
    SKILL.md Skill for running Lua scripts
  - Directorylight_switch
    SKILL.md
    Directoryscripts
    led_strip_switch.lua
  - Directorybuiltin_lua_modules
    SKILL.md built-in Lua module index
  - …

You can use the Web config file manager to add or remove Skills, or download new Skills from Skills Lab.

Runtime: progressively disclose capabilities to the LLM

Context can carry tools, requirements, and extra rules. But repeating every tool’s full manual every turn bloats context, hurts quality, and burns tokens.

Skills progressively disclose capabilities. Initially the model only sees the catalog entries with summaries. When a Skill is needed, the model activates it and receives the full document.

If metadata.cap_groups is set, activation also opens the matching capability groups so the LLM can call tools immediately—no separate manual enable. The activate_skill tool result contains the complete Skill document, so the LLM can read the usage guide without waiting for the next turn.