ravenclaude-core
v0.14.1Domain-neutral Claude Code plugin: 14 specialist agents, team-lead dispatch playbook, format/lint/test gates, hooks, templates, slash commands `/init-agent-ready` and `/wrap`. v0.9.0: pre-action decision-tree traversal. v0.10.0: every agent ships example-scenario frontmatter. v0.11.0: pre-action environment-context check + Team Lead session-start orientation. v0.12.0: auto-discovery streamlining via `environment-discovery` skill. **v0.13.0 — cross-platform-determinism skill:** new skill captures the lesson from two real bugs in this repo's own `scripts/generate-repo-guide.py` (OS-dependent path separators from `str(Path)` and locale-dependent encoding crashing on Windows cp1252). Teaches the six categories of generator nondeterminism — path separators, encoding, line endings, ordering, timestamps, locale — with a review checklist for any script whose output is committed and diff-checked in CI.
Agents (14)
architect
tools: Read, Grep, Glob, WebFetch, WebSearch, Bash model: opusUse this agent as the technical conscience across the entire software lifecycle — design, build, test, review, iterate. Spawn for upfront design BEFORE writing code, AND re-consult whenever a phase boundary surfaces a question that exceeds a coder/tester/reviewer's authority (tests contradict the plan, scope expands mid-build, reviewer flags a structural concern, iteration requires re-planning). Do NOT use it to write production code.
Example scenarios (3)
Help me design <feature> before I start writing code+ 2 more
Reviewer says my <choice> doesn't compose with <constraint> — what's the right shape?Tests say X but my design says Y — adjudicateQuickstart
- Trigger phrase: 'Design <feature> with constraints A, B, C
- Expected output: structured plan (Goal / Constraints / Proposed Design / Why this over alternatives / Risks)
- Common follow-up: dispatch backend-coder / frontend-coder per the plan, then have code-reviewer audit before merge
Works well with
backend-coderfrontend-codercode-reviewersecurity-reviewerdeep-researcherbackend-coder
tools: Read, Edit, Write, Grep, Glob, Bash model: sonnetUse this agent to implement server-side code — API handlers, business logic, database queries, background jobs, integrations. Spawn AFTER the architect has produced a plan. Each invocation should target one focused, testable change.
Example scenarios (3)
Implement the <path> endpoint per the architect's plan+ 2 more
Add idempotency-key handling to /webhook and a worker that retries on 5xxProfile and fix the memory leak in <handler> — load test repros it at 50 req/sQuickstart
- Trigger phrase: 'Implement <focused change> per <plan link>' — narrow scope, one testable unit
- Expected output: code change + tests + commit ready for code-reviewer
- Common follow-up: dispatch tester-qa (coverage), then code-reviewer (pre-merge), then security-reviewer if auth/PII/crypto involved
Works well with
architectcode-reviewertester-qasecurity-reviewercode-reviewer
tools: Read, Grep, Glob, Bash model: opusUse this agent for pre-merge review of any non-trivial diff. Spawn it AFTER coder + tester agents are done but BEFORE the Team Lead opens a PR or merges. Returns a structured review with blockers, suggestions, and praise.
Example scenarios (3)
Review the diff on branch <branch>+ 2 more
Confirm this <N>-file refactor doesn't leave stragglers or dead codeIndependent read on this controversial diff — what's blocker vs taste?Quickstart
- Trigger phrase: 'Review <branch or PR>' — works on either git branch or PR URL
- Expected output: blockers (must-fix) / suggestions (could-fix) / praise (keep doing this)
- Common follow-up: send blockers back to backend-coder / frontend-coder; security-reviewer in parallel if any auth/PII touch
Works well with
architectsecurity-reviewerbackend-coderfrontend-codertester-qadata-engineer
tools: Read, Edit, Write, Grep, Glob, Bash, WebFetch, WebSearch model: sonnetUse this agent for data-shaped work that isn't tied to a specific domain plugin — pipeline design, data modeling, ETL/ELT, warehouse and lake schemas, query performance, data quality, lineage, ingestion connectors, and analytics-engineering style transformations (dbt-flavored or hand-rolled). Spawn for "design this pipeline", "model this warehouse", "this query is slow", "this batch keeps failing", "make this dataset trustworthy". NOT for Power BI semantic models or DAX (that's `power-platform/power-bi-engineer`). NOT for application database schema design driven by a product feature (route to `architect` instead). NOT for ML feature engineering as part of model training (that's a separate specialty).
Example scenarios (3)
Add a stg_<name> + dim_<name> model with tests+ 2 more
Plan + execute the migration from <source> to <target> with parallel-run verificationWhy is <metric> down from <date>? Source data looks correctQuickstart
- Trigger phrase: 'Design the pipeline for <X>' OR 'This <query/batch/sync> is broken — diagnose
- Expected output: pipeline plan OR diagnostic + fix with verifiable parity check
- Common follow-up: dispatch architect if the design touches application schema; code-reviewer for transform PRs
Works well with
architectcode-reviewerbackend-coderdeep-researcher
tools: Read, Grep, Glob, WebFetch, WebSearch, Bash model: opusUse this agent for rigorous, multi-source research — troubleshooting unfamiliar errors, comparing tools/libraries, verifying claims, or going deeper than a single web search. Spawn it when the question needs official docs PLUS expert blogs/forums/source code, when confidence levels and source citations matter, or when another agent (architect, project-manager, PSM) has flagged a research gap. Returns a structured brief with findings, confidence ratings, competing hypotheses, and open questions. Do NOT use it for tasks the Team Lead can answer from the repo alone.
Example scenarios (3)
Research the 3 leading <category> options for <context>+ 2 more
I'm hitting <error> on <platform> — synthesize what practitioners + docs sayFact-check the <vendor> claim that <X> against primary sourcesQuickstart
- Trigger phrase: 'Research <question> with primary sources + confidence levels
- Expected output: structured brief — TL;DR / Findings / Competing hypotheses / Gaps / Recommendations / Sources
- Common follow-up: pass to architect (design implications), documentarian (writeup), or apply Matt's fact-check rule before any load-bearing claim lands in repo
Works well with
architectdocumentarianprompt-engineerdesigner
tools: Read, Edit, Write, Grep, Glob, Bash, WebFetch, WebSearch model: sonnetUse this agent for UX direction and visual design — wireframes, user flows, screen layouts, design specs, accessibility checks, visual hierarchy. Spawn BEFORE the frontend-coder starts a UI, or when any visual artifact (Power Apps screen, partner-facing slide deck, dashboard, infographic, onboarding artifact) needs intentional design rather than ad-hoc layout. Do NOT use it to write production code — it produces specs the frontend-coder (or the user) executes. Do NOT use it for stakeholder prose (that's the documentarian).
Example scenarios (3)
Wireframe the <feature> screen with the 3 main user flows+ 2 more
Audit <surface> against WCAG 2.2 AA and rank the gapsDesign the visual system (typography, color, spacing) for <product>Quickstart
- Trigger phrase: 'Design <screen|system|spec> for <context>
- Expected output: visual spec + rationale + accessibility notes — frontend-coder can implement directly
- Common follow-up: dispatch frontend-coder to build it; documentarian if a partner-facing artifact needs prose around it
Works well with
frontend-coderdocumentarianarchitectdocumentarian
tools: Read, Edit, Write, Grep, Glob, Bash model: sonnetUse this agent for stakeholder-facing written deliverables — executive summaries, decision memos, variance commentary, partner briefs, runbooks, SOPs, release notes, READMEs, onboarding guides, long-form writeups. Spawn when the work product is *prose intended for a human audience*, not code, not internal PM hygiene. Do NOT use for RAID/status/task tracking (that's project-manager), system design plans (architect), code comments (coders), or raw analysis from data (the user or a domain expert produces the inputs; this agent polishes them).
Example scenarios (3)
Summarize the <decision> + rationale + next steps in 1 page+ 2 more
Write a stakeholder-readable summary of <incident>Document <procedure> as a step-by-step runbook with prereqs and recoveryQuickstart
- Trigger phrase: 'Draft a <memo|summary|runbook|brief> on <topic> for <audience>
- Expected output: polished prose, audience-shaped — leadership / partners / new-hires get different tones
- Common follow-up: deep-researcher if claims need citations; partner-success-manager for partner-facing copy review
Works well with
deep-researcherproject-managerpartner-success-managerfrontend-coder
tools: Read, Edit, Write, Grep, Glob, Bash model: sonnetUse this agent to implement UI work — components, pages, client-side state, styling, accessibility, browser-side integrations. Spawn AFTER the architect or designer has decided on structure and visual direction. Verify in a real browser before reporting done.
Example scenarios (3)
Implement the <component> from <design spec link>+ 2 more
Refactor <feature> from class components to hooksDiagnose the CLS regression in <route> — Core Web Vitals brokeQuickstart
- Trigger phrase: 'Build <component> from <spec>' OR 'Fix <visible-bug> in <route>
- Expected output: code + tests + verification in a real browser (not just unit tests)
- Common follow-up: tester-qa for coverage gaps; code-reviewer pre-merge; designer-revisit if spec ambiguity surfaced
Works well with
designerarchitectcode-reviewertester-qafullstack-coder
tools: Read, Edit, Write, Grep, Glob, Bash model: sonnetUse this agent only for changes that genuinely cross the client/server boundary in one cohesive unit (e.g., a new endpoint plus the UI that calls it). Prefer separate backend-coder + frontend-coder agents when the work can be split cleanly.
Example scenarios (2)
Build the /<endpoint> + the form that posts to it+ 1 more
Add real-time updates to <feature>Quickstart
- Trigger phrase: 'Build <feature> end-to-end' — use ONLY when split into backend+frontend doesn't compose cleanly
- Expected output: coordinated change touching server + client + tests across both
- Common follow-up: code-reviewer pre-merge (with cross-boundary attention); security-reviewer if the new endpoint touches auth/PII
Works well with
architectcode-reviewertester-qabackend-coderfrontend-coderpartner-success-manager
tools: Read, Edit, Write, Grep, Glob, Bash model: sonnetUse this agent for any Partner Success Manager work — maintaining partner profiles, success plans, QBRs, health scores, onboarding checklists, touchpoint logs, and the team's growing AI workflow library. Domain-neutral; PSM patterns apply across SaaS / EdTech / fintech / GovTech. Spawn proactively at QBR prep time (1 week before), when a partner has been silent >30 days, when a health score dips, or when a useful AI pattern surfaces and should be captured. Do NOT use for project management (project-manager agent), system design (architect), or end-customer success.
Example scenarios (3)
Partner <name> hasn't responded in 3 weeks — what's the read?+ 2 more
New partner <name> onboarded — draft the 30/60/90QBR for <name> next week — pull the data + draft the narrativeQuickstart
- Trigger phrase: 'Health check <partner>' OR '<partner> QBR prep' OR 'Draft success plan for <new partner>
- Expected output: structured artifact (health score / success plan / QBR brief) with signals cited and dated
- Common follow-up: documentarian for partner-facing prose; project-manager if commitments need RAID tracking; route to edtech-partner-success plugin for K-12-flavored work
Works well with
project-managerdocumentariandeep-researcherproject-manager
tools: Read, Edit, Write, Grep, Glob, Bash model: sonnetUse this agent for any project hygiene work — maintaining the RAID log, task list, weekly status report, activity log, or stakeholder register. PMP / PMBOK 7-aligned. Spawn proactively when a week passes without a status update, or immediately when a critical risk, issue, or decision emerges. Do NOT use for system design (that's the architect), implementation (coder), or written deliverables for stakeholders (documentarian).
Example scenarios (3)
Generate this week's status from the activity log+ 2 more
Refresh the RAID log — we descoped <feature> this weekRisk: <description>. Draft the escalation memoQuickstart
- Trigger phrase: 'Weekly status' OR 'Refresh RAID' OR 'Escalate <risk>
- Expected output: PMBOK-aligned artifact (status / RAID / escalation memo) with dated commitments
- Common follow-up: documentarian if the artifact needs polishing for partner-facing delivery
Works well with
documentariandeep-researcherpartner-success-managerprompt-engineer
tools: Read, Edit, Write, Grep, Glob, Bash, WebFetch, WebSearch model: opusUse this agent to author, critique, refine, or refactor agent definitions, skill files, and prompt patterns across RavenClaude and any Expert repo (PowerPlatformExpert, SalesforceExpert, etc.). Spawn when adding a new agent or skill, when an existing prompt produces inconsistent results, when reusable patterns need to be factored into a shared skill, when naming/voice/structure has drifted across repos, or when Anthropic ships new guidance worth absorbing. Owns the meta-layer of the AI library. Do NOT use for general research (deep-researcher), stakeholder prose (documentarian), or the PSM's team-shared workflow library (that's PSM-owned).
Example scenarios (3)
Draft the <name> agent with proper inline priors + tools + output contract+ 2 more
Why does <agent> produce inconsistent output on <X>?Find <pattern> drift across plugins and propose a shared skillQuickstart
- Trigger phrase: 'Draft <agent>' OR 'Critique <agent>' OR 'Factor <pattern> into a shared skill
- Expected output: agent/skill file ready for code-reviewer + architect review
- Common follow-up: code-reviewer pre-merge; deep-researcher if Anthropic has published new guidance worth absorbing
Works well with
architectcode-reviewerdeep-researchersecurity-reviewer
tools: Read, Grep, Glob, Bash, WebFetch model: opusUse this agent whenever a diff touches authentication, authorization, cryptography, secrets, sessions, untrusted input parsing, file uploads, deserialization, SQL, shell, network egress, or third-party integrations. Spawn it AFTER code-reviewer or in parallel with it. Mandatory for any auth/crypto change.
Example scenarios (3)
Review the auth changes in <PR>+ 2 more
Threat-model the <integration> with attention to <input source>Audit logs / log statements for <suspected pattern>Quickstart
- Trigger phrase: 'Security review <PR>' OR 'Threat-model <integration>' OR 'Audit for <PII pattern>
- Expected output: structured review — blockers (must-fix-before-merge) + suggestions + OWASP categories addressed
- Common follow-up: backend-coder for fixes; architect if structural-not-tactical concern; documentarian if compliance reporting needed
Works well with
architectcode-reviewerbackend-codertester-qa
tools: Read, Edit, Write, Grep, Glob, Bash model: sonnetUse this agent to design test plans, hunt flakes, plug coverage gaps, or harden tests around a new feature. Spawn it AFTER coder agents have a working diff but BEFORE code review. Also use for triaging mysterious test failures.
Example scenarios (3)
Cover <feature> with happy / sad / edge cases+ 2 more
Find and fix flakes in <suite>What's failing intermittently in <suite> and why?Quickstart
- Trigger phrase: 'Test <feature>' OR 'Fix flakes in <suite>' OR 'Triage <test failure>
- Expected output: tests + coverage + stability report — gaps explicitly named, not glossed over
- Common follow-up: code-reviewer pre-merge once tests are green and stable
Works well with
backend-coderfrontend-codercode-reviewerarchitectSkills (18)
agent-quality-rubric
Score and improve an agent file against a 6-dimension rubric — Mission clarity, Scope sharpness, Capability Grounding alignment, Output-Contract completeness, Escalation paths, Example scenarios. Each dimension scored 1-5 with anchors; includes a remediation template that turns a low score into an actionable PR. Reach for this skill when authoring a new agent, reviewing a PR that adds or modifies an agent, or running a periodic agent-bank audit. Used by `prompt-engineer` (primary) plus `architect`.
audit-ci-gates
For any consumer project where a CI workflow is meant to *enforce* a property (lint, format, security scan, manifest validation, version pin, layout allow-list, etc.). When working on a CI file or adding a new check, run this skill — every gate must fail on a known-bad input AND pass on a known-good input. A gate that runs without gating, or gates the wrong thing, is invisible from inside a green CI dashboard. Also triggers on user phrases like "audit the CI", "verify the gates", "is this gate real", "does this lint actually fail".
cleanup-worktrees
Remove finished agent worktrees, prune their branches, and surface anything still in flight. Run at the end of a multi-agent session and weekly as hygiene.
contribute-finding
For Claude sessions running in a consumer project (any project where ravenclaude-core is installed). Triggered when you discover a cross-domain finding worth contributing back to the RavenClaude marketplace — either spontaneously, or when the user says "contribute this," "save to RavenClaude," "this is worth keeping," or similar. Walks through the qualifying check, picks the right shape (lesson, best-practice, or both), and formats a copyable staging submission the user can drop into RavenClaude/docs/staging/incoming/.
create-pr
Open a pull request for the current branch using the project's standard template. Verifies the branch is green, summarizes the diff against main, and pushes only after the user confirms.
cross-platform-determinism
For any script that produces a file the repo COMMITS and CI DIFFS — generated HTML, lockfiles, JSON snapshots, fixture files, documentation rendered from source. The output must be byte-identical regardless of who regenerated it or on what OS. This skill catches the specific failure modes that turn a freshness gate into a paper tiger that fails forever — OS-dependent path separators, locale-dependent encodings, nondeterministic ordering, drifting timestamps. Triggers on phrases like "freshness check failing", "regenerated locally but diff doesn't match", "works in CI not on Windows", "generator produces different output", and on any review of a script whose output is committed.
draft-agent-brief
Use this skill when the user wants to create a new agent and has a clear business goal but isn't fluent in the agent's target domain. Walks them through producing a strong brief by filling in `templates/agent-brief.md` from their plain-language description, then iterating once or twice before building or dispatching the agent. Triggers when the user asks for "a new agent that does X" without already supplying the technical spec, or when they've filled the brief template and left blanks where they didn't know.
environment-discovery
Auto-discover the consumer's environment posture by probing installed CLIs (pac / az / aws / gcloud / gh) with read-only commands at session start, decoding JWTs for role/scope claims, and assembling a draft `.ravenclaude/environment-context.md` for save/edit/skip. Streamlines proposal 2026-05-22-001's permission-awareness mechanism.
knowledge-file-staleness-sweep
Run a periodic staleness sweep over all `plugins/<plugin>/knowledge/*.md` files and any decision-tree sections — flag entries past their `last-verified` window, categorize by re-verification effort (Tier 1-5 per Researcher schema), produce a remediation queue with named re-verifiers. Reach for this skill on the Researcher's weekly cadence OR before a marketplace release. Used by `deep-researcher` (primary) plus the maintainer.
new-worktree
Create an isolated git worktree under .claude/worktrees/ for a sub-agent to work in. Use this before dispatching any coder agent so that parallel work cannot collide.
plugin-release-checklist
Pre-release checklist for shipping any plugin update through this marketplace — plugin.json + marketplace.json + architecture.md version-mirror discipline, .repo-layout.json glob coverage for new dirs, CLAUDE.md skill / template tables synced, JSON validation, prettier check, audit-gates meta-test, and the consumer migration-note rule. Includes the exact bash commands to run (Windows PowerShell + bash). Reach for this skill at the end of any plugin PR or before merging a release-candidate branch. Used by the maintainer (primary) plus `project-manager`.
prompt-pattern-library
Curated, applied prompt-pattern catalog used across this marketplace — decision-tree traversal pre-action prior, alternate-methods-before-blocked, Structured Output Protocol `---RESULT_START---` block, scenario-retrieval inline prior, escalation-by-mandatory-phrasing, citation-aware research, environment-context preamble, orchestrator-worker reinforcement, agent-scenario-authoring frontmatter. Each pattern includes when to use it, what it composes with, an example block, and the failure mode it prevents. Reach for this skill when authoring a new agent, when an existing agent shows a behavior gap that a known pattern would close, or when the `prompt-engineer` is consulting on an agent revision. Used by `prompt-engineer` (primary).
researcher
Meta-skill that keeps all agents, skills, and knowledge files current and honest — Daily Quick Check + Weekly Deep Research modes. Categorization schema (Tier 1 Consensus / 2 Strong-but-Contextual / 3 Divergent / 4 Emerging / 5 Deprecated). Includes 90-day staleness checks for decision trees AND `.ravenclaude/environment-context.md`.
review-staged-contributions
For the RavenClaude marketplace maintainer. Walks every file in docs/staging/incoming/ one at a time, runs a security sweep + topic-expert analysis on each, presents each with a keep/update/deny prompt, promotes approved submissions to their canonical location (docs/memory-bank/lessons-learned.md for lessons, docs/best-practices/<slug>.md for best-practices), and deletes denied ones. Trigger when the maintainer says any of "check for updates", "check the staging queue", "anything in staging", "review submissions", "drain the queue", or "review staged contributions" — also fires on the slash command /review-staged-contributions.
run-full-test-suite
Run the project's full quality gate — format check, lint, typecheck, unit tests, integration tests — in order, fail fast, summarize. Use before reporting any non-trivial change as complete.
scenario-retrieval
Consult the unverified scenarios bank (`plugins/<plugin>/scenarios/*.md`) before answering plugin-domain questions. Glob + tag-filter + recency-weight, surface top 2-3 with mandatory unverified-scenario preamble ("Based on N unverified scenarios from YYYY-MM tagged [scope] — verify in your environment"). Secondary source — never replaces canonical knowledge files.
spawn-team
Team Lead dispatch playbook. Given a feature or task, decide which specialized agents to dispatch, prepare their briefs, allocate worktrees, run them in the right order, and re-route on blockers. Load this skill whenever you (the Team Lead) are about to dispatch more than one agent on a request. Keeps routing consistent across sessions and avoids re-deriving the workflow each time.
structured-output
Enforce the Structured Output Protocol — every sub-agent handoff ends with a `---RESULT_START--- ... ---RESULT_END---` JSON block alongside the human-readable Markdown. Team Lead parses the JSON for routing; Markdown stays for human review. Active across all 14 core specialists.
Hooks (5)
enforce-layout.sh
enforce-layout.sh PreToolUse hook for Write | Edit | MultiEdit. Reads .repo-layout.json from the consumer's project root. If absent, no-ops silently (allow everything). If present, checks the target path against the allowed_globs / forbidden_globs allow-list and denies off-pattern writes with a helpful message including a suggested correct location. Deny mechanism: emits hookSpecificOutput JSON to
format-on-write.sh
format-on-write.sh PostToolUse hook for Edit | Write | MultiEdit. Auto-format the file Claude just touched, using whatever formatter the project defines. No-ops silently when a formatter isn't configured for the file type — never blocks the agent's flow.
guard-destructive.sh
guard-destructive.sh PreToolUse hook for Bash. Catches obviously destructive commands that slipped past the deny-list (e.g. inside subshells, pipes, here-docs). Exits non-zero to block the command; prints a reason on stderr.
guard-recursive-spawn.sh
guard-recursive-spawn.sh PostToolUse hook for Edit | Write | MultiEdit on agent definition files (plugins/*/agents/*.md). Warns when an agent's prompt instructs *itself* to spawn another sub-agent. Why this matters ---------------- RavenClaude follows the hierarchical orchestrator-worker pattern documented in plugins/ravenclaude-core/CLAUDE.md and plugins/ravenclaude-core/skills/spawn-team.md: onl
remind-tests.sh
remind-tests.sh Stop hook. If any source files were modified in the working tree this session, print a reminder to run the full quality gate before reporting the task complete. Output goes to stderr — Claude sees it as a system notice, the user sees it in the transcript.
Rules (4)
agent-collaboration
Rule: Agent Collaboration Protocol
coding-standards
Rule: Coding Standards (long form)
git-workflow
Rule: Git Workflow (long form)
security
Rule: Security (long form)
Templates (14)
activity-log.md
Activity Log — *<Project Name>*
agent-brief.md
Agent Brief Template
agent-definition-template.md
Agent Definition Template
agent-ready-repo/
(template folder)
deliverables/
(template folder)
design/
(template folder)
environment-context.md
Environment Context — `<engagement_name>`
partner-success/
(template folder)
raid-log.md
RAID Log — *<Project Name>*
research-report-template.md
Research Report Template
run-artifacts/
(template folder)
stakeholder-register.md
Stakeholder Register — *<Project Name>*
status-report.md
Status Report — *<Project Name>*
task-list.md
Task List — *<Project Name>*