Julian/sar

Files

julian 17c08e6392 chore: initial monorepo scaffold + WDS Phase 1+2 artifacts

- Nx 22.7 monorepo (pnpm 11.1, TypeScript 5.9, Node 24)
- apps/api: NestJS 11 (CJS conforme CODING-RULES.md PGD-DB-004)
- apps/web: React 19 + Vite 8 (ESM)
- libs/shared/api-interface: Zod contract base
- Docker Compose dev: Postgres 18, Valkey 8, MinIO, Mailpit
- WDS artifacts:
  - design-artifacts/A-Product-Brief/ (5 docs canônicos + 16 dialogs)
  - design-artifacts/B-Trigger-Map/ (hub + 4 personas + feature impact)
- Stack canon: STACK.md v2.2 + CODING-RULES.md v2.0 + brand.md
- AGENTS.md + README.md como entrada para devs/agentes

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-27 14:34:20 +00:00

5.9 KiB

Raw Permalink Blame History

Quality Dimensions — Quick Reference

Eight dimensions to keep in mind when building agent skills, plus a ninth (Sanctum Architecture) specific to memory agents. The quality scanners check these automatically during quality analysis — this is a mental checklist for the build phase.

1. Outcome-Driven Design

Describe what each capability achieves, not how to do it step by step. The agent's persona context (identity, communication style, principles) informs HOW — capability prompts just need the WHAT.

The test: Would removing this instruction cause the agent to produce a worse outcome? If the agent would do it anyway given its persona and the desired outcome, the instruction is noise.
Pruning: If a capability prompt teaches the LLM something it already knows — or repeats guidance already in the agent's identity/style — cut it.
When procedure IS value: Exact script invocations, specific file paths, API calls, security-critical operations. These need low freedom.

2. Informed Autonomy

The executing agent needs enough context to make judgment calls when situations don't match the script. The Overview section establishes this: domain framing, theory of mind, design rationale.

Simple agents with 1-2 capabilities need minimal context
Agents with memory, autonomous mode, or complex capabilities need domain understanding, user perspective, and rationale for non-obvious choices
When in doubt, explain why — an agent that understands the mission improvises better than one following blind steps

3. Intelligence Placement

Scripts handle plumbing (fetch, transform, validate). Prompts handle judgment (interpret, classify, decide).

Test: If a script contains an if that decides what content means, intelligence has leaked.

Reverse test: If a prompt validates structure, counts items, parses known formats, compares against schemas, or checks file existence — determinism has leaked into the LLM. That work belongs in a script.

4. Progressive Disclosure

SKILL.md stays focused. Detail goes where it belongs.

Capability instructions → ./references/
Reference data, schemas, large tables → ./references/
Templates, starter files → ./assets/
Memory discipline → ./references/memory-system.md
Multi-capability SKILL.md under ~250 lines: fine as-is
Single-purpose up to ~500 lines: acceptable if focused

5. Description Format

Two parts: [5-8 word summary]. [Use when user says 'X' or 'Y'.]

Default to conservative triggering. See ./references/standard-fields.md for full format.

6. Path Construction

Use {project-root} for any project-scope path. Use ./ for skill-internal paths. Config variables used directly — they already contain {project-root}.

See ./references/standard-fields.md for correct/incorrect patterns.

7. Token Efficiency

Remove genuine waste (repetition, defensive padding, meta-explanation). Preserve context that enables judgment (persona voice, domain framing, theory of mind, design rationale). These are different things — never trade effectiveness for efficiency. A capability that works correctly but uses extra tokens is always better than one that's lean but fails edge cases.

8. Customization Surface

Every agent ships customize.toml (metadata block is the install-time roster contract). The override surface beyond metadata is opt-in and archetype-sensitive.

Metadata validity (all archetypes): [agent] must include code, title, icon, description, agent_type. name is optional (empty string is valid); memory and autonomous agents whose name is learned during First Breath should leave it empty at build time. SKILL.md must agree with customize.toml on identity fields.
Stateless opportunity test: Does the agent load templates, write to paths, or have lifecycle points users will reasonably want to vary? Lift those to named scalars (*_template, *_output_path, on_<event>).
Stateless abuse test: Boolean toggles, opaque scalar names (style_config), more than two hooks, or arrays-of-tables without code/id keys are usually design smells.
Memory/autonomous rule: The sanctum is the primary customization surface. An override surface that duplicates PERSONA/CREED/BOND concepts (identity, communication_style, principles) is abuse. Default to metadata-only; opt in to the override surface only for narrow org-level needs (e.g. pre-sanctum compliance gate).
Autonomous rule: PULSE.md owns autonomous behavior. Do not put PULSE-shaped fields in customize.toml.

See Customization for Authors for the decision framework.

9. Sanctum Architecture (memory agents only)

Memory agents have additional quality dimensions beyond the general seven:

Bootloader weight: SKILL.md should be ~30 lines of content. If it's heavier, content belongs in sanctum templates instead.
Template seed quality: All 6 standard sanctum templates (INDEX, PERSONA, CREED, BOND, MEMORY, CAPABILITIES) must exist. CREED, BOND, and PERSONA should have meaningful seed values, not empty placeholders. MEMORY starts empty (correct).
First Breath completeness: first-breath.md must exist with all universal mechanics (for calibration: pacing, mirroring, hypotheses, silence-as-signal, save-as-you-go; for configuration: discovery questions, urgency detection). Must have domain-specific territories beyond universal ones. Birthday ceremony must be present.
Standing orders: CREED template must include surprise-and-delight and self-improvement, domain-adapted with concrete examples.
Init script validity: init-sanctum.py must exist, SKILL_NAME must match the skill name, TEMPLATE_FILES must match actual templates in ./assets/.
Self-containment: After init script runs, the sanctum must be fully self-contained. The agent should not depend on the skill bundle for normal operation (only for First Breath and init).

5.9 KiB Raw Permalink Blame History