Live test results โ€” not marketing

What Prompt Modules actually does

We ran a full automated test suite: 3 domains, 9 real bugs fixed, 4 module versions. Here's every number.

2026-04-14
Test run
41.8s
Duration
3
Modules tested
9/9
Fixes applied
100%
Success rate

The @reuse effect

The first time you define a module, PM builds a rich PML structure (~450 tokens). Every subsequent use is just @reuse[module_name] โ€” 5 tokens.
That's where the real saving lives.

Without PM (full context every time)
With PM (@reuse after first definition)
010k25k50k1 use5 uses10 uses25 uses50 uses100 usesTokens used
5
tokens per @reuse
vs ~500 without PM
98%
token reduction at 100 uses
49,067 tokens saved
$0.74
saved per module at 100 uses
at Claude API pricing

Module quality: B โ†’ A after 3 fixes

Every fix Claude applies gets versioned into the module. After 3 real bugs fixed, all 3 modules jumped from Grade B (72%) to Grade A (82%) โ€” production-ready.

user_managementauthGrade A
v4 ยท 3 fixes applied
82%
+10% improvement
Before fixes: 72%After fixes: 82%
Bugs fixed and versioned:
โœ“Google OAuth callback fails on mobile Safari โ†’ v2
โœ“JWT tokens not invalidated on logout โ†’ v3
โœ“Role check missing on /api/admin routes โ†’ v4
@attention rules (AI guardrails):
never_store_passwords_plaintextalways_validate_jwt_server_sidemust_check_rls_on_all_tablesnever_expose_service_role_key
payment_escrowpaymentsGrade A
v4 ยท 3 fixes applied
82%
+10% improvement
Before fixes: 72%After fixes: 82%
Bugs fixed and versioned:
โœ“Duplicate webhook processing causing double upgrades โ†’ v2
โœ“Trial expiry not enforced after payment fails โ†’ v3
โœ“Subscription cancel not reflected immediately โ†’ v4
@attention rules (AI guardrails):
never_process_payments_without_idempotency_keyalways_verify_webhook_signaturemust_enforce_trial_expiry_server_sidenever_store_card_data
rag_systemaiGrade A
v4 ยท 3 fixes applied
82%
+10% improvement
Before fixes: 72%After fixes: 82%
Bugs fixed and versioned:
โœ“Chunk overlap missing โ€” RAG misses cross-boundary context โ†’ v2
โœ“Embeddings not normalized โ€” wrong cosine similarity โ†’ v3
โœ“No re-ranking step โ€” top-k not always most relevant โ†’ v4
@attention rules (AI guardrails):
never_mix_user_document_spacesalways_normalize_embeddings_before_storagemust_include_source_citations_in_every_answernever_retrieve_from_other_users_documents

The fix system: Claude learns, permanently

When a bug is found, pm fix records the issue + solution and creates a new module version. Next session, Claude already knows the fix.

Module evolution โ€” user_management
v1CreatedAuto-analyzed from raw prompt ยท @core + @attention generated
v2Fix #1OAuth callback mobile Safari โ†’ redirect_uri must use https://
v3Fix #2JWT blacklist added โ†’ tokens now invalidated on logout
v4Fix #3requireRole("admin") added โ†’ server-side admin check enforced
9/9
fixes applied successfully
3.0
avg fixes per module
+10%
quality score per 3 fixes

Cross-prompt reuse: 7 modules auto-detected

When we ran a complex B2B SaaS prompt against the registry, PM found 7 matching modules automatically โ€” no manual tagging needed.

Claude Code โ€” auto-detect output

# Complex prompt: B2B SaaS with auth, payments, RAG, admin, team workspaces

# Registry scan โ†’ 7 matches found:

โ†บ @reuse[user_management]โ†’ injected

โ†บ @reuse[payment_escrow]โ†’ injected

โ†บ @reuse[rag_system]โ†’ injected

โ†บ @reuse[ai_engine]โ†’ injected

โ†บ @reuse[file_storage]โ†’ injected

โ†บ @reuse[admin_panel]โ†’ injected

โ†บ @reuse[api_gateway]โ†’ injected

# Result: 126 tokens โ†’ 5 tokens per @reuse tag ร— 7 = 35 tokens total context

What makes a module production-ready

We score every module on 10 dimensions. Here's what each point means for Claude's output quality.

@module declaration
5 pts
@core blocks (ร—5)
15 pts
@attention rules
10 pts
Token efficiency
10 pts
@version marker
5 pts
Explicit constraints
10 pts
Fix iterations
15 pts
Naming (snake_case)
5 pts
Metadata / tags
5 pts
Structure depth
5 pts
A
80โ€“100%
Production ready
B
65โ€“79%
Good, needs polish
C
50โ€“64%
Functional
D
0โ€“49%
Needs work

Cumulative savings over time

One module, one domain. Tokens and dollars saved as usage grows. (Assumes 500 tokens of context without PM, 5 tokens with @reuse after first definition)

UsesWithout PMWith @reuseTokens saved$ savedSavings %
1ร—500438+62$0.000912%
5ร—2,500458+2,042$0.030682%
10ร—5,000483+4,517$0.067890%
25ร—12,500558+11,942$0.179196%
50ร—25,000683+24,317$0.364897%
100ร—50,000933+49,067$0.736098%
Start saving tokens today

Your modules. Your registry. Your savings.

Free plan includes 5 modules. No credit card. The test suite code is open โ€” run it yourself.

Test run: 2026-04-14 ยท 41.8s ยท 100% automated ยท no cherry-picking