Leverage Record: April 15, 2026

Twenty-one tasks. April 15 was the biggest day yet by task count: two distinct themes ran in parallel throughout the day. The first was client application work, specifically porting a large backlog of legacy pages and wiring them to real backend data stores. The second was platform infrastructure, covering a billing system rebuild, a comprehensive health monitoring tool, a shared diagnostics library rolled out to 15 services, and a suite of service-to-service authentication tokens. The weighted average leverage factor was 37.1x with a supervisory leverage of 361.6x, representing 783.5 human-equivalent hours of work.

The overall leverage factor is pulled down substantially by a single 420-minute autopilot simulation task (4.57x), which required iterative simulation cycles to validate a monotonic readiness climb across 13 simulated days. Strip that one task out and the day's leverage would look dramatically different. The client app porting tasks at 200x and 171.4x represent the upper bound for this class of work: well-understood UI patterns applied at scale against clear specifications.

About These Records

These time records capture personal project work done with Claude Code (Anthropic) only. They do not include work done with ChatGPT (OpenAI), Gemini (Google), Grok (xAI), or other models, all of which I use extensively. Client work is also excluded, despite being primarily Claude Code. The actual total AI-assisted output for any given day is substantially higher than what appears here.

Task Log

#	Task	Human Est.	Claude	Weeks	Factor	Sup. Factor
1	Port 17 legacy UI pages to current framework with theme bridge and dark-mode fixes; 130/130 tests	100h	30m	2.5w	200.0x	3000.0x
2	Full parity pass: engine REST client, 8 new pages, site key gate, version checker, telemetry batcher, 95 unit + 200 end-to-end tests	200h	70m	5.0w	171.4x	4000.0x
3	Purchase service Stripe rebuild: 6-plan catalog, beta coupons, comp entitlements, tokenized invite flow, comp email, 785+ tests across 4 services and 1 admin UI	90h	60m	2.2w	90.0x	450.0x
4	Launch plan recalibration to two-phase beta; rewrote 3 planning docs; 9 blog posts, 6 LinkedIn posts, 9 Twitter threads, 9 Reddit posts; 105-asset artwork list; 7 scheduled campaigns, 2 new landing pages	40h	30m	1.0w	80.0x	480.0x
5	Health monitoring tool site scanner: backend crawler, 5 validators, 10 API endpoints, 8 MCP tools, 11 frontend components, 244 tests, ARM64 buildspec fix, crawled 59-page site	80h	80m	2.0w	60.0x	960.0x
6	Shared diagnostics library 0.2.0: 110 tests, 11 new check categories, rolled out to all 15 tools with API key auth and tool-specific metrics	56h	80m	1.4w	42.0x	840.0x
7	Health monitor add/delete site UI + fleet diagnostics viewer: 15-tool grid, per-tool drilldown, 17 MCP tools	24h	35m	0.60w	41.1x	480.0x
8	Comp entitlements admin UI: API client, comps page with filters and pagination, add/revoke/extend modals, billing tab rebuild, invite modal	24h	35m	0.60w	41.1x	180.0x
9	Wire console simulator via iframe multi-page entry; drive dashboard from real enrollment and autopilot stores	6h	10m	0.15w	36.0x	360.0x
10	Email client complete feature backlog: P0 web push/rule actions/refresh/image blocking, P1 budget/contacts/split-thread/density, P2 IMAP IDLE/phishing/bayesian spam/vcard import	60h	120m	1.5w	30.0x	450.0x
11	Tokenized accept-invite flow in auth service: invitations table, migration, service layer, admin endpoint, GET/POST API, email template, 34 tests	16h	35m	0.40w	27.4x	120.0x
12	Fix blank-page SPA navigation; real lab manifest (2,048 labs) in labs page with correct routing	5h	12m	0.12w	25.0x	300.0x
13	Full module list and real lab list visible to unenrolled users on course detail; dark-mode contrast fix	3h	8m	0.075w	22.5x	180.0x
14	MCP test suite v1: manifest, 6 YAML case files, fixtures, 2 slash commands	8h	22m	0.20w	21.8x	120.0x
15	Service-token MCP servers: auth, notification, purchase services; svc_ Bearer tokens; SSM provisioning; 5 Stripe webhook handlers	20h	90m	0.50w	13.3x	120.0x
16	Entitlement-based enrollment gating in auth service: purchase client, domain catalog, gating logic, 403 upgrade response, 12 tests	6h	28m	0.15w	12.9x	36.0x
17	Diagnose wrong Content-Type on resume PDFs; fix static site generator MIME detection; sweep 10 production buckets fixing 27 objects	4h	22m	0.10w	10.9x	80.0x
18	EC2 OOM diagnosis (17 containers on 4 GB), stop/start recovery, resize instance, verify email delivery, sync new MCP servers	3h	22m	0.075w	8.2x	45.0x
19	Auto-enroll on browse for comp users in enrollment service	3h	22m	0.075w	8.2x	36.0x
20	Fix email client backend service-token auth chain; fix Alembic collision on shared DB via per-service version tables; deploy and verify	3.5h	35m	0.087w	6.0x	70.0x
21	Autopilot Phases 2-7: P_pass fixes, entity persistence, 12 readiness-tracking iterations; validated monotonic readiness climb 0.376 to 0.559 across 13 simulation days	32h	420m	0.80w	4.6x	64.0x

Aggregate Statistics

Metric	Value
Total tasks	21
Total human-equivalent hours	783.5
Total Claude minutes	1,266
Total human-equivalent weeks	19.6
Total tokens	7,615,000
Weighted average leverage factor	37.1x
Weighted average supervisory leverage factor	361.6x

Analysis

The two highest-leverage tasks were both client UI work: porting 17 legacy pages (200x) and a full parity pass with 295 tests across unit and end-to-end suites (171.4x). These scores reflect a structural advantage in UI migration work: the destination framework is known, the source behavior is documented in existing code, and the test suite provides an unambiguous acceptance gate. When all three conditions are met, AI can execute migration work at a rate that makes the "hours" column feel like science fiction.

The purchase service Stripe rebuild (90x) deserves mention because it touched four services and one admin UI simultaneously, produced 785 passing tests, and coordinated a coherent billing model across comp entitlements, tokenized invite flows, and email templates. A human engineer would need to context-switch between codebases sequentially; the AI holds all five service contexts at once.

The autopilot simulation task (4.57x) is the outlier in every sense. At 420 Claude minutes, it consumed one-third of the day's total AI time. The work involved iterative simulation cycles where each run validated whether readiness metrics climbed monotonically. This is the category of task that compresses least under AI leverage: the bottleneck is not code generation but simulation runtime and state-dependent iteration. The task was ultimately tabled pending a UI rebuild, meaning the 420 minutes produced validation data rather than a shipped feature.

The supervisory leverage numbers on the top UI tasks are striking: 3,000x and 4,000x. A 2-minute and 3-minute prompt respectively produced the equivalent of months of senior engineering output. This is where the supervisory metric is most useful: it captures how little human decision time was consumed per unit of output, which is the actual cost to the person running the session.

The day's total of 783.5 human-equivalent hours amounts to roughly 20 weeks of a senior engineer's output. That figure is dominated by two tasks (the 200-hour parity pass and the 90-hour billing rebuild), but even the lower-leverage infrastructure work (diagnostics rollout, service-token auth, enrollment gating) would have occupied multiple engineers for weeks.