Skip to main content

Leverage Record: April 23, 2026

AITime Record

About the author: I'm Charles Sieg, a cloud architect and platform engineer who builds apps, services, and infrastructure for Fortune 1000 clients through Vantalect. If your organization is rethinking its software strategy in the age of AI-assisted engineering, let's talk.

Forty-three tasks. April 23 had three overlapping themes: wiring deep engine subsystems, hardening the entire legal and compliance surface across the platform, and shipping user-facing features. The day opened before 6 AM with a pair-selection benchmark harness and a CORS fix, then accelerated into a continuous stream of engine work: a 15-table persistence schema with dual-write repositories across multiple subsystem stores, NLI inference wiring, cross-domain intelligence calibration, autopilot ranker integration, and several phases of a large engine wiring plan covering replay mining, hardware drift detection, multi-agent safety, actor-critic policy training, and fragment embedding injection. In parallel, a greenfield local RAG and MCP server tool was built from scratch -- docs, backend, vector store, git-aware incremental indexer, React frontend, 6 MCP tools, and 21 passing tests -- in 19 minutes. A legal single-source-of-truth architecture was established across 3 brand canonicals with a propagation script, drift checking, and 10 consumers regenerated across 6 repositories. The admin dashboard completed a full REST-to-WebSocket migration with 87 new tests. The weighted average leverage factor was 20.7x with a supervisory leverage of 174.3x, representing 435.75 human-equivalent hours.

The 20.7x weighted average is higher than April 22 (16.1x / 143.6x) for a clear structural reason: April 23 had a much larger share of greenfield and large-scope refactor work mixed in with the remediation and bug-fixing tasks. The top leverage tasks were genuinely new systems being constructed from a specification -- a local RAG tool at 151.6x, a legal propagation architecture at 53.3x, compliance finding remediation at 48.0x -- rather than the diagnostic and repair work that dominated lower-leverage days. The engine wiring tasks, while large in absolute AI time (43, 50, and 67 minutes respectively), were executing against a detailed multi-phase plan, which kept the per-minute decision density low and the leverage factors in the 28-56x range. The supervisory leverage improvement from 143.6x to 174.3x reflects the same dynamic: several of April 23's largest tasks required only 1-2 supervisory minutes because the specifications were already written. Telling an AI to implement phases 6, 7, and 9 of an existing wiring plan costs 1 supervisory minute and yields 32 human-equivalent hours of output.

About These Records
These time records capture personal project work done with Claude Code (Anthropic) only. They do not include work done with ChatGPT (OpenAI), Gemini (Google), Grok (xAI), or other models, all of which I use extensively. Client work is also excluded, despite being primarily Claude Code. The actual total AI-assisted output for any given day is substantially higher than what appears here.

Task Log

#TaskHuman Est.ClaudeSup.FactorSup. Factor
1Local-only RAG and MCP server over a large monorepo: docs, backend (FastAPI + vector store + embedding models + LLM integration), git-aware incremental indexer, React frontend, 6-tool MCP server, 21 passing tests, end-to-end smoke test48h19m8m151.6x360.0x
2Learning engine wiring plan Phase 4 + Phase 8: catalog-projections client UI, self-report and seed-from-mastery endpoints, mapper wiring, prior cap lifted, study-hours fallback fix, ELO forwarding via session factory40h43m2m55.8x1200.0x
3Legal doc single-source architecture: 3 brand canonical documents (versioned), propagation script with drift-check, 10 consumers regenerated across 6 repositories, 4 legacy duplicates deleted, CI drift checks wired16h18m3m53.3x320.0x
4Compliance LOW findings remediation: 3 gap analysis documents, terms and privacy date updates, 18 deferred documentation audit findings resolved (12 design docs expanded, 3 Mermaid diagrams added, 3 structural doc issues fixed across 3 repositories)12h15m1m48.0x720.0x
5Compliance MEDIUM findings: add AI API subprocessors to Privacy Policy; add data processing action tracker to vendor register2h4m1m30.0x120.0x
6Engine persistence foundation: 15-table schema, write-through migration, session manager dual-store, portfolio fragment loader and repository, cross-domain cluster delegation, catalog projections, content rewriter LLM wiring, NLI entailment in equivalence class engine, Phase 0 tests, cascade re-embedding, contradiction detection, health and plan endpoints, synthesis seeder centroid fidelity60h120m2m30.0x1800.0x
7Engine wiring plan Phase 3.5: cluster delegation, cross-domain-transfer rewrite, catalog-projections endpoint, content rewriter LLM wiring, cascade re-embedding, contradiction detection endpoint, portfolio health endpoints, synthesis planning optimizer endpoint; 8 tasks across engine, gateway, and persistence with 30 new unit tests24h50m3m28.8x480.0x
8Engine wiring plan Phases 6-7-9: replay miner, hardware drift detection, simulation-real bridge, safety subsystem, multi-agent coordination, operations health, delta replication, actor-critic policy trainer, two-step tier opt-in, capabilities endpoint, fragment embedder injection32h67m1m28.7x1920.0x
9Web client user profile: profile, subscription, and account tabs; testimonial backend; avatar in navigation; sign-out relocation; terms of service testimonial license12h32m6m22.5x120.0x
10Activity-to-domain-group master mapping: relabeled matrix to 9 spec-directory groups by 22 activities, JSON projection, sync script, typed resolver, cross-system spec in architecture-client, fixed stale activity count across documentation, tests, and index header8h22m6m21.8x80.0x
11Autopilot Phase 8: engine-side catalog reconciliation -- activities catalog loader with path-search and graceful degradation, autopilot ranker category key threading and category applicability scoring, startup parity assertion with allow-listed divergence, catalog-aware maintenance backstop, AutopilotGoal category key and compute-next-actions plumbing, 15-test catalog parity suite, 8 ranker gating tests, 4 per-group journey tests10h28m3m21.4x200.0x
12Activity library docs overhaul: reconciled design doc as authoritative (20 sections, all 22 activities, hooks, primitives, providers, exam sim, demo, data flow, offline, accessibility), rewrote requirements as FR/NFR superset with shipped/partial/planned status markers, rewrote testing strategy as ideal-state test surface with gap inventory and 8-phase remediation plan, trimmed README to overview and activity index, deleted 22 superseded docs16h45m6m21.3x160.0x
13Fix autopilot 422 error (missing session ID) and add info button to autopilot card; diagnose resume upload 4042h6m3m20.0x40.0x
14Convert help page to full help center with FAQ, documentation, and legal tabs6h18m4m20.0x90.0x
15Engine Phase 4: self-report and seed-from-mastery endpoints, ELO and entity ID forwarding via session factory, catalog projections client UI, strategy readiness expansion with per-domain panel data, autopilot Postgres persistence with entity-domain uniqueness, drift fresh-user guard, entity-delete endpoint for account reset and compliance15h45m2m20.0x450.0x
16Merge rogue design document into canonical docs directory in desktop client repository; delete the root copy1.25h4m2m18.75x37.5x
17Vendor resume parser to unblock container build (fixes resume upload 404); practice exam: skip intro modal, disable next button until answered, color-coded question navigator (answered, skipped, pending)4h14m4m17.1x60.0x
18Reconcile activity metadata across three sources into a single canonical catalog: extended catalog to v2 with runtime source per activity, fixed field names, rewrote expression for miscellaneous categories, retired parallel mapping in activity library, added catalog-backed TypeScript reader and sync script with invariant check, rewrote library documentation, architecture-client activity domain mapping, and autopilot activity coverage docs7h25m8m16.8x52.5x
19Admin dashboard Phase 5 zero-REST WebSocket migration: multiplexed WebSocket replaces direct REST, JWT and JWKS auth handling for kid-less tokens, comps table names-first with UUID leak fix, markdown preview email, Redis dependencies for container builds, test suite revival (45 backend + 42 frontend tests), second-pass audit reconciliation, documentation sync25h90m2m16.7x750.0x
20Update architecture documentation (data model, system architecture, testing strategy) to reflect 15-table Postgres persistence schema2h8m5m15.0x24.0x
21Fix practice exam intro modal skip, answer persistence (refetch race condition), radio auto-select, navigator colors3h12m3m15.0x60.0x
22Admin dashboard docs audit pass 2: dashboard reality status (wired, partial, placeholder, not built) inline in requirements, plus path and import audit across design, requirements, testing strategy, and project instructions4.5h18m2m15.0x135.0x
23Wire real NLI model into equivalence class engine: new NLI gate module, dependency injection into engine, replaced similarity placeholder with batched NLI inference, 30 new unit tests4h18m5m13.3x48.0x
24Patent portfolio browser: design system adoption and Mermaid rendering hardening; design system providers with private registry Docker build, package resolution from registry (not local path), theme token resolution to concrete CSS values before Mermaid render, HSL value wrapping for SVG paint, node fill vs. container distinction, client-side Mermaid fallback for figures without pre-generated SVG10h45m2m13.3x300.0x
25Replace synthesis seeder fidelity magic constant with centroid cosine similarity; implement real threshold calibration with F1 grid-sweep in cross-domain intelligence engine4h20m5m12.0x48.0x
26Auth service hardening: canonical legal document refactor, testimonial model and CRUD API, auth methods in profile, avatar object storage wiring for both deployments, Redis dependencies and user lifecycle event publishing, write-through internal enrollment progress endpoint, admin bulk user lookup for display joins, LLM subprocessor privacy disclosure, terms and privacy effective dates, enrollment caching in Postgres, markdown welcome email with preview endpoint, frontend dependency regen, privacy and terms build restore12h60m3m12.0x240.0x
27Write 85 unit tests for 9 dual-write learner repositories covering entity embeddings, node stats, pair stats, entity ELO, learner fingerprint, hardware drift, adversarial fingerprint, credit schedule, and autopilot ranker; 98% coverage on learner repository module4h22m5m10.9x48.0x
28Legal SSOT refactor coordinated across the full fleet: establish single-source-of-truth architecture for all legal documents in planning repository, switch static site generator, auth service, and marketing sites to canonical legal documents, DPA action tracker in vendor register, compliance finding marked structurally resolved8h45m2m10.7x240.0x
29Pair selector benchmark harness: synthetic learner against real pair selection logic, 7 tuning arms by 50 seeds, identified goal-level recency as the specific fix for hot-goal streaks (max streak reduced, coverage improved)3h18m2m10.0x90.0x
30Admin dashboard docs audit: fix 16 discrepancies -- React version, delete all references to retired REST clients, rename WebSocket client classes throughout, rewrite directory tree and file inventory to match reality, appendix switched from REST endpoints to RPC methods, ASCII diagrams converted to Mermaid3h18m2m10.0x90.0x
31Revive admin dashboard test suite: bump test framework, fix accessibility test after REST retirement, add 45 backend tests (WebSocket protocol, connection manager, event forwarder, RPC dispatcher) and 42 frontend tests (WebSocket client, hooks, stores, anomaly panel); rewrite testing strategy; convert architecture diagram to Mermaid5h33m3m9.1x100.0x
32Merge student profile career editor into Profile tab; fix navigation avatar to route to profile; add redirect; wire bio field to backend1.5h10m2m9.0x45.0x
33Admin dashboard Phase 5 WebSocket migration completion: migrate account, billing, system health, and notifications pages off REST to multiplexed WebSocket RPC; add health current, history, and subscription RPC methods; broadcast health updates on poll cycles; consolidate notification types; delete 6 retired REST client modules; TypeScript and build clean6h42m4m8.6x90.0x
34Lesson system architecture doc (storage, personalization, delivery, rendering, 8 extensibility subsections); global Mermaid-first diagram standard established; converted all ASCII diagrams across architecture documentation to Mermaid (multiple protocol, training pipeline, recalibration pipeline, and learner flow diagrams); replaced ASCII pseudo-diagram with proper tables3h22m3m8.2x60.0x
35Wire Postgres and cache dual-write persistence for 5 subsystem stores (learner fingerprint store, hardware drift detector, adversarial detector, credit schedule store, autopilot ranker repository)6h45m5m8.0x72.0x
36Fix onboarding service JWKS auth and resume upload CORS chain2h15m6m8.0x20.0x
37Create object storage user-content bucket and wire avatar upload; fix latent legal SSOT build break and package registry lockfile poisoning3h25m3m7.2x60.0x
38Add comprehensive adaptive-lessons FAQ (14 Q&As and viewer mechanics) to help documentation; commit and deploy2h18m3m6.7x40.0x
39Fix 7 pre-existing TypeScript strict-mode errors in web client (button variant, unchecked indexed access fallbacks, ambient module declarations, JSX key spread, unused import, null guards); commit and deploy1h9m2m6.7x30.0x
40Continue-studying button and explainer modal, drop adaptive tile, breadcrumb correction, study-time fractional minutes with flush-on-unmount and cap, onboarding service CORS via parameter store with in-place container restart and build config persistence2h20m3m6.0x40.0x
41Deploy profile tabs across auth service variants and web client; diagnose and manually run database migrations after build pipeline skipped them2h20m2m6.0x60.0x
42Onboarding resume upload fix chain: JWKS auth, missing package data, UX spinner, friendly error messages, auto-bug report4h45m10m5.3x24.0x
43Commit, push, and verify CI/CD deployment of help center and practice exam changes0.5h8m1m3.75x30.0x

Aggregate Statistics

MetricValue
Total tasks43
Total human-equivalent hours435.75
Total Claude minutes1,261
Total supervisory minutes150
Total tokens6,695,500
Weighted average leverage factor20.7x
Weighted average supervisory leverage factor174.3x

Analysis

The highest-leverage task of the day also had the shortest runtime. A local-only RAG server with a 6-tool MCP surface, a git-aware incremental indexer, a vector store backend, a React frontend, and 21 passing tests was built in 19 minutes at 151.6x leverage. That translates to 48 human-equivalent hours of system construction in under 20 minutes of AI time. The architectural choices made during that session -- local-only operation, incremental indexing keyed to git state, MCP as the query interface -- reflect a coherent design that a human architect would spend time deliberating over before writing a line of code. Here the deliberation happened inside the AI's context window, invisibly, during the 19 minutes. The end-to-end smoke test passing on first run is the tell: the design was worked out before implementation started.

The legal and compliance surface work, distributed across several tasks (tasks 3, 4, 5, 28), is worth reading as a single coherent effort. The core problem was that legal documents existed as duplicates across many repositories, each copy drifting independently. The solution was a single-source-of-truth architecture: three brand canonical documents with version numbers, a propagation script with drift detection, CI checks to prevent future drift, and 10 consumer files regenerated from the canonical source across 6 repositories. Four legacy duplicates were deleted. On the compliance side, gap analysis documents were written, subprocessor disclosures were added to the privacy policy, and a DPA action tracker was established in the vendor register. The aggregate AI time for this body of work was roughly 82 minutes. The human-equivalent estimate across those tasks is 38 hours -- legal document architecture decisions, cross-repository coordination, compliance framework mapping, and the actual writing of privacy policy sections are not tasks that accelerate easily. They require careful reading and careful writing. Here the AI compressed them into a morning.

The engine persistence work (tasks 6, 7, 8, 15) was the day's largest single body of work by AI time: 275 minutes spread across 4 tasks representing 131 human-equivalent hours. Task 6 alone was a 15-table schema, a write-through migration, dual-store session management, fragment loading, cross-domain cluster delegation, catalog projections, a content rewriter LLM integration, NLI entailment inside the equivalence class engine, Phase 0 test suite, cascade re-embedding, contradiction detection, portfolio health endpoints, and synthesis planning optimization -- all in 120 minutes at 30x leverage. The 30x factor on a 120-minute task is correct: a senior engineer implementing that surface end-to-end, including the database schema design, the migration, the dual-write repository pattern, and the NLI plumbing, would take the better part of two weeks. The AI got it done in two hours. Tasks 7 and 8 continued the same plan across additional phases, adding 30 more unit tests, replay mining, hardware drift detection, multi-agent safety subsystems, and actor-critic policy training. Two supervisory minutes each.

The admin dashboard REST-to-WebSocket migration (tasks 19, 33) is a useful case study in what "refactor" looks like at high leverage. The problem was a large frontend that communicated with multiple backend services via individual REST clients. The solution was a single multiplexed WebSocket connection with an RPC dispatcher, JWT handling hardened for production edge cases, and 87 new tests. Across both tasks, the AI spent 132 minutes on it against a human estimate of 31 hours. The 87 tests are not a trivial artifact -- they cover WebSocket protocol behavior, connection management, event forwarding, and RPC dispatch on the backend, and WebSocket client lifecycle, hooks, stores, and UI components on the frontend. Writing tests for async, bidirectional network communication requires thinking carefully about timing, reconnection, and message ordering. At 9.1x for the test revival task, the factor correctly reflects that test writing is more constrained work than greenfield implementation, but 5 hours of test writing compressed into 33 minutes is still worth noting.

The day's long tail (tasks 29-43) is a mix of benchmark harnesses, documentation audits, deployment plumbing, and bug fixes. These tasks cluster in the 6-10x range, which is the expected floor for work where the human decision content per AI-minute is highest: diagnosing a resume upload CORS chain across multiple services, running database migrations manually after a pipeline skip, fixing TypeScript strict-mode errors one by one. The pair selector benchmark (task 29) is the most interesting in this group: 7 tuning arms each running 50 random seeds against real pair selection logic, with the outcome being the specific identification of goal-level recency as the fix for hot-goal streaks. That is a proper ablation study, and it ran in 18 minutes. The supervisory leverage of 174.3x for the full day reflects that most of these tasks were specified in 1-3 sentences -- the AI carries the implementation context, not the human.

Let's Build Something!

I help teams ship cloud infrastructure that actually works at scale. Whether you're modernizing a legacy platform, designing a multi-region architecture from scratch, or figuring out how AI fits into your engineering workflow, I've seen your problem before. Let me help.

Currently taking on select consulting engagements through Vantalect.