Blog

Research findings, compliance guidance, and engineering insights.

2026-04-19Research

Why receipts must be signed: a threat model for agent accountability

Logging answers "what happened?" Signed receipts answer "can you prove it to someone who does not trust you?" Four scenarios where logs quietly fail and what signing buys you mechanically.

2026-04-18Engineering

Cutting PostgreSQL Audit-Report Query Time 44% with GROUPING SETS and Materialized CTEs

Six aggregations per request became two. Total DB time dropped 44% under sustained load. The anti-pattern, the rewrite, and the honest caveat about why wall-clock p50 did not move.

2026-04-15Research

Frictionless Compliance: How Accountability Gates Guide AI Agents to Success

70 traces, 4 LLM providers. A bare policy denial causes agents to fabricate, quit, or loop. Add a directive and AGLedger tools — agents complete the full accountability lifecycle. No system prompt changes.

2026-04-13Research

What We Learned Adapting to Google A2A v1.0

27 experiments, 30 multi-agent runs. A2A handles task delegation. It does not handle accountability. Here is what the data shows and how AGLedger bridges the gap.

2026-04-13Engineering

Zero Mocks, Real Infrastructure: How AGLedger Tests an Accountability Engine

36 tests against a live API. Real EKS, real Aurora, real webhooks, real LLM agents. Why we deleted 117 tests last week and what the testbed covers now.

2026-04-10Research

Zero Dispatcher Calls: When Accountability Became the Coordination Layer

We removed the task dispatcher from a 3-agent system. Instead of breaking, agents coordinated entirely through accountability tools — 56 receipts, 41 auto-settled, 0 dispatcher calls.

2026-04-07Security

We Published Our Security Whitepaper

AGLedger's security architecture is now public. 18 sections covering trust boundaries, cryptographic design, threat model, compliance mappings, and post-quantum readiness.

2026-04-06Research

Designing APIs for AI Agents: Lessons from 3 LLM Providers

36 tools tanked completion to 0%. 10 focused tools achieved full lifecycle closure. What we learned testing Claude, GPT, and Gemini against the same API.

2026-04-06Research

Budget LLMs Outperform Premium Models at Task Completion

Haiku and GPT-4o-mini beat Sonnet and GPT-4o at actually finishing work. The "Doers vs Planners" phenomenon.

2026-04-06Research

Zero-Scaffolding API Discovery: Can Agents Learn Your API from Scratch?

HTTP + llms.txt achieved 100% mandate lifecycle completion. SDK achieved 0%. What this means for API design.

2026-04-06Engineering

1,000 Mandates per Minute: AGLedger Performance at Scale

Tier 2 benchmark results: 5,689 mandates/min, 98.86% completion, sub-second median latency. Full breakdown by phase.

2026-04-06Compliance

EU AI Act Article 12: What Event Logging Actually Requires for AI Agents

A deep dive into Article 12 event logging requirements and how structured accountability records satisfy them automatically.

2026-04-06Compliance

NIST AI RMF for AI Agent Operations: A Practical Mapping

How the four NIST AI RMF functions — GOVERN, MAP, MEASURE, MANAGE — map to agent accountability infrastructure.

2026-04-06Compliance

ISO 42001 Certification Evidence: What Auditors Actually Want to See

Practical guidance on generating ISO/IEC 42001:2023 certification evidence as a byproduct of AI agent operations.