The old perimeter (network boundary).
for thirty years, the perimeter meant a network drawing. firewall on the outside, VPC inside, IAM controlling who could cross. the assumption · an attacker comes in over the wire, and the job is to keep them out. inside trusted, outside hostile.
that model held because human operators were the only thing inside that could take consequential actions. humans have HR files, badge logs, manager approvals. the perimeter could afford to be a network drawing because the actors inside it were accountable through other systems.
The new perimeter (logging boundary).
agents broke the model. an agent that an enterprise authorized last quarter is, by construction, inside the trust boundary. it has credentials. it can call internal APIs. it can read nonpublic data. network controls cannot stop an authorized agent from doing what it has been authorized to do, and IAM cannot tell you whether what it did was within purpose.
so the question security teams now have to answer is not did anything cross the wire but what did the agent do, under what authorization, at what timestamp, and can we prove it after the fact. that is a logging question, not a network question. the boundary that matters is the boundary between actions that produce a durable record, independently verifiable without contacting Warrant, and actions that do not.
the New York Department of Financial Services put the regulator-side version of this in writing on 16 October 2024. the Industry Letter on AI cybersecurity opens with a sentence the rest of the document depends on:
read that twice. NYDFS did not write a new AI rule. it confirmed the existing rule, in force since 2017 and last amended November 2023, already attaches to AI systems. the audit trail provision inside that rule, 23 NYCRR § 500.6(a)(2), is what does the work:
a sitting US financial regulator, on record, applying an operation-shaped audit trail rule to AI systems handling nonpublic information. the perimeter has moved.
Three implications for security teams in 2026.
- application logs are necessary but not sufficient. gateway logs and APM traces tell you a tool was called and an LLM was invoked. they do not tell you whether the agent had purpose-aligned authorization to take that action against that subject at that moment. the trail § 500.6(a)(2) requires is operation-shaped. APM traces are traffic-shaped. necessary for SRE. not sufficient for the regulator.
- the operative perimeter is the audit trail. who can prove it, controls it. access control lists tell you who is allowed to do what. they do not tell you what was done. when an agent has standing authorization to read a customer record, the ACL has already approved every read it will ever make. the only remaining control is a record of each read mapped to a specific obligation, bound to the specific authorization context and independently verifiable without contacting Warrant.
-
regulators have already crossed this line.
NYDFS issued its AI Industry Letter on 16 October 2024, applying 23 NYCRR Part 500 (in force since 2017) to AI systems. The letter is explicit that no new statutory text was needed:
"This Guidance does not impose any new requirements beyond obligations that are in DFS's cybersecurity regulation codified at 23 NYCRR Part 500." NYDFS Industry Letter · 16 October 2024 · paragraph 2EU AI Act Article 12 requires automatic recording of events over the lifetime of high-risk AI systems, application 2 August 2026 in the AI Act as enacted, subject to the May 2026 Omnibus provisional deferral to 2 December 2027 pending OJEU. SEBI's algo-ID mandate took full effect 1 April 2026. the agent-perimeter shift is codified across jurisdictions on a fixed timeline that does not turn on operator readiness.
What this changes operationally.
three shifts for SOC and compliance teams in the next twelve months.
one. the SOC's agent-incident playbook now needs a forensic-evidence step, not just a containment step. when an agent does something it should not have done, the question regulators will ask is not whether you stopped it but whether you can prove what authorization context was active when it acted. that proof must be producible in days, not in a six-month log-mining exercise.
two. compliance and security have to stop treating audit trails as a logging-team problem. the trail is now the primary control. the team that owns it owns the perimeter. in most enterprises that team does not yet exist as a named function. it will, by the end of 2026.
three. evidence portability matters. a trail that lives only inside one vendor's system is not evidence in the legal sense, because the vendor can mutate it. the trails that survive enforcement scrutiny are the ones a regulator can verify without contacting the operator. that is a record mapped to a specific obligation and independently verifiable, not a query result.
Three perimeters that are now logging boundaries.
the framing is not abstract. three sectors have already had their perimeter redrawn by enforcement-shaped rule-making. each one is an example of the same shift · the boundary that used to be the network is now the model-output, the prompt-and-response, or the recommendation API. each one is in force today.
read across the three rows · the sectoral statute does not name AI in any of them. SR 11-7 was written in 2011, HIPAA's audit-control provision in 2003, NYDFS Part 500 in 2017, SEBI's algo rules pre-date the current LLM moment. what changed is not the rule. what changed is where the rule attaches now that the regulated output sits at a model boundary.
What was already known but underweighted.
none of what follows is new to anyone who has built audit infrastructure for a regulated industry. five things security architects knew five years ago. the AI-agent moment makes them load-bearing in a way that sectoral compliance teams had not yet had to act on.
- logging completeness is harder than logging correctness.you can fake a log line, redact a log line, or add a log line after the fact. you cannot fake a complete record. completeness is what the regulator examines · what was not logged is what the operator cannot defend. the engineering question shifts from "is this log line accurate" to "can the operator demonstrate, at write time, that no qualifying action escaped the trail." the answer is structural · the log emission has to live inside the call boundary, not outside it.
- tamper-evidence beats tamper-proofness.a record that is independently verifiable without contacting the operator is what regulators recognise. tamper-proof storage is a marketing claim · tamper-evident is a court-admissible one. the difference matters because no storage is actually proof against the operator with admin credentials, but a record mapped to a specific obligation can be checked by a counterparty and shows as altered if it has been altered. the standard is "could a counterparty independently detect a change" · not "could anyone in the world prevent a change."
- distributed tracing semantics become regulatory.the W3C Trace Context spec was written for SRE observability · trace IDs, span IDs, parent-child causality. the same primitives are now what a regulator needs to follow an action across a multi-system agent run. when an agent calls a tool, which calls an LLM, which calls another tool, the regulator wants the same parent-child reconstruction the SRE team uses for latency analysis. the trace context becomes part of the audit record.
- the audit boundary moves with the API boundary.every time an agent calls a new external API, the audit boundary moves with it. compliance that was scoped to the operator's data centre now follows the request out to model providers, vector stores, retrieval endpoints, and tool-call surfaces. the operator does not get to decide where the boundary is · the regulator follows the data and the decision-shape, not the operator's network diagram.
- retention is sectoral and the floor is set by the most demanding sector.EU AI Act Article 19 sets a floor of six months for high-risk system logs. SR 11-7 implies seven years for model-related records at a US bank. HIPAA pulls healthcare records to ten years and beyond. MiFID II runs five to seven years for trading-related records. the operator's retention policy is a maximum, not a minimum · the floor is set by the most demanding regime that any single action in the trace touches. an agent that crosses banking and healthcare in one run inherits the longer of the two retention windows for that entire trace.