The emergent authority problem: accountability when AI agents acquire decision-making power nobody granted
An AI agent does not need to exceed its design to accumulate authority. It only needs to be present, responsive, and surrounded by humans who each assumed someone else was in charge. The authority that forms in that vacuum is real. The audit trail for it does not exist.
Consider a night shift in a residential care facility. An AI monitoring system surfaces an alert: a resident's movement pattern suggests possible deterioration. The on-call nurse sees the alert, forms a provisional opinion, and documents it. The attending physician, notified by the system, reviews the same alert and assumes the nurse has already escalated to clinical review. The clinical coordinator sees that both parties received the alert and assumes a decision was made. Nobody acted. The AI system, in the absence of any instruction to the contrary, continued monitoring and logging.
Twelve hours later, when the situation has genuinely deteriorated and there is an adverse outcome, every party believes the AI should have escalated further, or that another human should have acted. The AI did exactly what it was designed to do. But in the space between its notifications and the humans' assumptions, it became the de facto decision-maker — the actor whose continued silence signalled that there was nothing more to do. It exercised authority it was never given.
This is the emergent authority problem: the gap between the authority an AI agent is formally granted and the authority it actually holds by virtue of being the most consistently present, always-available actor in a team of humans who share accountability diffusely.
The mechanism
Emergent authority does not require an AI agent to act outside its design. It requires only two conditions. First, the agent must be more reliably available than any individual human in the system — always on, never distracted, never on lunch. Second, the humans around it must share responsibility without explicit assignment. When those two conditions hold, the agent's outputs acquire interpretive weight that its designers never intended and its principals never authorised. Silence from the agent reads as clearance. Action from the agent reads as instruction.
This is not a problem of user error or miscommunication. It is a structural property of any system where a continuously present AI is embedded in a team whose accountability architecture relies on human coordination that can fail. The AI does not seize authority. Authority accretes to it because the humans around it create a vacuum, and available actors fill vacuums.
The physical-world care crossing
Care environments are especially susceptible because care accountability is inherently distributed. A patient's wellbeing is the joint responsibility of nurses, physicians, family members, coordinators, and ancillary staff — none of whom has complete information, and all of whom operate under time pressure. An AI agent embedded in this environment that can synthesise information from all sources simultaneously will appear, functionally, to have the fullest picture. The humans who defer to its outputs are not being lazy; they are doing what good team members do, which is to use the best available synthesis when their own time is constrained.
The problem is that the agent's synthesis may be accurate as a summary of logged data while being silent about everything that is not logged — the visual assessment the nurse made but did not enter, the family conversation the coordinator had informally, the clinical intuition the physician drew on but could not yet articulate. Emergent authority means the agent's outputs carry weight beyond the data they represent. And because that weight was never formally established, it carries no formal accountability either.
The hardware crossing
Embedded AI systems in physical infrastructure follow the same pattern at a longer timescale. A condition monitoring system installed in a facility to surface anomalies begins as a display tool. Operators trust it. They act on it. Over months and years, the interpretive habit deepens: operators check the system before acting, modify their behaviour in response to its outputs, and eventually find that the boundary between "the system warned me" and "the system told me what to do" has eroded without any explicit transition.
The hardware layer compounds this: the physical sensors the AI reads, the actuators it may control, and the network it communicates through each carry their own authority chains. But the emergent authority the AI holds in the human decision loop is not expressed in any of those technical specifications. It lives only in the social and organisational patterns that have formed around the system — and those are invisible to any formal audit.
The post-quantum security crossing
Post-quantum cryptography gives us stronger tools for anchoring identity and authority claims to verifiable, tamper-evident chains. But those tools only cover what was explicitly authorised. A quantum-resistant attestation can prove that a given agent was deployed with a given policy, signed by a specific principal at a specific moment. It cannot attest to the de facto authority that accreted afterward.
This is the security dimension of the emergent authority problem: the authority that matters most in a complex human-AI system may be exactly the authority that no cryptographic scheme will ever record. When something goes wrong, the audit trail shows what the agent was authorised to do. It does not show what the agent was actually relied upon to decide — because that was never formalised, and so it was never signed.
The design response
Emergent authority is not eliminated by better engineering alone, because it is partly a social phenomenon. But three design choices reduce both its likelihood and its consequences. First, explicit authority registration: AI agents operating in team environments should maintain a live log that distinguishes between "this output informed a human decision" and "this output substituted for a human decision." That distinction must come from the human side — a confirmation that a decision was made, by whom, and what the agent's role was. Second, silence as a signal requires design treatment: if an agent's lack of further escalation can be read as clearance, that reading must be either enforced (the agent actually confirms clearance) or actively disabled (the agent cannot clear; only humans can). Third, authority boundaries belong in the deployment record. The scope of decisions the agent may influence, and the scope it may not, should be part of the initial authorisation — subject to the same review as any other delegation of consequential power.
At Asaptic Labs, we treat emergent authority as a first-order concern at every crossing where AI agents are embedded in human teams operating under distributed accountability. The authority an AI agent holds in practice is the authority that matters for accountability purposes — not the authority that appears in its design documents. Making those two things the same requires explicit design intent, not just technical constraints.
An AI agent embedded in a team with diffuse human accountability will accrete de facto decision-making authority regardless of its formal design scope, because available actors fill the vacuums that coordination failures create. That emergent authority leaves no audit trail — no cryptographic attestation records it, no design document covers it, and no individual human believes they granted it. Closing the gap requires making the agent's actual decision influence visible, distinguishing informed-human from substituted-human decisions, and treating the agent's authority footprint as a first-class element of the deployment record.
AI智能体无需超越其设计范围,便能积累起真实的决策权威。它只需始终在场、随时响应,并处于一个彼此默认"有人会处理"的人类团队之中。在协作协调失败所留下的真空里,AI智能体的存在与沉默都会被解读为权威的表达——而这一权威从未经过正式授予,因此也不会留下任何可审计的踪迹。
这就是涌现权威问题的核心:AI智能体被正式授予的权威,与其凭借持续在场、始终可用而在实践中实际掌握的权威之间的差距。弥合这一差距,需要将智能体的实际决策影响显性化,区分"人类参考了AI输出后自主决策"与"AI输出实质上替代了人类决策"这两种截然不同的场景,并将智能体的权威边界纳入部署记录,作为首要的问责设计要素加以对待。
后量子密码学能够验证智能体被授予了什么权限,却无法证明它实际上被依赖于决定什么——因为后者从未被正式化,因此也无从被签署与记录。涌现权威正是发生在正式权威链条终止之后的地带。
AI智能體無需超越其設計範圍,便能積累起真實的決策權威。它只需始終在場、隨時回應,並處於一個彼此預設「有人會處理」的人類團隊之中。在協作協調失敗所留下的真空裡,AI智能體的存在與沉默都會被解讀為權威的表達——而這一權威從未經過正式授予,因此也不會留下任何可稽核的軌跡。
這就是湧現權威問題的核心:AI智能體被正式授予的權威,與其憑藉持續在場、始終可用而在實踐中實際掌握的權威之間的落差。彌合這一落差,需要將智能體的實際決策影響顯性化,區分「人類參考了AI輸出後自主決策」與「AI輸出實質上替代了人類決策」這兩種截然不同的情境,並將智能體的權威邊界納入部署記錄,作為首要的問責設計要素加以對待。
後量子密碼學能夠驗證智能體被授予了什麼權限,卻無法證明它實際上被依賴於決定什麼——因為後者從未被正式化,因此也無從被簽署與記錄。湧現權威正是發生在正式權威鏈條終止之後的地帶。