The organizational forgetting problem: accountability when AI deployment erodes the institutional knowledge required to evaluate AI decisions
Oversight assumes that the people responsible for approving AI decisions can tell when those decisions are wrong. The organizational forgetting problem is what happens when the act of deploying AI gradually destroys that capacity — leaving institutions formally responsible for outcomes they can no longer meaningfully evaluate.
Every AI accountability framework rests on a quiet assumption: that the humans positioned as oversight are capable of recognizing when an AI system is wrong. This assumption is rarely stated because it seems obviously true at the moment of deployment. The engineers who built the system understand it. The domain experts who approved its use can spot an implausible output. The operations team has seen enough raw data to recognize when a model is producing anomalies. Oversight is not empty at the point of launch.
What changes is what happens afterward.
As AI agents take over a task — flagging anomalies, drafting care assessments, approving access requests, identifying hardware defects — the humans who were performing that task stop doing it. They stop doing it not because they have been removed from the loop, but because the loop no longer requires them in the way it once did. They review outputs. They approve recommendations. They remain formally responsible. But the hours of daily practice that built their ability to independently evaluate those outputs are no longer accumulating. The expertise that made oversight meaningful is slowly replaced by familiarity with the interface through which oversight is performed.
Over months, and then years, the institution loses the capacity its accountability architecture was designed to rely on. Not dramatically, through a sudden failure. Gradually, through a slow substitution of genuine judgment for reviewed output. By the time the AI system produces a category of error that the oversight process was supposed to catch, the oversight process no longer contains people who could catch it.
Why this is structurally different from skill atrophy
The individual version of this problem — a surgeon who loses manual skill by relying on robotic assistance, a pilot who loses hand-flying ability through automation — is well documented. The organizational version is structurally different and more dangerous.
Individual skill atrophy affects one person's capacity and can be addressed through individual retraining. When an organization loses the institutional knowledge required to evaluate an AI system, several things are simultaneously true. The knowledge is distributed: no single person holds it, and no single person's atrophy explains its absence. The loss is invisible: the organization continues to perform oversight, and its metrics — review rates, approval times, escalation counts — show no degradation. The knowledge is not easily reconstructed: it exists in the accumulated judgment of people who no longer practice the underlying task, and rebuilding it requires reestablishing the pipeline of practice that its absence has closed. And the loss is self-reinforcing: the less institutional capacity exists to evaluate AI outputs critically, the more those outputs are taken as authoritative, which further reduces the demand for independent evaluation, which further erodes the capacity.
The accountability gap this creates is not a gap in the formal structure of oversight. It is a gap between the formal structure and its actual content. The institution has an oversight process. That process is being executed. The people executing it are formally qualified. But the knowledge that would allow them to recognize a systematic error has degraded to the point where they cannot catch what the process was designed to catch.
The post-quantum security crossing
In security contexts, the organizational forgetting problem has a specific and severe form. Cryptographic systems require institutional knowledge to evaluate correctly: the ability to assess whether an implementation is secure, whether a key ceremony was conducted with appropriate rigor, whether an attestation chain is valid, whether a migration to quantum-resistant algorithms has actually eliminated the vulnerabilities it was designed to address. This knowledge lives in a small number of people who have built it through years of exposure to both correct and flawed implementations.
When an organization deploys AI tooling to automate cryptographic audits, certificate validation, or security posture assessment, the cryptographers who would otherwise be performing those functions gradually redirect their attention. The AI produces outputs; they review outputs. If the AI's outputs are plausible — consistent with past results, free of obvious anomalies, formatted correctly — the review is brief. The ability to independently verify that the AI's assessment is correct, rather than merely consistent with prior outputs, requires exactly the practice that the AI deployment has made unnecessary.
The consequence is an organization that is formally conducting cryptographic oversight but practically incapable of detecting a systematic error in its own security posture — including an error introduced by a transition to post-quantum algorithms that was nominally completed but cryptographically inadequate. The AI assessed the transition as complete. The reviewers found the assessment plausible. No one in the organization had the capacity to verify that the assessment was correct.
The hardware crossing
Hardware verification is one of the domains most susceptible to organizational forgetting. The ability to evaluate whether a chip design is secure, whether a firmware attestation is valid, whether a hardware root of trust has been correctly implemented — these capacities require sustained exposure to both the design process and the failure modes that arise when it goes wrong. AI systems that automate design verification, security analysis, and supply-chain attestation reduce the demand for human judgment at exactly the points where human judgment is most difficult to maintain.
The hardware lifecycle creates a specific timing risk. An organization that deploys AI verification tooling this year will find, in five years, that the engineers who remember what manual verification looked like are approaching retirement or have moved on. Their institutional knowledge — which errors look like which, which specifications are commonly misread, which suppliers introduce which anomalies — does not transfer to the AI system's audit trail in a form that can be recovered. It lives in the people who held it, and when they leave, it is gone. The AI continues to produce verification outputs. The organization continues to review them. The capacity to recognize that the review has become meaningless has already been lost.
The physical-world care crossing
In care environments, the organizational forgetting problem acquires an additional dimension: the people whose welfare depends on the accuracy of AI assessments are often least able to identify and report when those assessments are wrong.
A care organization that deploys AI to assess resident wellbeing, flag deterioration, and generate care plans gradually shifts the work of clinical judgment from trained care staff to the task of reviewing AI outputs. The knowledge that makes clinical judgment possible — learned through years of observing the subtlety of how decline presents, the difference between medication-induced lethargy and infection-onset lethargy, the patterns that experienced nurses carry in their practice — is not recorded in the AI's input features. It exists in the people who built it through exposure to the full range of clinical presentation.
As AI outputs become the primary medium through which clinical decisions are structured, the hours of direct observation that build that judgment decrease. The staff who would have developed it are reviewing outputs rather than practicing the underlying skill. When the AI makes a systematic error — misclassifying a class of deterioration signals that it has not been trained to recognize — the institution may no longer contain the clinical expertise to recognize that the classification is wrong. The residents cannot report the error themselves. The staff review outputs they lack the expertise to challenge. The oversight structure remains formally intact and substantively hollow.
The accountability design response
The organizational forgetting problem does not have a simple solution, but it has a tractable structural response. Accountability architecture for AI deployments in consequential domains should treat institutional evaluation capacity as a resource that requires active maintenance, not a fixed property of the workforce.
This means several things in practice. First, the oversight process for high-stakes AI decisions should include periodic exercises in which designated reviewers evaluate a sample of AI decisions by performing the underlying judgment independently — without the AI output — and comparing results. This is not an audit of the AI system. It is an audit of the evaluation capacity of the people responsible for oversight. If the gap between their independent judgment and the AI's output is widening, and they can no longer identify which side of the gap is correct, the oversight architecture has begun to fail.
Second, the institutional knowledge that grounds evaluation capacity should be treated as a form of organizational asset that requires succession planning. When the people who built their expertise before AI deployment retire or move on, the question of where the evaluative capacity to replace them will come from needs to be answered before they leave, not after.
Third, authorization documents for AI deployments in high-stakes domains should explicitly identify the evaluation capacity they depend on — which expertise, held by whom, in what minimum quantity — and specify what happens to the deployment if that capacity falls below the threshold required for meaningful oversight.
The accountable AI deployment is not the one with the most rigorous formal oversight structure. It is the one whose oversight structure is designed on the assumption that the capacity of those performing oversight will degrade — and that has taken explicit steps to prevent, detect, and respond to that degradation before it makes accountability impossible.
AI deployment progressively erodes the institutional knowledge that makes oversight meaningful. As AI agents absorb the daily practice through which evaluative expertise is built, the humans formally responsible for oversight lose the capacity to recognize systematic errors — leaving the oversight structure intact and substantively hollow. The organizational forgetting problem is distinct from individual skill atrophy: the loss is invisible to organizational metrics, distributed across the workforce, self-reinforcing, and difficult to reverse. Accountability design for consequential AI must treat institutional evaluation capacity as an asset that requires active maintenance, succession planning, and explicit minimum thresholds — not an assumed property of the organizations it is deployed into.
每一套AI问责框架都建立在一个未言明的假设之上:承担监督职责的人能够识别出AI系统的错误。这个假设在部署时看起来显而易见——构建系统的工程师理解它,批准使用的领域专家能发现不合理的输出,运营团队见过足够多的原始数据,能够识别模型产生的异常。监督在上线时并非空洞的。
改变的是此后发生的事情。
随着AI智能体接管一项任务——标记异常、起草照护评估、批准访问请求、识别硬件缺陷——原本执行这项任务的人类停止了亲手实践。他们停下来,不是因为被移出了流程,而是因为流程不再像以前那样需要他们。他们审查输出,批准建议,仍然承担正式责任。但那些在过去积累起独立评估能力的每日实践,已不再进行。使监督具有实质意义的专业知识,正被对执行监督所用界面的熟悉感悄然取代。
数月之后,再经过数年,机构便失去了问责架构赖以存在的基础。这种失去并不戏剧化,不是通过突然的崩溃,而是通过真实判断被审查输出缓慢替代而发生。当AI系统产生某类原本应该被监督流程捕获的错误时,监督流程中已不再有人能够捕获它。
为何这与技能退化在结构上不同
这个问题的个体层面——依赖机器人辅助的外科医生失去手术操作技能、依赖自动化系统的飞行员失去手动驾驶能力——已有充分记录。组织层面的版本在结构上截然不同,且更为危险。
个体技能退化影响一个人的能力,可以通过个人再培训来解决。当一个组织失去评估AI系统所需的机构性知识时,多种情况同时成立。这种知识是分布式的:没有单一个体持有全部,也没有哪个人的退化可以解释其缺失。这种损失是不可见的:组织继续执行监督,其指标——审查率、审批时间、上报次数——没有显示任何下降。这种知识不易重建:它存在于那些不再实践基础任务的人的累积判断中,重建它需要重新建立其缺席已经关闭的实践渠道。这种损失是自我强化的:对AI输出进行批判性评估的机构能力越弱,这些输出越被视为权威,进而进一步降低对独立评估的需求,进而进一步侵蚀评估能力。
由此产生的问责缺口,不在于监督的正式结构存在漏洞,而在于正式结构与其实际内容之间的鸿沟。机构拥有监督流程,流程在执行中,执行者具有正式资质。但本应让他们识别系统性错误的知识,已经退化到无法捕获流程本应捕获之错误的程度。
后量子安全交叉点
在安全领域,组织遗忘问题有着特定且严峻的形式。密码系统需要机构性知识才能得到正确评估:评估一个实现是否安全、密钥仪式是否以足够的严格性执行、认证链是否有效、向抗量子算法的迁移是否真正消除了其设计要解决的漏洞。这种知识存在于少数人身上,通过多年接触正确与有缺陷的实现而建立起来。
当一个组织部署AI工具自动化密码审计、证书验证或安全态势评估时,原本执行这些功能的密码学家会逐渐将注意力转移。AI产生输出;他们审查输出。如果AI的输出看起来合理——与以往结果一致、没有明显异常、格式正确——审查便会简短结束。独立验证AI评估是否正确(而非仅仅与以往输出一致)的能力,需要的恰恰是AI部署使之不再必要的那种实践。
其结果是一个正式进行密码监督、却实际上无法检测自身安全态势中系统性错误的组织——包括一次名义上完成、实则密码学上不充分的后量子算法迁移所引入的错误。AI评估迁移已完成。审查者发现评估合理。组织内没有人具备验证评估是否正确的能力。
硬件交叉点
硬件验证是最容易出现组织遗忘的领域之一。评估芯片设计是否安全、固件认证是否有效、硬件信任根是否正确实现——这些能力需要对设计流程及出错时产生的故障模式持续接触。自动化设计验证、安全分析和供应链认证的AI系统,在人类判断最难以维持的关键节点上降低了对人类判断的需求。
硬件生命周期带来了特定的时间风险。今年部署AI验证工具的组织,五年后会发现还记得手动验证是什么样子的工程师已临近退休或已离职。他们的机构性知识——哪些错误呈现何种特征、哪些规格常被误读、哪些供应商引入哪些异常——无法以可恢复的形式转移到AI系统的审计追踪中。它存在于持有者身上,当他们离开时,它便消失了。AI继续产生验证输出。组织继续审查它们。意识到审查已变得毫无意义的能力,已经先行丧失。
物理世界照护交叉点
在照护环境中,组织遗忘问题多了一个维度:其福祉依赖于AI评估准确性的人,往往最难以识别并报告这些评估存在的错误。
部署AI评估居民健康状况、标记病情恶化并生成照护计划的照护机构,会逐渐将临床判断的工作从训练有素的照护人员转移到审查AI输出的任务上。使临床判断成为可能的知识——通过多年观察病情如何微妙地呈现而习得的知识:药物引起的倦怠与感染起病的倦怠之间的差别、有经验的护士在实践中积累的规律——并未记录在AI的输入特征中。它存在于那些通过接触完整临床表现范围而建立起这种知识的人身上。
随着AI输出成为临床决策的主要载体,积累这种判断力所需的直接观察时间减少了。本应发展这种能力的员工转而审查输出,而非实践基础技能。当AI出现系统性错误——对一类未被训练识别的病情恶化信号进行错误分类时——机构可能已不再具备识别该分类错误的临床专业知识。居民无法自行报告错误。员工审查着他们缺乏专业知识去质疑的输出。监督结构在形式上完整,在实质上空洞。
问责设计的回应
组织遗忘问题没有简单的解决方案,但有一个可操作的结构性回应。高风险领域AI部署的问责架构,应将机构评估能力视为需要主动维护的资源,而非劳动力的固有属性。
这在实践中意味着几件事。首先,高风险AI决策的监督流程应包括定期演练,指定的审查者在不查看AI输出的情况下独立评估一批AI决策样本,然后比较结果。这不是对AI系统的审计,而是对负责监督的人员评估能力的审计。如果他们独立判断与AI输出之间的差距在扩大,而他们已无法识别差距的哪一侧是正确的,监督架构便已开始失效。
其次,支撑评估能力的机构性知识应被视为需要接班规划的组织资产。当在AI部署前建立专业知识的人员退休或离职时,关于从哪里获得替代评估能力的问题,需要在他们离开之前而非之后得到回答。
第三,高风险领域AI部署的授权文件应明确标识其所依赖的评估能力——哪种专业知识、由谁持有、以何种最低数量——并规定如果该能力降至有意义监督所需的阈值以下,部署将如何处理。
负责任的AI部署,不是拥有最严格正式监督结构的部署,而是其监督结构在设计时就假设执行监督者的能力会退化——并在退化使问责变为不可能之前,采取了明确措施来预防、检测和响应这种退化的部署。
AI部署逐步侵蚀使监督具有实质意义的机构性知识。随着AI智能体吸收每日实践——评估性专业知识正是在这些实践中建立的——正式负责监督的人类失去了识别系统性错误的能力,留下形式完整、实质空洞的监督结构。组织遗忘问题不同于个体技能退化:这种损失对组织指标不可见,分布于整个员工队伍,具有自我强化性,且难以逆转。高风险AI的问责设计必须将机构评估能力视为需要主动维护、接班规划和明确最低阈值的资产——而非其所部署组织的固有属性。
每一套AI問責框架都建立在一個未言明的假設之上:承擔監督職責的人能夠識別出AI系統的錯誤。這個假設在部署時看起來顯而易見——構建系統的工程師理解它,批准使用的領域專家能發現不合理的輸出,運營團隊見過足夠多的原始數據,能夠識別模型產生的異常。監督在上線時並非空洞的。
改變的是此後發生的事情。
隨著AI智能體接管一項任務——標記異常、起草照護評估、批准存取請求、識別硬體缺陷——原本執行這項任務的人類停止了親手實踐。他們停下來,不是因為被移出了流程,而是因為流程不再像以前那樣需要他們。他們審查輸出,批准建議,仍然承擔正式責任。但那些在過去積累起獨立評估能力的每日實踐,已不再進行。使監督具有實質意義的專業知識,正被對執行監督所用界面的熟悉感悄然取代。
數月之後,再經過數年,機構便失去了問責架構賴以存在的基礎。這種失去並不戲劇化,不是透過突然的崩潰,而是透過真實判斷被審查輸出緩慢替代而發生。當AI系統產生某類原本應該被監督流程捕獲的錯誤時,監督流程中已不再有人能夠捕獲它。
為何這與技能退化在結構上不同
這個問題的個體層面——依賴機器人輔助的外科醫生失去手術操作技能、依賴自動化系統的飛行員失去手動駕駛能力——已有充分記錄。組織層面的版本在結構上截然不同,且更為危險。
個體技能退化影響一個人的能力,可以透過個人再培訓來解決。當一個組織失去評估AI系統所需的機構性知識時,多種情況同時成立。這種知識是分散式的:沒有單一個體持有全部,也沒有哪個人的退化可以解釋其缺失。這種損失是不可見的:組織繼續執行監督,其指標——審查率、審批時間、上報次數——沒有顯示任何下降。這種知識不易重建:它存在於那些不再實踐基礎任務的人的累積判斷中,重建它需要重新建立其缺席已經關閉的實踐管道。這種損失是自我強化的:對AI輸出進行批判性評估的機構能力越弱,這些輸出越被視為權威,進而進一步降低對獨立評估的需求,進而進一步侵蝕評估能力。
由此產生的問責缺口,不在於監督的正式結構存在漏洞,而在於正式結構與其實際內容之間的鴻溝。機構擁有監督流程,流程在執行中,執行者具有正式資質。但本應讓他們識別系統性錯誤的知識,已經退化到無法捕獲流程本應捕獲之錯誤的程度。
後量子安全交叉點
在安全領域,組織遺忘問題有著特定且嚴峻的形式。密碼系統需要機構性知識才能得到正確評估:評估一個實現是否安全、金鑰儀式是否以足夠的嚴格性執行、認證鏈是否有效、向抗量子演算法的遷移是否真正消除了其設計要解決的漏洞。這種知識存在於少數人身上,透過多年接觸正確與有缺陷的實現而建立起來。
當一個組織部署AI工具自動化密碼審計、憑證驗證或安全態勢評估時,原本執行這些功能的密碼學家會逐漸將注意力轉移。AI產生輸出;他們審查輸出。如果AI的輸出看起來合理——與以往結果一致、沒有明顯異常、格式正確——審查便會簡短結束。獨立驗證AI評估是否正確(而非僅僅與以往輸出一致)的能力,需要的恰恰是AI部署使之不再必要的那種實踐。
其結果是一個正式進行密碼監督、卻實際上無法檢測自身安全態勢中系統性錯誤的組織——包括一次名義上完成、實則密碼學上不充分的後量子演算法遷移所引入的錯誤。AI評估遷移已完成。審查者發現評估合理。組織內沒有人具備驗證評估是否正確的能力。
硬體交叉點
硬體驗證是最容易出現組織遺忘的領域之一。評估晶片設計是否安全、韌體認證是否有效、硬體信任根是否正確實現——這些能力需要對設計流程及出錯時產生的故障模式持續接觸。自動化設計驗證、安全分析和供應鏈認證的AI系統,在人類判斷最難以維持的關鍵節點上降低了對人類判斷的需求。
硬體生命週期帶來了特定的時間風險。今年部署AI驗證工具的組織,五年後會發現還記得手動驗證是什麼樣子的工程師已臨近退休或已離職。他們的機構性知識——哪些錯誤呈現何種特徵、哪些規格常被誤讀、哪些供應商引入哪些異常——無法以可恢復的形式轉移到AI系統的審計追蹤中。它存在於持有者身上,當他們離開時,它便消失了。AI繼續產生驗證輸出。組織繼續審查它們。意識到審查已變得毫無意義的能力,已經先行喪失。
物理世界照護交叉點
在照護環境中,組織遺忘問題多了一個維度:其福祉依賴於AI評估準確性的人,往往最難以識別並報告這些評估存在的錯誤。
部署AI評估居民健康狀況、標記病情惡化並生成照護計劃的照護機構,會逐漸將臨床判斷的工作從訓練有素的照護人員轉移到審查AI輸出的任務上。使臨床判斷成為可能的知識——透過多年觀察病情如何微妙地呈現而習得的知識:藥物引起的倦怠與感染起病的倦怠之間的差別、有經驗的護士在實踐中積累的規律——並未記錄在AI的輸入特徵中。它存在於那些透過接觸完整臨床表現範圍而建立起這種知識的人身上。
隨著AI輸出成為臨床決策的主要載體,積累這種判斷力所需的直接觀察時間減少了。本應發展這種能力的員工轉而審查輸出,而非實踐基礎技能。當AI出現系統性錯誤——對一類未被訓練識別的病情惡化信號進行錯誤分類時——機構可能已不再具備識別該分類錯誤的臨床專業知識。居民無法自行報告錯誤。員工審查著他們缺乏專業知識去質疑的輸出。監督結構在形式上完整,在實質上空洞。
問責設計的回應
組織遺忘問題沒有簡單的解決方案,但有一個可操作的結構性回應。高風險領域AI部署的問責架構,應將機構評估能力視為需要主動維護的資源,而非勞動力的固有屬性。
這在實踐中意味著幾件事。首先,高風險AI決策的監督流程應包括定期演練,指定的審查者在不查看AI輸出的情況下獨立評估一批AI決策樣本,然後比較結果。這不是對AI系統的審計,而是對負責監督的人員評估能力的審計。如果他們獨立判斷與AI輸出之間的差距在擴大,而他們已無法識別差距的哪一側是正確的,監督架構便已開始失效。
其次,支撐評估能力的機構性知識應被視為需要接班規劃的組織資產。當在AI部署前建立專業知識的人員退休或離職時,關於從哪裡獲得替代評估能力的問題,需要在他們離開之前而非之後得到回答。
第三,高風險領域AI部署的授權文件應明確標識其所依賴的評估能力——哪種專業知識、由誰持有、以何種最低數量——並規定如果該能力降至有意義監督所需的閾值以下,部署將如何處理。
負責任的AI部署,不是擁有最嚴格正式監督結構的部署,而是其監督結構在設計時就假設執行監督者的能力會退化——並在退化使問責變為不可能之前,採取了明確措施來預防、偵測和回應這種退化的部署。
AI部署逐步侵蝕使監督具有實質意義的機構性知識。隨著AI智能體吸收每日實踐——評估性專業知識正是在這些實踐中建立的——正式負責監督的人類失去了識別系統性錯誤的能力,留下形式完整、實質空洞的監督結構。組織遺忘問題不同於個體技能退化:這種損失對組織指標不可見,分散於整個員工隊伍,具有自我強化性,且難以逆轉。高風險AI的問責設計必須將機構評估能力視為需要主動維護、接班規劃和明確最低閾值的資產——而非其所部署組織的固有屬性。