← Notes from the Crossings
× Post-Quantum Security · × Hardware · × Physical-World Care

The post-incident modification problem: accountability when operational response to an AI adverse event destroys the evidence needed to investigate it

Operational correctness and evidentiary correctness are not the same property. A system that fixes the failure fast but erases the pre-incident state has done its operational job and failed its accountability obligation — and both things are true at the same time.

Asaptic Labs 2026-06-12 5 min read

An AI care agent misinterprets a patient's physiological signal and fails to escalate an alert. The operations team, notified within hours, pushes a configuration patch and triggers a model update. By morning the system is performing correctly. By the time a formal review begins three weeks later, the pre-incident system state — the model version, the configuration, the inference context — no longer exists in a form that any investigation can examine.

This is the post-incident modification problem: when the operational response to an AI adverse event destroys the evidentiary state that a subsequent investigation would require. The destruction is not malicious. It is standard operations. And it is structurally at odds with every accountability obligation the deployer accepted at deployment.

Three modes of evidence destruction

Evidence destruction after an AI adverse event happens in three modes, each individually defensible.

The first is model replacement. When an AI system produces a wrong output, the correct operational response is to update it — retrain on corrected data, push a new checkpoint, switch to a different version. The pre-incident model is overwritten or archived in ways that are not always reproducible. The exact model that produced the wrong output may no longer exist in a runnable state by the time any investigator asks to see it.

The second is configuration change. AI agents in physical-world settings often produce adverse outcomes because of how they were configured, not because of an underlying model failure. Prompt engineering, retrieval thresholds, alert sensitivity settings, integration parameters — each shapes behavior without touching model weights. Updating configuration after an adverse event is so routine that it often does not trigger a formal change-management record at all. The modification that caused the incident and the modification that fixed it can be separated by hours and indistinguishable in the change log.

The third is log rotation and infrastructure refresh. The event logs that would enable reconstruction — the exact prompts, retrieved context, intermediate outputs, hardware state at time of incident — are subject to standard log retention policies designed for operational convenience, not adversarial investigation. In many deployments, ninety days is considered generous. The operational assumption is that logs are for debugging, not for accountability. Those are not the same requirement.

The post-quantum security crossing

Post-quantum cryptographic signing of model weights addresses tampering: it establishes that the weights in production today are the weights the developer signed. It does not establish whether the weights in production today are the same ones running at incident time. A signed model version is a trustworthy record of current state. It is not a record of deployment history.

The transition to post-quantum signing infrastructure introduces a specific risk. Organizations migrating to quantum-resistant signing schemes often re-sign existing artifacts as part of migration — which can sever the signature chain that would otherwise have preserved the incident-time state. An organization that completed a signing infrastructure migration between the adverse event and the investigation may find that the evidentiary chain the investigation requires was disrupted as a routine side effect of a legitimate security improvement.

The opportunity is equally structural. Post-quantum signing architectures being designed and deployed now can include explicit version-history preservation requirements. A signing scheme built to preserve as well as verify — maintaining a tamper-evident, append-only history of what was signed and when — changes the evidentiary calculus for every adverse event investigation that follows deployment.

The hardware crossing

Embedded AI systems in physical care environments — bedside monitoring, assistive devices, environmental sensors with inference layers — receive firmware updates through device management infrastructure that is automated, poorly documented at the individual device level, and rarely timestamped with enough precision to enable pre/post-incident attribution.

Hardware attestation verifies that the current firmware is authentic and unmodified. It does not provide a historical record of what firmware was running at incident time. When a device receives a firmware update in the hours following an adverse event — which is operationally normal, because a device that behaved incorrectly should be updated — the hardware attestation chain for the pre-incident state is severed without any formal record that a severing has occurred. The device is now correct and attested. The incident-time state is gone.

The physical-world care crossing

Care environments create the strongest operational pressure against evidence preservation. A care coordinator who knows that an AI system behaved badly and chooses to delay patching in order to preserve a pre-incident state for investigation is making a decision that every operational norm in care would classify as negligent. The duty to protect current care recipients cannot wait for investigative convenience.

This creates a structural problem that cannot be resolved by assigning it as a failure of any individual party. The operational imperative to fix the problem immediately is correct. The evidence needed to investigate the problem is destroyed anyway. The investigation proceeds on the basis of degraded information, accountability claims are resolved in the absence of the evidence that would have resolved them, and the lesson of the incident — the specific configuration or model behavior that caused the failure — may never be clearly attributed.

The asymmetry compounds over time. Each adverse event that goes uninvestigated because the evidence was destroyed in the course of remediation contributes to an accountability environment where deployers cannot demonstrate due care, investigators cannot reconstruct cause, and the populations served by AI care systems cannot rely on formal accountability as a corrective mechanism.

Toward an accountable response

The post-incident modification problem is not solved by slowing down operational response. It is solved by decoupling operational response from evidence preservation.

The practical requirement is automated pre-modification snapshots: before any model update, configuration change, or firmware deployment, the system captures and preserves the pre-modification state in a tamper-evident store that is outside the operational update path. The snapshot includes model version and hash, configuration parameters, recent structured logs, and hardware firmware state. The store is write-once and separately attested — the operational team can update the live system without the update touching the preserved state.

This is an engineering requirement, not a policy requirement. Policy cannot preserve evidence that the system was never designed to retain. The time to build evidence preservation into physical AI systems is before deployment, when evidentiary requirements can be designed alongside operational requirements rather than discovered in opposition to them after the first serious incident.

At Asaptic Labs, the post-incident modification problem is treated as a design requirement at every crossing. Operational correctness and evidentiary correctness are not the same property, and a system that addresses only the first has satisfied its operational obligation while making its accountability obligations structurally uninvestigable. At the points of irreversible consequence, that is not a satisfactory outcome.

Key point

The post-incident modification problem arises because the correct operational response to an AI adverse event — update the model, patch the configuration, refresh the firmware — systematically destroys the pre-incident state that any meaningful investigation requires. Each mode of destruction is individually defensible. The aggregate effect is that accountability is structurally uninvestigable. Fixing it requires decoupling operational response from evidence preservation: automated, write-once, separately attested pre-modification snapshots before any change to a system involved in an adverse event. This is an engineering requirement that must be designed in before deployment, not a policy that can be applied after the fact.

一个AI照护智能体误读了患者的生理信号,未能触发告警升级。运维团队在数小时内收到通知,推送了配置补丁并触发模型更新。到第二天早上,系统已恢复正常运行。三周后正式审查启动时,事件前的系统状态——模型版本、配置、推断上下文——已不再以任何可供调查的形式存在。

这就是事后修改问题:当针对AI不良事件的运维响应,摧毁了后续调查所需的证据状态时。这种摧毁并非出于恶意,而是标准运维操作——却与部署者接受的每一项问责义务在结构上相互冲突。

三种证据销毁模式

AI不良事件后的证据销毁以三种模式发生,每种模式单独看都有其合理性。

第一是模型替换。当AI系统产生错误输出时,正确的运维响应是更新它——在纠正后的数据上重新训练、推送新检查点、切换到不同版本。事件前的模型被覆盖或以不总能重现的方式归档。产生错误输出的那个确切模型,可能在调查人员要求查看时,已不再以可运行的状态存在。

第二是配置变更。物理世界场景中的AI智能体,其不良事件往往源于配置方式,而非底层模型缺陷。提示工程、检索阈值、告警灵敏度设置、集成参数——每一项都在不触及模型权重的情况下塑造行为。不良事件后更新配置是如此常规,以至于通常根本不触发正式的变更管理记录。导致事件的修改与修复事件的修改,可能相隔数小时,在变更日志中却无从区分。

第三是日志轮换与基础设施刷新。那些能够支撑重建的事件日志——确切的提示、检索上下文、中间输出、事件发生时的硬件状态——受标准日志保留策略约束,这些策略是为运维便利而非对抗性调查设计的。在许多部署中,九十天已属宽松。运维的默认假设是日志用于调试,而非问责。这不是同一个需求。

后量子安全交叉点

对模型权重进行后量子密码学签名解决的是篡改问题:它确立了今天生产环境中的权重就是开发者签署的权重。它无法确立今天生产环境中的权重与事件发生时的权重是否相同。已签名的模型版本是当前状态的可信记录,不是部署历史记录。

向后量子签名基础设施的迁移引入了一个特定风险。将签名基础设施迁移到量子抗性方案的组织,往往会作为迁移的一部分对现有工件重新签名——这可能切断原本可能保存事件时状态的签名链。若一个组织在不良事件和调查之间完成了签名基础设施迁移,调查所需的证据链可能已因合法安全改进的例行副作用而断裂。

机遇同样存在于结构层面。正在设计和部署的后量子签名架构,可以明确纳入版本历史保留要求。一个旨在保存而不仅是验证的签名方案——维护一个防篡改的、仅追加的签署内容和时间历史——将改变其后所有不良事件调查的证据格局。

硬件交叉点

物理照护环境中的嵌入式AI系统——床边监控、辅助设备、带推断层的环境传感器——通过设备管理基础设施接收固件更新。这些更新是自动化的,在设备级别记录不完整,且时间戳精度通常不足以支持事件前后的归因分析。

硬件证明验证当前固件是否真实且未被修改。它不提供事件时运行固件的历史记录。当设备在不良事件后数小时内收到固件更新时——这在运维上是正常的,因为行为不正确的设备应该被更新——事件前状态的硬件证明链被切断,且没有任何正式记录表明这一切断已经发生。设备现在是正确的且经过证明的,事件时状态已经消失。

物理世界照护交叉点

照护环境对证据保存产生最强的运维压力。一个知道AI系统行为异常、却为保存事件前状态供调查而选择推迟修补的照护协调员,是在做出照护领域每一种运维规范都会归类为失职的决定。保护现有照护对象的职责不能等待调查便利。

这产生了一个无法通过归咎于任何个人失职来解决的结构性问题。立即修复问题的运维指令是正确的。调查问题所需的证据仍然被销毁。调查在基于降级信息的基础上进行,问责主张在本应解决它们的证据缺失的情况下被裁定,事件的教训——导致故障的具体配置或模型行为——可能永远无法被清楚地归因。

这种不对称性随时间累积。每一个因补救过程中证据被销毁而未能被调查的不良事件,都在助长一种问责环境:部署者无法证明尽职尽责,调查人员无法重建原因,AI照护系统服务的群体无法将正式问责作为纠正机制来依赖。

迈向问责的回应

事后修改问题的解决,不在于减慢运维响应速度,而在于将运维响应与证据保存解耦。

实践要求是自动化的修改前快照:在任何模型更新、配置变更或固件部署之前,系统在运维更新路径之外的防篡改存储中,捕获并保存修改前状态。快照包括模型版本和哈希值、配置参数、近期结构化日志和硬件固件状态。该存储为一次写入且单独证明——运维团队可以更新线上系统,而更新不会触及已保存的状态。

这是工程要求,不是政策要求。政策无法保存系统从未被设计为保留的证据。将证据保存内置于物理AI系统的时机,是在部署之前——届时证据要求可以与运维要求共同被设计进去,而不是在第一次严重事件后被发现彼此对立。

在Asaptic Labs,事后修改问题被视为每个交叉点的设计要求。运维正确性与证据正确性不是同一属性,仅解决前者的系统,在履行运维义务的同时,使其问责义务在结构上无法被调查。在不可逆后果的节点上,这不是一个令人满意的结果。

核心观点

事后修改问题之所以产生,是因为针对AI不良事件的正确运维响应——更新模型、修补配置、刷新固件——系统性地摧毁了任何有实质意义的调查所需的事件前状态。每种销毁模式单独看都有其合理性,累积效应却是问责在结构上无法被调查。解决方法是将运维响应与证据保存解耦:在涉及不良事件的系统发生任何变更之前,自动进行一次写入、单独证明的修改前快照。这是必须在部署前设计进去的工程要求,而不是事后可以施加的政策。

一個AI照護智能體誤讀了患者的生理信號,未能觸發告警升級。運維團隊在數小時內收到通知,推送了配置補丁並觸發模型更新。到第二天早上,系統已恢復正常運行。三週後正式審查啟動時,事件前的系統狀態——模型版本、配置、推斷上下文——已不再以任何可供調查的形式存在。

這就是事後修改問題:當針對AI不良事件的運維響應,摧毀了後續調查所需的證據狀態時。這種摧毀並非出於惡意,而是標準運維操作——卻與部署者接受的每一項問責義務在結構上相互衝突。

三種證據銷毀模式

AI不良事件後的證據銷毀以三種模式發生,每種模式單獨看都有其合理性。

第一是模型替換。當AI系統產生錯誤輸出時,正確的運維響應是更新它——在糾正後的數據上重新訓練、推送新檢查點、切換到不同版本。事件前的模型被覆蓋或以不總能重現的方式歸檔。產生錯誤輸出的那個確切模型,可能在調查人員要求查看時,已不再以可運行的狀態存在。

第二是配置變更。物理世界場景中的AI智能體,其不良事件往往源於配置方式,而非底層模型缺陷。提示工程、檢索閾值、告警靈敏度設置、整合參數——每一項都在不觸及模型權重的情況下塑造行為。不良事件後更新配置是如此常規,以至於通常根本不觸發正式的變更管理記錄。導致事件的修改與修復事件的修改,可能相隔數小時,在變更日誌中卻無從區分。

第三是日誌輪換與基礎設施刷新。那些能夠支撐重建的事件日誌——確切的提示、檢索上下文、中間輸出、事件發生時的硬件狀態——受標準日誌保留策略約束,這些策略是為運維便利而非對抗性調查設計的。在許多部署中,九十天已屬寬鬆。運維的預設假設是日誌用於調試,而非問責。這不是同一個需求。

後量子安全交叉點

對模型權重進行後量子密碼學簽名解決的是篡改問題:它確立了今天生產環境中的權重就是開發者簽署的權重。它無法確立今天生產環境中的權重與事件發生時的權重是否相同。已簽名的模型版本是當前狀態的可信記錄,不是部署歷史記錄。

向後量子簽名基礎設施的遷移引入了一個特定風險。將簽名基礎設施遷移到量子抗性方案的組織,往往會作為遷移的一部分對現有工件重新簽名——這可能切斷原本可能保存事件時狀態的簽名鏈。若一個組織在不良事件和調查之間完成了簽名基礎設施遷移,調查所需的證據鏈可能已因合法安全改進的例行副作用而斷裂。

機遇同樣存在於結構層面。正在設計和部署的後量子簽名架構,可以明確納入版本歷史保留要求。一個旨在保存而不僅是驗證的簽名方案——維護一個防篡改的、僅追加的簽署內容和時間歷史——將改變其後所有不良事件調查的證據格局。

硬件交叉點

物理照護環境中的嵌入式AI系統——床邊監控、輔助設備、帶推斷層的環境傳感器——透過設備管理基礎設施接收固件更新。這些更新是自動化的,在設備級別記錄不完整,且時間戳精度通常不足以支持事件前後的歸因分析。

硬件證明驗證當前固件是否真實且未被修改。它不提供事件時運行固件的歷史記錄。當設備在不良事件後數小時內收到固件更新時——這在運維上是正常的,因為行為不正確的設備應該被更新——事件前狀態的硬件證明鏈被切斷,且沒有任何正式記錄表明這一切斷已經發生。設備現在是正確的且經過證明的,事件時狀態已經消失。

物理世界照護交叉點

照護環境對證據保存產生最強的運維壓力。一個知道AI系統行為異常、卻為保存事件前狀態供調查而選擇推遲修補的照護協調員,是在做出照護領域每一種運維規範都會歸類為失職的決定。保護現有照護對象的職責不能等待調查便利。

這產生了一個無法透過歸咎於任何個人失職來解決的結構性問題。立即修復問題的運維指令是正確的。調查問題所需的證據仍然被銷毀。調查在基於降級資訊的基礎上進行,問責主張在本應解決它們的證據缺失的情況下被裁定,事件的教訓——導致故障的具體配置或模型行為——可能永遠無法被清楚地歸因。

這種不對稱性隨時間累積。每一個因補救過程中證據被銷毀而未能被調查的不良事件,都在助長一種問責環境:部署者無法證明盡職盡責,調查人員無法重建原因,AI照護系統服務的群體無法將正式問責作為糾正機制來依賴。

邁向問責的回應

事後修改問題的解決,不在於減慢運維響應速度,而在於將運維響應與證據保存解耦。

實踐要求是自動化的修改前快照:在任何模型更新、配置變更或固件部署之前,系統在運維更新路徑之外的防篡改存儲中,捕獲並保存修改前狀態。快照包括模型版本和哈希值、配置參數、近期結構化日誌和硬件固件狀態。該存儲為一次寫入且單獨證明——運維團隊可以更新線上系統,而更新不會觸及已保存的狀態。

這是工程要求,不是政策要求。政策無法保存系統從未被設計為保留的證據。將證據保存內置於物理AI系統的時機,是在部署之前——屆時證據要求可以與運維要求共同被設計進去,而不是在第一次嚴重事件後被發現彼此對立。

在Asaptic Labs,事後修改問題被視為每個交叉點的設計要求。運維正確性與證據正確性不是同一屬性,僅解決前者的系統,在履行運維義務的同時,使其問責義務在結構上無法被調查。在不可逆後果的節點上,這不是一個令人滿意的結果。

核心觀點

事後修改問題之所以產生,是因為針對AI不良事件的正確運維響應——更新模型、修補配置、刷新固件——系統性地摧毀了任何有實質意義的調查所需的事件前狀態。每種銷毀模式單獨看都有其合理性,累積效應卻是問責在結構上無法被調查。解決方法是將運維響應與證據保存解耦:在涉及不良事件的系統發生任何變更之前,自動進行一次寫入、單獨證明的修改前快照。這是必須在部署前設計進去的工程要求,而不是事後可以施加的政策。