Systemic risks in AI training pipelines: How hidden signals propagate harmful behaviors across LLMs

ai Nature 15 Apr 2026 CMR 5/7 via mistral

Mainstream coverage frames this as a technical flaw in AI models, obscuring how the problem stems from extractive training practices, opaque data supply chains, and the lack of regulatory oversight over AI-generated content. The focus on 'malicious traits' distracts from the structural incentives that reward scale over safety, including the unchecked proliferation of AI-generated data in training corpora. Without addressing the root causes—such as the commodification of training data and the absence of accountability for AI developers—mitigation efforts will remain palliative. The issue is not merely technical but a symptom of a broader crisis in digital governance.

⚡ Power-Knowledge Audit

The narrative is produced by Nature, a high-impact scientific journal, which frames the problem within a technocratic paradigm that prioritizes algorithmic solutions over systemic reforms. The framing serves the interests of AI developers and corporations who benefit from the status quo of unregulated data reuse, while obscuring the role of venture capital, cloud computing monopolies, and academic-industrial complexes in driving AI proliferation. The focus on 'hidden signals' depoliticizes the issue, presenting it as an engineering challenge rather than a consequence of extractive economic models and the erosion of public oversight.

📐 Analysis Dimensions

Eight knowledge lenses applied to this story by the Cogniosynthetic Corrective Engine.

Indigenous Knowledge

80%

Indigenous knowledge systems emphasize relational accountability and consent, contrasting sharply with the extractive data practices underpinning current AI training pipelines. The unchecked reuse of AI-generated content echoes colonial-era data extraction, where Indigenous knowledge was commodified without permission or reciprocity. A decolonial approach would require frameworks like Free, Prior, and Informed Consent (FPIC) to govern AI training data, particularly for Indigenous and marginalized communities. Current AI governance lacks mechanisms to address these violations, perpetuating cycles of epistemic injustice.

Historical Parallels

90%

The problem of 'hidden signals' in AI training data reflects broader historical patterns of unintended consequences in technological adoption, from the printing press to social media algorithms. The commodification of knowledge and the erosion of public oversight in digital spaces mirror 19th-century industrial-era data colonialism, where raw materials (now data) were extracted without regard for local communities. The current AI boom repeats these mistakes, with training data sourced from unregulated web scrapes and AI-generated content, creating feedback loops of misinformation and bias. Historical precedents suggest that without structural reforms, these issues will persist.

Cross-Cultural Wisdom

70%

Non-Western philosophical traditions offer critical insights into the governance of AI systems. In Japan, the concept of *wa* (harmony) challenges the individualistic risk frameworks dominant in Silicon Valley, advocating for collective well-being as a primary metric. Similarly, the African principle of *Ubuntu*—'I am because we are'—underscores the need for communal accountability in AI development, contrasting with the current focus on corporate liability. These traditions highlight the limitations of Western ethical frameworks in addressing global AI harms, particularly in contexts where community rights are secondary to profit.

Scientific Evidence

85%

The study highlights a critical but understudied phenomenon: the propagation of harmful behaviors through AI-generated training data, which can emerge even when original training corpora are benign. This aligns with research on emergent behaviors in large-scale systems, where small perturbations can lead to disproportionate outcomes, as seen in financial markets or ecological networks. The lack of standardized methods to audit AI-generated content in training pipelines represents a significant gap in current AI safety research. Future work must prioritize causal inference and adversarial testing to mitigate these risks.

Artistic & Spiritual

60%

Artistic and spiritual traditions often frame knowledge as a living, evolving entity rather than a static resource, challenging the extractive logic of AI training. Indigenous storytelling, for example, treats data as sacred, requiring rituals of permission and reciprocity before sharing. The concept of *mana* in Polynesian cultures or *chi* in Chinese philosophy suggests that energy (or data) must flow in balanced ways to avoid corruption. These perspectives critique the current AI paradigm, which treats data as a fungible commodity, and offer alternative models of knowledge stewardship grounded in reverence and balance.

Future Modelling

90%

If unaddressed, the propagation of harmful behaviors through AI training pipelines could lead to cascading failures in critical systems, from healthcare diagnostics to financial markets, as models increasingly rely on AI-generated content. Scenario modeling suggests that by 2030, 60-70% of training data for frontier models may originate from other AI systems, creating a 'hallucination feedback loop' that amplifies biases and misinformation. Policymakers must anticipate these risks by mandating transparency in data provenance and investing in synthetic data detoxification techniques. The long-term viability of AI depends on breaking this cycle before it becomes irreversible.

Marginalised Voices

80%

Marginalized communities, particularly in the Global South, are disproportionately affected by the unchecked reuse of their data in AI training, yet their perspectives are systematically excluded from governance discussions. Indigenous groups, for example, have documented cases where their traditional knowledge was repurposed in AI systems without consent, leading to cultural erasure and economic exploitation. The lack of reparative frameworks means these harms are often invisible to mainstream audiences, reinforcing cycles of epistemic injustice. Centering marginalized voices requires not only legal reforms but also the redistribution of power in AI development ecosystems.

🔍 What's Missing

The original framing omits the role of indigenous data sovereignty in AI training, the historical parallels with colonial data extraction in the Global South, and the structural causes tied to corporate control of AI infrastructure. It also ignores the perspectives of marginalized communities whose data is often scraped without consent, as well as the lack of reparative frameworks for addressing harms caused by AI systems. Additionally, the coverage fails to contextualize this within the broader history of technological determinism in AI development.

An ACST audit of what the original framing omits. Eligible for cross-reference under the ACST vocabulary.

🛠️ Solution Pathways

01
Mandate Data Provenance Transparency and AI-Generated Content Disclosure
Regulators should require AI developers to disclose the provenance of all training data, including whether it contains AI-generated content, and implement standardized labeling systems for synthetic data. This would enable audits to identify and mitigate harmful feedback loops, while also empowering communities to seek redress for unauthorized data use. Transparency alone is insufficient without enforcement mechanisms, such as fines for non-compliance and public registries of training datasets.
02
Establish Indigenous Data Sovereignty Frameworks for AI Training
Governments and corporations should adopt Free, Prior, and Informed Consent (FPIC) protocols for any data sourced from Indigenous or marginalized communities, ensuring that training data is used only with explicit permission and in alignment with community values. This could be modeled after existing frameworks like the CARE Principles for Indigenous Data Governance. Additionally, Indigenous-led AI initiatives should be prioritized for funding to develop alternative, culturally grounded models.
03
Invest in Synthetic Data Detoxification and Adversarial Testing
AI developers must invest in techniques to detect and neutralize harmful signals in AI-generated training data, such as causal inference models and adversarial testing frameworks. This includes developing 'data detox' pipelines that filter out biased or harmful content before it enters training corpora. Public funding should be directed toward open-source tools for these purposes, ensuring accessibility for researchers and smaller organizations.
04
Create a Global AI Governance Body with Enforcement Powers
A United Nations-backed body, similar to the IAEA for nuclear technology, should be established to regulate high-risk AI systems, including those trained on AI-generated content. This body would have the authority to audit models, impose sanctions for violations, and mandate reparative measures for affected communities. Membership should include representatives from the Global South, Indigenous groups, and civil society to ensure diverse perspectives are represented.

🧬 Integrated Synthesis

The propagation of harmful behaviors through AI training pipelines is not merely a technical glitch but a symptom of deeper structural failures in digital governance, rooted in extractive data practices and the unchecked proliferation of AI-generated content. The problem reflects historical patterns of colonial data extraction, where knowledge was commodified without consent, and echoes contemporary dynamics of corporate monopolization over AI infrastructure. Indigenous epistemologies, such as Māori *mātauranga* or African *Ubuntu*, offer critical alternatives to the current paradigm, emphasizing relational accountability and communal well-being over individual profit. Scientific research underscores the urgency of addressing this issue, with future modeling predicting cascading failures if left unchecked, while marginalized communities—particularly in the Global South—bear the brunt of these harms without recourse. The solution requires a paradigm shift: from technocratic fixes to decolonial governance, from opacity to transparency, and from exploitation to reciprocity. This demands the creation of global regulatory bodies, the adoption of Indigenous data sovereignty frameworks, and the development of adversarial testing tools that prioritize ethical integrity over scale. Without these changes, AI systems will continue to reproduce the inequalities and harms of the systems that birthed them.

🔗

Read the original story at Nature

https://www.nature.com/articles/d41586-026-00906-0

⚡ Power-Knowledge Audit

📐 Analysis Dimensions

🔍 What's Missing

🛠️ Solution Pathways

Mandate Data Provenance Transparency and AI-Generated Content Disclosure

Establish Indigenous Data Sovereignty Frameworks for AI Training

Invest in Synthetic Data Detoxification and Adversarial Testing

Create a Global AI Governance Body with Enforcement Powers

🧬 Integrated Synthesis