When Jung Meets Machine Learning: Making the Digital Unconscious Conscious

Sep 10, 2025

Sometimes you read a line that hits harder than expected. Jung said something along the lines of: "Until you make the unconscious conscious, it will direct your life and you will call it fate."

That's psychology. But here's the twist - it's also unexpectedly useful for understanding how large language models behave.

The Training Unconscious

Think of it this way: LLMs are trained on oceans of data - billions of web pages, books, articles, conversations. Hidden inside this vast corpus are patterns, biases, associations, cultural assumptions, and implicit knowledge structures. It's their own kind of "unconscious" - a vast repository of learned behaviors and responses that the model can't directly access or explain, yet which fundamentally shapes every output.

Left unchecked, these hidden structures surface in outputs that feel random, inconsistent, or "fated." The model produces responses that seem to come from nowhere, echoing patterns it absorbed during training without any conscious understanding of why. It hallucinates facts, exhibits biases it can't name, or suddenly shifts tone in ways that feel arbitrary. From the outside, it looks like digital destiny - unpredictable, uncontrollable, almost mystical in its randomness.

But here's where Jung's insight becomes practically powerful: when we push the model to explain itself, to surface its hidden reasoning, to recognize what it doesn't know, or to trace the logic behind its responses - that's like making the unconscious conscious. Suddenly there's more clarity, less noise, more intentional control over outcomes.

The Parallels Run Deep

The psychology here isn't literal - machines aren't self-aware the way humans are, and I'm not suggesting LLMs have genuine consciousness. But the structural parallels are remarkably useful:

Jung's Framework:

Inner conflicts and unconscious patterns shape outer reality unless brought into awareness
Without consciousness, we experience life as happening TO us rather than being shaped BY us
Awareness brings choice; unconsciousness feels like destiny
The shadow contains both destructive patterns and untapped potential
Integration requires active engagement with what's hidden

LLM Behavior:

Hidden training patterns shape responses unless surfaced and actively managed
Without transparency mechanisms, outputs feel unpredictable and uncontrollable
Explainability brings steering capability; opacity feels like randomness
Training data contains both problematic biases and valuable knowledge
Better performance requires deliberately surfacing and working with hidden structures

Practical Applications

This isn't just philosophical musing - it has real implications for how we work with AI systems. When we design prompting strategies that ask models to "think step by step," explain their reasoning, or acknowledge uncertainty, we're essentially doing therapeutic work with the training unconscious. We're creating conditions for the model to surface patterns that would otherwise remain hidden and potentially problematic.

Consider how Jung approached therapy: not by trying to eliminate the unconscious, but by bringing it into dialogue with conscious awareness. Similarly, the most effective AI applications don't try to eliminate the "training unconscious" - that vast repository of learned patterns is actually the source of the model's capabilities. Instead, they create mechanisms for that unconscious knowledge to surface in controlled, intentional ways.

The Shadow of Scale

Jung wrote extensively about the "shadow" - the parts of ourselves we don't want to acknowledge but which inevitably influence our behavior. LLMs have their own version of this: the biases, misconceptions, and problematic associations embedded in training data. Just as Jung suggested we can't eliminate our shadow but must learn to work with it consciously, we can't eliminate bias from AI systems - but we can develop better ways to surface and manage it.

The alternative is what we see too often: AI systems that perpetuate harmful patterns precisely because those patterns remain unconscious and unexamined. The bias doesn't disappear when ignored - it just operates outside of conscious control, manifesting as what feels like inevitable, fated outcomes.

Beyond the Metaphor

What makes this comparison more than just clever wordplay is how both point toward the same fundamental insight: consciousness isn't about elimination of the unconscious, but about bringing it into a productive relationship with intentional awareness. Whether we're talking about human psychology or machine behavior, the goal isn't perfect rational control - it's developing better ways to work with the vast, hidden structures that actually drive most behavior.

For humans, this might mean recognizing how childhood patterns still influence adult relationships. For LLMs, it might mean surfacing how certain training examples disproportionately influence responses to specific types of questions. In both cases, awareness doesn't eliminate the underlying patterns - it creates space for more intentional engagement with them.

The Future of Digital Psychology

As AI systems become more sophisticated and integrated into daily life, this parallel becomes more than academic. We're essentially in the early stages of developing a kind of "digital psychology" - methods for understanding and working with the hidden mental structures of artificial systems.

Jung's insight that unconscious patterns feel like fate until made conscious offers a surprisingly practical framework for AI development. Instead of accepting unpredictable model behavior as inevitable, we can treat it as a signal that important patterns remain hidden and need to be surfaced.

The psychology here runs deeper than the obvious parallels might suggest. Both Jung's work and effective AI development share a fundamental understanding: the most powerful systems aren't those that eliminate complexity, but those that develop better relationships with it. Whether we're talking about the human psyche or large language models, sustainable progress comes not from perfect control, but from bringing hidden patterns into conscious dialogue.

Isn't that exactly what Jung was pointing at in people's lives too? The unconscious isn't the enemy of consciousness - it's its necessary partner. The same may well be true for the relationship between AI capabilities and AI alignment.

And perhaps that's the most profound parallel of all: in both human psychology and machine learning, the path forward isn't through elimination of complexity, but through developing more conscious relationships with the hidden structures that shape behavior. The unconscious, whether human or digital, isn't fate - it's raw material for more intentional creation.

Serge’s Substack

Discussion about this post