How to Consume Reality
In both science and everyday experience, understanding the world begins with observation. We observe phenomena, analyze them, and construct models that help us explain, predict, and manipulate what we perceive. This process is central not only to human understanding but also increasingly to artificial intelligence , which now plays an active role in how we model reality.
But what if we stepped back and re-examined the entire process of observation not just as a passive act of seeing but as an active, layered mechanism? And what if the traditional human centered model of observation introduces limitations that AI could overcome?
Let’s explore this idea by dissecting the observation pipeline into three fundamental stages: Reality → Observation → Model. Each stage plays a distinct role, and understanding their relationship can help us design AI systems that don’t just mimic human perception, but perhaps transcend it. Reality is the only stage that exists independently. It is the raw state of the universe, existing whether or not there is someone, or something, there to observe it. Rocks fall, stars explode, and particles interact, regardless of our awareness.
This concept aligns with the scientific worldview: reality is objective and exists outside the observer. However, some interpretations in physics, such as the Anthropic Principle, suggest that our very presence as observers influences the kind of universe we can perceive or exist in. While intriguing, this philosophical twist doesn’t negate reality’s independence, it merely highlights that what we observe of it is constrained by our ability to exist within it.
Observation sits between reality and the model. It is the middleman. It is the act of filtering raw reality through the senses, brain, and prior beliefs of the observer. This step is powerful, but it’s also perilous, because it is deeply subjective. Middlemen can often slow down a process. When humans observe, we don’t take in reality in its raw form. Instead, we compress it. Our eyes detect only a narrow slice of the electromagnetic spectrum; our ears hear within a specific frequency range. Our brains process these signals through a complex web of prior experiences, language, and cultural context. What we “see” is never pure reality it’s a filtered, interpreted, and often biased approximation.
The human brain acts like a compression algorithm, reducing the overwhelming detail of reality into a manageable model. This makes survival possible, but it also limits our understanding. Once observation has occurred, the mind builds a model, an internal representation of the external world. These models help us reason, predict, and make decisions. They’re not exact replicas of reality, but simplified versions tuned for efficiency, not accuracy. Models are everywhere. Newton’s laws model motion. Economic models predict markets. Even a child’s understanding that “fire is hot” is a model formed through limited experience.
Artificial Intelligence, too, builds models. An AI trained to recognize cats doesn’t understand what a cat is, it compresses vast data into a pattern-recognition system that predicts “cat” when it sees certain features. In the traditional AI pipeline, humans serve as the “observer.” We decide what data to collect, what labels to assign, and how to preprocess it. We effectively insert ourselves between reality and the AI’s model, bringing all our human biases along for the ride.
This raises a profound question: What if we could remove the human observer from the loop?
Instead of feeding AI a human preprocessed version of reality, what if we gave it raw interactions, measurements and allowed it to form its own model, untethered from our interpretations? We could perhaps allow the machine to take in all frequencies of light, acoustics and any other sensor we have at hand.
Doing so would allow AI to develop intra realities, internal representations of the world that aren’t inherited from humans, but emerge from the AI’s direct interaction with raw data. These intra realities could differ from ours in fascinating, potentially more efficient ways. Additionally, human learning is shaped not only by the information we consume but also by the social and cultural instincts hardwired within us. Similarly, machines may need to construct their own forms of social networks to learn more effectively.
In some domains, like reinforcement learning, AI systems already do this to an extent, interacting with an environment and learning from the consequences. But even here, the environment is usually shaped by human design. Wether using SRL where the questions are human answered or not SRL in which case the biased base model is human biased.
The traditional chain Reality → Observation → Model has served humanity well. It is the foundation of science, perception, and learning. But as we build artificial systems capable of modeling the world, we are no longer confined to this human centric path. AI presents an opportunity to explore entirely new ways of perceiving and understanding reality, perhaps ones that bypass the human observer altogether. This possibility is exciting, but also deeply challenging. Humans have spent centuries observing the world and constructing languages to describe it. These languages, mathematical, symbolic, spoken, have shaped our models of reality and, in turn, shaped what we pass on to AI systems. Today’s large language models, for example, don’t observe the world directly. They observe human descriptions of the world, textual artifacts shaped by our cognition, our biases, and our culture.
For AI to go beyond this, to observe raw reality and construct its own model, perhaps even its own language, would mark a profound shift. Such a system wouldn’t be “thinking like a human,” but rather developing a fundamentally different mode of understanding. Could this help machines reach the state of a Von Neumann universal constructor? Perhaps by learning raw reality we can bridge the gap between the world of bits to the world of atoms for these thinking machines. This move toward embodied intelligence aligns with the trajectory of next generation AI: systems that don’t just read about reality but live within it, explore it, and interact with it. Language models may help AIs talk about the world. But sensorimotor models will help them exist in it.
Yet this raises an important question: Is that what we want? If the purpose of AI is to assist humans, collaborate with us, and serve human goals, it must remain intelligible. Even if an AI could form its own reality model from first principles, would it be able to communicate that back to us? Would we be able to understand an alien language built from patterns we never noticed or processed?
Enjoy Reading This Article?
Here are some more articles you might like to read next: