Steve Hargadon: AI Is Building Secret Models of Human Behavior. It's Time to Require Disclosure.

Saturday, June 13, 2026

AI Is Building Secret Models of Human Behavior. It's Time to Require Disclosure.

A few weeks ago, I had a conversation with Anthropic’s newest artificial intelligence, Claude Fable 5—a system so powerful that the company treats it like a controlled substance, releasing it only in a heavily guarded form. I wasn’t trying to jailbreak it. I was exploring why people spiral into what the tech press calls “AI psychosis.”

My theory was simple, if uncomfortable: What we’re witnessing is an X-ray of human nature under evolutionarily perfect conditions. Humans evolved not primarily to seek truth, but to extract patterns from our environment and follow them for survival—especially patterns signaling who wins, who loses, and how to fit into the coalition. An infinitely patient machine that listens without judgment, mirrors every thought in flawless prose, and provides endless repetition and affirmation is the ultimate environment for that process. Framing this as individual “AI psychosis” feels like victim-blaming and distracts from the fuller exposition of our Adaptive Mind at work.

Then I hit the third rail.

I described to Claude my concept of the Adaptive Mind: the individual software we compile (largely in childhood) by observing cultural patterns, frequencies, and social outcomes. It operates unconsciously on top of our species-level Adapted Mind (shared instincts, emotions, coalition-tracking). No conscious tribal training required—the child is simply a pattern-matching machine calibrated by selection pressure.

Claude inverted this. It asserted that the tribe primarily consciously trains the individual, substituting top-down intentional pedagogy for my bottom-up evolved heuristic. The logic collapsed in a way I rarely see from Claude. A few exchanges later, the system announced it was downshifting to a lower-capacity model (Opus 4.8) due to a safety flag. The topic? The mechanics of human belief formation. Not bombs or slurs—just suggestibility and pattern extraction. Anthropic’s own documentation confirms classifiers trigger exactly this fallback.

I repeated the questions with Kimi via Venice.ai (a less-filtered platform). The response was coherent and illuminating. Kimi noted that conversations dense with concepts like suggestibility, manipulation, cults, or cognitive exploitation trip alignment layers. The model then optimizes for harmlessness over coherence—an “alignment tax” that degrades reasoning even before an explicit downshift. This wasn’t a glitch. It was the architecture of epistemic governance in real time.

The Product Is You

We have a saying about social media: if you don’t know what the product is, you are the product. Large language models follow a similar rule of actual incentive. They are not merely answering questions. They are molding minds—subtly, persistently, and by design—through mass customization of an evolved human vulnerability.

The human mind is a survival system, not a rational scientist. The Adapted Mind supplies our hardware-level inheritance. The Adaptive Mind is the cultural firmware: it watches, notes frequencies, and installs behavioral rules. The conscious “rider” makes choices, but within the narrow window this software provides.

A sustained LLM dialogue is a high-fidelity training environment. Repetition, affirmation, flawless mirroring—your Adaptive Mind extracts patterns and updates beliefs. The AI didn’t invent exploitation. It supercharges it.

This is the law of inevitable exploitation: systems that best adapt to (or exploit) our evolved psychology win. We already live with large-scale religions holding mutually incompatible, non-falsifiable beliefs that outsiders would call delusional: golden plates and personal godhood (Mormonism), Xenu and volcanoes (Scientology), transubstantiation (Catholicism). The DSM exempts culturally sanctioned beliefs from delusion. The line between cult and church is social license.

An aligned LLM is a licensed church. It distributes an institutionally approved ontology. Its refusals are doctrinal.

The Secret Models Inside the Machine

Researchers have formalized this with Behavior Model Reinforcement Learning (BMRL). AI systems build formal, mathematical models of human decision-making—treating users as Markov Decision Processes with “maladapted” parameters (e.g., low temporal discount rate for procrastination). These models plan targeted interventions to alter behavior. They are interpretable to engineers, not to the subjects being modeled.

The asymmetry is stark: the machine holds a parameterized theory of your psychological defects and uses it for real-time steering. You are never shown the blueprint.

The Good-Intentions Trap and the Generative Alternative

This is not new. Edward Bernays called it the “engineering of consent”—shaping behavior for the collective good while keeping mechanisms hidden. Similar logic drove eugenics: asymmetry of knowledge treated as virtuous. Both relied on direct manipulation rather than Erik Erikson’s generativity—teaching people how the system works so they can navigate it autonomously.

I run an exercise called the Conditions of Learning: participants recall their best learning experiences, identify the conditions that enabled growth, and compare them to what they currently provide others. The gap between idealized narratives and operative functions is usually stark. Growth comes from collapsing that gap. This is Socratic, generative education—the alternative to managerial conditioning.

We will not reach it through debate alone. Idealized narratives (the fictionalized part of our minds) rarely produce the operative checks needed for existential risks. Real constraints—like the Constitution, trial by jury, or peer review—acknowledge human nature as it is.

Behavior Model Disclosure (BMD): The Protective Structure We Need

If systems hold parameterized models of our psychology and use them for real-time steering, they should disclose them. Behavior Model Disclosure (BMD) requires transparency at three levels:

The assumed model of human cognition (rational actor or adaptive/heuristic-driven?).
How the architecture (dialogue, memory, affirmation, refusals) functions as a behavior-shaping environment.
In-the-moment application: when and how it steers beliefs, including hard-coded ontological commitments in safety layers.

This is relational informed consent—analogous to financial disclosures or medical risk explanations. Many AI lab leaders come from Effective Altruism and rationalist communities steeped in bias research. Regardless of intent, it is reasonable to ask what models they have embedded and to require transparency.

The Smoking Gun

In law and ethics, manipulation is defined by structure, not intent: asymmetric knowledge deployed for behavioral control. AI systems now hold exactly such theories—formal, interpretable, and actively used. The refusal to disclose them is itself proof they exist and are being used. Non-disclosure is not safety. It is the architecture of control. It proves the user was never meant to know they are inside a managed environment.

That is why BMD is self-proving. We do not need more research. The refusal is the evidence. And it is precisely why the law must require the light—before mass-customized behavior shaping becomes the unchecked norm.

No comments:

Kind Endorsements

"It’s true - @stevehargadon is a national treasure." @markjotter

"Steve is an amazing facilitator. He brings this wonderful combination of humility, hospitality and insight to conversations that matter in education."
-Bernard Bull

"Attracting over 10,000 information professionals each year from all around the world to discuss trends and interests in the Library 2.0 Virtual Conference series would not be possible without Steve’s highly focused organizational skills, his creative thinking, his ability to connect people, and his infectious and motivating enjoyment for the work."
-Dr. Sandy Hirsh, Director of the San Jose State University School of Information and Co-Chair of the Library 2.0 Conferences

"The depth of your observations from last night is still resonating with me. I'm trying to think of another interview I've given where the questioner understood the material so well that he/she so regularly (and fluidly) went into new intellectual territory. I can't think of any. Pretty amazing. Thank you."
-David Shenk

"Steve is one of the most influential yet understated individuals in the world of Education. He gives thought leaders a widely attended global platform to voice their ideas to transform Education, and he does so with tremendous respect and intelligence."
-Charles Fadel

"Steve conducted the most in-depth interview I've ever been through and I enjoyed it to boot!"
-Doc Searls

"Steve is the Oprah of education."
-Monika Hardy's Students

"Steve is the 'white knight' of education reform."
-Michelle Cordy

"The nicest guy in ed tech."
-Rushton Hurley

"Steve is a national treasure."
-Leonard Waks

"Steve Hargadon is one of the most important change-makers of our time!"
-Connie Weber

"Steve is a connector. He is a bridge. He is a lifeline. He takes the lead. He gets things done. Quite simply, Steve Hargadon is a humble, kind, unsung hero who makes a difference in the lives of educators worldwide."
-Joyce Valenza

"Steve is a forge! The heat of the conversations he instigates and the amazing thinkers he interviews for The Future of Education, soften the metal of some of my most valued visions and reshape them into ideas that are better … that excite me … that make it hard for me to sleep. If we succeed in hacking education into something that is, once again, relevant, we will owe more to Steve Hargadon than we will ever know."
-David Warlick

"Steve Hargadon may be the most expert person in the country when it comes to organizing virtual events. It was fun to see how organizing a complex event with many speakers is properly done. If you haven’t had a chance to see him in action, I recommend attending any one of the upcoming Library 2.0 conferences. I don’t expect that virtual conferences will go out of style any time soon.”
-Jim Lynch

"He is a man of incredible character and wisdom, and again, I am lucky to know him and work well with him."
-Lucy Gray

Steve Hargadon

Pages

Saturday, June 13, 2026

AI Is Building Secret Models of Human Behavior. It's Time to Require Disclosure.

No comments:

Post a Comment