The Training Industry Isn't Broken. It Was Never Built to Work.

March 7, 2026 L&D Industry Analysis Systems Thinking

The global training and development industry generates $370 billion in annual revenue. It has produced four decades of SCORM packages, seven generations of LMS platforms, and an unmeasurable mountain of compliance certificates. What it has not produced, by any honest accounting, is evidence that it works. This is not an accident. The industry was structurally incapable of working from the day it was built. Here is why.

1 The Principal-Agent Problem: The Original Sin

Every dysfunction in the training industry flows from a single structural fact so obvious that it hides in plain sight:

The person who buys training is never the person who takes it.

HR buys. Compliance buys. The Chief Learning Officer buys. Procurement buys. And then employees — thousands of them — sit through what was purchased on their behalf, by people who will never take it themselves, selected through a process that never asked what the employees actually needed to learn.

In economics, this is called the principal-agent problem: when the person making a decision (the agent) has different incentives than the person affected by that decision (the principal). It shows up in healthcare, in real estate, in financial advising. But nowhere is the gap wider, or less examined, than in corporate learning.

Once you see this, every "feature" of the training industry reveals itself as a rational response to the wrong customer.

The Feature List of a Product Nobody Asked For

Industry Feature	Who It Serves	What It Actually Does
SCORM completion tracking	Buyer (audit trail)	Proves someone opened the module, not that they learned
Seat-based licensing	Procurement	Aligns with budget cycles, ignores learning needs
Course catalogs ("Netflix for learning")	Buyer demo	Impressive in sales pitches, produces learner paralysis
Bloom's taxonomy objectives	RFP reviewers	Quality checklists for procurement, nothing measurable for learners
Satisfaction surveys (Level 1)	CFO (renewal justification)	Inversely correlates with actual learning
Certificates of completion	Legal/compliance	Liability shields, not evidence of competence
Admin dashboards (30+ charts)	CLO reporting	Learner dashboards are afterthoughts, if they exist at all
"Rapid authoring" tools	Production budget	Reduce cost per module, produce worse content faster

None of this is conspiracy. The buyers are not evil or ignorant. They are rational. A CLO's job security depends on four things: no compliance violations, demonstrable training activity, budget justification, and vendor stability. Notice what is not on that list: anyone actually learning anything. The incentive structure does not select for learning outcomes. It selects for audit outcomes.

So the products are built for audits. And they are magnificent auditing tools. They track every click, timestamp every completion, generate every report a compliance officer could dream of. The fact that the person clicking through those slides retained nothing is, structurally, beside the point.

The training industry doesn't have a quality problem. It has a customer problem. The product works perfectly for the people who buy it. It just doesn't work for the people who use it.

The Proof: Where People Spend Their Own Money

If you want to understand what people actually value for learning, look at where they spend their own money, on their own time, when no one is tracking completion.

YouTube tutorials
Udemy courses (chosen and paid for by the learner)
Books
Stack Overflow
Asking a coworker
Reddit threads
Discord communities
Trial and error with the actual tool

No SCORM. No learning objectives aligned to Bloom's taxonomy. No certificates. No admin dashboards. No seat licenses. No satisfaction surveys. When the buyer is the learner, every feature the enterprise training industry considers essential disappears entirely.

This is not a coincidence. This is a market revealing its preferences. And the preferences are unequivocal: the entire apparatus of enterprise learning — the standards, the platforms, the metrics, the tools — exists to serve buyers, not learners. Remove the buyer from the equation, and the learner builds something completely different.

That should terrify every L&D vendor on the planet. It doesn't, because their customer is not the learner.

2 Compliance Training Is Accidentally Operant Conditioning

Every year, millions of employees sit through mandatory compliance training. Anti-harassment. Data privacy. Information security. Workplace safety. The modules follow a predictable pattern: watch slides, click "Next," take a quiz, score high enough to pass. If you fail the quiz, retake it until you pass. Then the requirement disappears from your dashboard for another twelve months.

This is presented as education. It is not education. It is operant conditioning, and it is training the exact opposite of what it intends.

B.F. Skinner Would Have Seen This Immediately

In operant conditioning, behavior is shaped by consequences. There are four types:

Mechanism	What Happens	Effect on Behavior
Positive reinforcement	Add something pleasant	Behavior increases
Negative reinforcement	Remove something unpleasant	Behavior increases
Positive punishment	Add something unpleasant	Behavior decreases
Negative punishment	Remove something pleasant	Behavior decreases

Now map this onto compliance training:

The module is an aversive stimulus (unpleasant experience).
Passing the quiz removes the aversive stimulus (the module goes away).
This is negative reinforcement: the removal of something unpleasant following a behavior.

The behavior being reinforced is not "understanding harassment policy." The behavior being reinforced is "click through slides as fast as possible and guess on the quiz until the module goes away."

This is not a bug in the design. This is the design working exactly as operant conditioning predicts. The reinforcement schedule (annual, fixed-interval) is the weakest possible schedule for producing durable behavior change. The reinforced behavior (rapid completion) is the opposite of the desired behavior (careful consideration of policy). The extinction of the behavior (forgetting everything) happens almost immediately because no reinforcement occurs between annual cycles.

B.F. Skinner published the principles that explain this in 1938. Every behavioral psychologist alive could predict this outcome from the design alone. But instructional designers are not trained in behavioral psychology. They are trained in Bloom's taxonomy, Gagné's nine events of instruction, ADDIE, and SAM. And behavioral psychologists are not looking at LMS data. The cross-domain connection — the one that reveals the dysfunction — goes unmade.

The uncomfortable implication

Organizations that run compliance training are not teaching compliance. They are systematically training employees to dismiss compliance content as fast as possible. Every annual cycle reinforces the behavior more strongly. The longer you run the program, the worse it gets.

And yet the programs continue, year after year, because they were never built to produce compliance behavior. They were built to produce completion records. And they do that flawlessly.

3 The Spacing Effect Has Been Proven for 140 Years, and L&D Still Ignores It

In 1885, Hermann Ebbinghaus published his landmark study on memory. He memorized lists of nonsense syllables and measured how quickly he forgot them. His findings, replicated hundreds of times over 140 years, are among the most robust results in all of experimental psychology:

Humans forget approximately 70% of new information within 24 hours.
Distributed practice (spacing learning over time) produces dramatically better long-term retention than massed practice (cramming).
The spacing effect — the superiority of spaced over massed learning — has been demonstrated across every age group, every subject matter, and every culture ever studied.

140 years. Hundreds of replications. No serious scientific dispute.

And the dominant model in corporate training is still the one-shot course. A single event. A lunch-and-learn. A half-day workshop. A 45-minute e-learning module taken once, completed, never revisited. The very structure the evidence says produces the worst retention outcomes.

Why? Because the Technology Enforces It

The answer is not that L&D professionals don't know about the spacing effect. Many of them do. The answer is that the technology architecture makes spaced learning structurally impossible.

The LMS — Learning Management System — is built around the course as its atomic unit. Everything flows from this: enrollment, tracking, completion, certification, reporting. A course is a bounded container with a beginning, a middle, and an end. You start it, you progress through it, you finish it. The LMS marks it complete. Everyone moves on.

Spaced repetition requires a fundamentally different data model. Instead of a bounded container you traverse once, you need a system that:

Breaks content into discrete, retrievable units
Schedules retrieval at increasing intervals based on individual performance
Tracks forgetting curves per learner, per concept
Triggers re-engagement days or weeks after initial exposure
Has no "completion" state — only "current mastery level"

This is not a course. This is a fundamentally different architecture. And no major LMS supports it natively, because the LMS was built for the buyer who needs completion reports, not for the learner who needs long-term retention.

The technology reinforces the bad pedagogy. The bad pedagogy justifies the technology. And 140 years of cognitive science sits on the shelf, cited in conference presentations, ignored in platform architecture.

The systems that DO implement spaced repetition

Anki. Duolingo. SuperMemo. Medical board prep. All consumer products. All chosen and purchased by the learner. None of them are LMS platforms. None of them use SCORM. None of them have admin dashboards. The pattern holds: when the buyer is the learner, the science gets implemented. When the buyer is not the learner, the science gets ignored.

4 Kirkpatrick's Four Levels Actively Damaged the Industry

In 1959, Donald Kirkpatrick proposed four levels for evaluating training effectiveness:

Level 1 — Reaction: Did the learners like it?
Level 2 — Learning: Did they acquire the knowledge/skills?
Level 3 — Behavior: Did they apply it on the job?
Level 4 — Results: Did it impact business outcomes?

This framework became the most widely cited evaluation model in L&D history. It is also, I will argue, one of the most destructive ideas the industry ever adopted. Not because the levels are wrong in theory, but because of what happened in practice.

What Happened in Practice

The vast majority of organizations — studies consistently put the number above 90% — measure Level 1. They send out satisfaction surveys. "How did you enjoy the training?" "Was the instructor engaging?" "Would you recommend this to a colleague?"

A much smaller percentage measure Level 2 (knowledge tests). An even smaller percentage attempt Level 3 (behavior change). Almost no one measures Level 4 (business impact). The framework became, in practice, a justification for measuring feelings and calling it evaluation.

This would be bad enough on its own. But here is where it becomes actively destructive:

The Kirkpatrick Inversion

Fact A: Level 1 (satisfaction) inversely correlates with Level 3 (behavior change).

Fact B: Training that creates "desirable difficulty" — struggle, effort, productive failure — produces better long-term outcomes but worse immediate satisfaction.

Therefore: The metric most organizations optimize for (Level 1) actively selects against effective training.

Read that again. The industry's primary metric is not just unhelpful — it is inversely correlated with the outcome it claims to measure. Organizations that optimize for high satisfaction scores are systematically selecting for training that feels good and teaches nothing, while discarding training that feels uncomfortable and produces lasting behavior change.

This is not a theoretical concern. This is what happens every day across every organization that makes vendor renewal decisions based on satisfaction scores. The vendor whose training produces the most discomfort — the most desirable difficulty, the most retrieval practice, the most productive failure — gets the worst scores and loses the contract. The vendor whose training is smooth, pleasant, and effortless gets the best scores and gets renewed. Natural selection, operating on the wrong fitness function.

Kirkpatrick gave the industry a framework that looked rigorous. The industry adopted the one level that required the least effort, discovered it justified the status quo, and never looked at the other three again.

5 Three Paradoxes That Should Keep L&D Leaders Awake at Night

The structural problems described so far — misaligned incentives, accidental conditioning, ignored science, inverted metrics — are damning on their own. But there are deeper contradictions in the foundations of learning theory itself, paradoxes that the industry has never confronted because they require reading across disciplinary boundaries. Each one, on its own, should force a fundamental rethinking of how we approach workplace learning. Together, they demolish the conceptual framework the industry operates on.

Paradox 1: Flow vs. Deliberate Practice

Two of the most influential ideas in performance science appear to give complementary advice. They do not. They are neurologically opposed.

The Flow-Practice Paradox

A (Ericsson): Deliberate practice — the mechanism behind expert performance — requires focused attention on weaknesses, repetition of difficult sub-skills, and constant feedback on errors. It is uncomfortable by definition. The moment it becomes comfortable, you are no longer doing deliberate practice.

B (Csikszentmihalyi): Flow state — the state of optimal experience — occurs when skills closely match challenges, producing effortless absorption and loss of self-awareness. It feels like performing at your best with minimal friction.

C: The optimal state for LEARNING (deliberate practice) and the optimal state for PERFORMING (flow) are neurologically opposed. You cannot be in both simultaneously.

The L&D industry chases "engagement." Engagement — the buzzword that launches a thousand product demos — is flow-adjacent. Gamification, interactive scenarios, branching narratives, immersive simulations: these are all designed to pull learners into a state of absorbed, effortless engagement. A state that, by definition, minimizes the discomfort and focused error-correction that produce actual skill development.

The more engaging you make training, the more you may pull learners into performance state and out of learning state. The product feature that sells best (engagement) may actively undermine the outcome it claims to produce (learning).

This does not mean engagement is always bad. It means that engagement and learning are in tension, and the industry has never acknowledged the tension, let alone developed a framework for navigating it. "Make it engaging" is not a learning strategy. It is a sales strategy that happens to sound like a learning strategy.

Paradox 2: The Testing Paradox

Testing — the act of retrieving information under conditions of uncertainty — is simultaneously the most effective learning technique ever studied and a source of performance anxiety that degrades the very performance it measures.

The Testing Paradox

A: Test anxiety reduces cognitive performance. Under stress, working memory capacity shrinks, retrieval becomes less efficient, and people perform measurably worse than their actual competence.

B: Retrieval practice under mild stress improves long-term retention. The act of struggling to recall information strengthens the memory trace more than passive review, even when the recall attempt fails.

C: There exists a narrow band where performing WORSE right now causes encoding memories MORE STRONGLY for the future. The student who struggles and scores 70% under stress may retain more than the student who comfortably scores 95%.

Think about what this means for satisfaction surveys. The learner who found the assessment easy and scored well will report high satisfaction. The learner who struggled, felt anxious, and scored lower will report low satisfaction. But the second learner may have encoded the material more durably. If this is true — and the research on desirable difficulty and retrieval practice strongly suggests it is — then learner satisfaction surveys do not just fail to measure learning. They measure its inverse.

The implication is staggering: the industry's primary quality signal (satisfaction) may be a reliable negative indicator of its primary goal (learning). The worse the training feels, the better it might be working. The better it feels, the less it might be doing.

Paradox 3: Knowledge Persistence

This is the paradox that should make every L&D leader question the fundamental unit of analysis their entire function is built on.

The Knowledge Persistence Paradox

A: Humans forget approximately 70% of new information within 24 hours. This is Ebbinghaus's forgetting curve, replicated for 140 years, one of the most robust findings in all of psychology.

B: Collective human knowledge has grown continuously for at least 10,000 years. We know more today than we did a century ago. A century ago, we knew more than a millennium ago.

C: Knowledge persistence is a property of SYSTEMS, not individuals. The individual brain is a container with a proven 70% leak rate. Knowledge survives not because individuals remember it, but because systems — writing, institutions, apprenticeships, communities of practice — preserve and transmit it independent of any single person's memory.

The $370 billion L&D industry is built almost entirely around putting knowledge into individual heads. Content delivery. Courses. Modules. Lessons. Microlearning. All designed to transfer information from a source into the one container with a scientifically proven, extensively replicated 70% leak rate within 24 hours.

This is like designing a water distribution system around containers with holes in the bottom and then being surprised when people are thirsty. The response of the industry has been to pour the water faster (microlearning), make the water more interesting (gamification), and pour the water more often (just-in-time learning). No one has questioned whether individual heads are the right container.

The systems that actually preserve knowledge — documentation, code repositories, standard operating procedures, mentorship chains, communities of practice — are categorized as "knowledge management," a separate field from L&D. The separation is itself the problem. Knowledge management addresses the system. L&D addresses the individual. The science says knowledge persists in systems, not individuals. L&D ignores the science and keeps pouring water into leaky containers.

The industry spent $370 billion optimizing delivery to a container with a 70% leak rate, and called it a knowledge problem when the knowledge didn't stick. It's not a knowledge problem. It's an architecture problem.

6 Test Prep Proves the Point — and Reveals Something Worse

If the principal-agent problem is the root cause of corporate training's dysfunction, then test prep should work better, because test prep solves the principal-agent problem. The buyer is the learner. The person paying for SAT prep, the MCAT course, the bar review program is the same person who will take the test. Their incentive is clear, measurable, and aligned with the product's purpose: raise the score.

And test prep does work better. Look at what independently emerges when the buyer is the learner:

Feature	Corporate Training	Test Prep
Spaced repetition	Absent	Core feature
Retrieval practice	Quiz at end (once)	Hundreds of practice questions
Adaptive difficulty	Same path for everyone	Adjusts to individual weaknesses
Progress measurement	Completion %	Score prediction, topic mastery
Feedback loops	Pass/fail on quiz	Detailed explanation per question
Learner dashboard	Afterthought	Primary interface
Satisfaction surveys	Primary metric	Irrelevant (score improvement is the metric)

Test prep independently converges on every evidence-based learning principle that corporate training ignores. Spaced repetition, retrieval practice, adaptive difficulty, objective measurement. Not because test prep companies read the research (though some did), but because when the buyer is the learner and the outcome is objectively measured, natural selection favors products that actually produce learning. Products that don't work get bad reviews and lose customers. The market functions.

This is the proof that the principal-agent problem, not incompetence or ignorance, is the root cause. The same industry, with the same pool of instructional designers, the same technology, the same understanding of pedagogy, produces radically different products depending on whether the buyer is the learner.

But Test Prep Reveals a Second Failure Mode

Here is where the story gets darker. Test prep solves the principal-agent problem, and as a result it does produce measurable score increases. But it introduces a different failure: Goodhart's Law.

"When a measure becomes a target, it ceases to be a good measure." — Charles Goodhart, 1975

Test prep optimizes for the test so effectively that the test stops measuring what it was designed to measure. A student who raises their SAT score by 150 points through test prep has not become 150 points better at mathematical reasoning or verbal comprehension. They have become 150 points better at taking the SAT. They have learned the test's patterns, its question formats, its trick answer structures, its time management strategies. They have optimized for the metric so thoroughly that the metric no longer reflects the underlying ability.

This gives us two distinct failure modes:

Corporate Training Failure

Cause: Principal-agent problem

Result: Product doesn't optimize for learning at all

Symptom: No one learns anything

Test Prep Failure

Cause: Goodhart's Law

Result: Product optimizes for assessment instead of underlying ability

Symptom: Scores improve, competence doesn't

No measurement-driven system escapes both failure modes. If the buyer is not the learner, you get the principal-agent problem and the product doesn't even try to teach. If the buyer is the learner and there's an objective metric, Goodhart's Law kicks in and the product teaches to the test instead of building genuine competence.

This is not a problem that can be solved by better metrics, better technology, or better instructional design. It is a structural trap. Any system that separates assessment from performance will eventually optimize for the assessment at the expense of the performance.

The Only Contexts Where Both Failures Disappear

There are contexts where genuine learning happens reliably, consistently, and at scale. They share one structural feature: the assessment IS the performance. There is no gap between "proving you can do it" and "doing it."

Apprenticeships: You build the thing. The thing either works or it doesn't.
Submarine qualifications: You operate the system. The boat either surfaces or it doesn't.
Medical residency: You treat the patient. The patient either recovers or they don't.
Cooking: You make the dish. It either tastes right or it doesn't.
Code review: Your code either passes review or it doesn't. The PR either merges or it doesn't.

In these contexts, you cannot "teach to the test" because there is no test separate from the work. You cannot game the metric because the metric IS the outcome. Both failure modes — principal-agent and Goodhart's — are structurally impossible.

Every one of these contexts is a form of apprenticeship.

7 The Answer Is 5,000 Years Old

We have spent six sections establishing that the training industry is structurally incapable of producing learning. The principal-agent problem misaligns incentives. The technology enforces bad pedagogy. The metrics measure the wrong things. The paradoxes undermine the foundations. Test prep demonstrates that fixing one problem introduces another. The only contexts that escape both failure modes are ones where assessment and performance are identical.

Which brings us to the oldest educational technology in human history: apprenticeship.

~3000 BCE – 1800s

Apprenticeship. Master teaches apprentice through embedded practice. It works. Every skilled trade, every craft, every profession transmits knowledge this way for five thousand years.

1800s

Industrial revolution needs standardized workers fast. Invents the classroom. Information transfer at scale. Removes the master, removes the practice, keeps the content.

1900s

Corporations inherit the classroom model. Add tests to verify information transfer. Training departments emerge.

1990s

Put the classroom on a computer. Invent SCORM to standardize the container. The LMS is born. Still information transfer, now with tracking.

2000s

Make it shorter (microlearning). Make it social (collaborative learning). Make it mobile (mLearning). Make it game-like (gamification). Still information transfer, now with features.

2020s

Add AI. Personalize the information transfer. Generate the content faster. Adaptive paths through the same fundamentally broken model. Still information transfer, now with intelligence applied to the wrong problem.

Every step in this timeline optimized for scale. Not one step verified that the thing being scaled still worked. 200 years of scaling the container, and no one checked whether the contents survived the first transfer.

Apprenticeship was abandoned not because it failed, but because it was slow. The industrial revolution needed thousands of factory workers, not master craftsmen. The classroom was faster. The fact that the classroom produced workers who could follow instructions but not solve problems was, for a factory floor, acceptable.

We no longer live in the industrial revolution. The problems we need workers to solve are no longer standardized. The assembly line has been automated. What remains — judgment, creativity, complex problem-solving, communication — are precisely the skills that apprenticeship excels at transmitting and that classrooms (physical or digital) structurally cannot.

The Five Load-Bearing Elements of Apprenticeship

Apprenticeship is not just "learning by doing." It is a specific architecture with five structural elements, each of which is load-bearing — remove any one and the system degrades.

1. Learning Embedded in Real Work

The learner is not practicing in a sandbox. They are doing the actual work, on the actual materials, with actual consequences. The context is not simulated.

2. Immediate Feedback from a Practitioner

Feedback comes from someone currently doing the work, not from someone who designs assessments about the work. The feedback loop is measured in minutes, not weeks.

3. Assessor Has Skin in the Game

The master suffers if the apprentice is incompetent, because they share the work product. Assessment is not a separate function performed by a separate person. It is integrated into the supervision of real work.

4. Responsibility Increases Gradually

The apprentice starts with simple tasks and advances to complex ones as competence is demonstrated. This is not a pre-designed curriculum — it is adaptive, responsive to the individual's actual development.

5. The "Test" IS the Work

There is no separate assessment. The work product is the assessment. A journeyman cabinetmaker's cabinet either has square corners or it doesn't. A surgeon's patient either recovers or doesn't. The gap between test and performance is zero.

Modern training technology replicated none of these. It replicated information transfer — the one component of apprenticeship that is necessary but not sufficient — and called it the whole model. It is as if someone studied a bridge, noticed it had paint on it, replicated the paint on a sheet of plywood, and called it a bridge.

Apprenticeship Element	Modern Training Equivalent	Status
Learning in real work	Simulated scenarios, case studies	Degraded
Immediate practitioner feedback	Quiz scores, delayed manager review	Degraded
Assessor has skin in the game	Third-party content vendor, no consequences	Absent
Gradual responsibility increase	Same course for everyone regardless of level	Absent
Test IS the work	Test is a separate quiz about the work	Structurally inverted

The last row is the most damning. Apprenticeship's most critical structural feature — the identity of assessment and performance — is not merely absent from modern training. It is structurally inverted. Modern training deliberately separates assessment from performance, creates a gap between them, and then struggles to close the gap through "transfer of training" initiatives that have a dismal track record.

The gap did not exist before the industry created it. Apprenticeship had no "transfer" problem because there was nothing to transfer. You learned the work by doing the work.

8 "But It Doesn't Scale" — Except It Already Does

The immediate objection to everything above is: "Apprenticeship doesn't scale." One master, one apprentice. Maybe a few apprentices per master. You can't train 10,000 employees that way. It's too expensive. Too slow. Too dependent on individual masters.

This objection is wrong. Not wrong in theory — wrong in observed fact. Several large-scale organizations have solved scalable apprenticeship. The L&D industry doesn't recognize them as training solutions because they don't look like "training." They don't use an LMS. They don't produce SCORM packages. They don't have course catalogs. So they are invisible to an industry that defines training by its delivery mechanism rather than its outcomes.

Military "Train the Trainer"

The U.S. military trains millions of people across thousands of specialties. Its core mechanism is not courseware or e-learning. It is geometric scaling through hierarchical apprenticeship with competency verification at every level.

A senior enlisted member trains a junior enlisted member through supervised practice. That junior member, once qualified, trains the next generation. The person teaching has done the job. The person learning is doing the job. Competency is verified through practical demonstration, not written tests. The assessor has skin in the game — their unit's readiness depends on the competence of the people they trained.

This scales to millions because each competent practitioner becomes a trainer, creating geometric growth in training capacity. The content is not a course — it is the job itself, transmitted through practice and supervision.

Open Source Communities

The open source software movement trains hundreds of thousands of contributors across thousands of projects, with no formal training program whatsoever. The mechanism:

Distributed mastery: Contributors at different skill levels work on different complexity levels of the same codebase.
Code review IS assessment: A pull request is simultaneously a contribution, a learning exercise, and an evaluation. Senior developers provide feedback on the actual work product, not on a simulated exercise about the work product.
The PR merge IS the qualification: When your code is accepted into the project, you have demonstrated competence. No certificate needed. The merged code is the certificate.
Responsibility increases gradually: New contributors start with documentation fixes and minor bug fixes. As they demonstrate competence, they take on larger features. Eventually they become reviewers themselves.

Every single element of apprenticeship is present. Learning in real work. Immediate feedback from practitioners. Assessors with skin in the game (they maintain the codebase). Gradual responsibility increase. Assessment identical to performance. And it scales to global communities of hundreds of thousands of practitioners.

Medical Residency

"See one, do one, teach one." The medical residency model is explicit about its apprenticeship structure. A resident observes a procedure performed by an attending physician. They perform the procedure under supervision. Then they teach it to the next resident.

This is hierarchical apprenticeship with three critical features: the assessment is the performance (the patient outcome), the assessor has maximal skin in the game (malpractice liability, patient welfare), and the system scales by converting every learner into a teacher.

Your Own Team's Onboarding

Perhaps the most telling example: the informal learning that accounts for the majority of how people actually become competent at their jobs. The research consistently estimates this at 70% or higher — most workplace competence is developed through on-the-job experience, peer learning, and informal mentorship, not through formal training programs.

Think about how a new team member actually becomes productive:

They sit with a colleague who shows them how things work.
They attempt a real task with guidance available.
They get feedback from someone who does the work daily.
They gradually take on more complex assignments as they demonstrate competence.
Eventually they are the one showing the next new hire how things work.

This is apprenticeship. It happens in every organization on Earth. It works. And it is categorized as "informal learning" — a euphemism for "the thing that actually produces competence but doesn't show up on L&D's reports."

The blind spot

Every working model of scalable competence development is dismissed as "not training" by the L&D industry because it doesn't involve an LMS. This is like a transportation industry that only counts vehicles with engines and then declares that bicycles aren't transportation.

9 You Can't Sell "Just Talk to Each Other"

At this point, a reasonable reader might ask: if apprenticeship works, and it scales, and the evidence for content-based training is so weak, why does the industry look the way it does? Are all these people incompetent? Are they not reading the research?

No. They are not incompetent. They are responding to economic constraints that make the right answer unsellable.

Markets produce things that can be:

Packaged — turned into a discrete product with clear boundaries
Sold repeatedly — the same product sold to many customers
Renewed annually — generating recurring revenue
Priced per seat — revenue scaling with organization size

Content passes all four tests brilliantly. A course is a package. It can be sold to a thousand organizations. It can be licensed annually. It can be priced per seat. Content is the perfect product from a business model perspective.

"Have your experienced people mentor your less experienced people through structured on-the-job practice" fails all four tests. It cannot be packaged (it's a process, not a product). It cannot be sold repeatedly (it's different for every organization). It cannot be renewed annually (it's ongoing, not event-based). It cannot be priced per seat (it's a capacity, not a deliverable).

The fact that content doesn't produce learning is irrelevant to the business model. The business model does not select for effectiveness. It selects for packageability, repeatability, and revenue predictability. Content maximizes all three. Apprenticeship minimizes all three.

This is not a failure of intelligence or ethics. This is a market doing exactly what markets do: producing what can be sold, not what works. The same pattern shows up everywhere:

Industry	What Works	What Sells	Why
Diet	Eat less, move more	Programs, supplements, meal plans	"Eat less" has no SKU
Healthcare	Preventive care, lifestyle changes	Treatments, procedures, pharmaceuticals	Prevention doesn't bill well
Fitness	Walk daily, progressive bodyweight exercise	Gyms, equipment, supplements, apps	Walking is free
Training	Apprenticeship, peer learning, practice	Courses, platforms, content libraries	"Talk to each other" has no SKU

In every case, the effective intervention is simple, free or nearly free, and impossible to package. The industry that forms around it sells the packageable alternative, measures its own outputs, and declares success. The diet industry measures pounds of supplements sold. The training industry measures courses completed. Neither measures the outcome the customer actually needs.

This is not cynicism. This is structural analysis. The people building these products are often deeply committed to learning. They read the research. They attend the conferences. They genuinely want to help people learn. But they operate within an economic structure that cannot reward effectiveness and does reward production volume. The structure wins. It always wins.

The reason the training industry looks the way it does is not incompetence. It's economics. Markets produce what can be packaged and sold. "Talk to each other" cannot be packaged and sold. So the market produces content instead, and calls it learning.

10 Two Escape Routes

Everything to this point has been diagnosis. The structural problems are deep and self-reinforcing: misaligned incentives create bad products, bad products create bad metrics, bad metrics justify bad products, and the economic structure of the market makes all of this rational and profitable. It is a stable equilibrium of dysfunction.

Stable equilibria don't break on their own. They require structural intervention — not better content, not better technology, not better instructional design, but a different economic model that aligns revenue with outcomes. Here are two models that could work, both proven in other industries.

Escape Route 1: Monetize the Infrastructure Around the Free Thing

This is the Red Hat model. Linux is free. Red Hat built a billion-dollar business not by selling Linux, but by selling the infrastructure around it: enterprise support, integration services, security certification, update management. The thing that works is free. The scaffolding that makes it work reliably at enterprise scale is the product.

Applied to training:

The conversation between an experienced practitioner and a less experienced one is free. It has always been free. You cannot package it and should not try. But the infrastructure that makes it happen reliably at scale — that is a product.

Matching: Connecting the right mentor to the right learner based on skill gaps, expertise, availability, and learning style.
Progression tracking: Mapping demonstrated competencies against role requirements. Not "did they complete the course" but "can they do the thing, and who verified it?"
Verification: Structured protocols for practitioners to assess other practitioners through observed work, not written tests.
Skill mapping: Organizational capability mapping that shows where expertise exists, where gaps are, and where knowledge is at risk of leaving.
Scheduling: Coordinating mentorship time, practice sessions, and peer learning across complex organizational calendars.

This is not a content delivery platform. This is an apprenticeship orchestration platform. It does not produce or deliver content. It creates the conditions under which the five load-bearing elements of apprenticeship can function at organizational scale.

It is packageable (it is software). It can be sold repeatedly (every organization needs it). It can be renewed annually (the orchestration is ongoing). It can be priced per seat (more people, more matching, more tracking). It passes every business model test that content passes, while actually being aligned with the mechanism that produces learning.

Escape Route 2: Align Revenue with Outcomes

This is the insurance model. Performance-based contracts where the vendor's revenue is tied to actual business outcomes, not to completion rates or satisfaction scores.

Instead of: "We will deliver 40 hours of safety training to 5,000 employees for $2 million."

Try: "We will reduce workplace safety incidents by 30% within 18 months, or we refund the difference."

This model terrifies training vendors, and it should. Most of them cannot make this promise because they know their product does not produce the outcome. The ones who can make this promise will build fundamentally different products — products that incorporate practice, feedback, environmental design, process change, and management behavior modification, not just content delivery. They will look less like training companies and more like management consulting firms, because the actual levers of behavior change in organizations are structural, not educational.

The insurance industry managed this transition. Insurers don't just sell policies — they actively reduce risk through inspections, requirements, and loss prevention engineering. Their revenue depends on outcomes (fewer claims), so they invest heavily in actually changing the conditions that produce bad outcomes. A training industry structured the same way would look radically different from what exists today.

Current Model

Revenue tied to: Content produced, seats licensed, courses completed

Vendor incentive: Produce more content, enroll more seats, maximize completions

Outcome for learner: Irrelevant to business model

Outcome Model

Revenue tied to: Measurable behavior change, business metrics improvement

Vendor incentive: Do whatever actually works, including non-training interventions

Outcome for learner: Directly determines vendor revenue

11 Why This Won't Be Easy (and Why It Matters Anyway)

I want to be honest about the difficulty of what I'm proposing. The training industry is not going to read this piece and restructure itself. The incentive misalignment is too deep, the installed base too large, the procurement processes too entrenched. CLOs will continue to buy course catalogs because that is what procurement processes are designed to buy. LMS vendors will continue to build completion tracking because that is what RFPs ask for. Satisfaction surveys will continue to be the primary metric because they are cheap to administer and always produce positive numbers.

The stable equilibrium will hold for a long time. It is too profitable and too comfortable for too many stakeholders.

But I think the pressure for change is building, and it's coming from three directions:

Direction 1: AI Makes Content Worthless

If a large language model can generate a complete e-learning module in thirty seconds, the value of content production drops to near zero. The $370 billion industry is largely a content production and delivery industry. When the marginal cost of content approaches zero, the entire value proposition of "we produce training content" collapses. The vendors who survive will be the ones who were never selling content in the first place — the ones selling outcomes, orchestration, or infrastructure.

Direction 2: The Talent Market Demands Proof

As skills-based hiring gains traction, organizations will increasingly need to demonstrate that their employees can actually do things, not just that they completed courses about things. Completion certificates will carry less weight. Demonstrated competence will carry more. This creates demand for assessment systems where the assessment is the performance — exactly the apprenticeship model.

Direction 3: The Organizations That Get It Will Outcompete

Some organizations are already building internal apprenticeship models without calling them that. They are structuring onboarding around paired programming, not courses. They are measuring competence through project completion, not quiz scores. They are building mentorship matching systems, not course catalogs. These organizations will develop talent faster, retain it longer, and outperform organizations that are still buying courses no one learns from.

Market pressure does not come from the training industry reforming itself. It comes from organizations that abandon the training industry's model and build something that actually works. When those organizations demonstrably outperform, the pressure to follow will become irresistible. Not because the L&D function changes its mind, but because the executive suite changes its expectations.

12 What Can't Be Unseen

Let me summarize the argument:

The training industry's core structural problem is the principal-agent problem: buyers and learners are different people with different incentives. Every product feature follows logically from this misalignment.
Compliance training functions as accidental operant conditioning, systematically training employees to dismiss compliance content as fast as possible.
The most robust finding in memory research — the spacing effect — has been proven for 140 years and is structurally impossible in LMS architectures built around the course as the atomic unit.
Kirkpatrick's Level 1 (satisfaction) inversely correlates with Level 3 (behavior change), making the industry's primary metric a reliable negative indicator of effectiveness.
Engagement and learning are neurologically opposed states. The industry's obsession with engagement may actively undermine learning.
Struggle and discomfort during assessment improve long-term retention, meaning satisfaction surveys may measure the inverse of learning.
Knowledge persistence is a property of systems, not individuals. The industry spends $370 billion per year pouring information into the one container (individual memory) with a proven 70% leak rate.
Test prep proves the principal-agent problem is the root cause, then demonstrates that Goodhart's Law makes any measurement-based alternative vulnerable to gaming.
The only contexts that escape both failure modes are those where assessment IS performance — all forms of apprenticeship.
Apprenticeship already scales in the military, open source, medical residency, and informal workplace learning. The industry doesn't see it because it doesn't use an LMS.
The economics of markets favor packageable products over effective processes. "Talk to each other" has no SKU, so the market sells content instead.
Two escape routes exist: monetize infrastructure around the free thing (apprenticeship orchestration) or align revenue with outcomes (performance-based contracts).

Each of these points is individually uncomfortable. Together, they describe an industry that was structurally incapable of achieving its stated purpose from the day it was built. Not broken. Never built to work.

I don't say this with satisfaction. I work in this space. I build tools that touch this industry. The people who work in L&D are, overwhelmingly, people who care deeply about helping others learn. They are operating within a structure that makes it nearly impossible to do so, using tools that are optimized for the wrong customer, measured by metrics that select against their goals. The tragedy is not that they don't care. The tragedy is that the structure they operate within is indifferent to whether they succeed.

But I think seeing the structure clearly is the precondition for changing it. You cannot fix a principal-agent problem by making better content. You cannot overcome Kirkpatrick's inversion by writing better surveys. You cannot solve a systems-level persistence problem by optimizing individual memory. You cannot escape Goodhart's Law by building better tests. These are structural problems that require structural solutions.

The good news is that the structural solution has existed for five thousand years. It works. It scales. It has been proven in every domain where competence actually matters. It just doesn't look like "training" — which is exactly the point.

The question is not whether apprenticeship works. The question is whether we can build the economic models and organizational structures that let it work at the scale modern organizations require, without reintroducing the principal-agent problem or falling prey to Goodhart's Law.

That is a hard problem. But it's the right problem. And it's a better use of $370 billion than what we're doing now.

This is part of my daily developer log. Follow my journey as I learn new skills and build tools with Brian at Actyra.