The Quality of Decisions Depends on the Quality of the Models Behind Them
Every organizational decision is generated by a mental model. The model determines what information is noticed and what is filtered out, what solutions are considered and what is ruled out before it is even articulated, and what counts as success when the decision is evaluated.
Most mental models in use in organizations are implicit. Leaders do not think "I am applying this mental model." They think "this is obviously the right call." The obvious-ness is the mental model operating — framing the situation in a way that makes one course of action appear natural and others appear unnecessary.
The quality of organizational decisions is therefore largely determined by the quality of the mental models that leaders bring to them. Improving organizational decision-making requires improving the mental models, not just the decision process.
The following eight mental models have the most practical value for organizational leaders, with consistent application. I have organized them by domain rather than by difficulty, because the limiting factor in applying good mental models is almost never comprehension — it is the cognitive bias that blocks application even when the model is known.
The Eight-Model Toolkit
Model 1: Second-Order Thinking
What it is: First-order thinking asks "what will happen?" Second-order thinking asks "and then what?" — following the causal chain through the immediate effect to the subsequent effects that the immediate effect will produce.
Organizational application: A common second-order failure: a company freezes hiring to reduce costs. First-order effect: reduced payroll cost. Second-order effects: key projects slip (fewer people to do the work), competitor hires the talent that was queued in your pipeline, remaining employees interpret the freeze as a signal of organizational distress and begin exploring alternatives, attrition increases, and the net cost of the freeze — in delayed revenue, recruiting costs for replacement hires, and knowledge loss — exceeds the payroll savings.
The second-order analysis does not always reverse the decision. Sometimes the first-order benefit outweighs the second-order costs. But the analysis cannot be skipped, because the second-order effects are often the more significant ones.
The failure mode: Taking action that solves an immediate problem while creating larger downstream ones. Cost cuts that destroy capability. Speed-ups that generate rework. Communications that solve one stakeholder's concern while alarming three others.
Model 2: Inversion and the Pre-Mortem
What it is: Instead of asking "how do we succeed?", ask "how could this fail?" — and then work backward to identify the conditions that would produce failure.
Organizational application: Run a pre-mortem before any significant decision or launch. Assume the decision has already been implemented and has failed. Ask the team: "What went wrong?" This framing bypasses the social dynamics that suppress dissent in forward-looking planning discussions (no one wants to be the person who raised doubts about a plan the leader is clearly committed to) because it is not challenging the decision — it is helping the team execute it better.
Pre-mortems consistently surface risks that standard risk assessments miss, because inversion activates a different cognitive mode than forward planning.
The failure mode: Plans that are strong in upside logic but weak in downside identification. Launches that fail for reasons that were predictable but never articulated because the forward-planning process did not create space for them.
Model 3: The Map Is Not the Territory
What it is: Every model, plan, forecast, or framework is a simplified representation of reality. The map is useful only to the extent that it accurately reflects the territory it represents — and all maps are incomplete and partially inaccurate.
Organizational application: The organizational plans and models that get built at planning time become increasingly inaccurate as circumstances change. The organization that updates its territory and adjusts its maps accordingly outperforms the one that holds onto its maps and interprets contrary evidence as execution failure.
The practical discipline: explicitly identify the assumptions embedded in every major organizational model. Which assumptions, if wrong, would most significantly change the conclusion? Monitor those assumptions specifically. When they break, update the model rather than explaining away the evidence.
The failure mode: Defending a plan or model against evidence that the plan or model is wrong. Attributing execution failure to people problems when the plan itself was built on incorrect assumptions. Running strategy processes that produce updated maps without examining whether the territory has changed in ways that make the map-territory gap more important to close.
Model 4: Base Rates and Reference Classes
What it is: When estimating the probability of success for a novel initiative, the most reliable starting point is the historical base rate of success for similar initiatives — not the specific details of this initiative that seem to distinguish it from the historical failures.
Organizational application: Most organizations systematically overestimate the probability of success for their own initiatives because they evaluate those initiatives based on their specific characteristics (our product is better, our team is stronger, our market timing is right) rather than from the base rate (most new products fail, most acquisitions destroy value, most transformation programs underperform).
The reference class forecast — starting with the base rate for a relevant class of similar decisions and then adjusting based on the specific characteristics that are genuinely distinctive — produces more accurate estimates than either pure outside-view (apply the base rate without any adjustment) or pure inside-view (evaluate the specifics and ignore the base rate).
The failure mode: Planning fallacy: consistently underestimating cost, duration, and difficulty because the plan is evaluated based on its specific details rather than the base rate for similar plans. Merger and acquisition valuations that assume above-average synergy realization when the base rate is that most acquisitions underperform. Product launch timelines that are set based on optimistic specifics rather than historical reference classes.
Model 5: Principal-Agent Dynamics
What it is: When one party (the principal) delegates decision-making authority to another party (the agent), the agent's incentives may diverge from the principal's interests. The agent has information the principal does not have, and will use that information in ways that serve the agent's interests, which may or may not align with the principal's.
Organizational application: Most organizational structures involve significant principal-agent relationships: shareholders and executives, executives and managers, managers and employees, companies and external consultants. Understanding where incentive misalignments exist — and designing structures that reduce them — is one of the highest-leverage organizational design problems.
Common organizational principal-agent problems: managers who optimize for their own teams' metrics rather than organizational outcomes; sales teams whose compensation structures reward revenue at the expense of margin or customer success; consultants whose business model incentivizes recommending additional consulting.
The failure mode: Attributing misaligned behavior to character or culture rather than to incentive structure. Adding more monitoring rather than redesigning the incentive structure. Assuming that well-intentioned individuals will override their incentive alignment even when incentives persistently push in a different direction.
Model 6: Opportunity Cost
What it is: The true cost of any decision includes not only the resources directly consumed, but also the value of the best alternative use of those resources that is foregone.
Organizational application: Organizations systematically undercount opportunity cost. A team that spends six months building a feature that produces minimal adoption did not just spend six months — they also did not build whichever alternative would have produced the most value in those six months. The cost of the decision is the sum of both.
The discipline: for any significant resource allocation decision, explicitly identify the best alternative use of those resources and quantify the value of that alternative. This quantification is often imprecise, but even an imprecise opportunity cost estimate changes the decision calculus compared to no opportunity cost estimate.
The failure mode: Sunk cost reasoning (continuing to invest in a failing initiative because resources have already been committed to it) is partially an opportunity cost blindness error — the decision to continue should be made on the basis of future costs and future opportunity costs, not past costs. Organizations that do not track opportunity cost tend to over-invest in existing initiatives and under-invest in new ones, because the cost of the new investment is visible and the cost of not investing is invisible.
Model 7: Correlation Is Not Causation
What it is: The observation that two variables move together does not establish that one causes the other. They may be correlated because a third variable causes both, or because of coincidence, or because the causality runs in the opposite direction from what is assumed.
Organizational application: Organizational data analysis is full of spurious correlations that generate confident causal claims and ineffective interventions. Offices that adopted a particular management practice also experienced above-average performance — but both may have been caused by a third variable (strong general management) rather than the practice itself. Attrition is correlated with compensation — but poor management may be the cause of both attrition and the low compensation levels that result from the limited budget that poor management tends to receive.
The discipline: before acting on a correlation, articulate the specific causal mechanism that would produce the correlation if causality runs in the assumed direction. Then ask: what alternative causal explanations exist for the same correlation? How could you distinguish between them?
The failure mode: Interventions based on spurious correlation — improving compensation in response to an attrition problem that is actually caused by management quality; adding communication programs in response to an engagement problem that is actually caused by organizational structure; improving sales training in response to a revenue problem that is actually caused by product-market fit.
Model 8: Reversibility and Asymmetry
What it is: Decisions differ in their reversibility. Some decisions can be easily undone if they turn out to be wrong; others cannot. The decision-making process should be calibrated to the reversibility of the decision — low rigor for highly reversible decisions, high rigor for irreversible ones.
Organizational application: Jeff Bezos's distinction between one-way and two-way doors is the most well-known articulation of this model: one-way doors (irreversible decisions) require careful deliberation; two-way doors (reversible decisions) should be made quickly at the lowest appropriate organizational level.
The practical discipline: explicitly classify decisions by reversibility before choosing a decision-making process. The cost of applying high rigor to a reversible decision is speed and agility. The cost of applying low rigor to an irreversible decision is potentially severe and lasting. Many organizations apply the same decision-making process to decisions of very different reversibility — producing over-deliberation on decisions that should be made quickly and under-deliberation on decisions that warrant much more care.
The failure mode: Applying committee processes to reversible decisions (producing organizational paralysis) while treating irreversible decisions with insufficient deliberation because they were not identified as structurally different. Strategic acquisitions that close before rigorous diligence is complete (treated as reversible when they are not). Product decisions that cannot be undone without significant customer impact treated as internally reversible.
One Decision, Several Models: A Worked Example
The models are most valuable not in isolation but stacked on a single decision, because real decisions activate several at once and the interaction is where the analysis earns its keep. Take a familiar one: a company is deciding whether to acquire a smaller competitor.
Start with base rates. The inside view is loud — strong strategic fit, a team the leadership admires, a price that looks reasonable against projected synergies. The outside view is quieter and more accurate: most acquisitions underperform, and most synergy projections are not realized. Model 4 does not kill the deal; it reframes the burden of proof. The question is no longer "is this a good acquisition?" but "what is genuinely distinctive here that would justify departing from the base rate that says it probably is not?"
Now run second-order thinking on the integration. The first-order effect is a larger combined entity. The second-order effects are the ones that usually decide the outcome: the acquired team's best people, whose informal knowledge the deal model implicitly depends on, read the acquisition as a signal to leave; two incompatible toolchains have to be merged at the cost of a quarter of roadmap; and the customers of the acquired company, promised continuity, watch their product stagnate during integration. None of these appear in the synergy spreadsheet, and all of them are predictable.
Inversion sharpens this further. Run the pre-mortem: it is eighteen months later and the acquisition is judged a failure — what happened? The team will name things the forward planning never surfaced, because the dynamics of a deal everyone is committed to suppress them. And finally, reversibility reframes the whole decision. An acquisition is a one-way door. That single classification should raise the rigor of every step above and explains why "we can always restructure later" is the most dangerous sentence in the room. No single model decided this. Four of them, stacked, turned an attractive deal into an honestly evaluated one.
Where Mental Models Mislead
A toolkit of mental models is itself a map, and the map-is-not-the-territory model applies recursively to the models themselves. Knowing them does not make a leader reliably better, and used carelessly they make decisions worse in three specific ways worth naming.
The first is sophisticated rationalization. A leader who knows eight models can find one that endorses whatever they already wanted to do. Second-order thinking justifies inaction; opportunity cost justifies action; reversibility justifies both speed and caution depending on which framing flatters the prior. The models give the bias a vocabulary, and a confidently model-labeled decision is harder to challenge than an honest hunch. The defense is to choose which model applies before knowing what it will recommend, not after.
The second is overfitting the frame. Every model is a lens that brings one structure into focus and pushes others out. A leader who reaches for principal-agent dynamics on every people problem will see incentive misalignment where the real issue is unclear authority or simple capability. The model is right about the cases it fits and wrong about the cases it is forced onto. The discipline is to hold more than one model against a situation and notice which one it actually resists.
The third is mistaking the label for the work. "We are in a second-order thinking moment" is not second-order thinking. The model names the move; it does not perform it. The hard work — actually tracing the causal chain, actually finding the reference class, actually classifying the door — is the same difficult thinking it always was. The vocabulary makes that work shareable. It does not make it shorter.
Building a Mental Model Toolkit Across a Leadership Team
Individual mental models are useful. Shared mental models across a leadership team are significantly more valuable, because they create a common analytical language that improves collective decision-making.
Building a shared toolkit requires more than distributing reading material. It requires applying the models explicitly and visibly in actual decisions. The discipline: when a significant decision is being made, explicitly invoke the relevant mental model. "We are in a second-order thinking moment here — what happens after the immediate effect?" "What is the reference class for this acquisition? What is the base rate of success?" This explicit invocation, repeated consistently, builds the shared language.
It also requires surfacing the failure modes. Most mental model failures are not intellectual — leaders can articulate second-order thinking perfectly well. They are motivational: the cognitive bias that blocks application when applying the model would require acknowledging an uncomfortable conclusion. Pre-mortems surface concerns that no one articulated during forward planning because the social dynamics of leadership teams suppressed dissent. Base rate analysis produces lower confidence estimates that may require leaders to revise projections they have already communicated externally.
Building the norm that applying good mental models is valued even when — especially when — the result is an uncomfortable conclusion is the prerequisite for the toolkit being genuinely useful rather than decoratively referenced.
What You Can Adopt This Week
You do not need all eight models running to get value. You need one, applied to one real decision, out loud.
Pick the next significant decision your team is about to make and attach a single model to it before the discussion starts — the one most likely to be neglected. If the team is enthusiastic and aligned, that is base rates: name the reference class and the historical success rate before anyone argues the specifics. If the team is committed and moving fast, that is the pre-mortem: spend ten minutes assuming it has already failed. If the decision is hard to reverse, that is reversibility: say so explicitly, and let that raise the rigor.
The point is not to run a model checklist on every decision — that produces the paralysis the toolkit is supposed to prevent. The point is to install one disciplined interruption at the moment a leadership team is most prone to its characteristic error: deciding by inside-view enthusiasm, by suppressed dissent, or by treating a one-way door as a two-way one. Do that consistently and the vocabulary becomes shared, and the shared vocabulary is what turns eight private models into one collective decision-making capability.
The eight models above are not a complete toolkit for organizational decision-making. They are the eight that consistently produce the highest leverage when applied well and produce the most costly failures when violated. They are the starting point, not the ceiling.
Continue in this series
This piece is part of What Is Organizational Governance? A Systems Practitioner's Complete Guide, my systematic guide to organizational governance and operating systems. Related reading:
- Second-Order Consequences: The Mental Model Most Decision-Makers Skip
- Systems Thinking for Leaders Who Don't Come From Engineering
- Writing That Changes How People Think, Not Just What They Do
- Systems Thinking: A Complete Guide for Organizational Practitioners
Working through this in your own organization? I help technical leaders design it directly — advisory engagements.






