Almost every HR professional has encountered the Myers-Briggs Type Indicator (MBTI). It is the world's most commercially successful personality assessment, taken by an estimated 2.5 million people every year in corporate contexts alone. Yet the scientific community has spent decades pointing out its weaknesses. Meanwhile, the Big Five — also known as OCEAN — has accumulated more than fifty years of peer-reviewed, cross-cultural validation and remains the dominant framework in academic personality research.
So which one should your team actually use? The answer is not simply "whichever feels more intuitive." It depends on what you want to measure, how accurate you need the results to be, and whether you are willing to repeat assessments every few months when people's types change.
How MBTI Works — and Why Its Type System Is Scientifically Problematic
The Myers-Briggs Type Indicator was developed by Isabel Briggs Myers and her mother Katharine Cook Briggs during the 1940s, drawing on Carl Jung's theory of psychological types. Participants answer a series of forced-choice questions and are assigned one of 16 discrete types, each defined by four binary dimensions:
- E/I — Extraversion vs. Introversion
- S/N — Sensing vs. Intuition
- T/F — Thinking vs. Feeling
- J/P — Judging vs. Perceiving
The appeal is obvious: 16 memorable labels (INTJ, ENFP, and so on) are far easier to discuss at a workshop than a set of continuous score distributions. The test is narratively satisfying — it tells a story about who you are. It has entered popular culture as a kind of secular horoscope. The comparison with other popular-but-weaker frameworks is instructive: 16Personalities vs Big Five explores a similar dynamic with a different viral instrument.
MBTI's Test-Retest Problem: 50% of Users Get a Different Type
Here is the core empirical problem with MBTI: when people retake it a few weeks later, a large share of them receive a different four-letter type. A widely cited review found that roughly 50 % of participants receive a different MBTI type when retested just five weeks later. This happens because the instrument forces continuous traits into binary categories: someone who scores right on the E/I borderline will tip to one side or the other depending on how they are feeling that week.
"As many as three-quarters of test takers achieve a different personality type upon retaking the MBTI after only five weeks." — cited across multiple independent meta-analyses reviewing MBTI stability data.
A measure that cannot reproduce its own results is not measuring a stable underlying construct — which is, by definition, what personality is supposed to be. This is not a minor technical criticism. It means teams making decisions about roles, coaching, or communication styles based on MBTI types may be acting on data that will be different next quarter. The history of the Big Five shows how the empirical alternative was developed — through lexical analysis and factor replication rather than theory alone.
Does MBTI Predict Job Performance? What the Research Shows
Even setting aside reliability, a test is useful for teams only if its scores predict outcomes that matter: job performance, collaboration quality, retention. The independent research on MBTI's predictive validity for job performance is consistently weak. No major peer-reviewed meta-analysis has found that MBTI type predicts job performance at the level that justifies its widespread use in hiring or team assignment.
The contrast with the Big Five is stark. Barrick and Mount's landmark 1991 meta-analysis — and dozens of replications since — demonstrated that Conscientiousness (Discipline in Cèrcol's dimension naming) is a robust predictor of job performance across virtually every occupational category. Emotional Stability (the inverse of Neuroticism / Depth) predicts performance in high-stress roles. These relationships hold across cultures, languages, and industries. For a dedicated analysis of why this matters, see what is Conscientiousness: the most consistent predictor of job performance.
Why Big Five Personality Outperforms MBTI on Every Validity Metric
The Big Five is not a single test — it is a scientific model of personality structure that has emerged from factor-analytic studies of natural language and self-report data across decades and cultures. Its five dimensions are:
- Openness to Experience (Vision) — curiosity, creativity, tolerance for ambiguity
- Conscientiousness (Discipline) — self-regulation, organisation, goal-directedness
- Extraversion (Presence) — sociability, positive affect, assertiveness
- Agreeableness (Bond) — cooperation, empathy, trust
- Neuroticism (Depth) — emotional reactivity, anxiety, vulnerability to stress
Unlike MBTI, the Big Five treats each dimension as a continuous spectrum. A person is not simply an "Extravert" or an "Introvert" — they fall somewhere on a distribution, and that nuance matters enormously for predicting how they will behave in a given context. High scorers and low scorers on any given dimension can both perform well; what differs is the type of environment and tasks that suit them.
A 2003 review by Gosling et al. (DOI: 10.1016/S0092-6566(03)00046-1) confirmed that even very brief Big Five measures reliably capture meaningful variance in personality outcomes, demonstrating the model's robustness across measurement formats. The five factors replicate in languages from English and Spanish to Chinese and Yoruba, suggesting they reflect genuine cross-cultural human variation rather than cultural artefacts — a finding documented extensively in Big Five personality across cultures.
The IPIP Advantage: Free, Open, and Scientifically Auditable
Most commercial implementations of the Big Five (NEO-PI-R, BFI) carry licensing fees and restrict access. The International Personality Item Pool (IPIP) is a public-domain repository of personality items that has been independently validated against the major commercial instruments. IPIP-based assessments deliver Big Five measurement quality at zero cost, with full scientific transparency — researchers and developers can inspect every item, every scoring algorithm, every norm.
Cèrcol is built on the IPIP. Our science page documents the specific item pools and validation studies underlying each dimension. See also our explainer on what the IPIP is. It is also worth comparing MBTI to other team-focused frameworks: DISC vs Big Five and Big Five vs DISC vs Belbin examine the broader landscape of alternatives.
MBTI vs Big Five: Head-to-Head Comparison Table
| Feature | MBTI | Big Five (IPIP) |
|---|---|---|
| Test-retest reliability (5 weeks) | ~50 % receive a different type | High (r ≈ 0.75–0.85 across studies) |
| Predictive validity for job performance | Weak / not established | Strong (especially Conscientiousness) |
| Scientific basis | Jungian theory (not empirically derived) | Factor-analytic, cross-cultural replication |
| Cost | Proprietary; official version is paid | IPIP is public domain / free |
| Output format | 16 discrete types | 5 continuous dimensions |
| Independent replication | Limited | Extensive (50+ years of peer review) |
Which Should Your Team Use? A Practical Recommendation
If your goal is a memorable workshop icebreaker and you will not use the results for any decision that affects people's roles or careers, MBTI is broadly harmless — many people find it engaging. But if you want data that will still mean something in six months, that can be compared across teams, and that research links to actual performance outcomes, the Big Five is the only defensible choice.
For teams specifically, the combination of self-assessment and peer-perspective data is more revealing than either alone. Cèrcol's Full Moon assessment pairs IPIP-based self-report with structured Witness feedback, giving you both how people see themselves and how their closest collaborators experience them — all built on the same scientifically validated framework.
The choice between MBTI and Big Five is, in the end, a choice between familiarity and validity. Your team deserves both rigour and accessibility. The Big Five delivers the former; a well-designed platform can deliver the latter.
Try the Validated Alternative — Free
If you have been using MBTI and want to see what a scientifically rigorous alternative actually looks like, the simplest step is to take one. Cèrcol's free assessment is built on the IPIP Big Five, takes about fifteen minutes, and produces results that hold up across retests — unlike the type you got last month that mysteriously changed. You will see your scores on Discipline, Bond, Presence, Depth, and Vision as continuous dimensions, not binary boxes. No paywall, no forced email funnel for the results. Try it free at cercol.team.
Sources: Myers-Briggs Type Indicator — Wikipedia · Big Five personality traits — Wikipedia · Gosling, S. D., Rentfrow, P. J., & Swann, W. B. (2003). A very brief measure of the Big-Five personality domains. Journal of Research in Personality, 37(6), 504–528. DOI: 10.1016/S0092-6566(03)00046-1
Further reading
- 16Personalities vs Big Five: the viral test that gets it half right
- DISC vs Big Five: why four styles aren't enough
- Big Five vs DISC vs Belbin: a scientist's comparison
- What is reliability and validity in personality testing?
- Forced-choice personality assessment: getting more honest data
- The history of the Big Five: from Allport to Goldberg