Beta launch — 500 free Full Moon licences remaining. Help us find bugs.
Claim free access

MBTI vs Big Five: which one should your team use?

MBTI is the world's most-used personality test; Big Five is the most validated. Half of MBTI users get a different type on retest. Here is what that means.

Miquel Matoses·8 min read

Almost every HR professional has encountered the Myers-Briggs Type Indicator (MBTI). It is the world's most commercially successful personality assessment, taken by an estimated 2.5 million people every year in corporate contexts alone. Yet the scientific community has spent decades pointing out its weaknesses. Meanwhile, the Big Five — also known as OCEAN — has accumulated more than fifty years of peer-reviewed, cross-cultural validation and remains the dominant framework in academic personality research.

So which one should your team actually use? The answer is not simply "whichever feels more intuitive." It depends on what you want to measure, how accurate you need the results to be, and whether you are willing to repeat assessments every few months when people's types change.


How MBTI Works — and Why Its Type System Is Scientifically Problematic

50%
MBTI type changes within 5 weeks (test-retest failure)
r < 0.20
MBTI correlation with job performance
r = 0.22
Big Five Conscientiousness → job performance
40+ years
of peer-reviewed Big Five validation research

The Myers-Briggs Type Indicator was developed by Isabel Briggs Myers and her mother Katharine Cook Briggs during the 1940s, drawing on Carl Jung's theory of psychological types. Participants answer a series of forced-choice questions and are assigned one of 16 discrete types, each defined by four binary dimensions:

  • E/I — Extraversion vs. Introversion
  • S/N — Sensing vs. Intuition
  • T/F — Thinking vs. Feeling
  • J/P — Judging vs. Perceiving

The appeal is obvious: 16 memorable labels (INTJ, ENFP, and so on) are far easier to discuss at a workshop than a set of continuous score distributions. The test is narratively satisfying — it tells a story about who you are. It has entered popular culture as a kind of secular horoscope. The comparison with other popular-but-weaker frameworks is instructive: 16Personalities vs Big Five explores a similar dynamic with a different viral instrument.


MBTI's Test-Retest Problem: 50% of Users Get a Different Type

Here is the core empirical problem with MBTI: when people retake it a few weeks later, a large share of them receive a different four-letter type. A widely cited review found that roughly 50 % of participants receive a different MBTI type when retested just five weeks later. This happens because the instrument forces continuous traits into binary categories: someone who scores right on the E/I borderline will tip to one side or the other depending on how they are feeling that week.

"As many as three-quarters of test takers achieve a different personality type upon retaking the MBTI after only five weeks." — cited across multiple independent meta-analyses reviewing MBTI stability data.

A measure that cannot reproduce its own results is not measuring a stable underlying construct — which is, by definition, what personality is supposed to be. This is not a minor technical criticism. It means teams making decisions about roles, coaching, or communication styles based on MBTI types may be acting on data that will be different next quarter. The history of the Big Five shows how the empirical alternative was developed — through lexical analysis and factor replication rather than theory alone.


Does MBTI Predict Job Performance? What the Research Shows

Even setting aside reliability, a test is useful for teams only if its scores predict outcomes that matter: job performance, collaboration quality, retention. The independent research on MBTI's predictive validity for job performance is consistently weak. No major peer-reviewed meta-analysis has found that MBTI type predicts job performance at the level that justifies its widespread use in hiring or team assignment.

The contrast with the Big Five is stark. Barrick and Mount's landmark 1991 meta-analysis — and dozens of replications since — demonstrated that Conscientiousness (Discipline in Cèrcol's dimension naming) is a robust predictor of job performance across virtually every occupational category. Emotional Stability (the inverse of Neuroticism / Depth) predicts performance in high-stress roles. These relationships hold across cultures, languages, and industries. For a dedicated analysis of why this matters, see what is Conscientiousness: the most consistent predictor of job performance.


Why Big Five Personality Outperforms MBTI on Every Validity Metric

The Big Five is not a single test — it is a scientific model of personality structure that has emerged from factor-analytic studies of natural language and self-report data across decades and cultures. Its five dimensions are:

  • Openness to Experience (Vision) — curiosity, creativity, tolerance for ambiguity
  • Conscientiousness (Discipline) — self-regulation, organisation, goal-directedness
  • Extraversion (Presence) — sociability, positive affect, assertiveness
  • Agreeableness (Bond) — cooperation, empathy, trust
  • Neuroticism (Depth) — emotional reactivity, anxiety, vulnerability to stress

Unlike MBTI, the Big Five treats each dimension as a continuous spectrum. A person is not simply an "Extravert" or an "Introvert" — they fall somewhere on a distribution, and that nuance matters enormously for predicting how they will behave in a given context. High scorers and low scorers on any given dimension can both perform well; what differs is the type of environment and tasks that suit them.

A 2003 review by Gosling et al. (DOI: 10.1016/S0092-6566(03)00046-1) confirmed that even very brief Big Five measures reliably capture meaningful variance in personality outcomes, demonstrating the model's robustness across measurement formats. The five factors replicate in languages from English and Spanish to Chinese and Yoruba, suggesting they reflect genuine cross-cultural human variation rather than cultural artefacts — a finding documented extensively in Big Five personality across cultures.


The IPIP Advantage: Free, Open, and Scientifically Auditable

Most commercial implementations of the Big Five (NEO-PI-R, BFI) carry licensing fees and restrict access. The International Personality Item Pool (IPIP) is a public-domain repository of personality items that has been independently validated against the major commercial instruments. IPIP-based assessments deliver Big Five measurement quality at zero cost, with full scientific transparency — researchers and developers can inspect every item, every scoring algorithm, every norm.

Cèrcol is built on the IPIP. Our science page documents the specific item pools and validation studies underlying each dimension. See also our explainer on what the IPIP is. It is also worth comparing MBTI to other team-focused frameworks: DISC vs Big Five and Big Five vs DISC vs Belbin examine the broader landscape of alternatives.


MBTI vs Big Five: Head-to-Head Comparison Table

FeatureMBTIBig Five (IPIP)
Test-retest reliability (5 weeks)~50 % receive a different typeHigh (r ≈ 0.75–0.85 across studies)
Predictive validity for job performanceWeak / not establishedStrong (especially Conscientiousness)
Scientific basisJungian theory (not empirically derived)Factor-analytic, cross-cultural replication
CostProprietary; official version is paidIPIP is public domain / free
Output format16 discrete types5 continuous dimensions
Independent replicationLimitedExtensive (50+ years of peer review)

Which Should Your Team Use? A Practical Recommendation

If your goal is a memorable workshop icebreaker and you will not use the results for any decision that affects people's roles or careers, MBTI is broadly harmless — many people find it engaging. But if you want data that will still mean something in six months, that can be compared across teams, and that research links to actual performance outcomes, the Big Five is the only defensible choice.

For teams specifically, the combination of self-assessment and peer-perspective data is more revealing than either alone. Cèrcol's Full Moon assessment pairs IPIP-based self-report with structured Witness feedback, giving you both how people see themselves and how their closest collaborators experience them — all built on the same scientifically validated framework.

The choice between MBTI and Big Five is, in the end, a choice between familiarity and validity. Your team deserves both rigour and accessibility. The Big Five delivers the former; a well-designed platform can deliver the latter.


Try the Validated Alternative — Free

If you have been using MBTI and want to see what a scientifically rigorous alternative actually looks like, the simplest step is to take one. Cèrcol's free assessment is built on the IPIP Big Five, takes about fifteen minutes, and produces results that hold up across retests — unlike the type you got last month that mysteriously changed. You will see your scores on Discipline, Bond, Presence, Depth, and Vision as continuous dimensions, not binary boxes. No paywall, no forced email funnel for the results. Try it free at cercol.team.


Sources: Myers-Briggs Type Indicator — Wikipedia · Big Five personality traits — Wikipedia · Gosling, S. D., Rentfrow, P. J., & Swann, W. B. (2003). A very brief measure of the Big-Five personality domains. Journal of Research in Personality, 37(6), 504–528. DOI: 10.1016/S0092-6566(03)00046-1

Further reading

Related articles

Cèrcol uses only functional cookies — no analytics, no advertising trackers. Privacy policy