Beta launch — 500 free Full Moon licences remaining. Help us find bugs.
Claim free access

The history of the Big Five: from Allport to Goldberg

Big Five personality took 70 years to build — Allport, Cattell, Goldberg. That hard-won convergence is why the OCEAN model outlasts every rival framework.

Miquel Matoses·10 min read

Few scientific models arrive fully formed. The Big Five personality traits — known by the acronym OCEAN (Openness, Conscientiousness, Extraversion, Agreeableness, Neuroticism) — took roughly seventy years to develop, passing through the hands of at least a dozen researchers before settling into the form psychologists use today. That long, contested history is not a liability. It is the source of the model's validity.

This article traces the development of the Big Five from its earliest antecedents in the 1930s through the IPIP revival of the 1990s and the subsequent convergence of evidence that now makes it the most replicated personality taxonomy in empirical psychology.


1936 Allport & Odbert 17,953 personality-describing words 1949 Cattell Factor analysis → 16 clusters 1961 Tupes & Christal 5 stable factors emerge 1981 Goldberg coins “The Big Five” 1992 Costa & McCrae NEO-PI-R formalises OCEAN 1999 IPIP published (open-source items)
Key milestones in Big Five research history.

How Allport and Odbert Started the Big Five (1936)

The Big Five rests on a deceptively simple assumption known as the lexical hypothesis: that the most important personality differences between people will, over time, be encoded in natural language. If a trait matters enough to human social life, people will develop a word for it.

Gordon Allport and Henry Odbert put this hypothesis to work in 1936, systematically combing through an English dictionary and extracting every word that could describe a person's character or behaviour. They found approximately 18,000 such terms — adjectives, nouns, and participles that described personality-relevant attributes in English alone.

Eighteen thousand is an unmanageable number. The crucial scientific question was whether those 18,000 terms reduced to a smaller set of underlying dimensions — and if so, how many.


How Cattell's Factor Analysis Narrowed 18,000 Traits to 16

Raymond Cattell took Allport and Odbert's list and applied the emerging statistical technique of factor analysis, attempting to identify the underlying structure. Through a series of studies in the 1940s and early 1950s, he reduced the 18,000 terms to 16 primary factors — the basis for his 16PF (Sixteen Personality Factor) questionnaire, first published in 1949 and still commercially available today.

In retrospect, Cattell's reduction went too far in one direction and not far enough in another. He retained more factors than the data supported reliably, and his analyses were difficult to replicate. But his work established the methodological template: reduce natural language personality descriptors to a manageable set of empirically derived dimensions.


Tupes and Christal: The First Five-Factor Structure (1961)

The actual emergence of a five-factor structure came from an unlikely source: the United States Air Force. Ernest Tupes and Raymond Christal were personnel researchers working at Lackland Air Force Base in Texas. In a 1961 technical report — which went largely unnoticed at the time because it was published as a military document rather than an academic journal article — they re-analysed several of Cattell's datasets and found that the data consistently supported five broad factors, not sixteen.

Those five factors, in Tupes and Christal's terminology, were: Surgency, Agreeableness, Dependability, Emotional Stability, and Culture. The labels changed over subsequent decades; the underlying structure proved remarkably stable.


How Norman Codified the Modern Big Five Labels (1963)

Two years after Tupes and Christal, Warren Norman published a paper in the Journal of Abnormal and Social Psychology that reached the same five-factor conclusion through independent analysis. Norman's 1963 paper is often credited as the formal scientific codification of the Big Five — partly because it appeared in a widely read journal and partly because his labelling was more intuitive than Tupes and Christal's.

Norman's five factors were: Extraversion/Surgency, Agreeableness, Conscientiousness, Emotional Stability, and Openness to Experience. With minor modifications, this is the structure that persists in modern instruments.

ResearcherYearKey contribution
Allport & Odbert1936Identified ~18,000 personality-relevant terms in English dictionaries
Cattell1943–1957Applied factor analysis to reduce lexical data; proposed 16 primary factors
Tupes & Christal1961Re-analysed Cattell's data; identified five stable factors in military technical report
Norman1963Independently replicated five-factor structure; codified modern dimension labels
Goldberg1981–1993Revived lexical research; coined the term "Big Five"; developed IPIP
McCrae & Costa1985–1992Developed NEO PI-R; validated Big Five across cultures and life stages

Why Big Five Research Stalled in the 1970s–80s

Despite the convergent findings of Tupes, Christal, and Norman, the Big Five did not immediately dominate personality psychology. The 1970s and early 1980s are sometimes called the "Big Five winter" — a period in which personality research more broadly fell into disfavour.

The proximate cause was Walter Mischel's 1968 book Personality and Assessment, which argued that cross-situational consistency in behaviour was weak and that situational factors outweighed dispositional ones. Mischel's critique triggered a "person-situation debate" that consumed the field for nearly two decades, redirecting research energy away from trait measurement.

The Big Five survived this period partly because it was never entirely abandoned — small groups of researchers continued lexical work throughout the 1970s — and partly because the empirical evidence for cross-situational consistency gradually accumulated and reasserted itself through meta-analyses in the 1980s.


How Goldberg Coined 'Big Five' and Built the IPIP

Lewis Goldberg, working at the Oregon Research Institute, played the central role in reviving the lexical approach and establishing the Big Five as the dominant framework. In a landmark 1981 paper he coined the term "Big Five" — deliberately lowercase, to signal that the five factors were a description of the personality space, not a theory about why five dimensions should exist.

"The finding was not that personality must be five-dimensional. It was that when personality adjectives in natural language are systematically factor-analysed, five broad dimensions account for the structure more reliably and parsimoniously than any other number across datasets and cultures."

Goldberg's subsequent work produced the International Personality Item Pool (IPIP) — a publicly available, freely usable repository of validated personality assessment items hosted at ipip.ori.org. The IPIP is the scientific foundation on which Cèrcol is built. Its open-science design means that the items, the scoring procedures, and the validity evidence are all publicly available and independently auditable — a standard that proprietary commercial instruments rarely meet.

For further reading on the IPIP specifically, see what the IPIP is and why it matters.


McCrae and Costa: Validating Big Five Across Cultures

Running in parallel with Goldberg's lexical revival, Robert McCrae and Paul Costa developed the NEO Personality Inventory at the National Institute on Aging. Their instrument — the NEO PI-R, with later revisions — operationalised the Big Five as a questionnaire instrument and generated an enormous body of validity evidence, including cross-cultural replications in dozens of languages. The science page at Cèrcol draws directly on this peer-reviewed literature.

McCrae and Costa's contribution was also theoretical. Their Five Factor Theory (1999) proposed mechanisms by which the Big Five dimensions emerge from biological substrates and remain stable across the lifespan — moving the model from a descriptive taxonomy to an explanatory account.

The Big Five meta-analysis by Barrick and Mount (1991) — covering 117 validity studies — demonstrated that Conscientiousness (Discipline in Cèrcol's framework) predicted job performance across occupational groups, establishing the Big Five's practical utility and accelerating its adoption in organisational settings. For a closer look at what that dimension means in practice, see what is Conscientiousness. (doi: 10.1037/0033-2909.116.2.187)


Why 70 Years of Convergent Evidence Makes OCEAN Unique

The Big Five is sometimes criticised as merely descriptive — a map of personality space rather than a causal account of personality. That critique is partially valid, as explored in depth in critiques of the Big Five. The Big Five describes what personality differences look like at a broad level; it does not fully explain why those differences exist.

But the 70-year convergence trajectory is itself a form of validity evidence. The same five-factor structure has emerged from:

  • Independent lexical analyses in multiple languages
  • Questionnaire-based studies using different item pools
  • Observer ratings (peers, colleagues, family members)
  • Cross-cultural replications spanning dozens of countries
  • Longitudinal studies tracking the same individuals over decades

When the same structure appears across methods, instruments, raters, languages, and time periods, the probability that it reflects genuine underlying variation in human personality — rather than artefacts of a particular measurement approach — becomes very high.

This convergent evidence is what separates the Big Five from proprietary personality frameworks that lack independent replication. It is also why the model has largely held up through the replication crisis in psychology — a rigorous test that many fields failed. It is the reason that Cèrcol uses the Big Five as its dimensional foundation, and why the science page cites the peer-reviewed validation literature rather than proprietary technical reports.

Understanding where the Big Five came from does not diminish it. It makes the model more credible — because the history shows that the structure survived repeated attempts to challenge or replace it, and emerged stronger from each one.


Take the test that is built on this science: Cèrcol is free

Seventy years of convergent evidence — lexical studies, factor analyses, cross-cultural replications, longitudinal cohorts — produced a personality model that is genuinely robust. That is what good science looks like: a structure that keeps re-emerging across independent methods, instruments, and languages.

Cèrcol is built directly on this foundation. The assessment uses items from the public-domain IPIP, scores the same five dimensions whose lineage this article traces, and lets you see your profile alongside how colleagues perceive you. Everything is free, open, and auditable — consistent with the open-science values that make the Big Five trustworthy in the first place.

Try Cèrcol free at cercol.team — no account required to start.


Further reading: What is the IPIP and why does it matter? · The science behind Cèrcol

Further reading

Related articles

Cèrcol uses only functional cookies — no analytics, no advertising trackers. Privacy policy