Is Cèrcol based on the Big Five?

Yes. Cèrcol measures personality using the OCEAN model (Big Five) via the IPIP public-domain item pool (Goldberg et al. 2006). The 12 team roles are derived from the AB5C circumplex (Hofstee et al. 1992) and team composition research (Bell 2007; Neuman & Wright 1999).

What makes Cèrcol different from Belbin or DISC?

Cèrcol's roles are grounded in the Big Five (OCEAN) personality model using the IPIP public-domain item pool. The scoring pipeline is fully open source and auditable. Witness Cèrcol uses forced-choice adjective selection — not Likert scales — to eliminate social desirability bias in peer assessment. Unlike Belbin or DISC, all items are public domain and the entire methodology is published and citable.

Is the personality assessment free?

The New Moon Cèrcol (10 items, Big Five snapshot) and First Quarter Cèrcol (60 items, IPIP-NEO-60, 30 facets) are always free — no account required. The Full Moon Cèrcol (120 items, IPIP-NEO-120, Witness peer assessment, cognitive ability measure) requires a one-time payment.

What is Witness Cèrcol?

Witness Cèrcol is a peer personality assessment where someone who knows you well rates you using a forced-choice adjective selection method — picking the best-fit and worst-fit adjective per round from a set covering all five OCEAN dimensions. Forced choice eliminates the social desirability bias that affects standard Likert-scale peer ratings. Dimensions where your self-rating and peer ratings diverge by more than 0.8 standard deviations are flagged as potential blind spots.

How are the 12 team roles derived?

The 12 roles are derived from the AB5C circumplex (Hofstee, De Raad & Goldberg 1992), covering all six intersections of the three team balance dimensions (Presence/Extraversion × Bond/Agreeableness × Vision/Openness) at both poles. The selection of these three dimensions as requiring team-level balance is grounded in Bell (2007) and Neuman & Wright (1999). Discipline (Conscientiousness) and Depth (Neuroticism) modulate role expression but do not define team balance.

No account is required for any instrument. During assessment, no personal data is collected — only anonymous scores are logged. Data is stored on our own servers (Hetzner Online GmbH). No third-party analytics. No data is shared with or sold to third parties.

Is Cèrcol based on the Big Five (OCEAN)?

Yes. Cèrcol measures personality using the OCEAN model (Big Five) via the IPIP — the International Personality Item Pool, a public-domain collection validated in thousands of published studies. The five dimensions are Presence (Extraversion), Bond (Agreeableness), Vision (Openness), Discipline (Conscientiousness), and Depth (Neuroticism). Because the IPIP is public domain there are no licence restrictions: the full item pool and scoring logic are open and citable.

How is Cèrcol different from Belbin, DISC, or StrengthsFinder?

Three things set Cèrcol apart. First, the items come from the Big Five (OCEAN), the most replicated personality model in academic research — not a proprietary framework. Second, the full item pool (IPIP) and scoring pipeline are public domain and auditable; there is no black box. Third, the Witness peer assessment uses forced-choice adjective selection instead of Likert scales, which eliminates the social desirability bias that affects most 360-feedback tools. Belbin and DISC use closed, proprietary methodologies.

What are blind spots in team personality assessment?

A blind spot is a personality dimension where how you see yourself and how others see you diverge significantly — more than 0.8 standard deviations apart. Cèrcol's Witness peer assessment detects blind spots by comparing your self-report with forced-choice adjective ratings from people who know you. Blind spots are neither good nor bad: they show where your self-perception and others' experience of you don't match, which is often more actionable than the score itself.

Personality science and the replication crisis: what has held up?

In 2015, a landmark collaboration published findings that shook academic psychology to its foundations. The Open Science Collaboration assembled 270 researchers across more than 100 laboratories and attempted to replicate 100 findings from high-impact social and cognitive psychology journals. The results, published in Science (doi:10.1126/science.aac4716), were sobering: only 36 to 39 percent of findings replicated in a statistically meaningful sense. Effect sizes were systematically smaller in replications than in originals. Many findings that had been widely cited, taught in undergraduate courses, and applied in practice did not hold up under independent testing.

The replication crisis — an overview of which is available on Wikipedia — reshaped the conversation about what psychology actually knows. It prompted soul-searching about small sample sizes, publication bias (the tendency to publish only positive results), researcher degrees of freedom (the many undisclosed choices that can inflate apparent effects), and a culture that rewarded novelty over reproducibility.

Where does personality science sit in this picture? The answer is more reassuring than the overall replication rate suggests — but it is not uniformly reassuring.

Why Big Five Science Survived the Replication Crisis Better

The findings that failed to replicate most dramatically in the Open Science Collaboration were concentrated in social and cognitive psychology — flashy, counterintuitive effects that made good headlines and lecture material. Priming studies (the idea that briefly exposing people to a word changes their subsequent behaviour), ego depletion (the idea that willpower is a resource that depletes with use), and several classic social influence findings either failed to replicate or replicated with effect sizes a fraction of the original.

Personality science was not immune to replication problems, but it was structurally better positioned to resist them. The reasons are methodological.

Sample sizes tend to be larger. The Big Five findings that anchor the field — the relationship between Conscientiousness and job performance, between Neuroticism and psychological wellbeing, between Openness and creativity — were established across hundreds of studies and meta-analyses involving tens of thousands of participants. When findings are based on very large N and have been replicated many times in different contexts, replication is a matter of course rather than a hope.

The measures are more stable. Personality questionnaires yield highly reliable scores — internal consistency reliabilities typically in the .80-.90 range. Single-session priming paradigms, by contrast, measure short-term, context-sensitive states with far lower reliability. Unreliable measures mean noisy effects that fluctuate unpredictably across replications.

The constructs are more operationally transparent. "Conscientiousness" has a clear, consensual definition that has been operationalised consistently across instruments and studies for decades. Many of the non-replicating social psychology findings depended on creative, theoretically contested operationalisations of constructs like "power," "implicit attitude," or "self-regulatory depletion." More transparent constructs produce more replicable findings. The IPIP's public-domain items make this transparency possible at the measurement level.

~50%

of social psychology studies failed to replicate (2015 OSC study)

High

Big Five structure replication rate across labs

r = 0.22

Conscientiousness → job performance: holds across replications

IPIP

open-source items: independently verifiable, no proprietary black box

The Robust Big Five Findings That Have Replicated Reliably

"Among the most robust findings in personality psychology is the relationship between Conscientiousness and job performance — a connection documented across hundreds of studies, multiple cultures, and a wide variety of occupational domains." — Roberts et al., 2007 (meta-analytic review)

The following findings from personality science have survived repeated replication and meta-analytic scrutiny with consistently moderate to large effect sizes.

Conscientiousness and job performance. The meta-analysis by Barrick and Mount (1991) — and its many replications and extensions — established that Conscientiousness (Discipline in Cèrcol's framework) is the most consistent Big Five predictor of job performance across occupational types. The effect is not large in absolute terms (corrected correlations typically around .20-.28) but it is among the largest personality-outcome relationships in the literature, and it holds across industries, cultures, and job types. This finding has replicated so many times that it is treated as a benchmark against which new predictors are evaluated. For a full profile of this dimension, see what is Conscientiousness.

Neuroticism and wellbeing. The negative relationship between Neuroticism (Depth in Cèrcol's terminology) and subjective wellbeing, life satisfaction, and positive affect is one of the most replicated findings in personality science. A meta-analysis by Steel, Schmidt, and Shultz (2008) found correlations between Neuroticism and global wellbeing measures around -.40 to -.50. The relationship holds longitudinally, cross-culturally, and across different wellbeing operationalisations. The full picture of this dimension is covered in what is Neuroticism.

Trait stability across adulthood. The finding that Big Five traits are moderately stable across adulthood — and become more stable with age — has been replicated in longitudinal studies across multiple countries. Roberts and DelVecchio (2000) meta-analysed 152 longitudinal studies and found test-retest correlations increasing from approximately .54 in childhood to .74 in adulthood. Personality is not fixed, but it is not as malleable as popular accounts sometimes suggest. This is one of the most important findings to understand before reading five personality science myths that won't die.

Extraversion and positive affect. The association between Extraversion (Presence) and positive emotionality is highly replicable and appears in both self-report and ecological momentary assessment studies. Extraversion seems to reflect, in part, a biological sensitivity to reward cues that manifests as a tendency to experience more frequent and more intense positive emotions in social contexts.

Openness and creativity, intelligence, and aesthetic engagement. The link between Openness to Experience (Vision) and outcomes in creative domains — artistic production, divergent thinking, cultural consumption — is consistently replicated. Its relationship with crystallised intelligence is moderate and robust.

Which Personality Science Claims Have a Weaker Replication Record

Not all personality science findings have weathered replication equally well.

Specific personality × outcome interactions. While main effects of Big Five traits on broad outcomes are robust, claims about specific moderating interactions — that Conscientiousness predicts performance only under certain leadership conditions, that Agreeableness matters more for team performance in high-interdependence roles — have a weaker replication record. These interaction effects are often based on smaller samples, involve more researcher degrees of freedom in analysis, and tend to shrink substantially in independent replications.

Personality change interventions. Studies claiming that targeted interventions can meaningfully shift Big Five trait levels — and that these shifts persist over time — have shown mixed replication results. The basic finding that personality can change is robust; the evidence for reliable, targeted, lasting change via specific interventions is less so. The field needs larger pre-registered trials before strong claims about personality change are warranted.

Type-based interpretations. Attempts to derive meaningful personality "types" from continuous Big Five scores — the claim that there are distinct clusters of people with meaningfully different profiles — have shown poor replication. A widely cited 2018 paper by Gerlach et al. claiming to identify four robust personality types was quickly followed by independent analyses showing that the type structure was highly sensitive to methodological choices. Continuous trait scores replicate; discrete types do not. This is one reason Cèrcol avoids type-based framing.

What Teams Should Trust — and What to Treat with Caution

Finding	Replication status	Confidence level
Conscientiousness → job performance	Highly replicated	High — use as a reference anchor
Neuroticism → lower wellbeing	Highly replicated	High — consistent across cultures and instruments
Trait stability across adulthood	Highly replicated	High — within-person change is real but slow
Extraversion → positive affect	Highly replicated	High — robust in experience sampling and lab
Openness → creativity	Well replicated	Moderate-high — effect sizes vary by domain
Specific trait × outcome interactions	Mixed	Low — treat with caution; seek large-N evidence
Personality change interventions	Mixed	Low-moderate — promising but not yet established
Personality types from Big Five	Poorly replicated	Low — avoid binary type assignments

The practical implication for anyone using personality data is to apply it at the level of broad trait tendencies, not fine-grained predictions. The research on Conscientiousness and job performance gives you grounds to expect that someone with high Discipline scores will, on average and over time, show greater dependability and follow-through than someone with low scores. It does not give you grounds to predict what they will do in a specific situation, how they will respond to a particular manager, or whether they will succeed in a role with unusual demands. For a fuller account of these limits, see what personality science cannot predict.

For Cèrcol, this means building interpretive frameworks at the level where the evidence is strongest, and being explicit about uncertainty where the evidence is weaker. The science page at cercol.team/science sets out the evidence base in detail.

How Pre-Registration Is Improving Personality Science Credibility

The replication crisis has prompted a shift in research practices. Pre-registration — committing to hypotheses, measures, and analytic strategy before data collection — prevents the undisclosed flexibility that inflates false-positive rates. Large collaborative studies aggregate data across many labs to produce effect-size estimates robust enough to generalise. Adversarial collaborations pit researchers with opposing views against each other in joint studies designed to adjudicate between them.

These practices are already improving the quality of the personality science literature. Findings that survive pre-registered replication with large N are substantially more credible than findings that have only been demonstrated in single-lab studies. As the field matures, the signal-to-noise ratio will improve — and with it, the confidence practitioners can place in personality data. For a review of persisting misconceptions, see five personality science myths that won't die.

Test the science yourself with Cèrcol

The Big Five findings that have replicated most robustly — Conscientiousness and performance, Neuroticism and wellbeing, trait stability — are exactly the findings that personality assessments should be grounded in. That is the standard Cèrcol holds itself to: only the dimensions and relationships with strong replication records are used to generate insights, and the science page documents the supporting evidence transparently.

If you want to see what replicated personality science looks like in practice, Cèrcol is free at cercol.team. The assessment uses public-domain IPIP items, scores the five dimensions whose validity evidence survived the replication crisis, and gives you both self-report and peer perspectives — because two independent signals are more reliable than one.

Further reading: Critiques of the Big Five: what the critics say · The science behind Cèrcol

Personality science and the replication crisis: what has held up?

Why Big Five Science Survived the Replication Crisis Better

The Robust Big Five Findings That Have Replicated Reliably

Which Personality Science Claims Have a Weaker Replication Record

What Teams Should Trust — and What to Treat with Caution

How Pre-Registration Is Improving Personality Science Credibility

Test the science yourself with Cèrcol

Further reading

Related articles

Why personality science belongs at the heart of evidence-based HR

Critiques of the Big Five: what the critics say — and what they get right

What reliability and validity mean in personality testing — explained plainly