Is Cèrcol based on the Big Five?

Yes. Cèrcol measures personality using the OCEAN model (Big Five) via the IPIP public-domain item pool (Goldberg et al. 2006). The 12 team roles are derived from the AB5C circumplex (Hofstee et al. 1992) and team composition research (Bell 2007; Neuman & Wright 1999).

What makes Cèrcol different from Belbin or DISC?

Cèrcol's roles are grounded in the Big Five (OCEAN) personality model using the IPIP public-domain item pool. The scoring pipeline is fully open source and auditable. Witness Cèrcol uses forced-choice adjective selection — not Likert scales — to eliminate social desirability bias in peer assessment. Unlike Belbin or DISC, all items are public domain and the entire methodology is published and citable.

Is the personality assessment free?

The New Moon Cèrcol (10 items, Big Five snapshot) and First Quarter Cèrcol (60 items, IPIP-NEO-60, 30 facets) are always free — no account required. The Full Moon Cèrcol (120 items, IPIP-NEO-120, Witness peer assessment, cognitive ability measure) requires a one-time payment.

What is Witness Cèrcol?

Witness Cèrcol is a peer personality assessment where someone who knows you well rates you using a forced-choice adjective selection method — picking the best-fit and worst-fit adjective per round from a set covering all five OCEAN dimensions. Forced choice eliminates the social desirability bias that affects standard Likert-scale peer ratings. Dimensions where your self-rating and peer ratings diverge by more than 0.8 standard deviations are flagged as potential blind spots.

How are the 12 team roles derived?

The 12 roles are derived from the AB5C circumplex (Hofstee, De Raad & Goldberg 1992), covering all six intersections of the three team balance dimensions (Presence/Extraversion × Bond/Agreeableness × Vision/Openness) at both poles. The selection of these three dimensions as requiring team-level balance is grounded in Bell (2007) and Neuman & Wright (1999). Discipline (Conscientiousness) and Depth (Neuroticism) modulate role expression but do not define team balance.

No account is required for any instrument. During assessment, no personal data is collected — only anonymous scores are logged. Data is stored on our own servers (Hetzner Online GmbH). No third-party analytics. No data is shared with or sold to third parties.

Is Cèrcol based on the Big Five (OCEAN)?

Yes. Cèrcol measures personality using the OCEAN model (Big Five) via the IPIP — the International Personality Item Pool, a public-domain collection validated in thousands of published studies. The five dimensions are Presence (Extraversion), Bond (Agreeableness), Vision (Openness), Discipline (Conscientiousness), and Depth (Neuroticism). Because the IPIP is public domain there are no licence restrictions: the full item pool and scoring logic are open and citable.

How is Cèrcol different from Belbin, DISC, or StrengthsFinder?

Three things set Cèrcol apart. First, the items come from the Big Five (OCEAN), the most replicated personality model in academic research — not a proprietary framework. Second, the full item pool (IPIP) and scoring pipeline are public domain and auditable; there is no black box. Third, the Witness peer assessment uses forced-choice adjective selection instead of Likert scales, which eliminates the social desirability bias that affects most 360-feedback tools. Belbin and DISC use closed, proprietary methodologies.

What are blind spots in team personality assessment?

A blind spot is a personality dimension where how you see yourself and how others see you diverge significantly — more than 0.8 standard deviations apart. Cèrcol's Witness peer assessment detects blind spots by comparing your self-report with forced-choice adjective ratings from people who know you. Blind spots are neither good nor bad: they show where your self-perception and others' experience of you don't match, which is often more actionable than the score itself.

Why self-assessment alone isn't enough: the case for peer personality feedback

Three Ways Self-Report Personality Tests Mislead You

Self-report personality measures have genuine strengths. They are cheap to administer, scalable, and they capture information about internal states — anxiety, motivation, rumination — that outside observers cannot directly access. Decades of research confirm that self-reports predict meaningful outcomes: job performance, relationship satisfaction, health behaviours.

But they come with well-documented problems.

The first is social desirability bias: the tendency to present oneself in a favourable light, consciously or otherwise. When someone reads "I get into arguments with people" on a questionnaire, they know the socially preferred answer. Even with instructions to be honest, people systematically inflate scores on dimensions seen as positive (Agreeableness, Conscientiousness) and deflate scores on dimensions seen as negative (Neuroticism). Research on this bias is extensive — it has been studied since the 1950s, and it reliably distorts self-report data across cultures and contexts. See: social desirability bias. For a deeper dive into how context shapes this effect, see anonymity in personality assessment: why it matters.

The second problem is introspective inaccuracy. We tend to assume we have reliable access to our own mental processes. We do not, at least not fully. A substantial body of psychological research shows that people are poor judges of the causes of their own behaviour, the frequency of their own actions, and the consistency of their own traits. Someone who sees themselves as a patient listener may genuinely believe it — while everyone around them regularly experiences being interrupted.

The third problem is reference group confusion. When you rate yourself on "I am organised," you are implicitly comparing yourself to some reference group — but that reference group is idiosyncratic. A surgeon's idea of "organised" is not the same as a secondary-school teacher's. Because different people use different internal benchmarks, self-reports are noisier than they appear.

What Self-Other Agreement Research Reveals About the Gap

The most direct way to measure the gap between self-perception and peer perception is self-other agreement research: studies that collect both self-ratings and ratings from people who know the target well (friends, colleagues, family members), then correlate them.

The landmark meta-analysis by Funder and colleagues, and the widely cited study by John and Robins (1993), established the basic picture. Across the Big Five dimensions, self-other correlations typically fall between .40 and .60. That sounds moderate, and in some respects it is — but it also means that 60–75% of variance in peer ratings is not accounted for by self-ratings. A substantial portion of what others see in you is simply not visible to you. For the full breakdown by dimension, see self-other agreement by Big Five dimension: where the gaps are biggest.

r = 0.18

average self-other agreement on personality dimensions

65%

of people overestimate their own Conscientiousness relative to peer ratings

peer raters needed before signal exceeds noise

"The correlation between self-ratings and observer ratings across Big Five dimensions averages approximately .47 — meaning peer ratings share less than a quarter of their variance with self-ratings."
— John & Robins (1993), Journal of Personality and Social Psychology, doi:10.1037/0022-3514.63.1.146

The gaps are not uniform across dimensions. Extraversion — in Cèrcol's framework, Presence — shows the highest self-other agreement, typically around .55–.65, because it involves observable social behaviour. Neuroticism — Cèrcol's Depth — shows the lowest, often around .25–.40, because it involves internal emotional states that others cannot directly observe.

Self-Other Agreement by Cèrcol Dimension: The Data

Cèrcol Dimension	Big Five Equivalent	Typical Self-Other Correlation	Why
Presence	Extraversion	.55 – .65	Visible, behavioural (talking, leading, initiating)
Bond	Agreeableness	.40 – .55	Partly visible in conflict behaviour, partly internal
Vision	Openness	.40 – .50	Observable in creative output; harder to assess in motivation
Discipline	Conscientiousness	.45 – .55	Visible in work output; internal effort is private
Depth	Neuroticism	.25 – .40	Predominantly internal; emotional experience rarely fully expressed

These gaps have real consequences. In hiring, a candidate's self-report of conscientiousness predicts performance — but a structured reference check adds incremental validity. In leadership development, a manager's self-rated empathy may have little connection to how their team experiences them. In coaching, the most productive conversations often begin with the moment a client discovers that their self-image and others' experience of them do not match.

Why Peer Ratings Add Information Self-Report Cannot

The psychometric case for peer ratings rests on two arguments. First, aggregation: a single self-rating is a noisy measure, but so is a single peer rating — however, when you aggregate multiple peer ratings, random noise cancels out and the signal becomes clearer. Five peers who independently agree that someone is high on Presence are probably right, even if each individual rating is imperfect. Research on the minimum number of raters needed is examined in how many peer assessors do you need for reliable personality data?.

Second, perspective: peers observe behaviour in contexts that self-report cannot fully capture. A colleague sees how you behave under deadline pressure. A team member sees how you respond when your idea is challenged. This is not introspection — it is observation, and it provides a fundamentally different type of evidence.

The combination of self-ratings and peer ratings produces better predictions of outcomes — performance, relationship quality, leadership effectiveness — than either alone. This is not because one is right and the other wrong. It is because they measure different things.

The History and Limits of 360-Degree Feedback

The idea of collecting feedback from multiple sources is not new. 360-degree feedback — gathering ratings from subordinates, peers, and supervisors, in addition to self-ratings — became a mainstream management practice in the 1990s and is now used by the majority of Fortune 500 companies. Its adoption was driven precisely by the recognition that self-ratings are insufficient.

But traditional 360-degree tools have problems of their own. Most use Likert-scale ratings, which are vulnerable to social desirability and acquiescence bias. Most ask raters to rate observable behaviours, which works reasonably well for Presence and Discipline but poorly for Depth and Vision. Most produce feedback reports that are rich in data but difficult to act on. And most are administered in high-stakes organisational contexts where honest ratings are socially costly — a dynamic explored in depth in anonymity in personality assessment: why it matters.

The result is that 360-degree feedback often confirms what people already believe about themselves, rather than revealing genuine blind spots.

How the Cèrcol Witness Instrument Improves on Traditional 360s

Cèrcol uses a different approach. Rather than asking Witnesses (peers) to rate a person on behavioural statements using a Likert scale, the Witness instrument presents pairs of adjectives — for example, "talkative" versus "thorough" — and asks the Witness to choose which word better describes the person being assessed. For more on why this design matters, see forced-choice personality assessment: more honest data.

This forced-choice design has two key advantages. First, it makes social desirability much harder to act on: both options in each pair are generally positive, so there is no obviously "good" or "bad" answer. Second, it makes acquiescence bias impossible: you cannot agree with both options simultaneously. The result is ratings that more accurately reflect the Witness's genuine perception.

The adjectives in the Witness instrument are grounded in the AB5C circumplex — Hofstee, de Raad, and Goldberg's mapping of personality adjectives onto Big Five dimensions and their intersections. This ensures that the forced-choice pairs are psychometrically principled, not arbitrary. The full item pool draws on IPIP — the International Personality Item Pool, a public-domain resource that underpins much of modern personality research.

The Witness instrument is open-source and built on the IPIP tradition, ensuring scientific transparency and reproducibility. Personality research at this level does not belong behind a commercial paywall.

Practical Implications: Using Self and Peer Data Together

If you are using personality data for development — whether your own, your team's, or your clients' — the practical implication is clear: collect Witness ratings alongside self-ratings, and treat the gap between them as information rather than noise.

A large gap on Presence might mean you underestimate how much your energy fills a room — or how much it overwhelms it. A gap on Bond might mean the warmth you feel internally is not coming across in how you communicate. A gap on Depth might mean the emotional regulation you work hard at internally is not visible to others — or, conversely, that you are showing more distress than you realise.

The goal is not to decide which rating is "correct." Both are. They answer different questions. Self-ratings answer: "What is it like to be me?" Witness ratings answer: "What is it like to work with me?" For most development goals, you need both.

For a closer look at what the Witness instrument specifically measures, see what the Cèrcol Witness instrument measures. Cèrcol's assessment is designed to make this comparison easy — collecting self-ratings and Witness ratings in the same session and surfacing the gaps clearly.

How Cèrcol closes the self-assessment gap

The research reviewed here points to one clear conclusion: accurate personality data requires both self-report and peer input. Cèrcol is built on exactly this principle. The Witness instrument collects anonymous peer ratings using a forced-choice format that resists social desirability bias, then surfaces the self-vs-peer gaps so you can see what others observe that you may not. The full assessment is free at cercol.team — because understanding how you come across to others should not require a commercial licence. If you want to understand not just what you think you are like, but what it is actually like to work with you, that is where to start.

**Why peer data changes everything:** The gap between how you see yourself and how others see you is not a flaw — it's information. Cèrcol's Witness instrument captures this gap systematically, giving you data that self-report alone can never provide. Teams that use peer assessment alongside self-report make measurably better development decisions.

Summary: Why Personality Feedback Requires Both Self and Peer Data

Self-assessment is valuable but incomplete. Social desirability bias, introspective inaccuracy, and reference-group effects all limit the accuracy of self-report personality data. The self-other agreement literature shows that peer ratings share less than a quarter of their variance with self-ratings — and that the gap is largest for Depth (Neuroticism), where internal states are hardest for others to observe.

Peer ratings do not replace self-ratings. They complement them. Together, they provide a richer, more actionable picture of personality than either can alone. That is the case for peer personality feedback — and the foundation on which Cèrcol is built.

References
John, O. P., & Robins, R. W. (1993). Determinants of interjudge agreement on personality traits: The Big Five domains, observability, evaluativeness, and the unique perspective of the self. Journal of Personality and Social Psychology, 63(1), 146–156. doi:10.1037/0022-3514.63.1.146

Why self-assessment alone isn't enough: the case for peer personality feedback

Three Ways Self-Report Personality Tests Mislead You

What Self-Other Agreement Research Reveals About the Gap

Self-Other Agreement by Cèrcol Dimension: The Data

Why Peer Ratings Add Information Self-Report Cannot

The History and Limits of 360-Degree Feedback

How the Cèrcol Witness Instrument Improves on Traditional 360s

Practical Implications: Using Self and Peer Data Together

How Cèrcol closes the self-assessment gap

Summary: Why Personality Feedback Requires Both Self and Peer Data

Further reading

Related articles

Anonymity in personality assessment: why it matters more than you think

What the Cèrcol Witness instrument measures — and how it differs from self-report

Social desirability bias in personality tests