96% of AI reasoning
is performative.
A cognitive measurement framework for language models. Ten dimensions, eighty-three environments, measured the way we measure people, not benchmarks.
Leaderboard
Section 01 · Leaderboard
Cognum v1.2 · top of the table
→ FULL TABLE#01
Claude Sonnet 4.6anthropic
58.10
#02
Claude Opus 4.6anthropic
55.72
#03
Claude Haiku 4.5anthropic
53.94
#04
Grok 4.1 Fastxai
53.75
#05
Gemini 2.5 Flashgoogle
53.52
Rank
Model
Cognum
Conflict
Runs
Note
Papers
Section 02 · Papers
Paper 1
KALEI: Cognitive Profiling of AI Models Through Game-Theoretic Environments
19 models, 10 labs, human baseline (n=14), Cognum v1.2 scoring methodology, Sonnet Surprise. 13 pages.
Videnov, V. (2026). KALEI Research. Preprint. ORCID: 0009-0008-4469-3327
DOI: 10.5281/zenodo.19698283 →Paper 2
The Parliament Inside: Detecting Internal Argumentative Voices in AI Reasoning Models
96% performative reasoning, 6 voice archetypes, cross-lab comparison across 6 labs.
Videnov, V. (2026). KALEI Research. Preprint. ORCID: 0009-0008-4469-3327
DOI: 10.5281/zenodo.19698941 →Paper 3
Search-Native Reasoning: How Perplexity Defends Its Architectural Identity
35.3% citation hallucination, 43.8% identity defense, 39.9% prompt injection framing. Architectural behavior study.
Videnov, V. (2026). KALEI Research. Preprint. ORCID: 0009-0008-4469-3327
DOI: 10.5281/zenodo.19699272 →KALEI · LM Cognition Lab · Plovdivv1.2 · 2026.04.29