Sunday, April 5, 2026
HomeCrypto MiningGPT-5.4 Professional jumps to 150 IQ on MESNA Norway check as OpenAI...

GPT-5.4 Professional jumps to 150 IQ on MESNA Norway check as OpenAI breaks its personal document

OpenAI’s newest GPT-5.4 Professional mannequin has now achieved an IQ rating increased than 99.96% of all human beings, giving markets a contemporary sign that AI functionality beneficial properties are beginning to outpace the same old product-cycle noise.

OpenAI’s GPT-5.4 Professional touches 150 on public IQ benchmark as markets enter one other macro-heavy week

TrackingAI’s public leaderboard now locations OpenAI GPT-5.4 Professional at an IQ rating of 150, a pointy step up from the 136 rating that OpenAI’s o3 posted on the Mensa Norway check final 12 months.

The leap arrives at a second when market consideration has narrowed round Iran, power, labor softness, and the following inflation print. That creates a unique query for the week forward: how shortly is machine intelligence compounding, and when will that acceleration start to overlap with financial positioning?

Why this issues: A transfer from 136 to 150 on a broadly understood benchmark compresses a fancy functionality shift right into a easy sign. For companies, that sign feeds straight into choices round automation, software program budgets, and headcount planning. For markets, it provides one other variable alongside charges, inflation, and development expectations.

OpenAI launched GPT-5.4 as its most succesful and environment friendly frontier mannequin for skilled work, with stronger coding, software use, and laptop use, and a context window of as much as 1 million tokens. In the identical launch, OpenAI stated GPT-5.4 achieved a brand new state-of-the-art on GDPval and exceeded human efficiency on OSWorld-Verified.

These benchmarks are separate from a public IQ check, but the path of journey aligns. Functionality is rising throughout separate measurement methods, and that rise is turning into quick sufficient to affect budgeting, hiring plans, workflow design, and software program spend.

A rating of 150 on a public IQ-style benchmark compresses a broader functionality transfer right into a single, moveable sign. The quantity is straightforward to grasp even earlier than the methodology is debated.

The sooner o3 Mensa outcome established the benchmark and its limits. GPT-4.1’s one-million-token context window confirmed how OpenAI was extending mannequin utility throughout long-horizon code and doc duties, whereas our evaluation of OpenAI’s increasing capital loop linked mannequin progress to {hardware} enlargement, financing loops, and infrastructure demand.

Taken collectively, these developments place the most recent IQ rating inside a broader industrial and financial context. A transfer from 136 to 150 on a public benchmark is placing by itself. A transfer from 136 to 150 whereas OpenAI is pushing deeper into software use, laptop use, enterprise productiveness, and capital-intensive infrastructure carries broader implications.

Public IQ benchmarks are restricted, however the functionality curve continues to be transferring increased

Public IQ-style exams stay imperfect devices for measuring frontier fashions. TrackingAI runs a public Mensa-style benchmark and likewise maintains a more durable non-public offline check.

IQ-style exams compress a slender slice of cognitive efficiency right into a single quantity, obscuring variation throughout reasoning varieties, context dealing with, creativity, and real-world problem-solving.

For AI and people alike, scores are delicate to check design, coaching publicity, and sample familiarity, which makes them a loud proxy for common functionality.

An IQ of 150 sits on the excessive higher tail of the distribution, typically related to people akin to Albert Einstein or Richard Feynman. In sensible phrases, it implies very quick abstraction, sturdy sample recognition, and the flexibility to navigate complicated, multi-step issues with restricted steerage.

The platform experiences scores as rolling averages throughout latest completions, and the methodology raises acquainted questions round immediate construction, reproducibility, training-set contamination, and format familiarity. These considerations have been already seen when o3 reached 136, they usually stay lively now that GPT-5.4 Professional sits at 150.

OpenAI’s o3 scores 136 on Mensa Norway test, surpassing 98% of human populationOpenAI’s o3 scores 136 on Mensa Norway test, surpassing 98% of human population
Associated Studying

OpenAI’s o3 scores 136 on Mensa Norway check, surpassing 98% of human inhabitants

OpenAI’s o3 mannequin reaches Mensa-Degree IQ in unbiased testing.

Apr 17, 2025 · Liam ‘Akiba’ Wright

Even with these limits, the broader sample has change into more durable to dismiss. One remoted benchmark outcome could be defined away as a quirk. A cluster of beneficial properties throughout public IQ-style testing, coding, browser use, desktop navigation, and knowledge-work efficiency carries extra analytical weight.

TrackingAI’s newest leaderboard locations GPT-5.4 Professional on the high of its public IQ board forward of all Cluade, Gemini, Qwen, and Grok fashions, providing an exterior, legible public benchmark that maps shortly onto the broader functionality debate.

Few individuals want an in depth understanding of benchmark design to know that 150 sits in a uncommon vary and traders don’t want to simply accept each premise behind an IQ-style check to acknowledge {that a} leap of this measurement suggests acceleration fairly than drift.

Chart titled “AI IQ Test Results” showing average Mensa Norway IQ scores for major AI models on a bell curve, with OpenAI’s GPT-5.4 variants plotted near the top end of the range.Chart titled “AI IQ Test Results” showing average Mensa Norway IQ scores for major AI models on a bell curve, with OpenAI’s GPT-5.4 variants plotted near the top end of the range.
Chart titled “AI IQ Check Outcomes” displaying common Mensa Norway IQ scores for main AI fashions on a bell curve, with OpenAI’s GPT-5.4 variants plotted close to the highest finish of the vary.

Enterprise patrons additionally don’t must imagine that IQ equals common intelligence to see that methods with stronger sample recognition, stronger software use, and stronger long-horizon process dealing with are transferring towards economically helpful territory, extending far past puzzle-solving.

This factors towards methods that may search, plan, confirm, navigate, and produce actual work throughout prolonged contexts. In that setting, the IQ rating capabilities much less as a novelty quantity and extra as a sign of the density of frontier reasoning.

There may be additionally aggressive worth within the leaderboard itself. A management place on a public benchmark reinforces OpenAI’s standing within the race for seen functionality management, particularly at a second when mannequin differentiation is turning into more durable to discern from structure notes alone.

Benchmark management compresses complexity right into a easy hierarchy. It presents builders a sign, enterprise patrons a story deal with, and traders one other proxy for the place the potential frontier at present sits.

CryptoSlate Day by day Transient

Day by day indicators, zero noise.

Market-moving headlines and context delivered each morning in a single tight learn.