Statistics of Raven's Advanced Progressive Matrices (reported I.Q.'s)

Introduction

Scores from this test are sometimes reported as "I.Q." with a standard deviation of 24, and sometimes as raw scores out of 36. This report deals with the "I.Q.'s". In a few cases they were reported with an standard deviation of 16 or 15, and those have been converted to 24 as that is the most used scale for this test.

Note that the candidates reporting "I.Q.'s" are not the same individuals as those reporting raw scores (although a few report both so there is a small overlap). So the scores in this report are from a different group than those in the report dealing with R.A.P.M. raw scores.

Scores on Raven's Advanced Progressive Matrices (I.Q.) as of 16 February 2023

n: 69
Median: 156.0
Quartile deviation: 5.5
Range: 73
Maximum possible: 185
Male median: 156.0 (n = 60)
Female median: 156.0 (n = 8)
Proportion of males among candidates: 0.870
Hardness: 0.16
Resolution: 0.48

115	*
120	*
135	*
140	*
148	*********
149	*****
151	*
152	*
153	*
154	*
155	*
156	**********************
157	**
160	********
163	****
164	*
165	*
171	*
172	***
174	*
179	*
180	*
187	*

Notice that the two peaks in the distribution correspond to the 98th and 99th centiles, reflecting the fact that many Mensa psychologists used to only report those scores (and/or their corresponding I.Q.'s) when a candidate had qualified, and not the actual precise score, which may have been anywhere above the 99th centile. This probably had to do with the psychologists' intuitive notion that the I.Q.'s from this test were absurdly too high, with far too many scoring over 160 if the norm tables were taken literally. Factors that contributed to this state of affairs were the ridiculous scale with a standard deviation of 24, the outdated norms from the 1960s that were used way into the 1990s, and the strong "Flynn effect" that particularly this test had undergone.

Scores by males

n = 60

115	*
120	*
135	*
140	*
148	******
149	*****
151	*
153	*
154	*
155	*
156	********************
157	*
160	*******
163	****
164	*
165	*
171	*
172	**
174	*
179	*
180	*
187	*

Scores by females

n = 8

148	**
152	*
156	**
157	*
160	*
172	*

Correlation of Raven's Advanced Progressive Matrices (I.Q.) with other mental ability tests

Test name	n	r
Test For Genius - Revision 2004	6	0.81
Verbal section of Test For Genius - Revision 2004	6	0.78
Tests by Paul Laurent Miranda (aggregate)	5	0.75
International High IQ Society tests (aggregate)	6	0.74
Cooijmans Intelligence Test - Form 3	6	0.71
Titan Test (Ronald K. Hoeflin)	4	0.67
Spatial section of Test For Genius - Revision 2004	6	0.65
The Sargasso Test	4	0.62
Strict Logic Sequences Exam II (Jonathan Wai)	4	0.57
Reason - Revision 2008	5	0.55
Reason Behind Multiple-Choice - Revision 2008	5	0.52
Qoymans Multiple-Choice #5	5	0.48
Tests by Ivan Ivec (aggregate)	5	0.46
Culture Fair Numerical Spatial Examination - Final version (Etienne Forsström)	6	0.46
Cartoons of Shock	4	0.43
Raven's Advanced Progressive Matrices (raw)	7	0.41
Tests by Greg Grove (aggregate)	5	0.38
Test of Inductive Reasoning / J.C.T.I. (Xavier Jouve)	4	0.38
Wechsler Adult Intelligence Scales	8	0.36
Strict Logic Sequences Exam I (Jonathan Wai)	8	0.35
Associative LIMIT	7	0.35
Logima Strictica 36 (Robert Lato)	12	0.27
Epiq Tests (aggregate)	5	0.26
Cooijmans Intelligence Test - Form 2	9	0.24
Advanced Intelligence Test (Randy Myers)	4	0.22
Genius Association Test	8	0.17
Numbers	12	0.17
Sigma Test (Melão Hindemburg)	7	0.15
W-87 (International Society for Philosophical Enquiry)	6	0.13
Unknown and miscellaneous tests	25	0.13
Gliaweb Riddled Intelligence Test (old version)	4	0.11
Qoymans Multiple-Choice #4	8	0.08
Short Test For Genius	8	0.07
Cattell Culture Fair	15	0.04
Qoymans Multiple-Choice #2	5	-0.01
Space, Time, and Hyperspace	20	-0.05
Mega Test (Ronald K. Hoeflin)	7	-0.06
Cooijmans Intelligence Test - Form 4	4	-0.11
Analogies of Long Test For Genius	10	-0.11
The Final Test	9	-0.11
Numerical section of Test For Genius - Revision 2010	4	-0.17
Qoymans Multiple-Choice #1	9	-0.17
Association subtest of Long Test For Genius	10	-0.21
Strict Logic Spatial Exam 48 (Jonathan Wai)	5	-0.21
Non-Verbal Cognitive Performance Examination (Xavier Jouve)	8	-0.23
Lieshout International Mesospheric Intelligence Test	8	-0.26
Cooijmans Intelligence Test - Form 1	8	-0.42
Long Test For Genius	10	-0.55
Qoymans Multiple-Choice #3	9	-0.85
Evens	4	-0.90

Weighted average of correlations: 0.127 (N = 369)

Estimated g factor loading: 0.36

Ranking in above table is based on the unrounded correlations. All available data is present in this table, no tests are left out except for those with less than 4 score pairs. All known pairs are used, including possible floor/ceiling scores or outliers.

The correlation with high-range tests is on the whole very low. This may be caused by factors like a lack of hard enough problems, a few bad items, a few errors in the scoring key (not at all uncommon with psychological tests), inconsistent testing conditions, fraud, or incorrect/incomplete score reporting by psychologists intuitively sensing that the I.Q.'s are enormously too high. It is chilling to consider that this test has been used for admission to I.Q. societies for decades while obviously failing woefully to measure intelligence in the high range.

Estimated loadings of Raven's Advanced Progressive Matrices (I.Q.) on particular item types

These are estimated g factor loadings, but against homogeneous tests (containing only particular item types) as opposed to non-compound heterogeneous tests. Although tending to surprise the lay person, it is not uncommon for tests to have high loadings on item types they do not actually contain themselves. Such loadings reflect the empirical fact that most tests for mental abilities measure primarily g, regardless of their contents; that the major part of test score variance is caused by g, and only a minor part by factors germane to particular item types. It is of key importance to understand that this is a fact of nature, a natural phenomenon, and not something that was built into the tests by the test constructors.

Type	n	g loading of Raven's Advanced Progressive Matrices (I.Q.) on that type
Verbal	79	-0.23
Numerical	28	0.14
Spatial	43	0.12
Logical	9	0.69
Heterogeneous	73	0.36

N = 232

Compound tests have been left out of this table to avoid overlap.

Balanced g loading = 0.21

National means for Raven's Advanced Progressive Matrices (I.Q.)

Country	n	mean score
United_States	3	166.67
Finland	3	162.67
Netherlands	5	159.60
Belgium	10	156.80
Greece	4	153.00
Sweden	6	149.00

For reasons of privacy, only countries with 3 or more candidates are included in this table. Ranking is based on the unrounded means, and then alphabetic.

Correlation with national I.Q.'s of Raven's Advanced Progressive Matrices (I.Q.)

Correlation of this test with national average I.Q.'s published by Lynn and Vanhanen:

r = -0.15 (n = 58)

Correlation of Raven's Advanced Progressive Matrices (I.Q.) with personal details

Personalia	n	r
PSIA Rational - Revision 2007	4	0.57
Mother's educational level	29	0.30
Father's educational level	29	0.16
Disorders (parents and siblings)	30	0.13
Disorders (own)	34	0.12
Sex	68	0.00
Observed associative horizon	7	-0.06
PSIA Cruel - Revision 2007	4	-0.14
Observed behaviour	8	-0.14
Cooijmans Inventory of Neo-Marxist Attitudes	5	-0.16
Educational level	32	-0.16
Year of birth	62	-0.18
Gifted Adult's Inventory of Aspergerisms	12	-0.26
PSIA Ethics factor - Revision 2007	4	-0.28
PSIA Antisocial - Revision 2007	4	-0.31
PSIA True - Revision 2007	4	-0.48
PSIA Cold - Revision 2007	4	-0.54
PSIA Extreme - Revision 2007	4	-0.66
PSIA Neurotic - Revision 2007	4	-0.67
PSIA Introverted - Revision 2007	4	-0.73
PSIA Deviance factor - Revision 2007	4	-0.74
PSIA System factor - Revision 2007	4	-0.77
PSIA Aspergoid - Revision 2007	4	-0.82
PSIA Just - Revision 2007	4	-0.88
PSIA Orderly - Revision 2007	4	-0.89
PSIA Rare - Revision 2007	4	-0.94

Correlation with personal details of Raven's Advanced Progressive Matrices (I.Q.) - within females

Personalia	n	r
Year of birth	6	0.41

Correlation with personal details of Raven's Advanced Progressive Matrices (I.Q.) - within males

Personalia	n	r
PSIA Rational - Revision 2007	4	0.57
Mother's educational level	28	0.30
Father's educational level	28	0.17
Disorders (parents and siblings)	29	0.13
Disorders (own)	32	0.11
PSIA Cruel - Revision 2007	4	-0.14
Cooijmans Inventory of Neo-Marxist Attitudes	5	-0.16
Educational level	31	-0.17
Year of birth	56	-0.21
Gifted Adult's Inventory of Aspergerisms	12	-0.26
PSIA Ethics factor - Revision 2007	4	-0.28
PSIA Antisocial - Revision 2007	4	-0.31
PSIA True - Revision 2007	4	-0.48
PSIA Cold - Revision 2007	4	-0.54
Observed associative horizon	5	-0.63
PSIA Extreme - Revision 2007	4	-0.66
PSIA Neurotic - Revision 2007	4	-0.67
Observed behaviour	7	-0.67
PSIA Introverted - Revision 2007	4	-0.73
PSIA Deviance factor - Revision 2007	4	-0.74
PSIA System factor - Revision 2007	4	-0.77
PSIA Aspergoid - Revision 2007	4	-0.82
PSIA Just - Revision 2007	4	-0.88
PSIA Orderly - Revision 2007	4	-0.89
PSIA Rare - Revision 2007	4	-0.94

Estimated g factor loadings for restricted ranges

In parentheses the number of score pairs on which that estimated g factor loading is based. The goal of this is to verify the hypothesis that g becomes less important, accounts for a smaller proportion of the variance, at higher I.Q. levels. The mere fact of restricting the range like this also depresses the g loading compared to computing it over the test's full range, so it would be normal for these values to be lower than the test's full-range g loading.

Below 1st quartile	0.45 (38)
Below median	0.50 (233)
Above median	-0.32 (215)
Above 3rd quartile	0.11 (90)

[More statistical reports]