Statistics of The Test To End All Tests

© March 2014 Paul Cooijmans

Norms

Scores on The Test To End All Tests

Contents type: Verbal.   Period: 1997-present

0 *****
2 **
4 *
5 **
6 *
7 **
8 *****
10 *********
11 **
12 ****
13 *****
14 ****
15 ***
16 **
17 ***
18 *****
19 **
20 *
22 *
26 *

Correlation of The Test To End All Tests with other tests by Paul Cooijmans

(Test index) Test name n r
(4) A Paranoiac's Torture: Intelligence Test Utilizing Diabolic Exactitude50.94
(3) Qoymans Multiple-Choice #5110.93
(30) Verbal section of The Marathon Test60.93
(48) Narcissus' last stand100.92
(44) Associative LIMIT100.90
(42) The Marathon Test50.88
(26) Verbal section of Test For Genius - Revision 2004200.87
(45) Numerical and spatial sections of The Marathon Test50.85
(32) Spatial section of The Marathon Test50.84
(2) Cooijmans Intelligence Test - Form 3100.84
(0) Test of the Beheaded Man100.82
(31) Numerical section of The Marathon Test50.82
(35) Intelligence Quantifier by assessment220.81
(40) Reason Behind Multiple-Choice - Revision 2008110.81
(7) The Final Test340.79
(16) Lieshout International Mesospheric Intelligence Test140.73
(62) Reason Behind Multiple-Choice70.72
(36) Reflections In Peroxide100.71
(1) Cartoons of Shock150.69
(66) Test For Genius - Revision 2004170.69
(53) Qoymans Multiple-Choice #3100.64
(18) The Nemesis Test150.63
(25) The Sargasso Test110.62
(21) Psychometric Qrosswords70.61
(85) Cooijmans Intelligence Test - Form 180.61
(63) Long Test For Genius120.61
(10) Genius Association Test170.59
(75) Analogies of Long Test For Genius150.59
(27) Spatial section of Test For Genius - Revision 2004210.59
(87) Cooijmans Intelligence Test - Form 2110.57
(24) Reason - Revision 2008110.57
(56) Short Test For Genius70.56
(57) Space, Time, and Hyperspace170.51
(83) KIT Intelligence Test - first attempts70.50
(15) Letters60.48
(54) Test of Shock and Awe100.48
(19) Numerical section of Test For Genius - Revision 201050.44
(79) Association subtest of Long Test For Genius130.42
(77) Analogies #190.37
(11) Isis Test130.33
(68) Numbers120.30
(80) Qoymans Multiple-Choice #4150.25
(41) The LAW - Letters And Words50.25
(82) Reason100.22
(84) Bonsai Test90.16
(51) Qoymans Multiple-Choice #190.12
(5) Daedalus Test70.09
(29) Words6-0.27
(69) Odds5-0.79

Weighted average of correlations: 0.595

Conservatively estimated minimum g loading: 0.77

Ranking in above table is based on the unrounded correlations. All available data is present in this table, no tests are left out except for those with less than 5 score pairs. All known pairs are used to obtain the true, honest statistics; correlations have not been artificially inflated by leaving out ceiling scores, outliers or other anomalies.

Correlation of The Test To End All Tests with tests by others

(Test index) Test name n r
(229) Mega Test90.71
(237) Sigma Test50.64
(235) Nonverbal Cognitive Performance Examination60.62
(243) Scholastic Aptitude Test (old)50.57
(241) Ultra Test60.52
(211) Culture Fair Numerical Spatial Examination - Final version50.48
(242) Unknown tests80.40
(239) Titan Test100.32
(234) Strict Logic Sequences Exam I70.31
(220) Cattell Culture Fair50.30
(225) Logima Strictica 3670.03

Weighted average of correlations: 0.438

Ranking in above table is based on the unrounded correlations. All available data is present in this table, no tests are left out except for those with less than 5 score pairs. All known pairs are used to obtain the true, honest statistics; correlations have not been artificially inflated by leaving out ceiling scores, outliers or other anomalies.

Please be aware that correlations with these external tests are in most cases affected (depressed, typically) by one or more of the following: (1) Little overlap with the object test because of the much lower ceilings and inherent ceiling effects of the tests used in regular psychology; (2) Candidates reporting scores selectively, for instance only the higher ones while withholding lower ones; (3) Candidates reporting, or having been reported by psychometricians, incorrect scores.

Estimated loadings of The Test To End All Tests on particular item types

These are estimated g factor loadings, but against homogeneous tests containing only particular item types, as opposed to non-compound heterogeneous tests. Although tending to surprise the lay person, it is not uncommon for tests to have high loadings on item types they do not actually contain themselves. Such loadings reflect the empirical fact that most tests for mental abilities measure primarily g, regardless of their contents; that the major part of test score variance is caused by g, and only a minor part by factors germane to particular item types. It is of key importance to understand that this is a fact of nature, a natural phenomenon, and not something that was built into the tests by the test constructors.

Typeg loading of The Test To End All Tests on that type
Verbal0.76
Numerical0.47
Spatial0.79
Logical0.57
Heterogeneous0.78

Compound tests have been left out of this table to avoid overlap.

Balanced g loading = 0.68

Correlation of The Test To End All Tests with personal details

Personalia n r
Observed associative horizon120.35
Gifted Adult's Inventory of Aspergerisms130.32
P.S.I.A. Rational180.26
Observed behaviour160.23
Sex600.17
Educational level390.17
P.S.I.A. Antisocial180.14
P.S.I.A. Cold180.13
P.S.I.A. Cruel180.11
Mother's educational level360.09
P.S.I.A. Rare180.07
P.S.I.A. Neurotic180.06
P.S.I.A. True180.04
P.S.I.A. Orderly180.04
P.S.I.A. Deviance factor24-0.08
Year of birth57-0.09
P.S.I.A. Aspergoid18-0.10
P.S.I.A. Ethics factor24-0.11
P.S.I.A. Just18-0.15
P.S.I.A. Extreme18-0.15
P.S.I.A. Introverted18-0.15
Disorders (parents and siblings)39-0.16
P.S.I.A. System factor11-0.16
Disorders (own)41-0.18
Father's educational level36-0.27
Candidate's self-estimated I.Q.3-0.50

Correlation with national I.Q.'s of The Test To End All Tests

Correlation of this test with national average I.Q.'s published by Lynn and Vanhanen:

Estimated g factor loadings upward and downward of particular scores

In parentheses the number of score pairs on which that estimated g factor loading is based. The goal of this is to verify the hypothesis that g becomes less important, accounts for a smaller proportion of the variance, at higher I.Q. levels. The mere fact of restricting the range like this also depresses the g loading compared to computing it over the test's full range, so it would be normal for both values to be lower than the test's full-range g loading.

Raw scoreUpward g (n)Downward g (n)
00.77 (535)NaN (0)
8.20.70 (345)0.80 (59)
120.62 (221)0.72 (203)
15.80.70 (98)0.74 (290)
35NaN (0)0.77 (535)

Reliability

Error

Scores by age

Age class n median score
60 to 6417.0
55 to 59113.0
50 to 5438.0
45 to 49812.0
40 to 44714.0
35 to 39713.0
30 to 34713.0
25 to 291012.5
22 to 241011.5
20 or 21110.0
18 or 19110.0
17110.0
16114.0

Scores by year taken

Year taken n median score
199735.0
199827.0
200029.0
2001413.5
2002513.0
2003513.0
20041310.0
2005415.5
200630.0
2007314.0
2008420.0
2009110.0
201047.0
2011113.0
2012120.0
201342.5
2014110.0

ryear taken × median score = 0.15 (n = 60)

Robustness and overall test quality

Item analysis

Item statistics are not published as that would help future candidates. To detect bad items, answers and comments from candidates are studied, as well as, for each problem, the correlation with total score and the proportion of candidates getting it wrong (hardness of the item). Possible bad items are removed or revised, resulting in a revised version of the test.