A little more than half a century ago: Mervyn Stone’s 1974 paper on cross validation
Many of Stone’s publications are remarkable for their deep impact in theoretical and applied statistics, and more recently in science in general, and data science in particular. I would like to comment on his pioneer paper Stone M (1974) Cross–Validatory Choice and Assessment of Statistical Predictions. JRSS-B, 36: 111-147 (referred hereafter as S74) which was a discussion paper read in the RSS on December 1973 and published a little more than half a century ago, in January 1974. There were 13 discussants and their questions and Stone’s replies are of very high quality.
S74 formalises cross-validation (CV) methods; in particular, the leave-one-out cross validation procedure is firstly defined there. The paper also deals with generalising estimates of prediction errors. The paper starts with an erudite Introduction providing the historical context in which CV developed. Stone refers to his preference of the concept of assessment of statistical predictions as opposed to validation, which ‘has a ring of excessive confidence around it’, a semantic distinction which should be borne in mind but seldom is. Besides the originality of the ideas defined and developed the paper has a didactic style. In section 2 there are numerous theoretical examples further to general statements. This is followed by a section looking at well-known statistical problems, e.g. predicting the location of a single univariate sample, predictors for the the k-group problem, and the choice of variables for least-squares prediction in which a ‘cross-validatory assessment measure’ is defined. These examples are defined and analysed in detail through the lens of the methods outlined in the previous section and are then applied to real data. The last section discusses the relation of previous work on CV to the paper’s methods, presents a portrait of “a conventional paradigm for a large portion of statistical activity including a deviation to consider how the Bayesian paradigm might be accomodated in this scheme, and concudes with a series of quotations addressing philosophical and practical statistical issues.
There is no substitute to reading the paper to realise its elegance and usefulness but a quick way to demonstrate its growing importance over 50+ years is to look at its number of citations and the journals in which papers citing S74 have been published. I looked at data on such papers obtained from Web of Science (WoS), which claims to be “the world’s most trusted publisher-independent global citation database”; citations figures were downloaded on 2nd November 2024 and are shown against year of publication in Figure 1. Note that the last point in the Figure does not refer to the definite number of citations for 2024. Table 1 shows the journals with the largest number of papers citing S74. There is a clear shift from S74 being cited in the statistical methods literature to specialised scientific publications, notably in chemometrics, and more recently to being referenced in general science journals. This process is similar to the three principles of translational medicine: benchside, bedside, and community. This arc from basic science to generalised scientific practice has been experienced by other important statistical papers, notably by David Cox’s 1972 paper on the proportional hazards model whose remarkable trend in number of citations can be seen here.
Mervyn Stone played an important role in UCL, and his contributions to the theory and application of statistical methods continue to be increasingly useful in many scientific areas. I hope this short paper draws attention to his life and work.