Lauri Mehtätalo (email)

Reporting modern statistical analyses: reproducible and transparent

Mehtätalo L. (2019). Reporting modern statistical analyses: reproducible and transparent. Silva Fennica vol. 53 no. 3 article id 10257. https://doi.org/10.14214/sf.10257

Author Info
  • Mehtätalo, University of Eastern Finland, Faculty of Science and Forestry, School of Computing, P.O. Box 111, FI-80101 Joensuu, Finland E-mail lauri.mehtatalo@uef.fi (email)

Received 7 October 2019 Accepted 8 October 2019 Published 8 October 2019

Views 9806

Available at https://doi.org/10.14214/sf.10257 | Download PDF

Creative Commons License CC BY-SA 4.0 full-model-article10257

Large majority of the articles published in Silva Fennica include statistical analyses of empirical quantitative data. In reporting of the materials and methods in such an article, an important requirement is reproducibility: the reader should be able to repeat the data collection and analyses based on the description in the article. A closely related requirement is transparency: the authors should give the necessary information for readers to evaluate whether the method is justified and has been implemented correctly. In Silva Fennica, we want to promote reproducibility and transparency of scientific publishing also in the future. To illustrate what it means in practice, I will discuss analyses based on linear models in more detail. In this context, analysis of variance and (t-) tests of sample means are seen as special cases of the classical linear model.

The use of classical linear model has a long history, dating back to the works of Karl Pearson, R. A. Fisher and G. U. Yule in the early 20th century. For example, Fisher’s classical textbook “Statistical methods for research workers” already presented the t-tests, analysis of variance, and linear regression as basic tools to analyze empirical data. These basic methods are fully optimal whenever the implicit assumptions about the independence and constant variance of the data are met. The use of those rather simple and widely known methods is straightforward and their use in a research report can be described, by writing e.g., “the logarithmic data were analyzed by one-way ANOVA and Tukey’s post-hoc tests; the applied transformation showed constant error variance according to standard diagnostic plots”. There is usually no need for a formal presentation of the implicitly assumed classical linear model. Of course, the empirical data and data collection procedures should be described transparently so that the reader can critically evaluate whether the selected method is justified.

Nowadays methods are widely available to take into account such properties of the data that could not be taken into account in the classical linear model. For example, there are good methods to analyze dependent data with heteroscedastic errors. However, a data set can be independent and homoscedastic in only one way, but dependent and heteroscedastic in infinitely many ways. Therefore, secondary sub-models are needed for dependence and heteroscedasticity. Furthermore, there may be several alternative methods for parameter estimation and inference. For example, in grouped data sets generalized linear mixed-effects models can be used to model non-normal grouped data. Formulating a linear mixed-effects model involves choices about the levels of grouping and random-effects structure for each level of grouping, in addition to the variance-covariance structure of the residual errors. For non-normal data, additional choices are needed about the link function, parameter estimation methods, applied methods for inference, and the models about zero-inflation and overdispersion. All these choices can have a major effect on the results about the factors of main interest and should therefore be reported.

Transparent publication of today’s statistical analyses requires reporting and justification of all non-trivial choices made in the model selection. Reproducible and transparent reporting of such an analysis is seldom possible without formal presentation of the model. Also tables showing the estimates of all model parameters are often useful, as well as carefully selected diagnostic graphs about model fit. The space limitations of the papers are no more a problem for a sufficiently detailed reporting of the methods and models, because they can be included as an electronic supplementary file.

Lauri Mehtätalo
Associate Editor for Biometry and Methods


Register
Click this link to register to Silva Fennica.
Log in
If you are a registered user, log in to save your selected articles for later access.
Contents alert
Sign up to receive alerts of new content

Your selected articles
Send to email
Donis J., Kitenberga M. et al. (2018) Factors affecting windstorm damage at the stand .. Silva Fennica vol. 52 no. 4 article id 10009 (remove) | Edit comment
Kubin E., Kemppainen L. (1994) Effect of soil preparation of boreal spruce fore.. Acta Forestalia Fennica vol. 0 no. 244 article id 7506 (remove) | Edit comment
Pesonen M., (1995) Non-industrial private forest landowners’ choice.. Acta Forestalia Fennica vol. 0 no. 247 article id 7509 (remove) | Edit comment
Niiranen J., (1980) Methods used in cutting propagation of forest tr.. Silva Fennica vol. 14 no. 1 article id 5065 (remove) | Edit comment
Kosenius A.-K., Juutinen A. et al. (2020) The role of state-owned commercial forests and f.. Silva Fennica vol. 54 no. 1 article id 10051 (remove) | Edit comment
Kanzian C., (2023) Are productivity studies in forest operations ol.. Silva Fennica vol. 57 no. 3 article id 23074 (remove) | Edit comment
Mecklin A., (1939) Timber measurement legislation Silva Fennica vol. no. 52 article id 4554 (remove) | Edit comment
Fetouab A., Fenton N. J. et al. (2024) Planting density and mechanical site preparation.. Silva Fennica vol. 58 no. 2 article id 23029 (remove) | Edit comment
Granvik B.-A., (1967) The preparation of coniferous sawn goods using c.. Acta Forestalia Fennica vol. 84 no. 3 article id 7184 (remove) | Edit comment
Eid J., (1981) Forest as a capital asset. Silva Fennica vol. 15 no. 1 article id 5105 (remove) | Edit comment
Pohjakallio O., Vaartaja O. (1948) Occurrence and spore production of Coleosporium .. Acta Forestalia Fennica vol. 55 no. 2 article id 7390 (remove) | Edit comment
Oinonen E., (1963) Sanajalan (Pteridium aquilinum (L.) Kuhn.) nekta.. Silva Fennica vol. 0 no. 113 article id 4709 (remove) | Edit comment
Heräjärvi H., (2019) New age of discovery in wood science Silva Fennica vol. 53 no. 2 article id 10216 (remove) | Edit comment
Your search results