If you don't remember your password, you can reset it by entering your email address and clicking the Reset Password button. You will then receive an email that contains a secure link for resetting your password
If the address matches a valid account an email will be sent to __email__ with instructions for resetting your password
Unité de Génétique Humaine et Fonctions Cognitives, Département de Neuroscience, Institut Pasteur, Paris, FranceNeuroscience Paris Seine, Institut de Biologie Paris Seine, Centre National de la Recherche Scientifique, Institut National de la Santé et de la Recherche Médicale, Université Pierre et Marie Curie, Sorbonne Universités, Paris, FranceGenes, Synapses and Cognition, Unité Mixte de Recherche 3571, Centre National de la Recherche Scientifique, Institut Pasteur, Paris, FranceHuman Genetics and Cognitive Functions, University Paris Diderot, Sorbonne Paris Cité, Paris, France
Unité de Génétique Humaine et Fonctions Cognitives, Département de Neuroscience, Institut Pasteur, Paris, FranceDépartement de Psychiatrie de l’Enfant et de l’Adolescent, Hôpital Robert Debré, L’Assistance Publique-Hôpitaux de Paris, Paris, France
Unité de Génétique Humaine et Fonctions Cognitives, Département de Neuroscience, Institut Pasteur, Paris, FranceGenes, Synapses and Cognition, Unité Mixte de Recherche 3571, Centre National de la Recherche Scientifique, Institut Pasteur, Paris, FranceHuman Genetics and Cognitive Functions, University Paris Diderot, Sorbonne Paris Cité, Paris, FranceFoundation Fondamentale, Créteil, France
Unité de Génétique Humaine et Fonctions Cognitives, Département de Neuroscience, Institut Pasteur, Paris, FranceDépartement de Psychiatrie de l’Enfant et de l’Adolescent, Hôpital Robert Debré, L’Assistance Publique-Hôpitaux de Paris, Paris, France
Neuroscience Paris Seine, Institut de Biologie Paris Seine, Centre National de la Recherche Scientifique, Institut National de la Santé et de la Recherche Médicale, Université Pierre et Marie Curie, Sorbonne Universités, Paris, France
Neuroscience Paris Seine, Institut de Biologie Paris Seine, Centre National de la Recherche Scientifique, Institut National de la Santé et de la Recherche Médicale, Université Pierre et Marie Curie, Sorbonne Universités, Paris, France
Unité de Génétique Humaine et Fonctions Cognitives, Département de Neuroscience, Institut Pasteur, Paris, FranceGenes, Synapses and Cognition, Unité Mixte de Recherche 3571, Centre National de la Recherche Scientifique, Institut Pasteur, Paris, FranceHuman Genetics and Cognitive Functions, University Paris Diderot, Sorbonne Paris Cité, Paris, France
The neuroanatomical bases of autism spectrum disorder remain largely unknown. Among the most widely discussed candidate endophenotypes, differences in cerebellar volume have been often reported as statistically significant.
Methods
We aimed at objectifying this possible alteration by performing a systematic meta-analysis of the literature and an analysis of the ABIDE (Autism Brain Imaging Data Exchange) cohort. Our meta-analysis sought to determine a combined effect size of autism spectrum disorder diagnosis on different measures of the cerebellar anatomy as well as the effect of possible factors of variability across studies. We then analyzed the cerebellar volume of 328 patients and 353 control subjects from the ABIDE project.
Results
The meta-analysis of the literature suggested a weak but significant association between autism spectrum disorder diagnosis and increased cerebellar volume (p = .049, uncorrected), but the analysis of ABIDE did not show any relationship. The studies meta-analyzed were generally underpowered; however, the number of statistically significant findings was larger than expected.
Conclusions
Although we could not provide a conclusive explanation for this excess of significant findings, our analyses would suggest publication bias as a possible reason. Finally, age, sex, and IQ were important sources of cerebellar volume variability, although independent of autism diagnosis.
Autism spectrum disorder (ASD) affect 1% of the population and is characterized by impairments in social interactions and the variety of interests. Through the years, many reports have suggested that cerebellar abnormalities were implicated in the onset of ASD [reviewed in (
)]. The cerebellum exhibits a highly regular arrangement of neurons and connections, supposed to support massive parallel computing capabilities, in particular through long-term synaptic plasticity (
). It has been traditionally involved in the performance of precise motor behavior, and patients with ASD also present with varying degrees of dyspraxia (
), using in vivo magnetic resonance imaging (MRI), were the first to report a cerebellar abnormality in a patient with ASD. Since then, various studies comparing the volumes and areas of cerebellar subregions between patients with ASD and control subjects have reported significant differences. However, whereas many articles report statistically significant differences, many others fail to detect differences. These discrepancies could be due to many factors, for example, the high heterogeneity in the etiology of ASD, differences in the inclusion criteria across studies, or differences in the eventual comorbidities affecting patients in different groups. The discrepancies could also reveal methodological bias, such as differences in MRI sequences, segmentation protocols, or statistical analyses. Finally, they could also result from chance when small sample sizes lead to noisy estimations of mean volumes.
Our aim was to objectify the alterations of cerebellar volumes in ASD. In the first part of this article, we present a systematic meta-analysis of the literature, examining the differences across previous reports and determining a combined effect size. In the second part, we describe our analysis of cerebellar volume in the ABIDE (Autism Brain Imaging Data Exchange) cohort (
) and study the consistency of these results with results from the meta-analysis. Finally, we describe our analyses of the impact of distinct sources of variability, such as sex, age at inclusion, or IQ, on the volume differences between patients with ASD and control subjects.
Methods and Materials
Meta-analysis of the Literature
Collection and Selection of Articles
We queried PubMed on October 12, 2016, for all articles that met the search criteria “cerebell* AND autis*”. We included articles reporting volumetric or area MRI measurements (mean and standard deviation) on at least one region of the cerebellum for patients with ASD and control subjects (Supplemental Methods).
Meta-analysis
We conducted two meta-analyses. The first compared mean cerebellar volume between patients with ASD and control subjects. Effect sizes were computed as standardized mean differences (Cohen’s d) using Hedges’ g as estimator (
). We combined effect sizes using a random-effects model, where the global estimate is obtained as the average of each estimate weighted by the inverse of its variance (
). The between-study variance τ2 was estimated using the restricted maximum likelihood method, from which we computed the proportion of variance imputable to heterogeneity (I2). To assess the impact of age and IQ on the effect size and thus identify possible sources of heterogeneity, we also conducted a meta-regression with average age and average IQ of patients with ASD as fixed effects (
We then evaluated publication bias and p-hacking in several ways. Publication bias is the tendency to publish preferentially studies reporting statistically significant results. p-Hacking describes the different methods used to achieve significance in a statistical test, such as flexible choice of covariates, flexible inclusion/exclusion criteria, and selective reporting. First, we calculated the rate of studies showing statistically significant differences between patients and control subjects. We compared this rate with the average statistical power obtained by assuming that the actual effect was equal to the effect estimated after meta-regression. Second, we evaluated the asymmetry of the funnel plot (
) using Egger’s test. In a funnel plot, the x axis represents effect size, and the y axis represents standard error. The interpretation of Egger’s test relies on the assumption that studies with small sample sizes are more affected by publication bias than studies with large sample sizes. Publication bias could then appear as an asymmetric distribution of points in the funnel plot, with an excess of small studies reporting large effect sizes (
). We evaluated p curves for inferred power—the most likely statistical power of the studies to get the observed p curve. Computations were performed using R (https://www.r-project.org) with the packages meta (
) along with the p curve app 4.0 (http://www.p-curve.com/app4/). We report statistical significance for an α level of .05. p values were not corrected for multiple comparisons.
Analysis of ABIDE
We analyzed data from the ABIDE I project (the original release of ABIDE), which include MRI scans for 539 patients with ASD and 573 control subjects. From the information provided in the ABIDE website, we concluded that there was no overlap between their subject groups and those included in our meta-analysis. Cerebellar volumes were automatically segmented using FreeSurfer 5.1 (https://surfer.nmr.mgh.harvard.edu). We developed a tool for visual quality control of the segmentations (Figure 1 and Supplemental Methods). We included only subjects for whom the segmentation quality was clearly good. Our quality control focused on the cerebellum, but we combined its results with our previous quality control of the whole brain (
Neuroanatomical diversity of corpus callosum and brain volume in autism: Meta-analysis, analysis of the autism brain imaging data exchange project, and simulation.
) and conserved only the subjects that passed both.
Figure 1Quality control (QC) of the cerebellum automatic segmentation. (A) QC tool. For each subject, we evaluated the quality of the segmentation in comparison with the original image in the three different planes. The values of the different measured volumes along with their deviation from the overall mean is represented in the graph at the bottom left. (B) A segmentation evaluated as correct. (C) A segmentation with an excess of unlabeled regions. (D) A segmentation with major labeling errors.
From the original 1112 subjects, 328 patients (61% of the subjects) and 353 control subjects (62% of the subjects) were retained for further analysis after quality control (see Table 1 for a description of selected subjects by ABIDE site). Among the excluded subjects, 411 subjects did not pass the quality control step, and 20 subjects were excluded because of unavailable full-scale IQ.
Table 1Demographics of ABIDE Subjects Retained for Statistical Analyses (Allometry and Linear Models)
Neuroanatomical diversity of corpus callosum and brain volume in autism: Meta-analysis, analysis of the autism brain imaging data exchange project, and simulation.
), we studied the allometric scaling of cerebellar volume. The division of regional volume measurements by total brain volume (normalization) is often used to control for differences in brain volume between groups. This strategy would be appropriate only if the volume of the cerebellum scaled proportionally to total brain volume. We assessed the scaling factor of the cerebellum with total brain volume using a linear regression.
We evaluated the effect of diagnosis and other factors on total cerebellar volume, cerebellar white matter volume, and cerebellar gray matter volume with two linear models. The first model included group, age, IQ, scanning site, sex, and brain volume as fixed effects; the second model included in addition the interactions of group with age, IQ, scanning site, sex, and brain volume. Because these statistical models were not the same as the models used in the meta-analysis of the literature, we also analyzed the ABIDE data using the same meta-analytical approach as for the literature. In the meta-analysis of ABIDE, the volume estimations of each site were combined using a random-effects model. At each site, we eliminated the minimal number of subjects that would ensure that the age and sex matching were respected, as in the meta-analysis of the literature. We did this iteratively, eliminating one subject after another, until the p values on the differences of age mean, age variance, and sex ratio were each above 0.2. We used JMP Pro 12.1.0 (https://www.jmp.com) for fitting linear models and R version 3.3.1 for the meta-analytical approach. The R scripts that we used for the computation of the meta-analysis are available on the web (https://github.com/neuroanatomy/Cerebellum).
Results
Meta-analysis of the Literature
Selection of Articles
The PubMed queries (“cerebell* AND autis*”) returned 947 items that were combined with the 124 references cited by a systematic review (
) that we found by other means. We selected studies in two steps, first based on their titles and abstracts and then based on their full text (see Supplemental Figure S1 for the Preferred Reporting Items for Systematic Reviews and Meta-Analyses workflow). Our final analyses were based on 30 articles covering seven different regions of interest. The three most reported measures were total cerebellar volume (1050 subjects in total), vermal lobules VI–VII area (965 subjects in total), and vermal lobules I–V area (861 subjects in total) (see Table 2 for a description of the selected articles).
MRI-based morphometry in children with multiple complex developmental disorder, a phenotypically defined subtype of pervasive developmental disorder not otherwise specified.
Abnormality of cerebellar vermian lobules VI and VII in patients with infantile autism: Identification of hypoplastic and hyperplastic subgroups with MR imaging.
IQ for normal control group supposed to be 115 ± 15.
Vermis, I–V, VI–VII
Age and IQ data are presented as mean ± SD. Regions of interest include volumes (Cb, WM, and GM) and midsagittal areas (whole vermis and its lobules numbered I–V, anterior; VI–VII, superior-posterior; and VIII–X, inferior-posterior).
ASD, patients with autism spectrum disorder; Cb, cerebellum; Ctrl, normal control subjects; F, female; GM, cerebellum gray matter; NR, not reported; WM, cerebellum white matter.
a IQ for normal control group supposed to be 115 ± 15.
b Mean and/or SD extrapolated from minimum and maximum.
c IQ for high-functioning autism group supposed to be 100 ± 15.
d Cerebellar volume was estimated for only 16 patients with ASD and 19 control subjects in Hardan et al.
Significant mean effects were found for three of the seven regions studied: total cerebellum (p = .049), white matter (p = .011), and vermal lobules VI–VII (p = .022) (Table 3). Compared with control subjects, patients with ASD displayed larger cerebellar white matter volume (Cohen’s d = 0.31, 95% confidence interval [CI] [0.07, 0.55]) and smaller areas for vermal lobules VI–VII (Cohen’s d = −0.24, 95% CI [−0.44, −0.03]). The effect of diagnosis on total cerebellar volume was barely statistically significant (Cohen’s d = 0.23, 95% CI [0.00, 0.45]) (see forest plot in Figure 2A).
Table 3Standardized Mean Difference: Combination of Effect Sizes
Figure 2Forest plots for the cerebellar volume. For each study, the gray square is centered on the estimated standardized mean difference, the black segment illustrates the 95% confidence interval (CI), and the surface of the square is proportional to the number of subjects in the study. Standardized mean difference is positive when the cerebellar volume is greater in autism spectrum disorder. W(random) represents the weight given to each study for the combination of effect sizes. (A) Meta-analysis of the literature. Heterogeneity: I2 = 65.9%, τ2 = 0.1235, p = .0001. (B) Meta-analysis of ABIDE (Autism Brain Imaging Data Exchange) sites. Heterogeneity: I2 = 32.6%, τ2 = 0.0192, p = .1219.
A statistically significant heterogeneity was found for four of the seven regions under study: total cerebellar volume (p = .0001), vermal lobules I–V area (p = .0049), vermal lobules VI–VII area (p = .0081), and vermal lobules VIII–X area (p = .019). Despite this high heterogeneity, our meta-regression did not show a significant impact of age or IQ on total cerebellar volume. The age of patients with ASD correlated with a reduced volume of cerebellar white matter (p = .025) and an increased volume of cerebellar gray matter (p = .024) compared with control subjects. However, the number of studies included in the meta-regression may be insufficient to obtain reliable results. The IQ of patients with ASD correlated with increased volume of the cerebellar white matter (p = .0176) and increased area of vermal lobules I–V (p = .019). Age and IQ did not seem to be the only factors producing heterogeneity: after the meta-regression, residual heterogeneity was still statistically significant for total cerebellar volume (p = .0020), vermal lobules VI–VII area (p = .039), and vermal lobules VIII–X area (p = .013). Table 3 shows the results of the random-effects models and meta-regressions for the cerebellum regions, and Supplemental Figure S2 shows the observed effect sizes versus the expected effect from the meta-regression on total cerebellar volume.
Statistical Power
Despite a small mean effect size estimated at Cohen’s d = 0.23 for the total cerebellar volume, 44% of the studies reported a significant result. If the actual effect size were fixed at this value, the mean statistical power for all the studies would be only 14%—that is, only 14% chances of detecting such a small effect size. The heterogeneity in age and IQ across studies appeared to limit the statistical power: mean achieved statistical power increased to 20% when taking into account the variations induced by age and IQ estimated by the meta-regression (the value is, however, still much lower than the observed 44% rate of detection, which agrees with the fact that much of the heterogeneity was not explained).
Publication Bias and p-Hacking
Egger’s test detected a statistically significant funnel plot asymmetry only for the whole vermis area (p = .002). None of the remaining regions presented a significant funnel plot asymmetry (Figure 3).
Figure 3Funnel plots for the different cerebellar regions. The dotted line represents the meta-analytic effect size, whereas the two dashed lines represent the boundaries of the 95% confidence interval for effect combined with the fixed effect model. *statistically significant asymmetry of the funnel plot (p = .002). GM, gray matter; WM, white matter.
Our analyses of the p curves were not conclusive on the presence of p-hacking. In five of the seven regions studied, there were only two or three p values < .05, which resulted in very imprecise estimations with wide confidence intervals. Total cerebellar volume and area of lobules VI–VII each had seven p values < .05, which still produced unreliable estimations. p curve analyses suggested low statistical power for both regions (total cerebellum, 22%, 95% CI [5%, 67%], and lobules VI–VII, 38%, 95% CI [6%, 78%]). The statistical powers inferred from the p curves were not incompatible with the respective rates of significant studies, but again, confidence intervals were very wide. Supplemental Table S1 summarizes the results for the different publication bias and p-hacking analyses. Figure 4 shows the observed p curve for the total cerebellar volume.
Figure 4p curve of studies with a significant result for the cerebellar volume. Note the observed p curve includes seven studies with statistically significant results (p < .05). Among them, five had p < .025. Eight additional studies were excluded from the p curve analysis because their results did not pass significance (p > .05). Null of, expected p curve in case of.
A barely statistically significant (uncorrected) effect was found for vermal lobules VIII–X area, suggesting larger volume variations for the ASD groups (p = .049). No significant effect was found for any other region. Heterogeneity was statistically significant only for the total cerebellar volume (p = .0052, uncorrected). As for the meta-analysis of mean differences, taking into account age and IQ in the meta-regression did not significantly reduce the heterogeneity (Supplemental Table S2).
Analysis of ABIDE Data
Allometry
The scaling between cerebellar volume and brain volume was not isometric, i.e., the volume of the cerebellum was not directly proportional to brain volume. Indeed, the scaling factor was estimated at 0.518 (95% CI [0.457, 0.578]), showing that large brains have a proportionally smaller cerebellum (Figure 5). This replicates a recent result by Mankiw et al. (
). Because of this nonproportional relationship, the normalization of cerebellar volumes by total brain volume should not be used to control for group differences. Instead, we used brain volume as covariate in our linear models.
Figure 5Log of cerebellar volume (Cb) vs. log of total brain volume (BV).
A very significant site effect was found for each volume; this effect was also different between patients and control subjects for total cerebellar volume and cerebellum gray matter volume. The effect of diagnosis group for the total cerebellar volume was not significant (estimated at −0.59 cm3, Cohen’s d = −0.04, 95% CI [−0.16, 0.09], in the first model—including group, age, IQ, scanning site, sex, and brain volume as fixed effects; estimated at −1.22 cm3, Cohen’s d = −0.08, 95% CI [−0.27, 0.11], in the second model—including in addition the interactions of group with age, IQ, sex, brain volume, and scanning site). These estimations were more precise than in the meta-analysis of the literature, with narrower confidence intervals. Despite this, we did not find any significant difference between patients and control subjects. We did not find any impact of age, sex, IQ, or brain volume on cerebellar volume that would differ by diagnosis group. The analysis of cerebellar subregions (white and gray matter volumes) did not reveal a statistically significant group effect. See Supplemental Table S3 for the results of the linear model with group as main effect and Supplemental Table S4 for the results model including the interaction of group with the other variables.
ABIDE Meta-analysis
The results of the analyses using the meta-analytical approach agreed with the results of the direct fit of linear models. Despite a smaller number of subjects than in the meta-analysis of the literature (Supplemental Table S5), the confidence intervals for the combined mean differences were narrower. This was due to a smaller estimated between-study variance among ABIDE sites compared with that among articles in the literature. We did not find a statistically significant standard mean difference between patients and control subjects: Cohen’s d = 0.06, 95% CI (−0.12, 0.24). In the meta-regression, no significant effect was found for age (p = .888) or IQ (p = .726). Supplemental Table S5 describes the characteristics of the subjects selected for the preservation of age and sex matching between patients and control subjects. Figure 2B shows the forest plot of ABIDE sites combined with the random-effects model. The meta-analytical approach did not show any statistically significant difference in the variability of volume measures between patients and control subjects (log-variance ratio = 0.17, 95% CI [−0.18, 0.52]). The results of the ABIDE meta-analysis on standardized mean difference are summarized in Table 3, and Supplemental Table S6 summarizes the results of the ABIDE meta-analysis on log-variance ratio.
Discussion
Neuroanatomical diversity appears to account for a substantial proportion of the risk for ASD (
). However, and even though several candidate neuroanatomical biomarkers have been proposed, it is not yet clear exactly which neuroanatomical traits more strongly influence diagnosis. In this report, we looked at one specific structure, the cerebellum, that has been widely discussed in the literature. We performed a meta-analysis of the literature and an analysis of data from the ABIDE project.
The meta-analysis of the literature did not show conclusive evidence for a difference between patients with ASD and control subjects either for the total cerebellar volume or for its subregions. Total cerebellar volume and cerebellar white matter volume appeared slightly larger in ASD, whereas the area of vermal lobules VI–VII was found to be slightly smaller in ASD, but the significance of these results did not survive correction for multiple comparisons. Compared with a previous meta-analysis on total cerebellar volume and vermal areas (
) reported a significant difference in the volume of vermal lobules VIII–X. In our analysis, despite the fact that a larger number of studies was taken into account, we did not replicate their finding.
The combined effect sizes in the meta-analysis of the literature were small in general. The number of articles that reported statistically significant results was larger than expected given the mean achieved power. We investigated whether this excess could be due to publication bias or p-hacking. Publication bias occurs when studies with statistically significant results have higher chances to be published than studies without statistically significant results. This is more likely to happen when the finding reported is central to the hypothesis made by an article. However, the main focus of many of the articles in our meta-analysis was not cerebellar volume, which should decrease the likelihood of bias. We studied publication bias by analyzing the asymmetry of the funnel plot using Egger’s test. This type of analysis is not very sensitive and is able to detect only strong publication bias. We observed statistically significant funnel plot asymmetry only for the vermis area (uncorrected). Whereas this result suggests the presence of publication bias, it could also be due to greater interstudy heterogeneity in this specific measurement (where a few large studies reported negative effect sizes, whereas the majority of the other studies reported positive effect sizes).
We aimed at testing for p-hacking by analyzing the p curves: the distribution of p values < .05. The numbers of significant p values reported, however, were not sufficient to draw a definite conclusion. In the case of the total cerebellar volume, where seven of 16 articles reported statistically significant findings, p curve analysis did not reveal evidence for p-hacking. Overall, the analysis of the literature alone did not allow us to provide a definitive explanation for the excess of statistically significant findings.
One result that appeared very clearly was the strong heterogeneity of the findings in the literature, even compared with the multisite data from the ABIDE project. For cerebellar volume, p value for heterogeneity was .0001, which would remain significant even after correction for multiple testing. A certain degree of variability in the estimations of volume is expected, especially given the small sample sizes used. Heterogeneity tests aim at detecting a degree of variability that would go beyond this expectation. We found statistically significant heterogeneity for the estimations of four of the seven meta-analyzed regions (i.e., total cerebellar volume, lobules I–V, lobules VI–VII, and lobules VIII–IX) (Table 3). Heterogeneity may be due to a combination of technical and physiological causes; for example, differences in MRI equipment, acquisition sequences, and segmentation protocol as well as the age of the subjects, IQ distribution, and so forth. We analyzed the effect of two of these factors, age and IQ, using meta-regression. First, our analysis did not reveal a differential effect of age or IQ on cerebellar volume for patients and control subjects. Second, residual heterogeneity was still statistically significant for three of the seven regions studied (total cerebellar volume, lobules VI–VII, and lobules VIII–IX). This indicates that sources other than age and IQ level may be causing significant heterogeneity in the literature.
The ABIDE project data provide a very interesting point of comparison for previous findings in the literature. The subjects in ABIDE come in most cases from research projects that had already been published and thus should be of similar characteristics as those in our literature meta-analysis. However, because the raw MRI and behavioral data are available, there is no issue of publication bias or p-hacking having an effect on cohort selection. Additionally, the availability of raw data makes it possible to run methodologically homogeneous analyses: same segmentation protocol, quality control procedures, and statistical analyses. There still remain, of course, many additional sources of heterogeneity owing to the grassroots nature of the project (some of them being currently addressed in ABIDE II through an important harmonization effort). Overall, the analyses of ABIDE data should provide a more precise, less heterogeneous estimation. The availability of raw data makes it also possible to engage a community effort to assess the impact of different methodological choices on the same dataset.
We analyzed all 1112 subjects from ABIDE using validated automatic computational neuroanatomy tools. After quality control and additional inclusion criteria, we retained a group of 681 subjects. We had 85% power to detect the Cohen’s d = 0.23 effect obtained from our meta-analysis of the literature (two-sided t test, α = .05). We did not find any statistically significant effect of ASD diagnosis on mean volumes. Our statistical analysis here was a linear model including group, age, sex, IQ, and site as main effects, or additionally the interaction between group and the other covariates. Although the absence of group effect was clear, we repeated our analyses using the same meta-analytical procedure used to study the literature to rule out an eventual methodological artifact. Every ABIDE site was considered as a different source of data, and we computed a meta-analytical effect size using a random-effects model weighting of each site’s estimations by the inverse of the variance (thus giving more weight to sites with larger sample sizes, as in the case of the literature meta-analysis). Our meta-analysis was in agreement with the results of our linear models: a clear absence of group differences.
Interestingly, and although the site effect was substantial, the estimations from the ABIDE data were more precise than the estimations from the literature (tighter confidence intervals) and less heterogeneous. One reason for the lack of power in the literature (in addition to the small sample sizes) could be the significant heterogeneity across studies. This may be an important source of discordant reports and makes it more difficult to draw conclusions from the literature. The public availability of raw data should greatly enhance our ability to understand neuroanatomical variability and increase our chances of detecting reliable neuroimaging phenotypes for neurodevelopmental disorders such as autism. Toward this aim, all our analysis scripts, our software for quality control, and the list of subjects we included have been made openly available on the Web to facilitate the replication, critical appraisal, and extension of our current results (https://github.com/neuroanatomy/Cerebellum).
In conclusion, we did not find evidence, either in the literature or in the ABIDE cohort, for a difference in cerebellar volume between patients with ASD and control subjects. This result does not rule out possible involvement of the cerebellum in the etiology of ASD. In particular, ABIDE includes only subjects without intellectual disability, and a difference could still appear in a population with a wider IQ range (although the meta-analysis of the literature did not suggest such a link). Also, it is possible that some other measurement of cerebellar anatomy, more sophisticated than mere volume measurements, may be linked to autism in the future. However, our current results do not provide evidence to justify a specific focus on the study of the cerebellum instead of any other brain structure. We reached a similar conclusion after analyzing the corpus callosum (
Neuroanatomical diversity of corpus callosum and brain volume in autism: Meta-analysis, analysis of the autism brain imaging data exchange project, and simulation.
), another structure that had traditionally captured the attention of the research community. Based on these experiences, we can advocate only for a broad analysis of all neuroanatomical phenotypes available. For this effort to be successful, our community needs to continue developing the data-sharing initiatives that will allow us to increase statistical power, decrease heterogeneity, and avoid the biases that prevent researchers from benefiting from the work of each other.
Acknowledgments and Disclosures
This work was supported by the Fondation de France project “Développement du plissement cortical dans les troubles du spectre autistique: caractérisation morphométrique et génétique” (NT, AB, TB, RD, RT). The Institut Pasteur group (NT, AB, TB, RD, RT) was funded by the European Research Area Networks Network of European Funding for Neuroscience Research Cofund Programme under “Horizon 2020” (SynPathy project), Institut Pasteur, Bettencourt-Schueller Foundation, Centre National de la Recherche Scientifique, University Paris Diderot, Agence Nationale de la Recherche programme “Investissements d’avenir,” Laboratoire d'excellence de Biologie pour la Psychiatrie (BioPsy; reference: ANR-11-IDEX-0004-02), Laboratoire d'excellence en Génomique Médicale, Conny-Maeva Charitable Foundation, Cognacq Jay Foundation, Orange Foundation, and Fondamental Foundation. The Neuroscience Paris Seine group (NT, LR-R, A-LP) is a member of BioPsy and École de Neuroscience de Paris foundation.
The authors report no biomedical financial interests or potential conflicts of interest.
Neuroanatomical diversity of corpus callosum and brain volume in autism: Meta-analysis, analysis of the autism brain imaging data exchange project, and simulation.
MRI-based morphometry in children with multiple complex developmental disorder, a phenotypically defined subtype of pervasive developmental disorder not otherwise specified.
Abnormality of cerebellar vermian lobules VI and VII in patients with infantile autism: Identification of hypoplastic and hyperplastic subgroups with MR imaging.