If you don't remember your password, you can reset it by entering your email address and clicking the Reset Password button. You will then receive an email that contains a secure link for resetting your password
If the address matches a valid account an email will be sent to __email__ with instructions for resetting your password
Imaging research has not yet delivered reliable psychiatric biomarkers. One challenge, particularly among youth, is high comorbidity. This challenge might be met through canonical correlation analysis designed to model mutual dependencies between symptom dimensions and neural measures. We mapped the multivariate associations that intrinsic functional connectivity manifests with pediatric symptoms of anxiety, irritability, and attention-deficit/hyperactivity disorder (ADHD) as common, impactful, co-occurring problems. We evaluate the replicability of such latent dimensions in an independent sample.
Methods
We obtained ratings of anxiety, irritability, and ADHD, and 10 minutes of resting-state functional magnetic resonance imaging data, from two independent cohorts. Both cohorts (discovery: n = 182; replication: n = 326) included treatment-seeking youth with anxiety disorders, with disruptive mood dysregulation disorder, with ADHD, or without psychopathology. Functional connectivity was modeled as partial correlations among 216 brain areas. Using canonical correlation analysis and independent component analysis jointly we sought maximally correlated, maximally interpretable latent dimensions of brain connectivity and clinical symptoms.
Results
We identified seven canonical variates in the discovery and five in the replication cohort. Of these canonical variates, three exhibited similarities across datasets: two variates consistently captured shared aspects of irritability, ADHD, and anxiety, while the third was specific to anxiety. Across cohorts, canonical variates did not relate to specific resting-state networks but comprised edges interconnecting established networks within and across both hemispheres.
Conclusions
Findings revealed two replicable types of clinical variates, one related to multiple symptom dimensions and a second relatively specific to anxiety. Both types involved a multitude of broadly distributed, weak brain connections as opposed to strong connections encompassing known resting-state networks.
The current study combines the dimensional assessment of psychiatric symptoms, potentially providing a better fit to neural measures than diagnostic categories (
) with resting-state functional magnetic resonance imaging (rsfMRI). We used advanced multivariate statistical techniques to identify highly correlated latent dimensions of psychopathology and brain connectivity. This is essential in identifying neural mechanisms that mediate clinical symptoms and thus represent appropriate targets for novel interventions. Such work is particularly needed among youth, in whom seeds of later-life psychopathology present as common, often co-occurring problems (
). Specifically, the study focuses on the neural correlates of pediatric irritability, attention-deficit/hyperactivity disorder (ADHD), and anxiety given prior evidence of both shared and distinct neural correlates among these symptom domains (
). This approach might be more sensitive to complex relationships among symptom and neural data than alternative approaches. This includes approaches used previously that first model covariance structure among clinical dimensions before relating these latent symptom dimensions to preselected brain networks (
), we target treatment-seeking youth identified by clinicians. Second, prior work in youth confirmed that this method can differentiate well-established, but vastly distinct, clinical domains such as psychosis and emotional and behavioral problems. Here, we focus on three more closely related and often comorbid domains: irritability, ADHD, and anxiety. We test a hypothesis consistent with previous work using other latent variable approaches combined with task-based fMRI (
): CCA yields latent phenotypes that capture both unique and shared aspects of irritability, ADHD, and anxiety. However, unlike past work, brain connectivity is not evoked by highly controlled tasks in the current study. Thus, more broadly distributed neural circuitry correlates are expected in the current study, as compared with correlates in previous studies.
Finally, as a third extension of past work, we evaluate the latent variables' replicability using novel sampling and analytic techniques. Prior CCA studies find replicable associations when discovery and replication cohorts represent subsets of the same sample (
). Robustness against sampling variability is essential for clinical applications of CCA, which possesses exploratory components that can make replication difficult. Thus, the current study used data from 2 independent cohorts of treatment-seeking youth assessed with similar methods. We treated the smaller sample (n = 182) as the discovery dataset, because it was assessed with homogeneous imaging parameters. The larger cohort (n = 326), assessed with heterogeneous imaging parameters, served as a replication dataset (
). Moreover, we employed analytic techniques that leverage independent component analysis (ICA) to improve interpretability of the canonical variates (
) that addresses limitations in other CCA studies concerning the handling of nuisance variables and possible inflation of type I errors.
Methods and Materials
Participants
Both samples comprised healthy volunteers and youth diagnosed with an anxiety disorder, disruptive mood dysregulation disorder, or ADHD by licensed clinicians using the Kiddie Schedule for Affective Disorders and Schizophrenia (K-SADS) (
Schedule for Affective Disorders and Schizophrenia for School-age Children—Present and Lifetime version (K-SADS-PL): Initial reliability and validity data.
J Am Acad Child Adolesc Psychiatry.1997; 36: 980-988
). Exclusion criteria were neurological disorders, autism and bipolar spectrum disorders, psychosis, substance use, MRI contraindications, and Full Scale IQ < 70. Anxiety was assessed by using the parent- and youth-reported ratings of the five subscales of the Screen for Child Anxiety Related Disorders (
) in the replication sample. These 29 ratings of anxiety, irritability, and disruptive behavior (18 parent-reported, 11 self-reported) were used as input for the joint CCA+ICA.
Samples were similar in terms of sex ratios, proportions of anxiety disorders, oppositional defiant disorder, medication-free-to-medication-use ratios, and levels of parent-reported symptoms of irritability, ADHD, and anxiety. However, the discovery sample was older and had a higher IQ, a lower proportion of ADHD cases, a higher proportion of diagnosis-free and disruptive mood dysregulation disorder cases, and lower self-reported irritability and anxiety. Both cohorts were ethnically diverse and were recruited from urban, semi-rural, and rural areas (Figure 1, Supplement).
Figure 1Demographics and clinical characteristics of the discovery and replication samples. ADHD, attention-deficit/hyperactivity disorder; ARI, Affective Reactivity Index; CBCL, Child Behavior Checklist; CMI, Child Mind Institute; DMDD, disruptive mood dysregulation disorder; DX, diagnosis; NIMH, National Institute of Mental Health; ODD, oppositional defiant disorder; SCARED, Screen for Child Anxiety Related Disorders.
Discovery sample data were acquired at one site with two identical 3T General Electric Signa scanners (GE Healthcare, Chicago, IL). The replication sample comprised data from a 1.5T Siemens Avanto scanner (Siemens Healthineers, Erlangen, Germany), a 3T Siemens Tim Trio scanner, and a 3T Siemens Prisma scanner. A high-resolution T1-weighted structural image and 10 minutes of blood oxygen level–dependent changes during rest were collected from all participants, although sequences varied between samples and within the replication dataset (Supplement). Quality of the imaging data was assessed using MRIQC (
), augmented by eight subcortical regions per hemisphere obtained using FreeSurfer segmentation (nucleus accumbens, nucleus caudatus, pallidum, putamen, amygdala, hippocampus, thalamus, and ventral diencephalon) (
), but not global signal, were regressed out from the time series. Functional connectivity was quantified using partial correlations, which offer an estimate of direct (as opposed to indirect or shared) connectivity between each pair of nodes (edges). Because the resulting network matrices are symmetric, only half of the edges (i.e., 23,220) were analyzed.
CCA, ICA, and Permutation Testing
Covariates (age, sex, race, IQ, psychotropic medication, and scanner for the discovery sample; additionally, site and sequence type for the replication sample) were regressed out from both imaging and clinical variables before dimensionality reduction. All 29 symptom ratings were included; dimensionality of rsfMRI was reduced using principal component analysis (PCA) before CCA (
). Residuals were projected to a lower dimensional space where data are exchangeable, thus mitigating spurious dependencies among observations introduced by residualization (
Given two sets of variables (here, imaging data [Y] and symptom ratings [X]), CCA seeks linear mixtures within each set (i.e., canonical variables [CVs]; U = Y × A and V = X × B), such that each resulting mixture (U) from one set is maximally correlated with a corresponding mixture (V) from the other set, but uncorrelated with all other mixtures in either set. We use upper-case letters U and V to represent the whole set of canonical variables on the imaging and clinical side, respectively, and lower-case letters followed by subscripts to indicate the order of the canonical correlations, from higher to smaller, uk and vk, to denote specific latent variables.
Small perturbations in the original data could possibly lead to arbitrary rotations of the CCA solutions. To mitigate the problem and aid interpretability, we subjected the stacked CVs to ICA, seeking CVs that were not only orthogonal but also statistically independent. The joint CCA+ICA procedure was performed using a modification of a recently proposed algorithm for permutation inference for CCA (
), thus allowing not only characterization and better disambiguation of the resulting CVs but also valid statistical inference (details in the Supplement). Below, where we refer to results of CCA, these are to be understood as results of the joint inference using CCA+ICA.
Because the number of CVs is determined by the smallest input dataset, we obtained 29 CVs. Statistical significance was determined using 10,000 permutations. In the permutation test, for each estimated CV (post-ICA), variance already explained by CVs with stronger, significant canonical correlations were removed in an iterative procedure (
). Canonical correlations were considered significant at alpha = .05 after familywise error rate (FWER) correction using a closed testing procedure. The nonsymmetric redundancy index (
), which gives the mean variance of the clinical data explained by imaging data, and vice versa, is reported in the Supplement.
Replicability
Replicability of the CVs was determined based on three criteria: 1) stability within the same dataset across variations in the number of PCA components that entered CCA relative to the sample size (input-to-participant ratio), 2) similarities of latent clinical patterns, and 3) similarities of latent connectivity patterns identified independently in the two samples. A prior CCA study in youth reported replicability only for clinical but not for rsfMRI patterns (
); thus, we decided to evaluate the replicability of the clinical and the connectivity patterns as separate criteria.
To evaluate the first criterion, we performed three analyses that varied the input-to-participant ratios. The primary analyses used an input-to-participant ratio of 1:2, which translated into 64 rsfMRI components, explaining 75% of the between-subject variance in rsfMRI connectivity in the discovery sample. In the replication cohort, 134 rsfMRI components were used; these explained only 57% of the variance, possibly owing to more unstructured noise in this dataset. This primary analysis was supplemented by two secondary analyses using input-to-participant ratios of 1:3 and 1:4, thereby reducing risks of overfitting, at the expense of explaining less variance. This was accomplished by using fewer imaging principal components as input to the CCA. Results were compared across the three ratios by examining cross-correlations among CCA components (e.g., corr[v1|1:2, YD×a1|1:3] and corr[u1|1:2, XD×b1|1:3]). Statistical significance was determined using 10,000 permutations, with a threshold of pFWER < .05 within each set of comparisons. Because psychiatric symptoms might relate to components that explain relatively little variance in the imaging data, we also discuss CVs that solely replicated at the 1:3 ratio but could be found in the replication cohort.
To test the second and third criteria, we used joint CCA+ICA in the replication dataset. Canonical weights from each dataset were applied to the input data from the other dataset; these products were then correlated with the CVs identified in that dataset (e.g., corr[v1|D, YD × a1|R] and corr[u1|D, XD×b1|R]). Clinical and connectivity patterns were considered replicable when both the application of weights from the discovery to the replication dataset and the application of weights from the replication to discovery dataset yielded statistically significant associations. We used 10,000 permutations to establish significance. However, thresholds differed for clinical and connectivity patterns. We used a stringent threshold of pFWER < .05 to determine replicability of the imaging and clinical patterns; additionally, we also investigated a more lenient threshold of puncorr < .05 for replicability of the connectivity pattern. This decision was motivated by two factors. First, a prior CCA study finding replicable clinical patterns did not report replicable connectivity patterns across two subsets of a single sample (
). This raises questions as to whether any evidence of replicability can be detected with even liberal statistical thresholds. Second, in the current study, significant differences exist between cohorts in all metrics, quantifying the quality of the imaging data (Figure S1); this contrasts with the broadly similar profiles for symptom ratings (Table S1 and Figure S1).
Interpretation of CVs
To interpret the significant CVs, we investigated their correlations with the residualized input data. These correlations between latent variables derived from the CCA and input data (i.e., symptom ratings, connectivity matrices) are henceforth referred to as canonical loadings. Consistent with prior studies (
), we focused on clinical items with loadings |r| > 0.2, resembling a small to moderate effect, to interpret and label key CVs. However, we extended this approach by limiting our focus to replicating clinical loadings, i.e., loadings |r| > 0.2 that could be observed across samples. Similarly, we emphasized edge loadings that replicated across samples. However, given the differences in the quality of the imaging data across samples, we applied a more lenient threshold of |r| > 0.15 to the replication cohort.
Results
In the discovery cohort, seven CVs associated symptoms with rsfMRI-connectivity (CV1|D: r = .74, pFWER = .0029; CV2|D: r = .73, pFWER = .0045; CV3|D: r = .73, pFWER = .0092; CV4|D: r = .70, pFWER = .0175; CV5|D: r = .69, pFWER = .0245; CV6|D: r = .69, pFWER = .0340; CV7|D: r = .68, pFWER = .0497) (Figure S2). These seven CVs explained 27.7% of symptom-level variance and 8.1% of rsfMRI-connectivity variance. Joint CCA+ICA in the replication dataset generated five CVs (CV1|R: r = .75, pFWER = .0198; CV2|R: r = .72, pFWER = .0311; CV3|R: r = .72, pFWER = .0321; CV4|R: r = .72, pFWER = .0364; CV5|R: r = .71, pFWER = .0440) (Figure S11) that represented 18.8% of the variance in the clinical data and 0.9% of the variance in the rsfMRI-connectivity data.
Our key hypothesis concerned identifying both cross-dimension and specific variates. Consistent with this hypothesis, latent clinical phenotypes (v1-7|D and v1-5|R) could be differentiated in terms of specificity of associated symptoms. In both datasets, we observed latent variables that loaded |r| > 0.2 exclusively on several items from the anxiety domain (v7|D, v1|R) and others capturing aspects shared across two (v3|D, v4|D, v3|R) or all three domains (v1|D, v2|D, v5|D, v6|D, v2|R, v4|R, v5|R) (Figure 2, Figure S12, Tables S7 and S12).
Figure 2Clinical loadings in the discovery dataset. Associations between the latent dimensions and symptoms in the domains of anxiety, irritability, and behavioral problems. Each of the seven concentric circles shows the positive (solid fill) and negative correlations (transparent fill) between the canonical variate and the original symptom ratings as bars. The length of the bar indicates the strength of the association. Exact numbers of the loadings are provided in Table S3. Canonical correlation analysis results are characterized by sign indeterminacy, meaning that it is valid to flip the sign for an entire latent dimension, which will affect the directions of the correlations.
All latent variables (u1-7|D and u1-5|R) involved distinct, albeit broadly distributed, connectivity patterns with many connections between well-known resting-state networks within and across both hemispheres. All u1-7|D and u1-5|R showed equal numbers of negative and positive correlations with edges. Connectivity patterns were denser in the discovery relative to the replication dataset applying a threshold of |r| > 0.2 (discovery dataset: u1|D: 1404 connections, u2|D: 2068 connections, u3|D: 5602 connections, u4|D: 2490 connections, u5|D: 2968 connections, u6|D: 2656 connections, and u7|D: 3226 connections; replication dataset: u1|R: 150 connections; u2|R: 130 connections, u3|R: 74 connections; u4|R: 74 connections; u5|R: 132 connections) (Figure 3, Figure 4, Figure 5, Figures S5–S10, S13–S17).
Figure 3Replicable, transdimensional latent variable (CV3|D, CV5|R). (A) Scatter plots show canonical variate (CV) 3 from the discovery dataset (D) and 5 from the replication dataset (R), which represent linear combinations of brain connectivity scores obtained during resting-state functional magnetic resonance imaging in the horizontal axis, and linear combinations of clinical scores derived from symptom ratings in the vertical axis. (B) Clinical loadings |r| > 0.2 in for both datasets, showing the same symptoms but an informant effect. Dark red indicates symptoms associated with the latent dimension in both datasets. (C) Edges in red that load strongly positively on u3|D and u5|R. Edges that load strongly negatively on u3|D and u5|R are depicted in blue. Given baseline differences in the strength of the connectivity patterns, connectivity maps were thresholded at |r| > 0.2 for the discovery sample and at |r| > 0.15 for the replication sample. Only edges that loaded highly positively or negatively in both datasets were retained for this figure. ADHD, attention-deficit/hyperactivity disorder; P, parent; Y, youth.
Figure 4Replicable, shared aspects of disruptive behavior and irritability (CV4|D, CV4|R). (A) Scatter plots show canonical variate (CV) 4 from the discovery dataset (D) and 4 from the replication dataset (R), which represent linear combinations of brain connectivity scores obtained during resting-state functional magnetic resonance imaging in the horizontal axis, and linear combinations of clinical scores derived from symptom ratings in the vertical axis. (B) Clinical loadings |r| > 0.2 in for both datasets. Dark red indicates symptoms associated with the latent dimension in both datasets. (C) Edges in red that load strongly positively on u4|D and u4|R. Edges that load strongly negatively on u4|D and u4|R are depicted in blue. Given baseline differences in the strength of the connectivity patterns, connectivity maps were thresholded at |r| > 0.2 for the discovery sample and at |r| > 0.15 for the replication sample. Only edges that loaded highly positively or negatively in both datasets were retained for this figure. P, parent; Y, youth.
Figure 5Replicable, anxiety-specific latent variable (CV7|D, CV3|R). (A) Scatter plots show canonical variate (CV) 7 from the discovery dataset (D) and 3 from the replication dataset (R), which represent linear combinations of brain connectivity scores obtained during resting-state functional magnetic resonance imaging in the horizontal axis, and linear combinations of clinical scores derived from symptom ratings in the vertical axis. (B) Clinical loadings |r| > 0.2 in for both datasets. Dark red indicates symptoms associated with the latent dimension in both datasets. (C) Edges in red load strongly positively on u7|D and u3|R. Edges that load strongly negatively on u7|D and u3|R are depicted in blue. Given baseline differences in the strength of the connectivity patterns, connectivity maps were thresholded at |r| > 0.2 for the discovery sample and at |r| > 0.15 for the replication sample. Only edges that loaded highly positively or negatively in both datasets were retained for this figure. GAD, generalized anxiety disorder; P, parent; Y, youth.
We highlight three CVs passing all three replicability criteria highlighted in the Methods and Materials section and two latent dimensions passing only the first two replicability criteria (stability within the same dataset and replicability of clinical patterns). Full results concerning replicability appear in the Supplement (Tables S8–S11, S13–S20). We describe replicable CVs based on the specificity of the clinical patterns ranging from shared between all three clinical domains to anxiety-specific.
Irritability, Anxiety, and ADHD (CV2|D, CV3|D, and CV5|R)
In both samples, CV2|D, CV3|D, and CV5|R were robust to variations in the participant-to-input ratios (all r > .67, all pFWER = .0001) (Tables S8–S11, S13–S16). Clinical patterns associated with v3|D and v5|R were negatively associated across samples (corr[v5|R, XR × b3|D]: r = −.20, puncorr = .0006, pFWER = .0175; corr[v3|D, XD × b5|R]: r = −.19, puncorr = .0141, pFWER = .3797). Both v3|D and v5|R loaded >.20 on the irritability and anxiety domains, where close inspection suggested informant effects; v3|D captured youth-reported whereas v5|R loaded on parent-reported irritability and anxiety (Figure 3, Tables S7 and S12). Yet, connectivity patterns correlated across samples (corr[u5|R, YR × a3|D]: r = .12, puncorr = .0375, pFWER = .7425; corr[u3|D, YD×a5|R]: r = .32, puncorr = .0001, pFWER = .0007). Inspection of the connectivity loadings showed that in both samples this transdimensional phenotype was associated with edges interconnecting established resting-state networks within and across both hemispheres (Figure S18).
Interestingly, v2|D was also robust against variations in participant-to-input ratios and showed substantial positive loadings >.20 on the same three parent-report items from the irritability domain (“Often loses temper,” “Angry for a long time,” “Loses temper easily”) and one from the ADHD domain as v5|R (“Talks excessively”) (Figure 2 and Figure S12, Tables S7 and S12). Moreover, clinical loadings for CV2|D and CV5|R significantly correlated across cohorts (corr[v5|R, XR × b2|D]: r = .25, puncorr = .0001, pFWER = .0004; corr[v2|D, XD × b5|R]: r = .45, puncorr = .0001, pFWER = .0001). However, within each sample, connectivity patterns associated with the two latent phenotypes were different, although brain connectivity data informed latent clinical dimensions.
Disruptive Behavior and Irritability (CV4|D and CV4|R)
In both cohorts, CV4|D and CV4|R were robust to input-to-participant ratio variations (all r > .31, pFWER = .0001) (Tables S8–S11, S13–S16). Similarities between CV4|D and CV4|R arose when applying clinical weights from the discovery to the replication dataset (corr[v4|R, XR × b4|D]: r = .19, puncorr = .0006, pFWER = .0229) and vice versa (corr[v4|D, XD × b4|R]: r = .27, puncorr = .0009, pFWER = .0190). Connectivity patterns were also associated using an uncorrected threshold, when applying weights from the replication to the discovery sample (corr[u4|D, YD×a4|R]: r = .19, puncorr = .0108, pFWER = .3494).
Across samples, CV4|D and CV4|R loaded >.20 on three items characterizing disruptive behavior from the ADHD domain (“Can’t sit still,” “Impulsive,” “Loud”) and one item from the domain of irritability (“Loses temper easily”). Furthermore, both CV4|D and CV4|R loaded negatively on one irritability item (“Angry most of the time”) (Figure 4, Tables S7 and S12). Inspection of substantial edge loadings in both samples indicated strong representations in the variate of connections among nodes in motor, attention, default mode, and temporal-parietal networks (Figure S19).
Anxiety (CV7|D and CV1|R, CV3|R)
The last set of replicable CVs comprised CV7|D in the discovery cohort, which correlated with both CV1|R and CV3|R in the replication data set. All three CVs emerged in analyses using input-to-participant ratios of 1:2 and 1:3 (all r > .31, all pFWER = .0001) (Tables S8–S11, S13–S16). Associations manifested between v7|D and v3|R for clinical (corr[v7|D, XD × b3|R]: r = .31, puncorr = .0001, pFWER = .0017; corr[v3|R, XR × b7|D]: r = .44, puncorr = .0001, pFWER = .0001) and for connectivity patterns, when applying an uncorrected threshold (corr[u7|D, YD × a3|R]: r = .19, puncorr = .0136, pFWER = .4120; corr[u3|R, YR × a7|D]: r = .10, puncorr = .0696, pFWER = 1). For V7|D and v3|R, both variates loaded >.20 on the same three anxiety items (“parent-reported generalized anxiety disorder,” “youth-reported generalized anxiety disorder,” “youth-reported panic”) (Figure 5, Tables S7 and S12). Replicable edges connected subcortical structures with the dorsal-attention and motor network as well as the control and default mode networks with sensory, motor, and attention networks (Figure S20).
Similarities in the clinical patterns were also observed between v7|D and v1|R. Both loaded >.20 on four items measuring anxiety (“parent-reported generalized anxiety disorder,” “parent-reported panic,” “parent-reported school avoidance,” “youth-reported panic”; corr[v7|D, XD × b1|R]: r = .45, puncorr = .0001, pFWER = .0001; corr[v1|R, XR × b7|D]: r = .51, puncorr = .0001, pFWER = .0001) (Figure 2 and Figure S12; Tables S7 and S12). However, unlike for u3|R, associated connectivity patterns between u7|D and u1|R were uncorrelated even when the uncorrected threshold was applied.
Discussion
Three key findings emerge from this study. First, analyses found seven CVs in a discovery dataset; four showed stability within the discovery dataset and replicability of clinical patterns in an independent sample; three CVs demonstrated at least weak signs of replicability for the associated rsfMRI connectivity patterns. This suggests the presence of meaningful relations between patterns of intrinsic brain connectivity and psychiatric symptom dimensions in youth. Second, the three most strongly replicable CVs from the discovery dataset varied in clinical specificity; one loaded on all three domains, the second captured shared aspects of irritability and ADHD, and the third loaded specifically on anxiety. Finally, CVs showed weak to modest associations, with multiple edges spanning widely distributed brain areas.
). Dimensions of irritability, ADHD, and anxiety are particularly closely interrelated. Understanding of these cross-dimension relations may follow from research on shared and unique neural correlates. Past work in this area assessed symptom covariation independent of imaging data before then relating symptoms to task-based imaging patterns (
). Using these measures, the current rsfMRI study identified two variates loading strongly on multiple clinical dimensions and a third loading strongly only on anxiety items. Thus, consistent with our hypotheses based on past studies, current findings demonstrate coexisting cross-dimensional and domain-specific neural correlates in treatment-seeking youth.
The detection of only anxiety but not irritability or ADHD-specific neural correlates in the current study could reflect many factors. These include differences between task-based and rsfMRI methods, differential sensitivity in CCA to particular domain-specific features, or biological features of anxiety that generate specific rsfMRI signatures. Additional imaging research might seek to refine clinical groupings based on replicable cross-study patterns for these and other interrelated dimensions.
Findings in the current and past CCA studies exhibited both similarities and differences. Cross-sample correlations for clinical loadings in the current study were notably similar in magnitude to those for variables involving emotion symptoms in the only other study of cross-domain pediatric psychopathology (
). Given differences across the two studies, such consistency speaks to the robust nature of pediatric emotional problem manifestations. The previous study also found strong cross-sample replicability for a pure externalizing factor, which did not emerge in the current study. Failure to detect this factor might reflect lesser diversity in targeted symptoms or larger proportions of treatment-seeking cases in the current study. Finally, unlike past research in treatment-seeking adults, the current study showed cross-sample replicability of latent clinical and connectivity patterns, a finding that might reflect age-related differences or distinct analytic approaches.
Interesting rsfMRI patterns manifested. Connectivity related to clinical dimensions was broadly distributed, involving hundreds of relatively weakly loading interhemispheric and within-hemisphere connections spanning distinct networks. Moreover, while within-sample stability was acceptable in the discovery sample, rsfMRI patterns minimally correlated across datasets. Interestingly, such weak replicability manifested alongside stronger replicability for clinical patterns, themselves defined by relations with rsfMRI. Replicable clinical patterns defined by less replicable rsfMRI patterns raise important questions for future studies. First, greater cross-sample differences existed for the fMRI than clinical assessments. Thus, whether homogeneous cross-sample imaging methods could generate improved rsfMRI replicability remains unclear. Second, replicable clinical patterns defined by minimally replicable fMRI patterns could arise from “many-to-one” mappings between neural and clinical variables. Such configurations commonly underlie brain-behavior relationships at many spatial scales. Thus, whether such “many-to-one” patterns also represent a common motif for mental disorders remains unclear.
From the clinical perspective, broadly distributed connectivity disturbances might require a diverse set of approaches to identify targets for novel interventions. Currently, therapies such as cognitive training or neural stimulation target functions in specific networks (
). However, at least for pediatric anxiety, irritability, and disruptive behavior, broadly distributed patterns may better represent the nature of connectivity disturbances during rest than patterns limited to particular networks. The focus on broad connectivity disturbances as opposed to particular networks might increase effect sizes of studies relating clinical domains to intrinsic brain connectivity.
Findings inform analytic decisions in future CCA studies. Different analyses within and across samples used different rsfMRI data, accounted for different amounts of overall rsfMRI variance, and yielded differences in CV structure. That input affects output is not unique to CCA. However, no ground truth informs selection of PCA-based or other input components for CCA. Thus, risk of overfitting is balanced against risk of omitting relevant variance through dimensionality reduction. Overfitting is reduced by ensuring proportionally more research participants than variables (
). However, particularly in moderately sized datasets, dimensionality reduction can exclude rsfMRI variance components that, even if small, powerfully relate clinical dimensions to connectivity patterns. Such factors create challenges that likely impact findings. The presence of modestly replicable clinical loadings across analyses in the current study suggests the promise of continued iterative work targeting these challenges.
One major limitation of the current study are the medium sample sizes. Also, differences in scanners, imaging-acquisition parameters, and data-quality indices introduced noise that decreased the probability of fully replicating findings across datasets. In effect, larger proportions of rsfMRI connectivity variance are explained in the smaller but homogeneous discovery sample, as evidenced by PCA. Furthermore, we did not include youth ratings of ADHD symptoms or ratings of depressive symptoms, another highly prevalent symptom dimension in youth.
Our findings implicate co-occurring transdimensional and anxiety-specific neural features in pediatric psychopathology. Results further suggest that pediatric clinical dimensions reflect widely distributed brain connectivity patterns. Thus, as with genetic correlates, neural correlates of some pediatric psychopathology dimensions may reflect hundreds of individually small associations.
Acknowledgments and Disclosures
This work is supported by the National Institute of Mental Health (NIMH) Intramural Research Program (Grant Nos. ZIAMH002786 [to EL], ZIAMH002778 [to EL], and ZIAMH002782 [to DSP]), conducted under NIH Clinical Study Protocols described at ClinicalTrials.gov (NCT02531893, NCT00025935, and NCT00018057).
The authors report no biomedical financial interests or potential conflicts of interest.
Schedule for Affective Disorders and Schizophrenia for School-age Children—Present and Lifetime version (K-SADS-PL): Initial reliability and validity data.
J Am Acad Child Adolesc Psychiatry.1997; 36: 980-988