Structural Effect Size in Psychometric Networks: An Asymptotic Bias–Recovery Framework for Deriving a Sample Size Design Law
DOI:
https://doi.org/10.55074/hesj.vi55.1890Keywords:
Structural effect size, Psychometric networks, Small-sample bias, Network recovery, Sample size designAbstract
The present study aimed to develop a quantitative analytical framework for defining Structural Effect Size in psychometric networks and examining its statistical properties under limited sample conditions, with the goal of deriving a quantitative design rule for determining sample size in network studies. The proposed framework measures the structural difference between the true network and the estimated network using the normalized Frobenius distance between their adjacency matrices, which is then transformed into a structural similarity index bounded between zero and one. Structural effect size is defined as the degree of deviation from perfect structural similarity, reflecting the structural recovery error of the network system. To examine the statistical behavior of this indicator, a large-scale simulation study was conducted on psychometric networks with different numbers of nodes (pⓜ=10"،" 20"،" 40) and varying network densities (dⓜ=0.2"،" 0.5"،" 0.8), while varying sample size. The results showed that structural recovery error, which represents the structural effect size resulting from network estimation error, increases with the structural complexity of the network and decreases inversely with the square root of the sample size. The analyses also revealed a regular asymptotic relationship that can be expressed as: y≈C√((d" " p)/N) Where y represents the structural effect size of the recovery error, p denotes the number of nodes, d the network density, and N the sample size. The findings indicate that the accuracy of network recovery depends more strongly on the ratio of sample size to structural network complexity than on the absolute sample size alone. Based on this relationship, a quantitative design rule was derived to estimate the sample size required to achieve a targeted level of structural similarity. The study thus provides a theoretical contribution by introducing the concept of structural effect size into psychometric network analysis and a methodological contribution by offering a priori design guidance for planning sample size in psychological network research.Downloads
References
Ávalos-Tejeda, M., & Calderón, C. (2025). Psychometric network model recovery: The effect of sample size, number of items, and number of nodes. European Journal of Investigation in Health, Psychology and Education, 15(11), Article 235. https://doi.org/10.3390/ejihpe15110235
Bates, D., Maechler, M., & Matrix authors. (2023). Matrix: Sparse and dense matrix classes and methods (R package version). https://cran.r-project.org/package=Matrix
Borkulo, C. D. van, Bork, R. van, Boschloo, L., Kossakowski, J. J., Tio, P., Schoevers, R. A., Borsboom, D., & Waldorp, L. J. (2022). Comparing network structures on three aspects: A permutation test. Psychological Methods. https://doi.org/10.1037/met0000476
Borsboom, D. (2017). A network theory of mental disorders. World Psychiatry, 16(1), 5–13. https://doi.org/10.1002/wps.20375
Borsboom, D. (2022). Possible futures for network psychometrics. Psychometrika, 87(1), 253–265. https://doi.org/10.1007/s11336-022-09851-z
Borsboom, D., & Cramer, A. O. J. (2013). Network analysis: An integrative approach to the structure of psychopathology. Annual Review of Clinical Psychology, 9, 91–121. https://doi.org/10.1146/annurev-clinpsy-050212-185608
Borsboom, D., Mellenbergh, G. J., & van Heerden, J. (2003). The theoretical status of latent variables. Psychological Review, 110(2), 203–219. https://doi.org/10.1037/0033-295X.110.2.203
Bringmann, L. F., Vissers, N., Wichers, M., Geschwind, N., Kuppens, P., Peeters, F., Borsboom, D., & Tuerlinckx, F. (2013). A network approach to psychopathology: New insights into clinical longitudinal data. PLOS ONE, 8(4), e60188. https://doi.org/10.1371/journal.pone.0060188
Bühlmann, P., & van de Geer, S. (2011). Statistics for high-dimensional data: Methods, theory and applications.Springer>
https://doi.org/10.1007/978-3-642-20192-9
Casella, G., & Berger, R. L. (2002). Statistical inference (2nd ed.). Duxbury Press.
Castro, D., Gysi, D. M., Ferreira, F., Ferreira-Santos, F., & Ferreira, T. (2024). Centrality measures in psychological networks: A simulation study on identifying effective treatment targets. PLOS ONE, 19(2), e0297058. https://doi.org/10.1371/journal.pone.0297058
Chen, J., & Chen, Z. (2008). Extended Bayesian information criteria for model selection with large model spaces. Biometrika, 95(3), 759–771. https://doi.org/10.1093/biomet/asn034
Christensen, A. P., Choi, J., & Garrido, L. E. (2024). Evaluating network replicability across local, mesoscale, and global structures. PsyArXiv Preprints. https://doi.org/10.31234/osf.io/9d4nm
Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). Lawrence Erlbaum Associates.
de Ron, J., Fried, E. I., & Epskamp, S. (2021). Psychological networks in clinical populations: investigating the consequences of Berkson's bias. Psychological medicine, 51(1), 168–176.
https://doi.org/10.1017/S0033291719003209
Epskamp, S., Borsboom, D., & Fried, E. I. (2018). Estimating psychological networks and their accuracy: A tutorial paper. Behavior Research Methods, 50(1), 195–212. https://doi.org/10.3758/s13428-017-0862-1
Epskamp, S., Cramer, A. O. J., Waldorp, L. J., Schmittmann, V. D., & Borsboom, D. (2012). qgraph: Network visualizations of relationships in psychometric data. Journal of Statistical Software, 48(4), 1–18. https://doi.org/10.18637/jss.v048.i04
Epskamp, S., Rhemtulla, M., & Borsboom, D. (2017). Generalized network psychometrics: Combining network and latent variable models. Psychometrika, 82(4), 904–927. https://doi.org/10.1007/s11336-017-9557-x
Fonseca-Pedrero, E. (2017). etwork analysis: A new way of understanding psychopathology? Revista de Psiquiatría y Salud Mental, 10(4), 206–215. https://doi.org/10.1016/j.rpsm.2017.06.004
Forbes, M. K., Wright, A. G. C., Markon, K. E., & Krueger, R. F. (2021). Evidence that psychopathology symptom networks have limited replicability. Journal of Abnormal Psychology, 130(7), 691–706. https://doi.org/10.1037/abn0000673
Foygel, R., & Drton, M. (2010). Extended Bayesian information criteria for Gaussian graphical models. In J. Lafferty, C. Williams, J. Shawe-Taylor, R. Zemel, & A. Culotta (Eds.), Advances in Neural Information Processing Systems (Vol. 23, pp. 604–612).
Fried, E. I., & Cramer, A. O. J. (2017). Moving forward: Challenges and directions for psychopathological network theory and methodology. Perspectives on Psychological Science, 12(6), 999–1020. https://doi.org/10.1177/1745691617705892
Friedman, J., Hastie, T., & Tibshirani, R. (2008). Sparse inverse covariance estimation with the graphical lasso. Biostatistics, 9(3), 432–441. https://doi.org/10.1093/biostatistics/kxm045
Friedman, J., Hastie, T., & Tibshirani, R. (2008). Sparse inverse covariance estimation with the graphical lasso. Biostatistics, 9(3), 432–441. https://doi.org/10.1093/biostatistics/kxm045
Haslbeck, J. M. B., & Waldorp, L. J. (2018). How well do network models predict observations? On the importance of predictability in network models. Behavior Research Methods, 50, 853–861. https://doi.org/10.3758/s13428-017-0910-x
Hastie, T., Tibshirani, R., & Wainwright, M. (2015). Statistical learning with sparsity: The Lasso and generalizations. CRC Press. https://doi.org/10.1201/b18401
Huang, W., & Ribeiro, A. (2018). Network comparison: Embeddings and interiors. IEEE Transactions on Signal Processing, 66(2), 412–427. https://doi.org/10.1109/TSP.2017.2764848
Hullám, G., Juhasz, G., Deakin, B., & Antal, P. (2017). Structural and parametric uncertainties in full Bayesian and graphical lasso based approaches: Beyond edge weights in psychological networks. In IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology. https://doi.org/10.1109/CIBCB.2017.8058566
Isvoranu, A.-M., & Epskamp, S. (2021). Which estimation method to choose in network psychometrics? Deriving guidelines for applied researchers. Psychological Methods. https://doi.org/10.1037/met0000439
Kalgotra, P., Sharda, R., & Luse, A. (2020). Which similarity measure to use in network analysis: Impact of sample size on phi correlation coefficient and Ochiai index. International Journal of Information Management, 55, 102229. https://doi.org/10.1016/j.ijinfomgt.2020.102229
Lauritzen, S. L. (1996). Graphical models. Oxford, UK: Clarendon Press (Oxford University Press).
Liang, F., Song, Q., & Qiu, P. (2015). An equivalent measure of partial correlation coefficients for high-dimensional Gaussian graphical models. Journal of the American Statistical Association, 110(511), 1248–1265. https://doi.org/10.1080/01621459.2015.1012391
Markus, K. A. (2010). Questions about networks, measurement, and causation. Behavioral and Brain Sciences, 33(2–3), 164–165. https://doi.org/10.1017/S0140525X10000658
Miasnikof, P., & Shetopaloff, A. Y. (2025). A statistical test for network similarity. arXiv. https://doi.org/10.48550/arXiv.2508.14399
R Core Team. (2024). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/
Ravikumar, P., Wainwright, M. J., & Lafferty, J. D. (2010). High-dimensional Ising model selection using ℓ1-regularized logistic regression. The Annals of Statistics, 38(3), 1287–1319. https://doi.org/10.1214/09-AOS691
Robinaugh, D. J., Hoekstra, R. H. A., Toner, E. R., & Borsboom, D. (2020). The network approach to psychopathology: A review of the literature 2008–2018 and an agenda for future research. Psychological Medicine, 50(3), 353–366. https://doi.org/10.1017/S0033291719003404
Santhanam, N. P., & Wainwright, M. J. (2012). Information-theoretic limits of selecting binary graphical models in high dimensions. IEEE Transactions on Information Theory, 58(7), 4117–4134. https://doi.org/10.1109/TIT.2012.2191659
Schieber, T. A., Carpi, L. C., Díaz-Guilera, A., Pardalos, P. M., Masoller, C., & Ravetti, M. G. (2017). Quantification of network structural dissimilarities. Nature Communications, 8, 13928.
https://doi.org/10.1038/ncomms13928
Ulitzsch, E., Khanna, S., Rhemtulla, M., & Domingue, B. W. (2023). A graph theory based similarity metric enables comparison of subpopulation psychometric networks. Psychological Methods. https://doi.org/10.1037/met0000625
Wainwright, M. J. (2019). High-dimensional statistics: A non-asymptotic viewpoint. Cambridge University Press.
http://dx.doi.org/10.1017/9781108627771
Wickham, H. (2016). ggplot2: Elegant graphics for data analysis. Springer.
Wickham, H., François, R., Henry, L., Müller, K., & Vaughan, D. (2023). dplyr: A grammar of data manipulation (R package version). https://cran.r-project.org/package=dplyr
Williams, D. R., & Rast, P. (2020). Back to the basics: Rethinking partial correlation network methodology. British Journal of Mathematical and Statistical Psychology, 73(2), 187–212.
https://doi.org/10.1111/bmsp.12173
Zhang, J., Zhang, J., & Chen, Y. (2017). A hypothesis testing framework for modularity-based network community detection. Statistica Sinica, 27(1), 437–456. https://doi.org/10.5705/SS.202015.0040













