Sci Rep. 2026 Jun 10.
Extracellular matrix (ECM) remodeling contributes to retinal vascular basement membrane thickening, an early structural hallmark of diabetic retinopathy (DR). This study aimed to identify key ECM-related genes (ECMGs) associated with DR. Transcriptomic data of DR and ECMGs from MatrixDB were integrated to identify differentially expressed ECMGs. Six machine learning (ML) models, including Extra Trees (ET), Logistic Regression, Adaptive Boosting, Random Forest, Extreme Gradient Boosting, and naive Bayes classifier, were employed to construct DR classification models, with SHapley Additive exPlanation (SHAP) used to interpret feature contributions. Functional enrichment analysis using GSEA and immune infiltration analysis using CIBERSORT were conducted to explore the potential mechanisms by which key ECMGs regulate DR. Regulatory networks were constructed using predicted miRNAs, lncRNAs, and transcription factors (TFs) via the ENCORI, miRWalk, and miRNet databases. Drug-key ECMGs-DM-related diseases interactions were further explored using the DGIdb and CTD databases. Nine candidate ECMGs were identified by overlapping 356 DM-associated DEGs, 1,626 DR-associated DEGs, and 1,023 ECMGs, including CILP2, FN1, DEFA3, COL17A1, CRISP3, TPSAB1, SFRP1, GPHA2, and ECM2. Among the six ML algorithms, the ET classifier exhibited the best overall performance, and five ECMGs (SFRP1, CILP2, FN1, TPSAB1, and ECM2) with non-zero SHAP values were retained as key genes. These genes showed distinct expression patterns across the healthy, DM, and DR groups, and were enriched in neural-related pathways, such as axon guidance, glycosphingolipid biosynthesis ganglio series, and neuroactive ligand receptor interaction. Immune profiling and correlation analysis revealed that FN1, TPSAB1, and CILP2 were correlated with memory/naive B cells, CD8 + T cells, activated memory CD4 + T cells, Tregs, monocytes, and neutrophils. Additionally, the ceRNA network contained five miRNAs, 7 lncRNAs, and two ECMGs, and further regulatory and pharmacologic analysis further linked key ECMGs to specific TFs, drugs, and diabetes-related diseases. This study identified SFRP1, CILP2, FN1, TPSAB1, and ECM2 as key ECMGs in DR, revealing their coordinated involvement in ECM remodeling, neural signaling, and immune modulation. These findings provide novel insights into DR pathogenesis and potential therapeutic targets.
Keywords: Diabetic retinopathy; Extracellular matrix; Gene expression profiling; Machine learning