Comput Biol Med. 2025 Mar 20. pii: S0010-4825(25)00393-2. [Epub ahead of print]190 110042
AIMS: Congenital disorders of glycosylation (CDG) comprise a diverse group of genetic diseases characterized by aberrant glycosylation that leads to severe multi-systematic effects. Despite advancements in understanding the underlying molecular mechanisms, curative options remain limited. This study employed computational methods to identify key molecular biomarkers for CDG-I and examine the pharmacological effects of Ginkgolide A (GA), a potent bioactive natural compound.
METHODS: We analyzed the GSE8440 microarray dataset to discover differentially expressed genes (DEGs) in patients compared to healthy individuals with CDG-I utilizing GEO2R. Functional enrichments, including gene ontologies (GO) and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway analyses, were conducted to contextualize the biological mechanisms and molecular signatures involved in CDG-I (Congenital Disorders of Glycosylation Type-1). The protein-protein interaction (PPI) network for DEGs was constructed using the STRING database, and the central hub genes within the PPI network were identified using Cytohubba. Furthermore, the 3D structure of the top hub gene (P4HB) was predicted by using the Robetta server. The CASTp was employed to evaluate the active sites. Molecular docking of P4HB with GA was carried out to investigate the binding affinity using the PyRx tool, and the stability of the docked complex was validated through MD simulation. The pharmacokinetics, toxicity, and bioactivity score of GA were comprehensively assessed using SwissADME, ProTox-II, and Molinspiration.
RESULTS: Our findings indicated 247 significant DEGs, including 146 up-regulated and 101 down-regulated genes. GO and KEGG pathway analyses confirmed that the up-regulated and hub genes were strongly associated with protein folding, glycoprotein processing in the endoplasmic reticulum, and endoplasmic reticulum stress (ER) pathways. P4HB emerged as the top hub gene in CDG-I, playing a significant role in protein folding and ER stress. The 3D structure of P4HB was refined and validated, achieving 95.8 % residues in the most favored region of the Ramachandran plot, with an overall quality of 92.97 %. The CASTp server predicted the largest active site with an area of 2243.660 Å2 and a volume of 3236.584 Å3. Molecular docking revealed that GA has a strong binding affinity with P4HB (-8.9 kcal/mol). The ADME (Absorption, Distribution, Metabolism, Excretion) and toxicity assessments confirmed promising drug-like characteristics, excellent bioavailability, and minimal toxicity risk.
CONCLUSION: This study emphasizes GA as a potential treatment possibility option to alleviated CDG-I pathology by targeting protein misfolding and ER stress, which are fundamental aspects of the disease. Additionally, our findings indicate that P4HB is a critical molecular target in CDG-I. These results pave the way for future preclinical and clinical investigations aimed at advancing the targeted and tailored treatments for CDG.
Keywords: ADME; Active sites; CDG-I; GA; GO and KEGG pathways; Hub gene (P4HB); MD simulation; Microarray dataset (GSE8440)