bims-aukdir Biomed News
on Automated knowledge discovery in diabetes research
Issue of 2026–03–01
twenty papers selected by
Mott Given



  1. Biomedicines. 2026 Jan 28. pii: 290. [Epub ahead of print]14(2):
      Background: Diabetic retinopathy (DR) poses a significant global health challenge that needs scalable and efficient screening pathways beyond the current limitations of teleophthalmology. This study retrospectively evaluated the diagnostic performance of an artificial intelligence (AI) DRISTi system (Version 2.1) against ophthalmologist grading for more-than-mild diabetic retinopathy (mtmDR), vision-threatening diabetic retinopathy (vtDR), and diabetic macular edema (DME). Methods: The methods involved a retrospective, observational, non-interventional validation comparing the AI DRISTi system's output to ophthalmologist grading on 739 colour fundus images acquired using Topcon NWC 400, CrystalVue NFC 600/700, Canon CR2/CR2 AF, and Zeiss VISUCAM 500 cameras. Results: Primary outcomes included sensitivity and specificity, with statistical analyses utilizing 2 × 2 contingency tables and 95% confidence intervals. The AI system achieved an accuracy of 93.36% (sensitivity 95.03%; specificity 92.90%) for mtmDR, 98.64% (sensitivity 96.92%; specificity 99.01%) for vtDR, and 97.97% (sensitivity 92.85%; specificity 98.88%) for DME. Performance was robust and consistent across all evaluated camera types. Conclusions: In conclusion, the AI DRISTi system (Version 2.1) demonstrates strong diagnostic performance for mtmDR, vtDR, and DME, comparable to leading commercial AI systems, from fundus photographs acquired across multiple camera platforms. This system holds significant promise as an adjunctive screening tool for large-scale DR screening programs, contributing to early detection, appropriate triage, and the prevention of vision loss in at-risk populations.
    Keywords:  DRISTi; artificial intelligence; diabetic macular edema; diabetic retinopathy; screening
    DOI:  https://doi.org/10.3390/biomedicines14020290
  2. Biomedicines. 2026 Feb 04. pii: 359. [Epub ahead of print]14(2):
      Background/Objectives: Diabetic retinopathy (DR) is a chronic, progressive complication of diabetes mellitus and remains one of the leading causes of vision impairment worldwide, particularly when early pathological changes go undetected or untreated. The earliest clinically identifiable biomarkers are microaneurysms, which are minute, round dilatations of capillary walls. Retinal abnormalities of a broad spectrum are indicative of the condition. This paper introduces a novel automated screening system for DR that prioritises the detection of these early indicators. Methods: The proposed approach integrates advanced image processing techniques based on the circular Hough transform and the YOLOv9 model, to localise and detect microaneurysms in colour fundus images. Results: Several system prototype versions were developed and evaluated. The final, best-performing YOLOv9-based model achieved an accuracy of 91%, representing a substantial performance improvement compared with the circular Hough transform. Conclusions: The developed models effectively address the issue of significant image processing challenges in lesion detection as well as small and class imbalance data, which are recurring constraints in medical image analysis.
    Keywords:  YOLO; deep learning; diabetic retinopathy; microaneurysms detection; retinal fundus imaging
    DOI:  https://doi.org/10.3390/biomedicines14020359
  3. Med Eng Phys. 2026 Feb 24. 147(3):
      Diabetic retinopathy (DR) remains a major cause of visual impairment globally, and early, accurate diagnosis is critical for effective intervention. To address the challenges of limited labeled training data, difficulty identifying subtle and dispersed lesions, and redundant feature extraction in retinal images, this paper proposes a DR classification network with a multi-frequency contextual attention module (MFCA-DRNet). First, this paper introduces a self-supervised contrastive learning strategy pre-trained on the EyePACS dataset, eliminating reliance on extensive labeled data and enabling effective feature learning. Next, an adaptive preprocessing method, integrating histogram equalization with non-local means denoising, is designed to enhance image quality by reducing noise and improving lesion visibility. The proposed MFCA module effectively captures long-range contextual relationships and associates dispersed lesion characteristics across retinal images, significantly enhancing lesion recognition. Additionally, the backbone network incorporates an attention mechanism guided by an energy function, emphasizing lesion-specific features while suppressing irrelevant information. Evaluated through downstream classification tasks on DDR, APTOS 2019, and Messidor-2 datasets, MFCA-DRNet achieved strong performance, particularly on the APTOS 2019 dataset, with accuracy, precision, recall, andF1 scores of 87.12%, 81.2%, 85.3%, and 83.16%, respectively. These results highlight MFCA-DRNet's potential to improve DR diagnosis and clinical applicability in diverse imaging conditions.
    Keywords:  contrastive learning; diabetic retinopathy; image classification; self-supervised learning
    DOI:  https://doi.org/10.1088/1873-4030/ae45ab
  4. Sci Rep. 2026 Feb 24.
      Accurate fovea segmentation in fundus images is a critical step in diabetic retinopathy screening; however, it remains a challenging task due to the indistinct boundaries of the fovea. Beyond simple localization, precise segmentation offers essential clinical value for Diabetic Macular Edema (DME) management, as treatment decisions-specifically the choice between intravitreal anti-VEGF injection for center-involved DME and laser therapy for extrafoveal edema-depend on the accurate delineation of the foveal region. While existing methods often rely on increasing model architecture complexity, the potential of anatomical context within the training process remains under-explored. This paper presents a data-centric approach that leverages contextual information to robustly identify the fovea. We demonstrate that progressively incorporating key anatomical landmarks-the optic disc, retina, and blood vessels-into training labels significantly enhances fovea detection. To facilitate this, we developed IDRiD-RETA-FV, a meticulously annotated dataset comprising 81 images (54 training, 27 testing) with complete anatomical structures (inter-observer F1=0.98), and introduce MNv4Fovea, a framework designed to explicitly exploit these anatomical inter-dependencies through a multi-class constraints mechanism. Evaluation on the held-out test set with verified ground truth demonstrates excellent segmentation performance (fovea IoU = 0.812, F1 = 0.894, AED = 4.06 pixels). To demonstrate the efficacy of our synthesis strategy, our GEV-based augmentation technique achieves a detection rate of 98.4% compared to 59.0% for baseline geometric augmentation (paired t-test: t = 8.536, p < 0.001, Cohen's d = 1.093). Cross-dataset evaluation on REFUGE, MESSIDOR, and ARIA demonstrates competitive localization performance, achieving state-of-the-art Average Euclidean Distance on REFUGE (22.46 ± 18.73 pixels) and MESSIDOR (6.52 ± 5.89 pixels) with robust generalization across diverse imaging protocols. These results establish that explicit anatomical context, rather than mere model complexity, is key to accurate fovea segmentation, offering a robust paradigm for medical image analysis.
    Keywords:  Fovea segmentation; MobileNetV4; contextual anatomical labels; data-centric AI; diabetic retinopathy; semantic segmentation
    DOI:  https://doi.org/10.1038/s41598-026-40287-y
  5. Eur J Med Res. 2026 Feb 21.
       BACKGROUND: Persistent diabetic macular edema (DME) remains a leading cause of vision loss in diabetic retinopathy, even with anti-VEGF therapy. About 30-40% of patients respond poorly after standard loading doses, resulting in prolonged disease and increased treatment burden. Early identification of high-risk patients is essential for optimizing therapy and reducing unnecessary interventions. This study aimed to develop and compare machine learning models integrating optical coherence tomography (OCT)-derived biomarkers with clinical features to predict persistent DME after anti-VEGF therapy and identify key predictors.
    METHODS: This retrospective study analyzed 339 patients with DME treated with anti-VEGF injections at Changhai Hospital between January 2018 and December 2022. Eight predictive models (M1-M8), including logistic regression, LASSO, XGBoost, random forest (RF), and support vector machine (SVM), were constructed using baseline clinical and OCT data. Model performance was assessed by AUC, accuracy, precision, recall, and F1-score. DeLong's test was used for AUC comparison, decision curve analysis (DCA) for clinical utility, and SHAP values for model interpretability.
    RESULTS: Models integrating OCT biomarkers and clinical features outperformed single-modality models. SVM (M8), XGBoost (M6), and RF (M7) achieved the highest validation AUCs (0.865, 0.819, and 0.812, respectively), demonstrating strong discrimination in the internal validation cohort. DeLong's test confirmed significant AUC improvements for multimodal and regularized models compared with clinical-only models (P < 0.05). SHAP analysis highlighted diabetes duration, ellipsoid zone (EZ)/external limiting membrane (ELM) disruption, and hyperreflective foci (HF) as key predictors.
    CONCLUSIONS: Machine learning models integrating OCT-derived structural biomarkers with clinical features provide superior predictive accuracy for persistent DME compared with conventional models. SHAP-based interpretation consistently identified diabetes duration, EZ/ELM disruption, and HF as key predictors. Among the developed models, SVM achieved the highest AUC in the internal validation cohort, whereas XGBoost and RF provided a favorable balance between predictive performance and interpretability, supporting their potential use in individualized treatment planning.
    Keywords:  Computer-assisted; Diabetic macular edema; Image processing; Optical coherence tomography; Predictive modeling; Support vector machine learning
    DOI:  https://doi.org/10.1186/s40001-026-04064-x
  6. Clin Ophthalmol. 2026 ;20 581564
       Introduction: Fully autonomous artificial intelligence (AI) can improve access to diabetic retinopathy (DR) screening. In this study, we explore patient perspectives on the use of AI in DR screening.
    Methods: Adults with diabetes at an urban academic medical center were recruited to undergo imaging with a handheld AI fundus camera and respond to a survey on awareness of AI in healthcare, trust in AI systems, perceived efficiency of AI, preferences for personal interaction, and overall receptivity of AI in DR screening. Responses were stratified by sociodemographic and neighborhood characteristics.
    Results: A total of 100 participants were included (mean age 60 years; 52% female; 24% Hispanic, 20% non-Hispanic Black, 31% non-Hispanic White, 25% other). Most were aware of AI (78%) and its use in healthcare (70%), but fewer of its application to eye disease (46%). Most believed AI could improve accuracy and protect confidentiality (both 77%), yet 83% preferred physician oversight and would trust AI more if it were supervised by a doctor. Overall, 76% were comfortable with AI as part of the eye exam and 92% were satisfied with AI-based screening. Despite this, only 31% felt AI could replace a doctor visit and the majority of participants (94%) believed doctors will always remain responsible for diagnosing, even if AI was evaluating the scans.
    Conclusion: We found that while participants were generally comfortable with the use of AI as part of the eye examination, most would trust AI more if it were supervised by a doctor and did not believe that AI-based DR screening could replace a doctor's visit. Implementation of autonomous AI-based DR screening should address the mismatch between the intended use of fully autonomous AI screening and participants' understanding of the role that would mean for physicians.
    Keywords:  artificial intelligence; diabetic retinopathy; fundus photography; health disparities
    DOI:  https://doi.org/10.2147/OPTH.S581564
  7. Commun Med (Lond). 2026 Feb 26.
       BACKGROUND: Deep learning has shown promise in diabetes management but faces challenges in real-world application due to its "black-box" nature, characterized by opaque internal decision-making processes. Explainable artificial intelligence (XAI) methods have been proposed to enhance model transparency. However, most of current XAI methods applied in the medical field often ignore the interaction of features in complex environments and pose deviation from clinical domain knowledge.
    METHODS: Our study used two Electronic Health Record (EHR) cohorts of hospitalized patients with type 2 diabetes (T2DM), including an internal dataset of 1,275 inpatients (mean age 58.5 ± 14.3 years) and an external dataset of 292 patients (mean age 69.3 ± 14.5 years). We introduce an expert-guided XAI framework to improve the transparency and trustworthiness of deep learning models for insulin titration in diabetes management. The framework utilizes a post-hoc XAI model named Shapley Taylor Interaction Index (STII) to capture the impact of feature interactions. Additionally, the model is refined iteratively in a doctor-in-the-loop (DIL) process by encoding clinical constraints to align with medical expertise.
    RESULTS: Here we show that our STII-DIL model could explore the interaction factors and reduce unreasonable explanations compared with other explanation models. The final XAI system explanations demonstrated strong alignment with experts' explanations and increased correctness by expert evaluation An AI-human collaboration study revealed that insulin titration accuracy significantly improved for junior clinicians with STII-DIL assistance, while senior clinicians showed minimal change. Both junior and senior clinicians reported increased confidence when using the STII-DIL system.
    CONCLUSIONS: We present an explainable deep learning framework that combines post-hoc XAI and expert domain knowledge to provide transparent and expert-aligned explanations for insulin titration in type 2 diabetes management. This framework enhances decision-making accuracy and confidence, especially for junior clinicians, and may facilitate broader clinical adoption of AI-assisted decision-making tools.
    DOI:  https://doi.org/10.1038/s43856-026-01449-1
  8. PLoS One. 2026 ;21(2): e0339580
      Metabolic-Associated Fatty Liver Disease (MAFLD) is common among Type 2 Diabetes (T2DM) patients. The coexistence of these conditions increases the risk of MAFLD progression and diabetes complications. Detecting MAFLD early is challenging due to its asymptomatic initial stages. In this study, we aimed to develop a machine learning model to predict MAFLD in T2DM patients. We conducted a cross-sectional study on 3,654 Iranian T2DM patients using their demographic and lab data. This study involved thorough data preprocessing, including evaluating various imputation methods on simulated missingness in a complete subset of the dataset. Additionally, four feature selection methods were applied to eight machine learning models to identify the most effective predictive model. The XGBoost classifier without feature selection achieved the best performance, with an accuracy of 80.6% and an area under the receiver operating characteristic curve (AUC) of 88.9%.Notably, certain features, such as alanine aminotransferase (ALT), platelet count (PLT) and Vitamin D(VitD) influenced the predictive performance.
    DOI:  https://doi.org/10.1371/journal.pone.0339580
  9. Diagnostics (Basel). 2026 Feb 11. pii: 532. [Epub ahead of print]16(4):
      Background/Objectives: Diabetes is one of the most familiar and common diseases among people currently, and is a type of metabolic disease that is caused due to high levels of sugar in the blood for longer periods of time. If the disease is predicted at an earlier stage, the severity and risks associated with diabetes are significantly reduced, which helps to save the lifespan of people. In earlier investigations, various kinds of automated models based on artificial intelligence (AI) were developed for this purpose. However, key issues still revolve around the lack of robustness, dependability, and precise prediction. The motivation behind the proposed study is to design and develop an automated tool for the diagnosis of chronic disease with the use of novel AI methodology. Methods: For this purpose, a new detection framework is introduced, known as the Brass Optimized Learning-Based Diabetes Prediction (BOLD) model for remote healthcare applications. By using this kind of optimization-integrated deep learning technique, the overall performance and efficiency of the diabetes detection system are maximized. This framework preprocesses the input diabetes dataset before performing the data splitting, normalization, and cleaning activities. Next, the best attributes for improving the prognostic performance of the classifier are chosen using the Brassy Pelican Optimization (BPO) procedure. The Hunting Optimized Recurrent Neural Network-Long Short-Term Memory (RNN-LSTM) method is used to categorize the people into those who are diabetic and those who are not based on the chosen attributes. The approach employs a Deer Hunting Optimization (DHO) method to choose the hyperparameters needed to make an informed choice. A variety of parameters have been employed to confirm the results, which are evaluated for performance verification using the PIDD, Indonesia diabetic database, and kidney disease dataset. Results: The BOLD framework is successful to the extent that it has been able to achieve several metrics of comparably good results, such as an RMSE value of 0.015, a Cohen's Kappa measure of 0.99, a precision of 0.991, a recall of 0.99, an accuracy equal to 0.996, and an AUC equal to 0.99. Conclusions: It is also remarkable that a very short time of 0.8 s was enough for it to deliver this kind of performance, making it a neat combination of both time and power efficiency.
    Keywords:  Artificial Intelligence (AI); Brass Optimized Learning-based Diabetes Prediction (BOLD); Brassy Pelican Optimization (BPO); Deep Learning (DL); Deer Hunting Optimization (DHO); Recurrent Neural Network—Long Short Term Memory (LSTM); classification; diabetes prediction; remote healthcare
    DOI:  https://doi.org/10.3390/diagnostics16040532
  10. Brief Bioinform. 2026 Jan 07. pii: bbag083. [Epub ahead of print]27(1):
      Diabetic complications are a major cause of disability and mortality among patients, and early identification of high-risk individuals is essential for precision prevention and management. In recent years, the rapid advancement of artificial intelligence (AI) has provided transformative tools for risk prediction and clinical decision support in diabetes care. In this narrative review, we systematically surveyed studies published between January 2015 and June 2025 in PubMed, Web of Science, and Scopus that applied AI-based predictive modeling for three major diabetic complications: diabetic retinopathy (DR), diabetic nephropathy (DN), and diabetic cardiovascular disease (CVD). A total of 58 studies were included, encompassing models based on clinical features, molecular omics, medical imaging, and multimodal data integration. Cross-scale and multimodal data fusion has emerged as a promising new paradigm, demonstrating improved predictive performance over single-modality approaches in three major diabetic complications. We also summarize the evolution from traditional machine learning to deep learning and, more recently, to large language models and agent-based systems, comparing their methodological characteristics, strengths, and suitable application scenarios. Finally, we proposed an actionable six-step framework and clinical translation pathway for AI in diabetic complications, outlining key steps from data curation and model development to validation, regulatory compliance, and real-world implementation. Together, these insights provide a roadmap toward developing robust, transparent, and clinically deployable AI systems capable of transforming the prevention and management of diabetic complications.
    Keywords:  artificial intelligence; diabetic complications; large language models; machine learning; risk prediction
    DOI:  https://doi.org/10.1093/bib/bbag083
  11. JMIR Diabetes. 2026 Feb 23. 11 e82084
       Background: The global prevalence of type 2 diabetes mellitus (T2DM) poses significant challenges due to its association with increased cardiovascular risk and complications like cardiovascular autonomic neuropathy. Measures derived from heart rate variability (HRV) and cardiorespiratory interactions quantified through frequency response function (FRF) and impulse response (IR) metrics reflect different aspects of autonomic regulation and may provide complementary physiological information relevant to diabetes-related autonomic alterations.
    Objective: The study aimed to investigate whether these metrics, individually or in combination, provide useful physiological features for distinguishing individuals with and without T2DM using machine learning classifiers.
    Methods: Electrocardiogram and respiratory signals from 2 PhysioNet datasets were used to derive 3 domains of autonomic and cardiorespiratory features: (1) spectral HRV indices reflecting overall variability; (2) FRF metrics characterizing frequency-specific respiratory-cardiac transfer properties; and (3) causal IR metrics capturing time-domain responsiveness to respiratory inputs. ML classifiers-logistic regression, support vector machine (SVM) with linear kernel, and SVM with radial basis function (SVM RBF) kernel-assessed the predictive value of individual and combined feature sets under NearMiss-1 (NM) undersampling and Synthetic Minority Oversampling Technique oversampling. This systems-based framework may capture subtle differences in respiratory-cardiac regulation associated with T2DM more effectively than HRV alone by reflecting integrated cardiorespiratory coupling.
    Results: Across classifiers and balancing strategies, IR features frequently produced comparatively strong standalone performance, suggesting that causal, time-domain cardiorespiratory dynamics capture informative physiological differences between groups. With logistic regression and NM, IR features achieved mean accuracy of 0.770 (SD 0.179), precision of 0.783 (SD 0.217), recall of 0.900 (SD 0.224), and F1-score of 0.798 (SD 0.140). While HRV metrics were the least informative standalone feature set, the combined HRV+FRF feature set under NM yielded the highest observed performance, with accuracy of 0.830 (SD 0.172), precision of 0.800 (SD 0.183), recall of 0.933 (SD 0.149), and F1-score of 0.853 (SD 0.145; SVM RBF). Under Synthetic Minority Oversampling Technique, HRV+IR showed the strongest observed combined performance, yielding accuracy of 0.700 (SD 0.128), precision of 0.783 (SD 0.217), recall of 0.683 (SD 0.207), and F1-score of 0.691 (SD 0.097) with SVM RBF, surpassing standalone IR in most metrics, though IR alone retained superior recall (0.950, SD 0.112) and F1-score (0.708, SD 0.038). These results reflect that performance depends on both feature domain and sampling strategy and that combining features capturing complementary physiological aspects of autonomic regulation may enhance discriminative ability.
    Conclusions: HRV, FRF, and IR metrics each reflect distinct dimensions of autonomic and cardiorespiratory regulation. Systems-based approaches incorporating frequency-domain and causal dynamic features may offer richer characterization of diabetes-related regulatory differences than HRV alone. Although preliminary and limited by sample size, these findings highlight promising physiological feature domains and sampling strategies for future investigation. Larger datasets with well-defined autonomic phenotyping are needed to evaluate generalizability and determine clinical relevance.
    Keywords:  cardiorespiratory coupling; diabetic autonomic neuropathy; frequency response function; heart rate variability; impulse response; machine learning; type 2 diabetes
    DOI:  https://doi.org/10.2196/82084
  12. AMIA Annu Symp Proc. 2024 ;2024 1434-1443
      Hepatic fibrosis poses a significant health risk for young adults with type 2 diabetes (T2D). We propose FCFNets, a novel factual and counterfactual learning framework to predict hepatic fibrosis in young adults with T2D that can address class imbalance issue and increase interpretability leveraging electronic health records (EHRs). We designed a hybrid UNDO oversampling strategy, combining random and dissimilar oversampling that improves dataset diversity and model robustness. FCFNets also integrates SHAP-based global and instance-level explanations, alongside feature interaction analysis, providing insights into critical risk factors associated with hepatic fibrosis. The results show our proposed model outperforms various baseline methods with high sensitivity (0.846) and accuracy (0.768), while delivering counterfactual explanations. Hyperparameter tuning and dropout analysis further refine the model, ensuring optimal performance. This study demonstrates FCFNets's potential for early detection and personalized management of hepatic fibrosis, paving the way for interpretable AI applications in precision medicine.
  13. Healthc Technol Lett. 2026 Jan-Dec;13(1):13(1): e70067
      This study investigated the efficacy of pre-trained deep learning models for multi-class classification of eye diseases, namely cataract, diabetic retinopathy, and glaucoma, using fundus images. Although CNN and transformer-based models have been extensively explored separately in ophthalmic diagnostics, a direct comparative analysis remains limited. Moreover, recent high-performing systems frequently rely on heavy backbones, ensembles, or large-scale domain pretraining, which can be impractical for resource-constrained screening pipelines. We evaluated three models, EfficientNetB3, MobileNetV2 and vision Transformer, with tailored modifications. An attention-enhanced feature refinement module and the OpthaHead custom classifier enhanced EfficientNetB3 and MobileNetV2, while META customization optimized vision Transformer. The proposed design explicitly targets two practical bottlenecks observed in ophthalmic transfer learning, insufficient feature selectivity for subtle lesions and structural regions, and overfitting or instability in the final decision layers when training data are limited. The optimized EfficientNetB3 achieved a 10.84% improvement over its baseline with 96.04% accuracy, and MobileNetV2 improved by 11.26%, balancing accuracy and computational efficiency. META customization boosted vision Transformer performance by over 18%, showing that reducing model complexity benefits transformers on limited medical data. This study demonstrates strong performance for AI-driven eye disease classification and highlights the potential of AI tools for early detection, improving clinical decision-making and patient outcomes.
    Keywords:  deep learning; diabetic retinopathy; efficientNet; eye disease classification; transfer learning
    DOI:  https://doi.org/10.1049/htl2.70067
  14. AMIA Annu Symp Proc. 2024 ;2024 257-266
      Helping patients self-managing diseases like type 1 diabetes (T1D) requires informatics tools delivering real-time predictions with explainable, actionable guidance. However, many healthcare AI solutions lack actionable recommendations and user-friendly explanations, limiting clinical impacts. We introduce APEA, a pediatric T1D self-management Ambient-AI assistance tool, integrating glucose multi-trajectory-scenarios Prediction, interactive, context-aware large language model Explanations, and just-in-time adaptive intervention policy optimization for Actionable real-time suggestions through reinforcement learning. Using T1DEXIP dataset (262 pediatric T1D patients, multi-center), our results showed improved glucose control outcomes: 45% over human management, 69% over infusion-pump management. Although constrained by small sample size and severe class imbalance, APEA addresses healthcare AI implementation gaps by bridging what might happen, what can be done about it, and why it makes clinical sense. APEA offers a transferable framework for other chronic conditions that demand continuous, personalized, just-in-time adaptive interventions.
  15. Diabetologia. 2026 Feb 25.
       AIMS/HYPOTHESIS: This study aimed to develop an accessible tool, derived using machine learning, to predict hypoglycaemia risk at the start of exercise and to provide clear, rapid risk assessment to support safer participation in exercise.
    METHODS: Data from four diverse studies were combined, encompassing 16,430 exercise sessions from 834 participants aged 12-80 years using various insulin delivery methods. The XGBoost algorithm was used to develop two models: a comprehensive model and a simplified model for predicting hypoglycaemia during exercise.
    RESULTS: The comprehensive model (406 variables) achieved a mean ROC AUC of 0.89. The simplified model, using only starting glucose, exercise duration and glucose trend arrows, achieved a comparable ROC AUC of 0.87. The simplified model performed consistently across exercise types and insulin delivery methods. In collaboration with individuals with type 1 diabetes, this model was translated into GlucoseGo, a user-friendly traffic-light heatmap displaying hypoglycaemia risk based on the three variables.
    CONCLUSIONS/INTERPRETATION: The GlucoseGo heatmap provides a practical, accessible tool for predicting hypoglycaemia risk immediately before exercise. It may empower individuals with type 1 diabetes to exercise more safely, reduce hypoglycaemic episodes, and increase engagement in physical activity.
    Keywords:  Continuous glucose monitoring; Decision support; Exercise; Hypoglycaemia; Machine learning; Risk prediction; Self-management; Type 1 diabetes
    DOI:  https://doi.org/10.1007/s00125-026-06692-8
  16. Stud Health Technol Inform. 2025 Oct 03. 330 714-749
      Synthetic data generation is a strategy used to address the lack and complex process to acquire clinical data and information, in particular in type 2 diabetes mellitus (T2DM) research. T2DM is characterized by chronic hyperglycemia with macrovascular and microvascular complications. Nevertheless, despite the importance of data to improve diagnostic accuracy, better treatments, and personalized patient care, medical datasets are often restricted by ethical and privacy constraints. In this sense, this chapter evaluates four synthetic data generation techniques, Gaussian Mixture Models (GMM), Generative Adversarial Networks (GAN), Wasserstein GAN (WGAN), and Variational Autoencoders (VAE). The quality of the generated data was assessed through statistical divergence metrics-specifically Jensen-Shannon (JSD) and Kullback-Leibler (KLD)and by analyzing their impact on classification performance. The results indicate that GMM achieved the lowest JSD, showing the best overall distributional similarity, while WGAN obtained the lowest KLD, suggesting a closer alignment in information content with real data. Additionally, GAN and WGAN demonstrated the highest predictive performance in classification tasks, indicating that they better preserved essential relationships within the data. These findings confirm that generative strategies of using synthetic data to improve T2DM research are feasible, offering an alternative to develop diagnosis tools without compromising patient confidentiality. It is possible to conclude that the generation method selection depends on the type of data and research objective, maximizing statistical similarity, optimizing performance, or balancing both aims. Synthetic data generation approaches represent a feasible approach to expand balanced and quality datasets to advance in personalized healthcare for diabetes patients.
    Keywords:  Gaussian Mixture Models (GMM); Generative Adversarial Networks (GAN); Statistical divergence metrics; Synthetic data; Type 2 Diabetes Mellitus (T2DM)
    DOI:  https://doi.org/10.3233/SHTI251459
  17. Diabetologia. 2026 Feb 21.
       AIMS/HYPOTHESIS: Data-driven subtyping of type 2 diabetes has not been translated into clinical practice due to the lack of routine fasting glucose and insulin measurements. We aimed to identify type 2 diabetes subtypes in clinical settings using electronic health records and study their epidemiology.
    METHODS: We identified 727,076 adults (≥18 years) with newly diagnosed type 2 diabetes from Epic Cosmos research platform data across all 50 states and the District of Columbia between 2012 and 2023. Classification models developed in cohort studies were applied to study the sociodemographic distribution of subtypes. Cox proportional hazards regression models, adjusted for age and sex, were used to assess the rates of microvascular complications (retinopathy, neuropathy and nephropathy) and macrovascular complications (severe atherosclerotic cardiovascular disease [ASCVD], other ASCVD and heart failure).
    RESULTS: Among newly diagnosed individuals (mean age 64.4 years [SD 13.3], 52% female), 21.6% were classified as having severe insulin-deficient diabetes (SIDD), 23.8% were classified as having mild obesity-related diabetes (MOD), 40.9% were classified as having mild age-related diabetes and 13.7% were classified as having the mixed subtype. Compared with those classified as having MOD, individuals classified as having SIDD had higher HRs for retinopathy (HR 2.83; 95% CI 2.73, 2.93), neuropathy (HR 1.57; 95% CI 1.54, 1.60), nephropathy (HR 1.34; 95% CI 1.32, 1.37), severe ASCVD (HR 1.49; 95% CI 1.46, 1.53), other ASCVD (HR 1.23; 95% CI 1.21, 1.25) and heart failure (HR 1.17; 95% CI 1.15, 1.20). SIDD and MOD were more prevalent among Hispanics (28.4% and 30.1%, respectively) and non-Hispanic Black people (25.5% and 30.0%, respectively) compared with non-Hispanic White people (20.1% and 21.6%, respectively), and were also more prevalent in the District of Columbia and Utah, respectively, compared with the rest of the country.
    CONCLUSIONS/INTERPRETATION: Individuals with different type 2 diabetes subtypes, identified through electronic health records, differ in terms of their risk of vascular complications. These findings support leveraging routine electronic health record data to improve the characterisation of patient heterogeneity at the time of diabetes diagnosis.
    Keywords:  Clinical translation; Clusters; Electronic health records; Health disparities; Machine learning; Macrovascular complications; Microvascular complications; Real-world evidence; Subphenotypes
    DOI:  https://doi.org/10.1007/s00125-026-06687-5