Skip to main navigation Skip to search Skip to main content

Development of a Machine Learning Model for Predicting Treatment-Related Amenorrhea in Young Women with Breast Cancer

  • Long Song
  • , Zobaida Edib
  • , Uwe Aickelin
  • , Hadi Akbarzadeh Khorshidi
  • , Anne Sophie Hamy
  • , Yasmin Jayasinghe
  • , Martha Hickey
  • , Richard A. Anderson
  • , Matteo Lambertini
  • , Margherita Condorelli
  • , Isabelle Demeestere
  • , Michail Ignatiadis
  • , Barbara Pistilli
  • , H. Irene Su
  • , Shanton Chang
  • , Patrick Cheong Iao Pang
  • , Fabien Reyal
  • , Scott M. Nelson
  • , Paniti Sukumvanich
  • , Alessandro Minisini
  • Fabio Puglisi, Kathryn J. Ruddy, Fergus J. Couch, Janet E. Olson, Kate Stern, Franca Agresta, Lesley Stafford, Laura Chin-Lenn, Wanda Cui, Antoinette Anazodo, Alexandra Gorelik, Tuong L. Nguyen, Ann Partridge, Christobel Saunders, Elizabeth Sullivan, Mary Macheras-Magias, Michelle Peate
  • School of Computing and Information Systems
  • University of Melbourne
  • Royal Women's Hospital
  • Institut Curie
  • Royal Children's Hospital Melbourne
  • Murdoch Children's Research Institute
  • University of Edinburgh
  • IRCCS Ospedale Policlinico San Martino
  • University of Genoa
  • Université libre de Bruxelles
  • UNICANCER Federation
  • University of California at San Diego
  • University of Glasgow
  • University of Pittsburgh
  • Azienda Sanitaria Universitaria Friuli Centrale
  • Mayo Clinic Scottsdale, AZ
  • Melbourne IVF
  • Royal Melbourne Hospital
  • Peter Maccallum Cancer Centre
  • Sydney Children's Hospital
  • University of New South Wales
  • Monash University
  • Dana-Farber Cancer Institute
  • University of Newcastle
  • Breast Cancer Network Australia

Research output: Contribution to journalArticlepeer-review

Abstract

Treatment-induced ovarian function loss is a significant concern for many young patients with breast cancer. Accurately predicting this risk is crucial for counselling young patients and informing their fertility-related decision-making. However, current risk prediction models for treatment-related ovarian function loss have limitations. To provide a broader representation of patient cohorts and improve feature selection, we combined retrospective data from six datasets within the FoRECAsT (Infertility after Cancer Predictor) databank, including 2679 pre-menopausal women diagnosed with breast cancer. This combined dataset presented notable missingness, prompting us to employ cross imputation using the k-nearest neighbours (KNN) machine learning (ML) algorithm. Employing Lasso regression, we developed an ML model to forecast the risk of treatment-related amenorrhea as a surrogate marker of ovarian function loss at 12 months after starting chemotherapy. Our model identified 20 variables significantly associated with risk of developing amenorrhea. Internal validation resulted in an area under the receiver operating characteristic curve (AUC) of 0.820 (95% CI: 0.817–0.823), while external validation with another dataset demonstrated an AUC of 0.743 (95% CI: 0.666–0.818). A cutoff of 0.20 was chosen to achieve higher sensitivity in validation, as false negatives—patients incorrectly classified as likely to regain menses—could miss timely opportunities for fertility preservation if desired. At this threshold, internal validation yielded sensitivity and precision rates of 91.3% and 61.7%, respectively, while external validation showed 92.9% and 60.0%. Leveraging ML methodologies, we not only devised a model for personalised risk prediction of amenorrhea, demonstrating substantial enhancements over existing models but also showcased a robust framework for maximally harnessing available data sources.

Original languageEnglish
Article number1171
JournalBioengineering
Volume12
Issue number11
DOIs
Publication statusPublished - Nov 2025

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • breast cancer
  • cross imputation
  • machine learning
  • risk prediction model
  • treatment-related amenorrhea

Fingerprint

Dive into the research topics of 'Development of a Machine Learning Model for Predicting Treatment-Related Amenorrhea in Young Women with Breast Cancer'. Together they form a unique fingerprint.

Cite this