Identifying predictive factors for neuropathic pain after breast cancer surgery using machine learning

Int J Med Inform. 2020 Sep:141:104170. doi: 10.1016/j.ijmedinf.2020.104170. Epub 2020 Jun 13.

Abstract

Introduction: Neuropathic pain (NP) remains a major debilitating condition affecting more than 26% of breast cancer survivors worldwide. NP is diagnosed using a validated 10-items Douleur Neuropathique - 4 screening questionnaire which is administered 3 months after surgery and requires patient-doctor interaction. To develop an effective prognosis model admissible soon after surgery, without the need for patient-doctor interaction, we sought to [1] identify specific pain characteristics that can help determine which patients may be susceptible to NP after BC surgery, and 2) assess the utility of machine learning models developed in objective [1] as a knowledge discovery tool for downstream analysis.

Methods: The dataset is from a prospective cohort study of female patients scheduled to undergo breast cancer surgery for the first time at the Jewish General Hospital, Montreal, Canada between November 2014 and March 2019. NP was assessed at 3 months after surgery using Douleur Neuropathique - 4 interview scores (in short, DN4-interview; range: 0-7). For the primary analysis, we constructed six ML algorithms (least square, ridge, elastic net, random forest, gradient boosting, and neural net) to identify the most relevant predictors for DN4-interview score; and compared model performance based on root mean square error (RMSE). For the secondary analysis, we built a logistic classification model for neuropathic pain (DN4-interview score ≥ 3 versus DN4-interview score < 3) using the relevant-consensus-predictors from the primary analysis.

Results: Anxiety, type of surgery, preoperative baseline pain and acute pain on movement were identified as the most relevant predictors for DN4 - interview score. The least square regression model (RMSE = 1.43) is comparable in performance with random forest (RMSE = 1.39) and neural network model (RMSE = 1.50). The Gradient boosting model (RMSE = 1.16) outperformed the models compared including the penalized regression models (ridge regressions, RMSE = 1.28; and elastic net, RMSE = 1.31). In the secondary analysis, the preferred logistic regression classier for NP had an area under the curve (AUC) of 0.68 (95% CI = 0.57 to 0.79). Anxiety was significantly associated with the likelihood of NP (odds ratio = 2.18; 95% CI = 1.05-4.49). In comparison to their counterparts, the odds of NP were higher in participants with acute pain on movement or with present preoperative baseline pain or participants who performed total mastectomy surgery, but the differences were not statistically significant.

Conclusions: Modern machine learning models show improvements over traditional least square regression in predicting of DN4-interview score. Penalized regression methods and the Gradient boosting model out-perform other models. As a predictor discovery tool, machine learning algorithms identify relevant predictors for DN4-interview score that remain statistically significant indicators of neuropathic pain in the classification model. Anxiety, type of surgery and acute pain on movement remain the most useful predictors for neuropathic pain.

Keywords: DN4-interview; Machine learning; Neuropathic pain; Prediction.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Breast Neoplasms* / surgery
  • Canada
  • Female
  • Humans
  • Machine Learning
  • Mastectomy
  • Neuralgia* / diagnosis
  • Neuralgia* / epidemiology
  • Prospective Studies