Developing a ‘Smart’ Recovery
Monitoring and Support System

John J. Curtin, Ph.D.

University of Wisconsin-Madison

April 9, 2025

Precision Mental Health for Continuing Care

Precision Mental Health for Continuing Care



“Could you predict not only who might be at greatest risk for relapse …
… but precisely when that relapse might occur …
… and how best to intervene to prevent it?”



Precision Mental Health for Continuing Care


Precision mental health requires us to provide the right interventions and supports to the right people at the right time, every time


SUD continuing care requires

  • Long-term monitoring
  • Ongoing lifestyle adjustments and support

Precision Mental Health for Continuing Care


Precision mental health requires us to provide the right interventions and supports to the right people at the right time, every timed


SUD continuing care requires

  • Long-term monitoring
  • Ongoing lifestyle adjustments and support


A “Smart” Recovery Monitoring and Support System can provide temporally precise, dynamic, personalized continuing care by combining:

  • Sensing
  • Artificial Intelligence/Machine learning

Model Output: Lapses


Lapses

  • are clearly defined,
  • have a temporally precise onset, and
  • can serve as an early warning sign for relapse (precede and predict)


  • “Abstinence violation effects” can increase relapse risk
  • Even a single lapse can result in overdose and/or death for some drugs

Model Output: Lapses


Lapses

  • are clearly defined,
  • have a temporally precise onset, and
  • can serve as an early warning sign for relapse (precede and predict)


  • “Abstinence violation effects” can increase relapse risk
  • Even a single lapse can result in overdose and/or death for some drugs

Model Inputs: Personal Sensing


Personal sensing collects information from smartphones and wearable sensors to identify a person’s thoughts, feelings, behaviors, and context.

Model Inputs: Personal Sensing


Personal sensing collects information from smartphones and wearable sensors to identify a person’s thoughts, feelings, behaviors, and context.


Sensing allows for “real-world” measurement that

  • Can be sustained for long periods
  • Has very high temporal granularity

Model Inputs: Personal Sensing


Personal sensing collects information from smartphones and wearable sensors to identify a person’s thoughts, feelings, behaviors, and context.


Sensing allows for “real-world” measurement that

  • Can be sustained for long periods
  • Has very high temporal granularity


Practical requirements

  • Feasible/acceptable for long term use
  • Consistent across platforms/devices
  • Stable/Low churn over time in necessary hardware/software

Model Inputs: Personal Sensing


We use smartphone sensing methods to collect

  • Ecological Momentary Assessments (EMA)
  • Contextualized Geolocation
  • Contexualized Smartphone Communications

Model Inputs: Personal Sensing


We use smartphone sensing methods to collect

  • Ecological Momentary Assessments (EMA)
  • Contextualized Geolocation
  • Contexualized Smartphone Communications

Lapse Prediction for AUD


  • 151 individuals with moderate to severe AUD


  • Early in recovery (1-8 weeks)


  • Committed to abstinence throughout study


  • Followed with sensing for up to 3 months
    • Ecological Momentary Assessments
    • Contextualized Geolocation
    • Contexualized Smartphone Communications
    • (also sensed physiology, sleep, coarse self-report)

risk1_pis.png 
niaaa_logo.png 

Lapse Prediction for AUD


  • 151 individuals with moderate to severe AUD


  • Early in recovery (1-8 weeks)


  • Committed to abstinence throughout study


  • Followed with sensing for up to 3 months
    • Ecological Momentary Assessments
    • Contextualized Geolocation
    • Contexualized Smartphone Communications
    • (also sensed physiology, sleep, coarse self-report)

risk1_pis.png 
niaaa_logo.png 

Participant Characteristics

Participant Characteristics


  • All participants met criteria for moderate to severe AUD


  • Reported abstinence goals

Ecological Momentary Assessments


  • Current/Recent Experiences
    • Craving
    • Emotional state
    • Recent past alcohol use
    • Recent risky situations
    • Recent stressful events
    • Recent pleasant event


  • Future Expectations
    • Risky situations
    • Stressful events
    • Abstinence Confidence

Modeling: Feature Engineering


  • Features based on recent past experiences (12, 24, 48, 72, 168 hours)


  • Min, max, and median response (all items)


  • History (count) of past lapses (item 1) and completed EMAs (compliance)


  • Raw scores and change scores (from baseline/all past responses)

Modeling: Predictions


  • Predict hour-by-hour probability of future lapse


  • Lapse window widths
    • 1 week
    • 1 day
    • 1 hour

Modeling: Predictions


  • Predict hour-by-hour probability of future lapse


  • Lapse window widths
    • 1 week
    • 1 day
    • 1 hour

Modeling: Algorithms and Resampling


  • XGBoost - Boosted decision trees
  • Also considered:
    • ElasticNet GLM (e.g., LASSO, ridge regression)
    • Random Forest
    • KNN


  • Using grouped (by participant), nested, repeated k-fold CV
    • 30 “held-out” test sets
    • New participants and observations not used for training

Predicted Lapse Probabilities: Next Week Model


  • Model predicts probability of lapse in next week for “new observations in test sets


  • Can panel predictions by Ground Truth (i.e., true lapse vs. no lapse observations


  • Want high probabilities for true lapses and low probabilities for true no lapses

Model Performance: Area under ROC curve (auROC)


The Area under the Receiver Operating Characteristic Curve (auROC) indicates the probability that any true lapse is scored higher than any true no-lapse by the model

  • Random performance: auROC = 0.5
  • Perfect performance: auROC = 1.0

Model Performance: Area under ROC curve (auROC)


The Area under the Receiver Operating Characteristic Curve (auROC) indicates the probability that any true lapse is scored higher than any true no-lapse by the model

  • Random performance: auROC = 0.5
  • Perfect performance: auROC = 1.0

Model Performance: Next Week Model

Model Performance: Next Day Model

Model Performance: Next Hour Model

Understanding the Models

Understanding the Models: Next Hour Model


  • All EMA items impact lapse probability

Understanding the Models: Next Hour Model


  • All EMA items impact lapse probability


  • Lapse day and lapse hour are useful

Understanding the Models: Next Hour Model


  • All EMA items impact lapse probability


  • Lapse day and lapse hour are useful


  • Demographics not particularly important

Interium Summary and Next Steps


  • Very strong overall performance
  • Temporally precise models for immediate future lapse risk
  • EMA risk features are intepretable and sensible

Next Steps: Algorithmic Fairness

Next Steps: Algorithmic Fairness

Next Steps: Algorithmic Fairness

Algorithmic Fairness

Next Steps: Algorithmic Fairness


  • NIDA project recruited ~ 400 patients in recovery from Opioid Use Disorder
  • National sample (size; diversity: demographics, location)
  • More variation in stage of recovery (1 – 6 months at start)
  • Sensing for 12 months

Next Steps: Algorithmic Fairness

  • Excellent performance: auROC ~ 0.94

Next Steps: Algorithmic Fairness

  • Excellent performance: auROC ~ 0.94

Next Steps: Sensing Geolocation and Communcations

Next Steps: Sensing Geolocation and Communcations

Next Steps: Sensing Geolocation and Communcations

…Imagine my text messages…

Context is Critical

Context is Critical

smartphone_context.png 

Context is Critical

Contextualized Geolocation


  • Location type (e.g., home, home of friend, bar, restaurant, liquor store, work, health care, AA/recovery meeting, gym/fitness center)
  • Is alcohol available at this location
  • Have you drank alcohol at this location?
  • Is your experience at this location generally pleasant, unpleasant, mixed or neutral?
  • This location is (high risk, moderate risk, low risk, no risk) for my recovery

Contextualized Communications


  • Have you drank alcohol with this person?
  • What is their drinking status (e.g., drinker, non-drinker)?
  • Would you expect them to drink in your presence?
  • Are they currently in recovery from alcohol or other substances?
  • Do they know about your recovery goals and if so, are they supportive?
  • Are your experiences with them typically pleasant, unpleasant, mixed or neutral?

Next Steps: Clinical Uses

Next Steps: Clinical Uses


Do NOT provide model output to clinicians

  • Clinicians are over-burdened
  • Not ready for new data streams

Next Steps: Clinical Uses


Do NOT predict class labels (lapse vs. no-lapse)

  • Iatriogenic effects?
  • Information loss

Next Steps: Clinical Uses


Do NOT predict class labels (lapse vs. no-lapse)

  • Iatriogenic effects?
  • Information loss

Next Steps: Clinical Uses


DO use lapse probability

  • auROCs range from 0.90 - 0.94

Next Steps: Clinical Uses


DO use lapse probability

  • auROCs range from 0.90 - 0.94
  • Probabilities are calibrated and ordinal
  • Provides fine gradations of relative risk for clinical decision-making

Next Steps: Personalized Daily Support Recommendations


  • SHAP values from the NEXT DAY model can identify the most important risk features for a specific individual on each day


  • These features can be used to personalize daily support recommendations

Next Steps: Personalized Daily Support Recommendations

Next Steps: Personalized Daily Support Recommendations

Next Steps: Personalized Daily Support Recommendations

Next Steps: Personalized Daily Support Recommendations


  • SHAP values from the NEXT DAY model can identify the most important risk features for a specific individual on each day


  • These features can be used to personalize daily support recommendations


  • We can also eventually learn which interventions are best for which risk

Next Steps: Advanced Warning

  • Previous models only predict immediate future
  • Advanced warning needed for some types of supports

Next Steps: Advanced Warning

  • Previous models only predict immediate future
  • Advanced warning needed for some types of supports


  • Can lag model up to two weeks into the future
  • Performance drops but still remains good

Next Steps: Optimize System Feedback to Patients


  • Sensing EMA and geolocation for four months

  • Model updated each night for next day

    • Lapse probability predictions
    • Important risk features
    • Risk relevant support recommendations
  • Participants receive daily messages varying combinations of these components

  • Measure trust, engagement, and clinical outcomes

Next Steps: Optimize System Feedback to Patients


  • Sensing EMA and geolocation for four months

  • Model updated each night for next day

    • Lapse probability predictions
    • Important risk features
    • Risk relevant support recommendations
  • Participants receive daily messages varying combinations of these components

  • Measure trust, engagement, and clinical outcomes

CRediTs

credits.png 

Acknowledgements (recent projects first)

Co-Investigators



Graduate Students

Staff

Acknowledgements (alphabetized)

Co-Investigators

Graduate Students

Staff