Top Comments
🚨 AI Can Predict Opioid Death Risk… So Why Isn’t It Used in Clinics?
What began as an effort to treat PAIN has, in many parts of the world, evolved into a devastating public health crisis — with prescription opioids contributing to a growing number of opioid-related deaths.
A recent review compiled 44 machine learning studies attempting to predict opioid harm — yet almost NONE have made it into real clinical use.
Interestingly, model performance across these papers was moderate to strong, but 41% lacked proper calibration, meaning their risk predictions may not reflect real-world probabilities.
This UK study adds one more to the pile, using competing risk time-to-event models on over 1 million patients. It predicts opioid-related death with ~82% accuracy.
Top predictors include prior substance abuse, lung/liver co-morbidities, strong opioids at initiation, and gabapentinoid co-prescription.
What they did differently:
Predicted mortality rather than overdose
Implemented competing risks framework accounting for deaths from other causes
Tested if deep learning helps: 48,500-parameter neural network under-performed to a simple LASSO regression
Acknowledged poor calibration in external validation, where models overestimated absolute risk by 2-7×, designing percentile-based scores as workaround
Built for deployment: EHR-native features, SHAP interpretability, no data leakage
This new paper is a more rigorous model. Yet, it still may not reach patients.
Given the high recall, specificity and lower precision, the model works best when used against its design. The "Implementation Irony" holds true as it is trained to flag danger, yet it succeeds only at clearing safety.
It can suggest who probably will not die.
It cannot say who WILL
And in the midst overwhelming clinicians with false alarms.
⸂⸂⸜(രᴗര๑)⸝⸃⸃ Hey everyone!! 👋 Biomed engineering PhD student here — I always enjoy seeing how technology might actually translate into real healthcare impact. Anyways, this study recently caught my attention, and I’m curious to hear what others think.
🤔 Food for thought:
If simple models beat deep learning, why do we keep building bigger ones?
Is negative screening (identifying those safe to proceed) even useful to clinicians?
Thus far, most models are based in the US or the UK, how far would the prediction shift in a new area / culture?