Uncertainty for Neural Networks, and Other Large Complicated Models

Last update: 23 Apr 2025 13:12
First version:

Yet Another Inadequate Placeholder

I'm only interested in uncertainty in the predictions, not the parameters --- for the type of data-mining-ish application I'm thinking of, "all parameters are nuisance parameters"
Neural networks are obviously big these days but I suspect principles / methods will be more generally applicable (and hence more enduring)
That said, neural networks may be especially nice because everything's continuous and it's fit by minimization.

Adversarial Examples
Conformal Prediction [A well-established set of techniques increasingly used for these problems, and probably where I'd start in any actual application]
Propagation of Error to the Rescue? [A half-baked (at best) idea I had years ago and never did any actual work on]

A. Feder Cooper, Katherine Lee, Madiha Zahrah Choksi, Solon Barocas, Christopher De Sa, James Grimmelmann, Jon Kleinberg, Siddhartha Sen, Baobao Zhang, "Is My Prediction Arbitrary? The Confounding Effects of Variance in Fair Classification Benchmarks", arxiv:2301.11562
Janis Postels, Francesco Ferroni, Huseyin Coskun, Nassir Navab, Federico Tombari, "Sampling-free Epistemic Uncertainty Estimation Using Approximated Variance Propagation", arxiv:1908.00598
Peter Schulam, Suchi Saria, "Can You Trust This Prediction? Auditing Pointwise Reliability After Learning", arxiv:1901.00403

Nilesh A. Ahuja, Ibrahima Ndiour, Trushant Kalyanpur, Omesh Tickoo, "Probabilistic Modeling of Deep Features for Out-of-Distribution and Adversarial Detection", arxiv:1909.11786
Charles Corbière, Nicolas Thome, Avner Bar-Hen, Matthieu Cord, Patrick Pérez, "Addressing Failure Prediction by Learning Model Confidence", arxiv:1910.04851
Jean Feng, Arjun Sondhi, Jessica Perry, Noah Simon, "Selective prediction-set models with coverage guarantees", arxiv:1906.05473
Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Bernardo A. Pires, Rémi Munos, "Neural Predictive Belief Representations", arxiv:1811.06407
Liat Ein-Dor and Ido Kanter, "Confidence in prediction by neural networks", Physical Review E 60 (1999): 799--802
Varun Gupta, Christopher Jung, Georgy Noarov, Mallesh M. Pai, Aaron Roth, "Online Multivalid Learning: Means, Moments, and Prediction Intervals", arxiv:2101.01739
Eyke Hüllermeier, Willem Waegeman, "Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction to Concepts and Methods", arxiv:1910.09457
Danijel Kivaranovic, Kory D. Johnson, Hannes Leeb, "Adaptive, Distribution-Free Prediction Intervals for Deep Networks", arxiv:1905.10634
Max-Heinrich Laves, Sontje Ihler, Karl-Philipp Kortmann, Tobias Ortmaier, "Well-calibrated Model Uncertainty with Temperature Scaling for Dropout Variational Inference", arxiv:1909.13550
Tengyuan Liang, "Universal Prediction Band via Semi-Definite Programming", Journal of the Royal Statistical Society B 84 (2022): 1558--1580 [An alternative]
Zhen Lin, Shubhendu Trivedi, Jimeng Sun
- "Locally Valid and Discriminative Prediction Intervals for Deep Learning Models", arxiv:2106.00225
- "Conformal Prediction Intervals with Temporal Dependence", arxiv:2205.12940
- "Generating with Confidence: Uncertainty Quantification for Black-box Large Language Models", arxiv:2305.19187
Alexander Meinke, Matthias Hein, "Towards neural networks that provably know when they don't know", arxiv:1909.12180
Fabricio Olivetti de Franca, Gabriel Kronberger, "Prediction Intervals and Confidence Regions for Symbolic Regression Models based on Likelihood Profiles", arxiv:2209.06454
Raanan Y. Rohekar, Yaniv Gurwicz, Shami Nisimov, Gal Novik, "Modeling Uncertainty by Learning a Hierarchy of Deep Neural Connections", arxiv:1905.13915