Background and Motivation
Modern deep learning models have achieved remarkable success in computer vision and natural language processing. However, they typically produce overconfident predictions and lack reliable mechanisms to quantify uncertainty. This limitation becomes particularly problematic in high-stakes applications, such as healthcare diagnosis, autonomous systems, and scientific discovery.