Proceedings of the National Academy of Sciences, Volume 122, Issue 48, December 2025. As AI systems from decision-making algorithms to generative AI are deployed more widely, computer scientists and social scientists alike are being called on to provide trustworthy quantitative evaluations of AI safety and reliability. These calls have …