[1]
Q. Fang, D. L. Oberski, and D. Nguyen, “PATCH! {P}sychometrics-{A}ssis{T}ed Ben{CH}marking of Large Language Models against Human Populations: A Case Study of Proficiency in 8th Grade Mathematics”, CLIN Journal, vol. 14, pp. 113–134, Jul. 2025.