Fang, Q., D. L. Oberski, and D. Nguyen. “PATCH! {P}sychometrics-{A}ssis{T}ed Ben{CH}marking of Large Language Models Against Human Populations: A Case Study of Proficiency in 8th Grade Mathematics”. Computational Linguistics in the Netherlands Journal, vol. 14, July 2025, pp. 113-34, https://clinjournal.org/clinj/article/view/189.