A Review of Large Language Model Evaluation Methods
SONG Jialei1,2,ZUO Xingquan1,2,ZHANG Xiujian3,4,HUANG Hai1,2
1.School of Computer Science,Beijing University of Posts and Telecommunications,Beijing 100876,China;
2.Key Laboratory of Trustworthy Distributed Computing and Services,Ministry of Education,Beijing 100876,China;
3.Beijing Aerospace Institute for Metrology and Measurement Technology,Beijing 100076,China;
4.Key Laboratory of Artificial Intelligence Measurement and Standards for State Market Regulation,Beijing 100076,China
SONG Jialei, ZUO Xingquan, ZHANG Xiujian, HUANG Hai . A Review of Large Language Model Evaluation Methods[J]. Journal of Astronautic Metrology and Measurement, 2025, 45(2): 1-30.