About
I received my B.Sc. in Computer Science from Harbin Institute of Technology and am currently pursuing my M.Sc. in Computer Science at Fudan University. My research interests lie in Natural Language Processing and model evaluation, particularly focusing on efficient and generalizable evaluation methods for large language models.
Paper List
-
Effieval: Efficient and generalizable model evaluation via capability coverage maximization
-
Model Utility Law: Evaluating LLMs beyond Performance through Mechanism Interpretable Metric
-
Do LLMs Signal When They're Right? Evidence from Neuron Agreement