Steffi Chern

🎓 Hi everyone! I’m Steffi, and I recently graduated with a B.S. in Statistics and Machine Learning from Carnegie Mellon University (CMU). I’m seeking opportunities to pursue a Ph.D. degree in Computer Science starting Fall 2025.

🧠 My research interests lie at the intersection of natural language processing (NLP) and machine learning (ML). Specifically, I’m interested in the following topics:

  • Superalignment: Ensuring that superintelligent AI systems autonomously adhere to human goals, values, and safety standards, especially as they surpass human cognitive understanding and operate in scenarios that are hard for humans to control. ScaleEval, OlympicArena, BeHonest, Cognitive Engineering.
  • Trustworthy and Robust AI: Ensuring AI systems are reliable, honest, and safe by developing tools to detect factual inaccuracies, prevent hallucinations, adapt AI behavior to dynamic human norms, and strengthen robustness against adversarial attacks. BeHonest, FacTool, Align on the Fly, Halu-J, Combating Adversarial Attacks.
  • Evaluation and Benchmarking of LLMs: Investigating scalable evaluation methods to assess the performance of LLMs. Emphasis on human-AI collaboration, leveraging AI to enhance humans oversight and evaluate model outputs effectively. ScaleEval, OlympicArena, BeHonest.

✨ I’m fortunate to have received an honorable mention for the NSF Graduate Research Fellowship.

⛳ I was also a student-athlete, playing NCAA Women’s Golf at CMU.

đź“© Feel free to contact me about any research/job opportunities or questions you have!


Academic Service

🔎 Reviewer: NeurIPS (2024), ICLR (2025), AISTATS (2025)


Teaching

đź“š Teaching Assistant for Probability Theory, Fall 2023

đź“š Teaching Assistant for Advanced Methods for Data Analysis, Spring 2024


Publications

⬇️ Below are some of my recent publications:

BeHonest: Benchmarking Honesty in Large Language Models
Steffi Chern, Zhulin Hu, Yuqing Yang, Ethan Chern, Yuan Guo, Jiahe Jin, Binjie Wang, Pengfei Liu
Preprint. [paper] [github] [website]

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
Zhen Huang, Zengzhi Wang, Shijie Xia, Xuefeng Li, Haoyang Zou, Ruijie Xu, Run-Ze Fan, Lyumanshan Ye, Ethan Chern, Yixin Ye, Yikai Zhang, Yuqing Yang, Ting Wu, Binjie Wang, Shichao Sun, Yang Xiao, Yiyuan Li, Fan Zhou, Steffi Chern, Yiwei Qin, Yan Ma, Jiadi Su, Yixiu Liu, Yuxiang Zheng, Shaoting Zhang, Dahua Lin, Yu Qiao, Pengfei Liu
Accepted to Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS D&B Track) [paper] [github] [website]

Can Large Language Models be Trusted for Evaluation? Scalable Meta-Evaluation of LLMs as Evaluators via Agent Debate
Steffi Chern, Ethan Chern, Graham Neubig, Pengfei Liu
Preprint. [paper] [github]

Halu-J: Critique-Based Hallucination Judge
Binjie Wang, Steffi Chern, Ethan Chern, Pengfei Liu
Preprint. [paper]

FacTool: Factuality Detection in Generative AI - A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios
I-Chun Chern, Steffi Chern, Shiqi Chen, Weizhe Yuan, Kehua Feng, Chunting Zhou, Junxian He, Graham Neubig, Pengfei Liu
Preprint. [paper] [github] [website]

Combating Adversarial Attacks with Multi‑Agent Debate
Steffi Chern *, Zhen Fan *, Andy Liu *
Preprint. [paper]

Align on the Fly: Adapting Chatbot Behavior to Established Norms
Chunpu Xu, Steffi Chern, Ethan Chern, Ge Zhang, Zekun Wang, Ruibo Liu, Jing Li, Jie Fu, Pengfei Liu
Preprint. [paper] [github]

Voice Direction‑of-Arrival Conversion
I‑Chun Chern, Steffi Chern, Heng‑Cheng Kuo, Huan‑Hsin Tseng, Kuo‑Hsuan Hung, Yu Tsao
Accepted to 33rd IEEE Machine Learning for Signal Processing (MLSP) 2023. [paper]

Automated Analysis of Fluency Behaviors in Aphasia.
Davida Fromm, Steffi Chern, Zihan Geng, Mason Kim, Brian MacWhinney, Joel Greenhouse
Accepted to 52nd Clinical Aphasiology Conference (CAC) for poster, and Journal of Speech, Language, and Hearing Research for paper (JSLHR). [poster] [paper]