publications

publications by categories in reversed chronological order.

2025

  1. MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation
    Weihao Xuan, Rui Yang, Heli Qi, and 29 more authors
    In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, Nov 2025
  2. JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models’ Detection of Human Self-Destructive Behavior Content in Jirai Community
    Yunze Xiao, Tingyu He, Lionel Z. Wang, and 5 more authors
    Nov 2025
  3. Understanding Aha Moments: from External Observations to Internal Mechanisms
    Shu Yang, Junchao Wu, Xin Chen, and 4 more authors
    Nov 2025
  4. Embracing Contradiction: Theoretical Inconsistency Will Not Impede the Road of Building Responsible AI Systems
    Gordon Dai, and Yunze Xiao
    In Advances in Neural Information Processing Systems, Nov 2025
  5. Synthetic Socratic Debates: Examining Persona Effects on Moral Decision and Persuasion Dynamics
    Jiarui Liu, Yueqi Song, Yunze Xiao, and 5 more authors
    In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, Nov 2025
  6. Humanizing Machines: Rethinking LLM Anthropomorphism Through a Multi-Level Framework of Design
    Yunze Xiao, Lynnette Hui Xian Ng, Jiarui Liu, and 1 more author
    In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, Nov 2025
  7. Hire Your Anthropologist! Rethinking Culture Benchmarks Through an Anthropological Lens
    Mai AlKhamissi, Yunze Xiao, Badr AlKhamissi, and 1 more author
    Nov 2025
  8. Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction
    Ming Li, Han Chen, Yunze Xiao, and 3 more authors
    Nov 2025

2024

  1. Chinese Offensive Language Detection:Current Status and Future Directions
    Yunze Xiao, Houda Bouamor, and Wajdi Zaghouani
    Nov 2024
  2. ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations
    Yunze Xiao, Yujia Hu, Kenny Tsu Wei Choo, and 1 more author
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
  3. Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs
    David R. Mortensen, Valentina Izrailevitch, Yunze Xiao, and 2 more authors
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024
  4. InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews
    Xintao Wang, Yunze Xiao, Jen-tse Huang, and 10 more authors
    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
  5. Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development Goals
    Qingyang Wu, Ying Xu, Tingsong Xiao, and 8 more authors
    Aug 2024

2023

  1. Nexus at ArAIEval Shared Task: Fine-Tuning Arabic Language Models for Propaganda and Disinformation Detection
    Yunze Xiao, and Firoj Alam
    In Proceedings of ArabicNLP 2023, Dec 2023

2022

  1. A Transformer-based Attention Flow Model for Intelligent Question and Answering Chatbot
    Yunze Xiao
    In 2022 14th International Conference on Computer Research and Development (ICCRD), Dec 2022