publications
publications by categories in reversed chronological order.
2025
-
MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model EvaluationIn Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, Nov 2025
-
JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models’ Detection of Human Self-Destructive Behavior Content in Jirai CommunityNov 2025
-
Understanding Aha Moments: from External Observations to Internal MechanismsNov 2025
-
Embracing Contradiction: Theoretical Inconsistency Will Not Impede the Road of Building Responsible AI SystemsIn Advances in Neural Information Processing Systems, Nov 2025
-
Synthetic Socratic Debates: Examining Persona Effects on Moral Decision and Persuasion DynamicsIn Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, Nov 2025
-
Humanizing Machines: Rethinking LLM Anthropomorphism Through a Multi-Level Framework of DesignIn Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, Nov 2025
-
Hire Your Anthropologist! Rethinking Culture Benchmarks Through an Anthropological LensNov 2025
-
Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty PredictionNov 2025
2024
-
Chinese Offensive Language Detection:Current Status and Future DirectionsNov 2024
-
ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking PerturbationsIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
-
Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMsIn Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024
-
InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological InterviewsIn Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
-
Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development GoalsAug 2024
2023
-
Nexus at ArAIEval Shared Task: Fine-Tuning Arabic Language Models for Propaganda and Disinformation DetectionIn Proceedings of ArabicNLP 2023, Dec 2023
2022
-
A Transformer-based Attention Flow Model for Intelligent Question and Answering ChatbotIn 2022 14th International Conference on Computer Research and Development (ICCRD), Dec 2022