publications
publications by categories in reversed chronological order.
2025
- 
      MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation2025
- 
      JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models’ Detection of Human Self-Destructive Behavior Content in Jirai Community2025
- 
      Understanding Aha Moments: from External Observations to Internal Mechanisms2025
- 
      Embracing Contradiction: Theoretical Inconsistency Will Not Impede the Road of Building Responsible AI Systems2025
- 
      Synthetic Socratic Debates: Examining Persona Effects on Moral Decision and Persuasion Dynamics2025
- 
      Humanizing Machines: Rethinking LLM Anthropomorphism Through a Multi-Level Framework of Design2025
2024
- 
      Chinese Offensive Language Detection:Current Status and Future Directions2024
- 
      ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations2024
- 
      Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs2024
- 
      InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological InterviewsIn Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
- 
      Surveying Attitudinal Alignment Between Large Language Models Vs. Humans Towards 17 Sustainable Development GoalsAug 2024
2023
- 
      Nexus at ArAIEval Shared Task: Fine-Tuning Arabic Language Models for Propaganda and Disinformation DetectionIn Proceedings of ArabicNLP 2023, Dec 2023
2022
- 
      A Transformer-based Attention Flow Model for Intelligent Question and Answering ChatbotIn 2022 14th International Conference on Computer Research and Development (ICCRD), Dec 2022