Yunze (Lorenzo) Xiao
Enterprise Agent Lead, Abaka AI
GHC 5418
4902 Forbes Ave
Pittsburgh, PA 15213
I recently graduated from the Language Technology Institute at Carnegie Mellon University (M.S., May 2026), where I was advised by Prof. Mona Diab. Concurrently, I work as Enterprise Agent Lead at Abaka AI and as a Visiting Research Scientist at the 2077AI Foundation.
Research. I aim to develop large language models that move beyond surface-level fluency toward genuine human-like intelligence — systems that can think, remember, feel, and interact in socially and cognitively coherent ways. This connects three intersecting directions across NLP, Computational Social Science, and HCI:
Anthropomorphism as a modeling dimension. How do training objectives, architectural decisions, and interface design shape the emergence of human-like traits in LLMs? I study anthropomorphism not as a risk to mitigate but as a controllable, analyzable design space.
Anthropomorphism for applications. How can human-like attributes — emotional resonance, persona consistency, contextual memory — improve LLM performance in real-world settings like education, therapy, and collaborative writing?
Architectures for synthetic human-likeness. What design innovations (memory modules, affective simulation, multi-modal grounding) are needed to support truly interactive and situated AI agents that engage users as emotionally-aware collaborators?
Industry. At Abaka AI, I lead development of enterprise agent systems across three intersecting directions:
Agentic evaluation. Benchmarks and frameworks for measuring agent performance in open-ended, multi-turn enterprise settings — where standard accuracy metrics break down.
Agents for diverse enterprise applications. End-to-end agent pipelines adapted to heterogeneous business contexts: multi-model reasoning, privacy-preserving data synthesis (GDPR / CCPA / PIPL / ISO 27701), and production infrastructure with FastAPI, gRPC, and Kafka.
Human–agent and multi-agent collaboration. Interaction protocols and coordination architectures for human-in-the-loop workflows and multi-agent systems operating under real-world constraints.
Previously I was advised by Prof. Houda Bouamor and Prof. Kemal Oflazer at Carnegie Mellon University in Qatar.
Feel free to reach out — yunzex@andrew.cmu.edu or @LrzNeedResearch.
Open to opportunities
Research Engineer / Applied Scientist — full-time, open to remote
I recently graduated from CMU LTI (M.S., May 2026) and am looking for my next role. I build end-to-end: research (persona collapse, LLM evaluation, synthetic populations) and production (multi-agent pipelines, agentic evaluation, enterprise agent systems). I’m most excited about teams working on LLM evaluation, agent systems, or responsible AI — but I’m open to the right problem. If that sounds like a fit, I’d love to talk.
news
| Apr 26, 2026 | New blog post & interactive microsite — The Chameleon’s Limit summarizes our preprint on persona collapse in LLMs (explore the microsite). |
|---|---|
| Sep 28, 2024 | Happy to share that I am reviewing for ICWSM 2025 and CSCW 2025! |
| Sep 28, 2024 | Happy to share that our work on Cloaked Offensive Language is accepted by EMNLP 2024! |
| Jun 19, 2024 | Our work on Cloaked Offensive Language is on arxiv now! |
selected publications
-
ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking PerturbationsIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Nov 2024
-
InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological InterviewsIn Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
-
Embracing Contradiction: Theoretical Inconsistency Will Not Impede the Road of Building Responsible AI SystemsIn Advances in Neural Information Processing Systems, Aug 2025
-
Humanizing Machines: Rethinking LLM Anthropomorphism Through a Multi-Level Framework of DesignIn Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, Nov 2025
my schedule
Feel free to check my availability. Times shown in Eastern Time (Pittsburgh).
This site has been visited … times since March 2026.