About

I am a last-year Ph.D. student at Arizona State University, co-advised by Prof. Chitta Baral and Prof. Ben Zhou. I am broadly interested in various topics related to natural language processing. My focus is on utilizing synthetic datasets to build effective NLP systems.

Education

B.S. of Computer Science, Worcester Polytechnic Institute, 2014-2018
M.S. of Computer Science, University of Southern California, 2018-2020
Ph.D. of Computer Science, Arizona State University, 2020-2026

Experience

Sep. 2025 - Dec. 2025: Applied Scientist Internship @ Amazon, Seattle, Washington
May. 2024 - Aug. 2024: Applied Scientist Internship @ Amazon, New York, New York
May. 2023 - Aug. 2023: Applied Science Internship @ Amazon, New York, New York
May. 2022 - Aug. 2022: Applied Scientist Internship @ Amazon, New York, New York

Publication

BOW: Reinforcement Learning for Bottlenecked Next-word Prediction
Ming Shen*, Zhikun Xu*, Jacob Dineen, Xiao Ye, Ben Zhou
ArXiv, 2025

Optimizing LLM-Based Multi-Agent System with Textual Feedback: A Case Study on Software Development
Ming Shen, Raphael Shu, Anurag Pratik, James Gung, Yubin Ge, Monica Sunkara, Yi Zhang
COLM 2025 Workshop on AI Agents: Capabilities and Safety

ToW: Thoughts of Words Improve Reasoning in Large Language Models
Zhikun Xu*, Ming Shen*, Jacob Dineen, Zhaonan Li, Xiao Ye, Shijie Lu, Aswin RRV, Chitta Baral, Ben Zhou
in Proceedings of NAACL 2025

QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA
Jacob Dineen, Aswin RRV, Qin Liu, Zhikun Xu, Xiao Ye, Ming Shen, Zhaonan Li, Shijie Lu, Chitta Baral, Muhao Chen, Ben Zhou
in Findings of EMNLP 2025

Rethinking Data Selection for Supervised Fine-Tuning
Ming Shen
ArXiv, 2024

Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models
Man Luo, Shrinidhi Kumbhar, Ming Shen, Mihir Parmar, Neeraj Varshney, Chitta Baral, Pratyay Banerjee, Somak Aditya
ArXiv, 2023

Simple Yet Effective Synthetic Dataset Construction for Unsupervised Opinion Summarization
Ming Shen, Jie Ma, Shuai Wang, Yogarshi Vyas, Kalpit Dixit, Miguel Ballesteros, Yassine Benajiba
in Findings of EACL 2023

Unsupervised Pronoun Resolution via Masked Noun-Phrase Prediction
Ming Shen*, Pratyay Banerjee*, and Chitta Baral
in Proceedings of ACL-IJCNLP 2021

CommonGen: A Constrained Text Generation Challenge for Generative Commonsense Reasoning
Bill Yuchen Lin*, Wangchunshu Zhou*, Ming Shen, Pei Zhou, Chandra Bhagavatula, Yejin Choi, and Xiang Ren
in Findings of EMNLP 2020

TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition
Bill Yuchen Lin*, Dong-Ho Lee*, Ming Shen, Ryan Moreno, Xiao Huang, Prashant Shiralkar, and Xiang Ren
in Proceedings of ACL-IJCNLP 2020

Evaluating Medical LLMs by Levels of Autonomy: A Survey Moving from Benchmarks to Applications
Xiao Ye, Jacob Dineen, Zhaonan Li, Zhikun Xu, Weiyu Chen, Shijie Lu, Yuxi Huang, Ming Shen, Phu Tran, Ji-Eun Irene Yum, Muhammad Ali Khan, Muhammad Umar Afzal, Irbaz Bin Riaz, Ben Zhou
ArXiv, 2025

Methods and Mechanisms for Interactive Novelty Handling in Adversarial Environments
Tung Thai, Ming Shen, Mayank Garg, Ayush Kalani, Nakul Vaidya, Utkarsh Soni, Mudit Verma, Sriram Gopalakrishnan, Chitta Baral, Subbarao Kambhampati, Jivko Sinapov, Matthias Scheutz
in Proceedings of AAMAS 2023

An Architecture for Novelty Handling in a Multi-Agent Stochastic Environment: Case Study in Open-World Monopoly
Tung Thai, Ming Shen, Neeraj Varshney, Sriram Gopalakrishnan, Utkarsh Soni, Matthias Scheutz, Chitta Baral, and Jivko Sinapov
in Proceedings of AAAI 2022 Symposium: Designing Artificial Intelligence for Open Worlds

Project

Demo project for ACL 2019 System Demonstrations paper: AlpacaTag: Active Learning-based Crowd Annotation Framework for Sequence Tagging

Awards And Honors

2020 - Present: University Doctoral Fellowship, Arizona State University
2020: CIDSE Doctoral Fellowship, Arizona State University

Service

Reviewer: ACL Rolling Review, ACL, EMNLP, COLING