Publications

Note: I stopped writing academic papers since 2025, so this page is no longer updating. To know more about my current research and interests, check the Research Blog instead.

Manuscript

Thinking Fast and Right: Balancing Accuracy and Reasoning Length with Adaptive Rewards

Jinyan Su, Claire Cardie

Preprint

Reasoning RL LLM

Between Underthinking and Overthinking: An Empirical Study of Reasoning Length and Correctness in LLMs

Jinyan Su, Jennifer Healey, Preslav Nakov, Claire Cardie

Preprint

Reasoning RL LLM

Fast or Better? Balancing Accuracy and Cost in Retrieval-Augmented Generation with Flexible User Control

Jinyan Su, Jennifer Healey, Preslav Nakov, Claire Cardie

Preprint

Reasoning RAG Human-Centered LLM

Towards More Robust Retrieval-Augmented Generation: Evaluating RAG Under Adversarial Poisoning Attacks

Jinyan Su, Jinpeng Zhou, Zhengxin Zhang, Preslav Nakov, Claire Cardie

Preprint

RAG Safety LLM

Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI

Yuxia Wang, Rui Xing, Jonibek Mansurov, Giovanni Puccetti, Zhuohan Xie, Minh Ngoc Ta, Jiahui Geng, Jinyan Su, Mervat Abassy, Saad El Dine Ahmed, Kareem Elozeiri, Nurkhan Laiyk, Maiya Goloburda, Tarek Mahmoud, Raj Vardhan Tomar, Alexander Aziz, Ryuto Koike, Masahiro Kaneko, Artem Shelmanov, Ekaterina Artemova, Vladislav Mikhailov, Akim Tsvigun, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav Nakov

Preprint

Human-Centered Alignment LLM

2025

Corpus Poisoning via Approximate Greedy Gradient Descent

Jinyan Su, Preslav Nakov, Claire Cardie

Findings of ACL 2025

RAG Safety LLM

MixUCB: Enhancing Safe Exploration in Contextual Bandits with Human Oversight

Jinyan Su, Rohan Banerjee, Jiankai Sun, Wen Sun, Sarah Dean

RLC 2025

Human-Centered RL Safety Theory

2024

Learning from Streaming Data when Users Choose

Jinyan Su, Sarah Dean

ICML 2024

Human-Centered Alignment Theory

M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection

Yuxia Wang, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, Osama Mohanned Afzal, Tarek Mahmoud, Giovanni Puccetti, Thomas Arnold, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav Nakov

ACL 2024

Safety LLM

Adapting Fake News Detection to the Era of Large Language Models

Jinyan Su, Claire Cardie, Preslav Nakov

Findings of NAACL 2024

Safety LLM

M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection

Yuxia Wang, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, Chenxi Whitehouse, Osama Mohammed Afzal, Tarek Mahmoud, Alham Fikri Aji, Preslav Nakov

EACL 2024 Resource paper award

Safety LLM

2023

DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text

Jinyan Su, Terry Yue Zhuo, Di Wang, Preslav Nakov

Findings of EMNLP 2023

Safety LLM

Leveraging Large Language Models for Structure Learning in Prompted Weak Supervision

Jinyan Su, Peilin Yu, Jieyu Zhang, Stephen H Bach

IEEE BigData 2023

LLM

Differentially Private Stochastic Convex Optimization in (Non)-Euclidean Space Revisited

Jinyan Su, Changhong Zhao, Di Wang

UAI 2023

Theory

2022

Privacy Model with Public Unlabeled Data

Jinyan Su, Jinhui Xu, Di Wang

ACML 2022 Best paper award

Theory

Faster Rates of Private Stochastic Convex Optimization

Jinyan Su, Lijie Hu, Di Wang

ALT 2022

Theory