ryuryukke

Follow

🚀

Ryuto Koike ryuryukke

🚀

Follow

Visiting Associate at UPenn, PhD-ing @nlp-titech, NLP + AI Safety

2 followers · 0 following

University of Pennsylvania
Philadelphia, PA
sites.google.com/view/ryutokoike
https://tinyurl.com/ryuto-google-scholar
@sponddd

Achievements

Achievements

Highlights

Pro

ryuryukke/README.md

Hi there 👋

I am a CS PhD candidate at Tokyo Institute of Technology, advised by Naoaki Okazaki. My expected graduation date is March 2026. Currently, I am visiting the UPenn NLP, hosted by Chris Callison-Burch. I also work with Preslav Nakov from MBZUAI NLP.

I work on AI safety, specifically improving the safety of large language models (LLMs) from various perspectives, including:

Detecting texts generated by LLMs, particularly in increasing its robustness against adversarial attacks in the wild, like OUTFOX (AAAI 2024);How You Prompt Matters (Findings of EMNLP 2024)
Enhancing LLM-as-a-judge to be more reliable by mitigating its evaluation bias, like Likelihood-based mitigation (Findings of ACL 2024)

In addition, I have broad interests in AI safety, including jailbreak and safe alignment.

📢 I am actively looking for research internships starting in (Summer | Fall | Winter) 2025.

Contact

Personal Website: sites.google.com/view/ryutokoike/
Twitter: @sponddd
email: my_first_name.my_last_name[at]nlp.c.titech.ac.jp

Pinned Loading

OUTFOX OUTFOX Public

[AAAI 2024] The official repository for our paper, "OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially Generated Examples"

Python 36 3
HowYouPromptMatters HowYouPromptMatters Public

[EMNLP 2024] The official repository for our long paper, "How You Prompt Matters! Even Task-Oriented Constraints in Instructions Affect LLM-Generated Text Detection"

3