Publications

You can also find my articles on Google Scholar.

DynamixSFT: Dynamic Mixture Optimization of Instruction Tuning Collections
Haebin Shin, Lei Ji, Xiao Liu, Zhiwei Yu, Qi Chen, Yeyun Gong
Preprint
Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling
Haebin Shin, Lei Ji, Xiao Liu, Yeyun Gong
ICML 2025
Exploring Adversarial Robustness in Classification tasks using DNA Language Models
Hyunwoo Yoo, Haebin Shin, Kaidi Xu, Gail Rosen
ICML 2025 GenBio workshop
Can Large Language Models Classify and Generate Antimicrobial Resistance Genes?
Hyunwoo Yoo, Haebin Shin, Gail Rosen
ACL 2025 BioNLP workshop
Generative Prompt Internalization
Haebin Shin, Lei Ji, Yeyun Gong, Sungdong Kim, Eunbi Choi, Minjoon Seo
NAACL 2025 Oral
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
Seungone Kim, Juyoung Suk, ..., Haebin Shin, ..., Bill Yuchen Lin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo
NAACL 2025 Best Paper
InstructIR: A Benchmark for Instruction Following of Information Retrieval Models
Hanseok Oh, Hyunji Lee, Seonghyeon Ye, Haebin Shin, Hansol Jang, Changwook Jun, Minjoon Seo
ACL 2024 KnowledgeNLP workshop
KTRL+ F: Knowledge-Augmented In-Document Search
Hanseok Oh*, Haebin Shin*, Miyoung Ko, Hyunji Lee, Minjoon Seo
NAACL 2024
Intuitive access to smartphone settings using relevance model trained by contrastive learning
Joonyoung Kim, Kangwook Lee, Haebin Shin, Hurnjoo Lee, Sechun Kang, Byunguk Choi, Dong Shin, Joohyung Lee
AAAI 2023 — Innovative Applications of Artificial Intelligence (IAAI-23)
Learning to embed multi-modal contexts for situated conversational agents
Haeju Lee, Oh Joon Kwon, Yunseon Choi, Minho Park, Ran Han, Yoonhyung Kim, Jinhyeon Kim, Youngjune Lee, Haebin Shin, Kangwook Lee, Kee-Eung Kim
NAACL 2022 Findings
Tackling situated multi-modal task-oriented dialogs with a single transformer model
Haeju Lee, Oh Joon Kwon, Yunseon Choi, Jinhyeon Kim, Youngjune Lee, Ran Han, Yoonhyung Kim, Minho Park, Kangwook Lee, Haebin Shin, Kee-Eung Kim
AAAI 2022 DSTC10 workshop