Publications
Eugene Jang, Kimin Lee, Jin-Woo Chung, Keuntae Park, Seungwon Shin. Improbable Bigrams Expose Vulnerabilities of Incomplete Tokens in Byte-Level Tokenizers Preprint. [Paper]
Minkyoo Song, Eugene Jang, Jaehan Kim, Seungwon Shin. Covering Cracks in Content Moderation: Delexicalized Distant Supervision for Illicit Drug Jargon Detection KDD 2025.
Jian Cui, Hanna Kim, Eugene Jang, Dayeon Yim, Kicheol Kim, Yongjae Lee, Jin-Woo Chung, Seungwon Shin, Xiaojing Liao. Tweezers: A Framework for Security Event Detection via Event Attribution-centric Tweet Embedding NDSS 2025. [Paper]
Eugene Jang, Jian Cui, Dayeon Yim, Youngjin Jin, Jin-Woo Chung, Seungwon Shin, Yongjae Lee.
Ignore Me But Don’t Replace Me: Utilizing Non-Linguistic Elements for Pretraining on the Cybersecurity Domain NAACL Findings 2024. [Paper]Hanna Kim, Jian Cui, Eugene Jang, Chanhee Lee, Yongjae Lee, Jin-Woo Chung, Seungwon Shin.
DRAINCLoG: Detecting Rogue Accounts with Illegally-obtained NFTs using Classifiers Learned on Graphs NDSS 2024. [Paper]Youngjin Jin, Eugene Jang, Jian Cui, Jin-Woo Chung, Yongjae Lee, Seungwon Shin.
DarkBERT: A Language Model for the Dark Side of the Internet ACL 2023. [Paper]Youngjin Jin, Eugene Jang, Yongjae Lee, Seungwon Shin, Jin-Woo Chung.
Shedding new light on the language of the dark web NAACL 2022. [Paper]ChaeHun Park, Eugene Jang, Wonsuk Yang, Jong C Park.
Generating negative samples by manipulating golden responses for unsupervised learning of a response evaluation model NAACL 2021. [Paper]Huije Lee, Wonsuk Yang, Chaehun Park, Hoyun Song, Eugene Jang, Jong C Park.
Optimizing Domain Specificity of Transformer-based Language Models for Extractive Summarization of Financial News Articles in Korean PACLIC 2021. [Paper]
Domestic (Korean) Publications
Dayeon Yim, Eugene Jang, Jin-Woo Chung.
Topic classification for cybercrime-related telegram messages using BERT KSC 2023.Eugene Jang, Wonsuk Yang, Jong C Park.
Target-Agnostic Detection of Stances Toward Entities in News Articles HCIK 2021.