Zoher Kachwala

NaN  OSoMe  CNetS  Luddy  IU

prof_pic.jpg

I am a PhD candidate at Indiana University, in the Luddy School of Informatics, Computing and Engineering, advised by Professor Filippo Menczer. I am an active member of the Observatory on Social Media and the NaN research group. I also collaborate with Professor Jisun An and Professor Haewoon Kwak.

My research specializes in LLM post-training and evaluation:

  • Evaluation & Benchmarking: Designed REMATCH (NAACL 2024), a novel AMR graph evaluation metric achieving 5x speedup while ranking first in semantic similarity. Built PLURULE (ACL 2026), a multimodal, multilingual benchmark spanning 2,419 Reddit communities and 3,692 rules in 10 languages.
  • Post-Training Methods: Developed Prefill-Guided Thinking (NeurIPS 2025 Workshop), achieving 24% F1 improvement for zero-shot AI image detection. Designed DisCloze, a distillation method that generalizes reasoning to 100+ unseen Reddit communities.
  • Research to Production: Build and deploy systems using PyTorch, vLLM, DeepSpeed, and multi-GPU infrastructure. Experience scaling LLM training and evaluation pipelines on GPU clusters.

I also contributed to large-scale social media research (MEIU22, ICWSM 2023), releasing multi-platform datasets for political discourse analysis. Currently tracking answer-token logits during chain-of-thought generation to detect reasoning convergence, and comparing prefill vs prompt optimization using GEPA.


Recent GitHub Activity

Contribution activity for the past year

GitHub Contribution Chart

Updated automatically ‱ View on GitHub

news

Dec 15, 2025 Excited to share that our paper Prefilled responses enhance zero-shot detection of AI-generated images has been accepted to the NeurIPS 2025 Workshop on Generative and Protective AI for Content Creation! The paper has also been submitted to ACL ARR. Our Prefill-Guided Thinking (PGT) method improves AI-generated image detection by up to 24% without training data. 🚀
Oct 17, 2024 The results of the first CNetS Chocolate Tasting Workshop are finally live! 🎉 We had a panel of expert taste-testers rate 15 different chocolates on a -5 to 5 scale, and the results are full of surprises. Which chocolate reigned supreme? Which one got the cold shoulder? You’ll have to click through to find out!😏
Jun 06, 2024 My virtual NAACL24 presentation for Rematch is now live on YouTube! In this video, I delve into: 🔍 The significance of graphical representations in language, or “local knowledge graphs.” ⚖ The critical aspects we aim to optimize while keeping computational costs low. 🏆 How our algorithm, REMATCH, outperforms state-of-the-art methods in these areas.
Mar 14, 2024 Our paper REMATCH: Robust and Efficient Knowledge Graph Matching was accepted to NAACL24!
Mar 01, 2024 My research was awarded computing resources worth $160,550 by NSF’s Jetstream2 Project!

selected publications

  1. task_aligned.png
    Prefilled responses enhance zero-shot detection of AI-generated images
    Zoher Kachwala , Danishjeet Singh , Danielle Yang , and 1 more author
    2025
    NeurIPS 2025 Workshop; Under Review - ACL ARR
  2. PLURULE: A Challenging Benchmark for Detecting Rule Violations on Pluralistic Social Media
    Zoher Kachwala , Bao Tran Truong , Rasika Muralidharan , and 3 more authors
    In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics , 2026
  3. Distilling Cloze Reasoning Improves Detecting Violations on Pluralistic Social Media
    Zoher Kachwala , Jisun An , Haewoon Kwak , and 1 more author
    2025
    In Preparation
  4. rematchflow.png
    REMATCH: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic Similarity
    Zoher Kachwala , Jisun An , Haewoon Kwak , and 1 more author
    In Findings of the Association for Computational Linguistics: NAACL 2024 , Jun 2024
  5. multiplatform.png
    A multi-platform collection of social media posts about the 2022 US midterm elections
    Rachith Aiyappa , Matthew R DeVerna , Manita Pote , and 8 more authors
    In Proceedings of the International AAAI Conference on Web and Social Media , Jun 2023
  6. The Inexplicable Efficacy of Language Models
    Rachith Aiyappa , and Zoher Kachwala
    XRDS: Crossroads, The ACM Magazine for Students, Apr 2023