Zoher Kachwala

NaN  OSoMe  CNetS  Luddy  IU

prof_pic.jpg

I am a PhD candidate at Indiana University, in the Luddy School of Informatics, Computing and Engineering, advised by Professor Filippo Menczer. I am an active member of the Observatory on Social Media and the NaN research group. I also collaborate with Professor Jisun An and Professor Haewoon Kwak.

My research specializes in LLM post-training and evaluation:

  • Evaluation & Benchmarking: Designed REMATCH (NAACL’24), a novel AMR graph evaluation metric achieving 5x speedup while ranking first in semantic similarity. Building multimodal benchmarks for community-aware content moderation.
  • Post-Training Methods: Developed Prefill-Guided Thinking (NeurIPS’25 Workshop), achieving 24% F1 improvement for zero-shot AI image detection. Researching structured fine-tuning for cross-domain generalization.
  • Research to Production: Build and deploy systems using PyTorch, vLLM, and multi-GPU infrastructure. Experience scaling LLM training and evaluation pipelines on GPU clusters.

I also contributed to large-scale social media research (MEIU22, ICWSM 2023), releasing multi-platform datasets for political discourse analysis. Currently exploring heuristic-guided decoding for improved reasoning and optimization landscapes of prefills versus prompts.


Recent GitHub Activity

Contribution activity for the past year

GitHub Contribution Chart

Updated automatically ‱ View on GitHub

news

May 15, 2025 Excited to share that our paper Prefilled responses enhance zero-shot detection of AI-generated images has been accepted to the NeurIPS 2025 Workshop on Generative and Protective AI for Content Creation! The paper has also been submitted to ACL ARR. Our Prefill-Guided Thinking (PGT) method improves AI-generated image detection by up to 24% without training data. 🚀
Oct 17, 2024 The results of the first CNetS Chocolate Tasting Workshop are finally live! 🎉 We had a panel of expert taste-testers rate 15 different chocolates on a -5 to 5 scale, and the results are full of surprises. Which chocolate reigned supreme? Which one got the cold shoulder? You’ll have to click through to find out!😏
Jun 06, 2024 My virtual NAACL24 presentation for Rematch is now live on YouTube! In this video, I delve into: 🔍 The significance of graphical representations in language, or “local knowledge graphs.” ⚖ The critical aspects we aim to optimize while keeping computational costs low. 🏆 How our algorithm, REMATCH, outperforms state-of-the-art methods in these areas.
Mar 14, 2024 Our paper REMATCH: Robust and Efficient Knowledge Graph Matching was accepted to NAACL24!
Mar 01, 2024 My research was awarded computing resources worth $160,550 by NSF’s Jetstream2 Project!

selected publications

  1. task_aligned.png
    Prefilled responses enhance zero-shot detection of AI-generated images
    Zoher Kachwala , Danishjeet Singh , Danielle Yang , and 1 more author
    2025
  2. MultiModReddit: A Benchmark for Community-Aware Content Moderation
    Zoher Kachwala , Jisun An , Haewoon Kwak , and 1 more author
    2025
    Under Review – ACL ARR
  3. Cross-Community Generalization through Structured Supervised Fine-Tuning
    Zoher Kachwala , Jisun An , Haewoon Kwak , and 1 more author
    2025
    In Preparation
  4. rematchflow.png
    REMATCH: Robust and Efficient Matching of Local Knowledge Graphs to Improve Structural and Semantic Similarity
    Zoher Kachwala , Jisun An , Haewoon Kwak , and 1 more author
    In Findings of the Association for Computational Linguistics: NAACL 2024 , Jun 2024
  5. multiplatform.png
    A multi-platform collection of social media posts about the 2022 US midterm elections
    Rachith Aiyappa , Matthew R DeVerna , Manita Pote , and 8 more authors
    In Proceedings of the International AAAI Conference on Web and Social Media , Jun 2023
  6. The Inexplicable Efficacy of Language Models
    Rachith Aiyappa , and Zoher Kachwala
    XRDS: Crossroads, The ACM Magazine for Students, Apr 2023