cv

Basics

Name Zoher Kachwala
Label PhD Candidate in LLM Post-Training & Evaluation
Url https://zoher15.github.io/
Summary PhD candidate specializing in LLM post-training and evaluation. Designed REMATCH (NAACL 2024), a novel AMR graph evaluation metric achieving 5x speedup while ranking first in semantic similarity. Developed Prefill-Guided Thinking (NeurIPS 2025 Workshop), achieving 24% F1 improvement for zero-shot AI image detection. Built PLURULE (ACL 2026), a multimodal benchmark spanning 2,419 Reddit communities in 10 languages.

Education

Work

  • 2022.08 - 2024.12
    Teaching Assistant
    Introduction to Network Science
    Led lab sections and designed exam problems and homework assignments covering network fundamentals including graph traversal, community detection, shortest paths, and hub analysis.
  • 2020.08 - 2020.12
    Teaching Assistant
    Applied Machine Learning
    Supported course on classical machine learning methods including linear and logistic regression, decision trees, and ensemble methods.
  • 2019.08 - 2021.12
    Teaching Assistant
    Elements of Artificial Intelligence
    Designed homework assignments and built autograding infrastructure to support 300+ graduate students on foundational AI topics including search algorithms, Markov models, heuristics, and decision trees.
  • 2018.06 - 2018.08
    Technology Consultant Intern
    PwC
    Managed project estimates for phase 2 of a large P&C insurance engagement. Created and prepared estimate slide decks and sheets for the client CEO.

Skills

Research
LLM Post-Training
Model Evaluation
Benchmarking
Supervised Fine-Tuning (SFT)
Distillation
Prompt Engineering
Chain-of-Thought Reasoning
Multimodal AI
RLHF
Model Interpretability
Programming
Python
PyTorch
C++
CUDA
JavaScript
SQL
Bash
ML Tools
PyTorch
Hugging Face Transformers
vLLM
DeepSpeed
Weights & Biases (W&B)
Weave
TensorFlow
Infrastructure
Multi-GPU Training
GPU Clusters
Distributed Computing
AWS
Google Cloud Platform (GCP)
Batch Processing
Data/Eval
Ray
Benchmark Development
Statistical Analysis
Data Pipeline Design
A/B Testing
Pandas
NumPy