Baktash Ansari — AI / ML Researcher

01

About Me

I'm Baktash Ansari, a Computer Science researcher beginning my PhD at Michigan State University (2026), after completing my MS at the University of Washington with a perfect 4.0 GPA and my BS at Iran University of Science & Technology, where I ranked #1 in my department.

My work sits at the intersection of natural language processing, vision-language modeling, and information retrieval. I'm fascinated by how large language models reason — their theory-of-mind, their blind spots on visual illusions, and how retrieval can make them more grounded and trustworthy.

I've published at venues including CVPR, NeurIPS, NAACL (SemEval), and The Web Conference, and I love building things — from multi-GPU training pipelines to full-stack web apps.

Based inSeattle, WA

Email[email protected]

FocusNLP · VLM · IR

Research Interests

Natural Language Processing
Vision-Language Modeling
Information Retrieval
Medical Imaging
LLM Reasoning & Theory of Mind
Neuro-Symbolic AI

02

Education & Journey

Aug 2026 — 2031

PhD, Computer Science & Engineering

Michigan State University · East Lansing, MI

Incoming PhD researcher in machine learning and reasoning systems.

Sep 2025 — Present

MS, Computer Science GPA 4.0

University of Washington · Bothell, WA

Specialization in AI + Machine Learning. Graduate Research Assistant working on retrieval & LLM reasoning.

Oct 2020 — Jul 2024

BS, Computer Engineering GPA 3.92 · Rank #1

Iran University of Science & Technology · Tehran

Top student of the department among 102 peers. Specialization in AI + Machine Learning.

03

Selected Publications

Google Scholar ↗

WebConf2026

ToxiTwitch: Toward Emote-Aware Hybrid Moderation for Live Streaming Platforms

B. Ansari, E. Martin, A. Mashhadi

Accepted at DHOW-MiLLA Workshop · The Web Conference 2026

Content ModerationLLMsTwitch

CVPR2025

Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions

M. Rostamkhani, B. Ansari, H. Sabzevari, et al.

Accepted at MAR · CVPR 2025 · Spotlight at MAR · NeurIPS 2024

Vision-LanguageMultimodalBenchmark

PDF ↗

NAACL2024

BAMO at SemEval-2024 Task 9: BRAINTEASER

B. Ansari, M. Rostamkhani, S. Eetemadi

Proceedings of the 18th Int'l Workshop on Semantic Evaluation (SemEval-2024) · NAACL 2024

ReasoningMulti-Agent DebateNLP

PDF ↗

CLEF2024

Bilingual Sexism Classification: Fine-Tuned XLM-RoBERTa & GPT-3.5 Few-Shot Learning

A. Azadi, B. Ansari, S. Zamani

Proceedings of CLEF 2024 · EXIST Lab

Text ClassificationFine-TuningFew-Shot

04

Research Experience

Graduate Research Assistant

Sep 2025 — Present

University of Washington · Bothell, WA

Fine-tuned retrieval models (Contriever) with TopicTune, boosting Llama & GPT accuracy on open-domain Reddit/Lemmy data by 10%.
Applied reinforcement learning with LLM feedback for retrieval optimization and generation.
Implemented prompt-tuning on DeepSeek & Llama for toxicity detection in Twitch chat.
Built multi-GPU PyTorch pipelines for inference, analysis & evaluation.

Research Assistant

Aug 2025 — Present

Michigan State University · Remote

Designed a Neuro-Symbolic Agentic Pipeline integrating symbolic reasoning with LLMs for abductive reasoning on complex visual datasets.
Trained CLIP models with contrastive learning for visual-text alignment.

Research Assistant

Jul 2023 — Aug 2024

Iran University of Science & Technology · Tehran

Built 3 pipelines (Multi-Agent Debate, Chain-of-Thought, Fine-tuning) for SemEval-2024 Task 9, reaching 85% accuracy.
Developed sexism-detection methods — 4th in Task 1, 2nd in Task 2 at CLEF 2024 EXIST.
Generated 4 image datasets (4,000+ samples each) with Stable Diffusion to benchmark multimodal models (GPT-4o, Gemini, CLIP, BLIP).

05

Selected Projects

🔍

Hamming Embedding Image Search

Bag-of-Features with Hamming Embedding & Weak Geometric Consistency in C++. Achieved 63.46% mAP on Oxford — a 31% lift over baseline BoF.

C++SIFTImage Retrieval

🎬

Movie Revenue Prediction

End-to-end ML pipeline with 60+ engineered features across 17 models. Best: Extra Trees (R²=0.738), beating XGBoost & deep nets.

PythonScikit-learnOOP

😶

Persian Emotion Detection

Fine-tuned ParsBERT & XLM-RoBERTa Large on ArmanEmo; published models on Hugging Face. Multimodal classification with MVSA + ResNet.

TransformersHuggingFacePersian NLP

🧩

BAMO @ SemEval 2024

Led the team on lateral-thinking reasoning: fine-tuned transformers, chain-of-thought prompting, and multi-agent debate (ReConcile).

LLMsReasoningPublished

🎚️

Fuzzy Pendulum Control

Top-performing fuzzy control system for inverted-pendulum stabilization in a Computational Intelligence course.

Fuzzy LogicControl

✈️

TrekDestiny — Travel Platform

Led 3 devs building a travel web app with auth, real-time chat, notifications & blog. React + Tailwind, Scrum, CI/CD.

ReactTailwindFull-Stack

06

Technical Skills

Languages

PythonC++CJavaJavaScriptSQLC#BashHTML/CSS

ML / AI

PyTorchTensorFlowHuggingFaceCUDANLPComputer VisionRLGenerative AIFine-TuningNeuro-SymbolicInformation Retrieval

Tools & Data

NumPyPandasOpenCVScikit-learnMatplotlibGitLinuxKaggle

Honors

🏆

Top Student — Computer Engineering

Ranked #1 of 102 students at IUST · 2024

🎯

National Entrance Exam (Konkur)

Ranked 523rd — top 0.3% of 155,000+ · 2020

07

Let's Connect

Open to research collaborations, PhD discussions, and interesting problems in NLP, vision-language modeling, and retrieval.

[email protected]

Google Scholar GitHub LinkedIn Download CV