Mohammad Reza Taesiri

Machine Learning Scientist at EA SPORTS Vancouver, specializing in emerging technologies for video games. PhD from the University of Alberta.

Research

I am broadly interested in large language and vision-language models, with a particular focus on post-training and model evaluation. My work involves stress-testing existing models through extensive benchmarking to elucidate the limitations of different architectural designs and training paradigms. Benchmarks I have developed are used by OpenAI, Google DeepMind, ByteDance, NVIDIA, Alibaba, and other leading research labs.

Highlights

Vision language models are blind

Pooyan Rahmanzadehgervi, Logan Bolton, Mohammad Reza Taesiri, Anh Nguyen

ACCV, 2024 Oral

Website Code Dataset

VLMs are Biased

Vision Language Models are Biased

An Vo, Khai-Nguyen Nguyen, Mohammad Reza Taesiri, Vy Tuong Dang, Anh Totti Nguyen, Daeyoung Kim

ArXiv Preprint, 2025

Website Dataset

VideoGameQA-Bench

VideoGameQA-Bench: Evaluating Vision-Language Models for Video Game Quality Assurance

Mohammad Reza Taesiri, Abhijay Ghildyal, Saman Zadtootaghaj, Nabajeet Barman, Cor-Paul Bezemer

NeurIPS Datasets and Benchmarks Track, 2025

Website Dataset

ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models

Jonathan Roberts, Mohammad Reza Taesiri, et al.

ArXiv Preprint, 2025

Website GitHub Dataset

Recent Papers

B-score: Detecting biases in large language models using response history

An Vo, Mohammad Reza Taesiri, Daeyoung Kim, Anh Totti Nguyen

ICML, 2025

Everyday Image Editing

Understanding Generative AI Capabilities in Everyday Image Editing Tasks

Mohammad Reza Taesiri, Brandon Collins, Logan Bolton, Viet Dac Lai, Franck Dernoncourt, Trung Bui, Anh Totti Nguyen

WACV, 2026

HoT: Highlighted Chain of Thought for Referencing Supporting Facts from Inputs

Tin Nguyen, Logan Bolton, Mohammad Reza Taesiri, Anh Nguyen

ArXiv Preprint, 2025

VideoGameBunny: Towards vision assistants for video games

Mohammad Reza Taesiri, Cor-Paul Bezemer

WACV, 2025 Oral

Website Model Dataset

PCNN: Probable-Class Nearest-Neighbor Explanations Improve Fine-Grained Image Classification Accuracy for AIs and Humans

Giang Nguyen, Valerie Chen, Mohammad Reza Taesiri, Anh Nguyen

TMLR, 2024

GlitchBench: Can large multimodal models detect video game glitches?

Mohammad Reza Taesiri, Tianjun Feng, Anh Nguyen, Cor-Paul Bezemer

CVPR, 2024

Website Code Dataset

Allowing humans to interactively guide machines where to look does not always improve a human-AI team's classification accuracy

Giang Nguyen, Mohammad Reza Taesiri, Sunnie S. Y. Kim, Anh Nguyen

CVPR XAI4CV Workshop, 2024

ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial Biases in Image Classification

Mohammad Reza Taesiri, Giang Nguyen, Sarra Habchi, Cor-Paul Bezemer, Anh Nguyen

NeurIPS, 2023

Website Code Dataset

Visual Correspondence XAI

Visual correspondence-based explanations improve AI robustness and human-AI team accuracy

Mohammad Reza Taesiri*, Giang Nguyen*, Anh Nguyen (*equal contribution)

NeurIPS, 2022

Website Demo Code Video

CLIP meets GamePhysics

CLIP meets GamePhysics: Towards bug identification in gameplay videos using zero-shot transfer learning

Mohammad Reza Taesiri, Finlay Macklon, Cor-Paul Bezemer

MSR, 2022

Website Code Demo Dataset

LLM Bug Detectors

Large Language Models are Pretty Good Zero-Shot Video Game Bug Detectors

Mohammad Reza Taesiri, Finlay Macklon, Yihe Wang, Hengshuo Shen, Cor-Paul Bezemer

ArXiv Preprint, 2022

Website Code Dataset