Bhavin Jawade

Research Scientist @ Netflix

University at Buffalo, SUNY

Biography

I am a Research Scientist at Netflix Research, working on Multimodal Large Language Models, Reward Modeling, and Reasoning. I did my Ph.D. in Computer Science from the University at Buffalo, where I was advised by Dr. Venu Govindaraju. During my Ph.D., I was a part of the Center for Unified Biometrics and Sensors (CUBS) Lab.

Previously, I was a research scientist intern at Netflix Research, Yahoo Research, and Adobe Research. My doctoral research primarily focused on Multimodal Contrastive Learning, Deep Metric Learning, and Deep Feature Fusion. My Ph.D. was funded by the IARPA BRIAR, NSF AI Institute, Qualcomm, NSF CITeR, NSF DiBBS, and NSF S&CC programs.

Among some notable achievements, I received the IEEE Best Paper Award at IJCB 2023, the Graduate Leadership Award (2023) and the Russell Agrusa Research Innovation Award (2021), as well as the Best CSE Ph.D. Poster and SEAS Ph.D. Poster Awards in 2022.

News

Nov 2025: I am hiring a Research Intern for summer 2026 at Netflix Research. Reach out if you are interesed.
Nov 2025: Serving as an Area Chair for ICASSP 2026
Feb 2025: Glad to share that I have joined Netflix Research as full-time Research Scientist
Nov 2024: Glad to share that I our paper at Netflix Research - "RSMS: Audio-Visual Representation Learning For Lip-Sync Estimation Through Ranking Augmented Contrastive Training" was accepted at ICASSP 2025
Oct 2024: Glad to share our paper 'ProxyFusion: Face Feature Aggregation Through Sparse Experts' got accepted at NeurIPS 2024
August 2024: Glad to share our paper 'SCOT: Self-Supervised Contrastive Pre-training For Zero-Shot Compositional Retrieval' got accepted at WACV 2025 in Round 1 (12.1% Acceptance Rate)
May 2024: Started summer internship at Netflix Research as a Research Scientist.
Dec 2023: Honored to receive the Graduate Leadership Award, from Department of Computer Science, University at Buffalo
Feb 2024: Glad to share our paper 'GestSpoof: Gesture Based Spatio-Temporal Representation Learning For Robust Fingerprint Presentation Attack Detection.' got accepted at FG 2024
Dec 2023: Glad to share our paper 'Conditional Neural Aggregation Network For Unconstrained Long Range Person Feature Fusion' got accepted in IEEE TBIOM Journal
Sept 2023: CoNAN received the Best Paper Award at IJCB 2023. Link
July 2023: Glad to share our paper title: 'CoNAN - Conditional Neural Aggregation Network for Unconstrained face recognition' has been accepted at IJCB 2023.
May 2023: Started summer internship at Yahoo Research as a Research Scientist.
April 2023: Glad to share our paper 'RealCQA - Scientific Chart Question Answering as a Test-bed for First-Order Logic' got accepted at ICDAR 2023.
April 2023: Presented my work on spatio-temporal fingerprint spoof detection at CITeR-EAB workshop at Idiap, Switzerland.
Feb 2023: Honored as one of the winner of SEAS PhD Research Poster Award 2023, by School of Engineering and Applied Sciences, at University at Buffalo
Feb 2023: I will be co-organizing the IJCB LivDet 2023 Challenge on detecting contactless fingerprint spoofs.
Feb 2023: Our book chapter on Deep Metric Learning in Handbook Of Statistics is now available online.
Jan 2023: Met with SUNY Chancellor Dr. John B King (Former US Secretary of Education) as part of his visit to UB.
Jan 2023: Presented our two papers (NAPReg and Hear The Flow) at WACV 2023 in Hawaii.
Dec 2022: Awarded Best CSE PhD Poster Award 2022 by University at Buffalo
Dec 2022: Presented our work on multi-modal fusion and feature aggregation (CoNAN) at IARPA BRIAR program review
Nov 2022: Presented our work on fingerprint spoof detection through temporal learning at NSF CITeR's Fall 2022 Conference
Oct 2022: Glad to share our paper 'NAP Regularization' got accepted at WACV 2023
Oct 2022: Glad to share our paper 'Hear the Flow' got accepted at WACV 2023
Oct 2022: Glad to share our paper on attribute de-biased vision transformers (AD-ViT) got accepted at AVSS 2022
Oct 2022: Presented our paper on RidgeBase dataset at IJCB 2022
Sep 2022: Our dataset RidgeBase is now available for public use
Sep 2022: Presented our work on contactless fingerprint recongition at FedID conference
Jun 2022: Won of the winners of Adobe Code Jam 2022
May 2022: Started summer internship at Adobe Research as Research Scientist
Dec 2021: Awarded Russell Agrusa Research Innovation Award
Oct 2021: Won Blackstone launchpad Best Idea Award 2021
Aug 2021: Started serving as the President of Computer Science Graduate Student Association
Jun 2021: Glad to share our paper 'MultiLoss Fusion For Contactless Fingerprinting' got accepted at WIFS 2021
Jan 2021: Won Govt. of British Columbia's Maple Ridge Hackathon

Interests

Large Language Models
Computer Vision
Vision Language Alignment
Deep Metric Learning
Multi-modal Learning
Machine Learning

Education

Ph.D. in Computer Science and Engineering, 2024

University at Buffalo, State University of New York
M.S. in Computer Science and Engineering, 2022

University at Buffalo, State University of New York
B.E. in Information Technology, 2019

SGSITS Indore
Secondary School, PCM - CS, 2015

S.T. Paul H.S. School Indore

Experience

Research Scientist

Netflix Research

Feb 2025 – Present

Netflix Research | Los Gatos, CA

Large Multimodal Language Model Research. Reward Modelling, Contrastive Learning, Finetuning for Agents

Research Scientist Intern

Netflix Research

May 2024 – Dec 2024

Netflix, Localization Team | Los Gatos, CA

Working on multimodal contrastive learning for fine-grained audio-visual sync estimation.

Research Scientist Intern

Yahoo Research

May 2023 – Dec 2023

Yahoo, Visual Intelligence Team | San Francisco

Working as Research intern in visual intelligence team to solve multi-modal retrieval problem.

Research Scientist Intern

Adobe Research

May 2022 – Dec 2022

Adobe, Media Intelligence Lab | San Jose, CA

Designed novel pre-training strategies and transformer architectures for large scale vision-language contrastive pretraining on 100M scale datasets. Evaluation on downstream tasks such as zero-shot image classification, text-image retrieval, object-detection etc.

Research Assistant

Department of Computer Science, University at Buffalo

Jun 2020 – Present

Buffalo

I am Graduate Research Assitant (SUNY) at CUBS lab at University at Buffalo, State University of New York. I am working with Prof. Srirangaraj (Ranga) Setlur and Prof. Venu Govindaraju on an NSF funded project called Made@UB ML toolkit. ML Toolkit is an easy-to-use GUI based application, developed to reduce the time it takes to build prototypes for ML models as well as experiment with various feature extraction methods available.

Software Engineer - Machine Learning

Persistent Systems

Jul 2019 – Dec 2020

Pune

At Persistent Systems we build software that drives the business of our customers; enterprises and software product companies with software at the core of their digital transformation.

Recent Publications

ProxyFusion: Face Feature Aggregation Through Sparse Experts

(NeurIPS 2024) The Thirty-eighth Annual Conference on Neural Information Processing Systems

Bhavin Jawade, Alexander Stone, Deen Dayal Mohan, Xiao Wang, Srirangaraj Setlur, Venu Govindaraju

Preprint Code Project Poster Slides

SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval

(WACV 2025) IEEE/CVF Winter Conference on Applications of Computer Vision

Bhavin Jawade, Joao Soares, Kapil Thadani, Deen Dayal Mohan, Amir Erfan Eshratifar, Benjamin Culpepper, Paloma de Juan, Srirangaraj Setlur, Venu Govindaraju

Preprint PDF

NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal Embeddings

(WACV 2023) IEEE/CVF Winter Conference on Applications of Computer Vision

Bhavin Jawade, Deen Dayal Mohan, Naji Mohammed, Srirangaraj Setlur, Venu Govindaraju

PDF Code Poster

CoNAN: Conditional Neural Aggregation Network For Unconstrained Face Feature Fusion (Best Paper Award) (Oral)

(IJCB 2023), IEEE International Joint Conference on Biometrics

Bhavin Jawade, Deen Dayal Mohan, Dennis Fedorishin, Srirangaraj Setlur, Venu Govindaraju

Preprint PDF Poster

LLM Augmented Intervenable Multimodal Adaptor for Post-operative Complication Prediction in Lung Cancer Surgery

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops, 2026

Shubham Pandey, Bhavin Jawade, Srirangaraj Setlur, Venu Govindaraju, Kenneth Seastedt

Preprint PDF Code DOI

Ridgeformer: Mutli-Stage Contrastive Training For Fine-grained Cross-Domain Fingerprint Recognition

IEEE International Conference on Image Processing (ICIP), 2025

Shubham Pandey, Bhavin Jawade, Srirangaraj Setlur

Preprint PDF Code DOI

Multimodal LLM-Driven Intervention for Precision Risk Prediction in Lung Cancer Surgery

Clinical Cancer Research 31(13 Suppl), Abstract A054, 2025

Shubham Pandey, Bhavin Jawade, Srirangaraj Setlur, Venu Govindaraju, Kenneth P. Seastedt

Source Document DOI Abstract

Audio-Visual Representation Learning For Lip-Sync Estimation Through Ranking Augmented Contrastive Training

(ICASSP 2025) IEEE International Conference on Acoustics, Speech, and Signal Processing 2025

Bhavin Jawade, Ravi Teja Gadde, Christophe Bejjani, Yinghong Lan

Preprint PDF Poster

Multimodal Contrastive Models Through Zero-Shot Training

Ph.D. dissertation, State University of New York at Buffalo, 2025

Bhavin Jawade

DIOR: Dataset for Indoor-Outdoor Reidentification Long Range 3D/2D Skeleton Gait Collection Pipeline, Semi-Automated Gait Keypoint Labeling and Baseline Evaluation Methods

IEEE International Joint Conference on Biometrics (IJCB), 2024

Yuyang Chen, Praveen Raj Masilamani, Bhavin Jawade, Srirangaraj Setlur, Karthik Dantu

Preprint PDF DOI

GestSpoof: Gesture Based Spatio-Temporal Representation Learning for Robust Fingerprint Presentation Attack Detection

IEEE International Conference on Automatic Face and Gesture Recognition (FG), 2024

Bhavin Jawade, Shreeram Subramanya, Atharv Dabhade, Srirangaraj Setlur, Venu Govindaraju

DOI

Attention Guided Multi-Attribute Architecture for Deepfake Detection

IEEE Western New York Image and Signal Processing Workshop (WNYISPW), 2023

Rohan Sharma, Bhavin Jawade, Akshay Agarwal, Srirangaraj Setlur, Nalini Ratha

DOI

Conditional Neural Aggregation Network For Unconstrained Long Range Biometric Feature Fusion

IEEE Transactions on Biometrics, Behavior, and Identity Science (TBIOM), 2024

Bhavin Jawade, Deen Dayal Mohan, Prajwal Shetty, Dennis Fedorishin, Srirangaraj Setlur, Venu Govindaraju

Preprint PDF Poster

Liveness Detection Competition - Noncontact-based Fingerprint Algorithms and Systems (LivDet-2023 Noncontact Fingerprint)

IEEE International Joint Conference on Biometrics (IJCB), 2023

Sandip Purnapatra, Humaira Rezaie, Bhavin Jawade, Yu Liu, Yue Pan, Luke Brosell, Mst Rumana Sumi, Lambert Igene, Alden Dimarco, Srirangaraj Setlur, Soumyabrata Dey, Stephanie Schuckers, Marco Huber, Jan Niklas Kolf, Meiling Fang, Naser Damer, Banafsheh Adami, Raul Chitic, Karsten Seelert, Vishesh Mistry, Rahul Parthe, Umit Kacar

Preprint PDF Dataset Project DOI

RealCQA: Scientific Chart Question Answering as a Test-Bed for First-Order Logic

IEEE/IAPR International Conference on Document Analysis and Recognition (ICDAR), 2023

Saleem Ahmed, Bhavin Jawade, Shubham Pandey, Srirangaraj Setlur, Venu Govindaraju

Preprint PDF Code

Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization

IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022

Dennis Fedroishin, Deen Dayal Mohan, Bhavin Jawade, Srirangaraj Setlur, Venu Govindaraju

PDF Code Poster

Deep metric learning for computer vision: A brief overview

Book Chapter - Handbook of Statistics, Edition 48

Deen Dayal Mohan, Bhavin Jawade, Srirangaraj Setlur, Venu Govindaraju

PDF

RidgeBase: A Cross-Sensor Multi-Finger Contactless Fingerprint Dataset

IAPR/IEEE International Joint Conference on Biometrics (IJCB), 2022

Bhavin Jawade, Deen Dayal Mohan, Srirangaraj Setlur, Nalini Ratha, Venu Govindaraju

PDF Poster Request Dataset

AD-ViT: Attribute De-biased Vision Transformer for Long-Term Person Re-identification

IEEE International Conference on Advanced Video and Signal-Based Surveillance (AVSS), 2022

Kyung Won Lee, Bhavin Jawade, Deen Dayal Mohan, Srirangaraj Setlur, Venu Govindaraju

PDF

Multi Loss Fusion For Matching Smartphone Captured Contactless Finger Images

IEEE International Workshop On Forensics and Security (WIFS), 2021

Bhavin Jawade, Akshay Agarwal, Srirangaraj Setlur, Nalini Ratha

PDF

Low Computation in-device geofencing Algorithm using hierarchy based searching for offline usage

IEEE International Conference on Inventive Computation Technologies (ICICT), 2018

Bhavin Jawade, Khushbu Goyal

PDF Certificate

Patents

Audio-visual representation learning for lip-sync estimation through ranking augmented contrastive training

US Patent Application US 20260073669 A1 (Application No. US 19/319,350)

B. Jawade, R. T. Gadde, C. B. H. El Bejjani, Y. Lan

Mar 12, 2026

Audio-visual representation learning for lip-sync estimation through ranking augmented contrastive training

US Patent Application US 20260073670 A1 (Application No. US 19/319,398)

B. Jawade, R. T. Gadde, C. B. H. El Bejjani, Y. Lan

Mar 12, 2026

Systems and methods for automatically adding text content to generated images

US Patent US 12,561,867 B2 (Application No. US 18/512,871)

F. Perez-Sorrosal, B. Jawade, E. Eshratifar, J. V. B. Soares

Feb 24, 2026

Systems and methods for image compositing via machine learning

US Patent Application US 20250315922 A1 (Application No. US 18/626,427)

B. Jawade, A. E. Eshratifar, K. Thadani, P. de Juan, J. V. B. Soares, J. Culpepper

Oct 9, 2025

Systems and methods for image compositing via machine learning

US Patent Application US 20250292368 A1 (Application No. US 19/072,081)

B. Jawade, A. E. Eshratifar, K. Thadani, P. de Juan, J. V. B. Soares, others

Sep 18, 2025

Recent Blogs

Understanding MatFormer - Nested Transformers for elastic inference

Google recently released the Gemma 3n models — E4B and E2B. The models are packed with novel components and features from PLEs, ASR and …

Bhavin Jawade

Jul 13, 2025 11 min read

ORPO — Preference Optimization without Reference Model

Typically, preference alignment in large language models (LLMs) requires a reference model and a warm-up phase of supervised …

Bhavin Jawade

Feb 20, 2025 5 min read

Tuning-Free Longer Context Lengths For LLMs — A Review of Self-Extend (LLM Maybe LongLM)

LLMs are typically trained on fixed-length sequences, leading to performance degradation when dealing with longer texts due to …

Bhavin Jawade

Jan 4, 2024 6 min read

Demystifying GQA — Grouped Query Attention for Efficient LLM Pre-training

The article explores Grouped Query Attention (GQA), an efficient pre-training strategy for large language models (LLMs) like LLaMA-2 …

Bhavin Jawade

Dec 15, 2023 4 min read

Understanding LoRA — Low Rank Adaptation For Finetuning Large Models

Fine-tuning large pre-trained models is computationally challenging, often involving adjustment of millions of parameters. This …

Bhavin Jawade

Dec 7, 2023 3 min read

Recent & Upcoming Talks

Paper Presentation - Google Deepmind's Thinking fast and slow: Efficient Text-to-Visual Retrieval with Transformers (CVPR)

Lab’s weekly paper presentation | CVPR paper presented at regular lab meeting.

Nov 5, 2021 10:00 AM — 4:00 AM CUBS/CEDAR Lab University at Buffalo.

Antoine Miech, Jean-Baptiste Alayrac, Ivan Laptev, Josef Sivic, Andrew Zisserman

PDF Slides

Azure For App Developers

Azure for App Developers

Jun 16, 2019 10:00 AM — 3:00 AM Wittyfeed HQ, Shekhar Central, Palasia, Indore 452001

Bhavin Jawade

Slides

Pytorch: Deep Learning Framework | F8 Indore Meetup

Facebook Developer Cirlce Indore | F8 Indore Meetup | Pytorch: Deep Learning Framework

May 5, 2019 10:00 AM — 4:00 AM Vidorra, Shekhar Central, Palasia Indore 452001

Slides

Women of Developer Circle | Facebook Developer Circles Indore

Facebook Developer Cirlce Indore | Women in Tech.

Apr 14, 2019 10:00 AM — 1:00 AM Wittyfeed HQ, Shekhar Central, Palasia, Indore 452001

Slides

SAGE University Indore: Session on OpenSource with Facebook.

Getting started with opensource and creating your first VR app

Apr 5, 2019 10:00 AM — 1:00 AM SAGE University Indore, Sagar Institute of Research and Technology.

PDF Code Slides Video