Video ai researchers with experience in multimodal learning, diffusion models, a…

completed4 qualified1 runApr 22, 5:25 AMvideo-ai-researchers-with-experience-in-multimodal-learning-1776835546

ParsedOpenAI · 4 topics · Junior · Researcher · United States

Generating seed nodes

0 proposed

Explored 0 queries

0/0 done

Expanding nodes

queued

Qualifying candidates

queued

Qualified Candidates (4)

high hireability

Graduate Researcher@Georgia Institute of Technology

Previously: Student Researcher @ Meta

Atlanta, US

Strong video AI + multimodal researcher: key contributor to Ego4D (1498 citations), Ego-Exo4D, and Gaze-LLE (CVPR 2025 Highlight), with audiovisual egocentric attention work covering multimodal learning
Diffusion model and inference optimization experience not evident in published work
US-based (Atlanta, GA Tech), <3 years industry experience (internships at Adobe Research and Meta only)
Not from OpenAI/DeepMind/xAI
Hireability: HIGH — passed PhD dissertation defense April 8 2026, prime immediate transition window

high hireability

Ph.D. Student@University of North Carolina, Chapel Hill

Previously: Student Researcher @ DeepMind

North Carolina, US

4th-year PhD at UNC Chapel Hill with strong video-language and multimodal AI work: SeViLA (NeurIPS'23, 248 cites), CREMA (ICLR'25, efficient multimodal video reasoning), VideoTree (CVPR'25), Video-RTS (EMNLP'25, RL+test-time scaling for video reasoning)
Diffusion model work via VEGGIE (ICCV'25, MLLM+Diffusion instructional video editing), SAFREE (ICLR'25, safe T2I/T2V generation), and training-free T2V guidance
Note: has Google DeepMind student researcher internships (2025, summer 2026 upcoming) — PhD intern only, not a full-time employee
Based in North Carolina, US. <3 years industry experience (seasonal internships only)
Hireability: HIGH — 4th-year PhD (started 2022), approaching graduation window ~2026-2027, very active output across top venues, website actively maintained

medium hireability

Ph.D. student@Northwestern University

Previously: Undergrad student @ ShanghaiTech University

Evanston, US

4th-year PhD student at Northwestern (ECE) with direct video AI work — Seq2Time (CVPR 2025) on Video LLM temporal grounding, and explicit research pivot in late 2025 to 'Multimodal LLMs: vision-language models, video LLMs, multi-modal agents, time-series reasoning'
DiffBoost (text-guided diffusion model, TMI 2024) shows diffusion experience
US-based (Chicago), not from excluded companies (interned at Meta Reality Lab Summer 2025)
No inference optimization work
Hireability: MEDIUM — 4th-year PhD student, still likely 1-2 years from graduation; active in research (CVPR/ECCV 2026 submissions); GitHub commits as recently as April 2026

medium hireability

Research Assistant@UCF Institute of Artificial Intelligence

Previously: Research Assistant @ UCF Institute of Artificial Intelligence

Orlando, US

Research Assistant at UCF (Chen Chen lab), co-authored CVPR 2025 paper on motion-grounded video reasoning with multimodal LLMs + SAM, plus SciVideoBench (LMM video reasoning benchmark)
Strong on video AI and multimodal learning, but no diffusion models or inference optimization work found
US-based (Orlando), fits <3 years experience as a grad RA
Hireability: MEDIUM — Research Assistant at UCF, likely early-mid PhD, active researcher (3 papers 2024-2026 including CVPR 2025), no open-to-work signals but typical student transition candidate

#1completed0 qualified / 0 foundApr 22, 5:25 AM