Back to dashboard

Video ai researchers with experience in multimodal learning, diffusion models, a…

completed4 qualified1 runApr 22, 5:25 AMvideo-ai-researchers-with-experience-in-multimodal-learning-1776835546
ParsedOpenAI · 4 topics · Junior · Researcher · United States
Generating seed nodes
0 proposed
Explored 0 queries
0/0 done
    3
    Expanding nodes
    queued
    4
    Qualifying candidates
    queued

    Qualified Candidates (4)

    FR

    Fiona Ryan

    high hireability

    Graduate Researcher@Georgia Institute of Technology

    Previously: Student Researcher @ Meta

    Atlanta, US

    • Strong video AI + multimodal researcher: key contributor to Ego4D (1498 citations), Ego-Exo4D, and Gaze-LLE (CVPR 2025 Highlight), with audiovisual egocentric attention work covering multimodal learning
    • Diffusion model and inference optimization experience not evident in published work
    • US-based (Atlanta, GA Tech), <3 years industry experience (internships at Adobe Research and Meta only)
    • Not from OpenAI/DeepMind/xAI
    • Hireability: HIGH — passed PhD dissertation defense April 8 2026, prime immediate transition window
    SY

    Shoubin Yu

    high hireability

    Ph.D. Student@University of North Carolina, Chapel Hill

    Previously: Student Researcher @ DeepMind

    North Carolina, US

    • 4th-year PhD at UNC Chapel Hill with strong video-language and multimodal AI work: SeViLA (NeurIPS'23, 248 cites), CREMA (ICLR'25, efficient multimodal video reasoning), VideoTree (CVPR'25), Video-RTS (EMNLP'25, RL+test-time scaling for video reasoning)
    • Diffusion model work via VEGGIE (ICCV'25, MLLM+Diffusion instructional video editing), SAFREE (ICLR'25, safe T2I/T2V generation), and training-free T2V guidance
    • Note: has Google DeepMind student researcher internships (2025, summer 2026 upcoming) — PhD intern only, not a full-time employee
    • Based in North Carolina, US. <3 years industry experience (seasonal internships only)
    • Hireability: HIGH — 4th-year PhD (started 2022), approaching graduation window ~2026-2027, very active output across top venues, website actively maintained
    BW

    Bin Wang

    medium hireability

    Ph.D. student@Northwestern University

    Previously: Undergrad student @ ShanghaiTech University

    Evanston, US

    • 4th-year PhD student at Northwestern (ECE) with direct video AI work — Seq2Time (CVPR 2025) on Video LLM temporal grounding, and explicit research pivot in late 2025 to 'Multimodal LLMs: vision-language models, video LLMs, multi-modal agents, time-series reasoning'
    • DiffBoost (text-guided diffusion model, TMI 2024) shows diffusion experience
    • US-based (Chicago), not from excluded companies (interned at Meta Reality Lab Summer 2025)
    • No inference optimization work
    • Hireability: MEDIUM — 4th-year PhD student, still likely 1-2 years from graduation; active in research (CVPR/ECCV 2026 submissions); GitHub commits as recently as April 2026
    LS

    Lincoln Spencer

    medium hireability

    Research Assistant@UCF Institute of Artificial Intelligence

    Previously: Research Assistant @ UCF Institute of Artificial Intelligence

    Orlando, US

    • Research Assistant at UCF (Chen Chen lab), co-authored CVPR 2025 paper on motion-grounded video reasoning with multimodal LLMs + SAM, plus SciVideoBench (LMM video reasoning benchmark)
    • Strong on video AI and multimodal learning, but no diffusion models or inference optimization work found
    • US-based (Orlando), fits <3 years experience as a grad RA
    • Hireability: MEDIUM — Research Assistant at UCF, likely early-mid PhD, active researcher (3 papers 2024-2026 including CVPR 2025), no open-to-work signals but typical student transition candidate

    Runs

    #1completed0 qualified / 0 foundApr 22, 5:25 AM