Wisdom O. Ikezogwo

Machine Learning Researcher | Ph.D. Candidate in Computer Science

GitHub

About

Highly accomplished Machine Learning Researcher and Ph.D. Candidate at the University of Washington’s Paul G. Allen School of Computer Science & Engineering, specializing in generative modeling, multimodal representation learning, and large-scale data curation. My research drives state-of-the-art advancements in medical imaging and video generation, backed by prestigious grants from Microsoft and the UW Population Health Initiative. With a First Class B.Sc. in Electronic & Electrical Engineering from Obafemi Awolowo University and impactful research internships at Apple and Mayo Clinic, I bring a proven track record of innovation. My work is published in top-tier venues like CVPR and NeurIPS, spanning critical applications in healthcare, finance, and recommendation systems.

Work Experience

Research Assistant – Graphics and Imaging Laboratory (GRAIL)

University of Washington

Dec 2025 - Present

Leading cutting-edge research in multimodal AI for medical imaging and advanced video generation, driving innovation in state-of-the-art models and dataset creation.

  • Spearheaded research on Multimodal Large Language Models (LLMs) for medical imaging, developing novel medical multimodal datasets (Quilt-1M, MedNarratives) and state-of-the-art models (QuiltNet, Quilt-LLaVA).
  • Engineered a multi-agent AI framework, PathFinder, for clinical diagnosis, achieving performance superior to human experts and establishing improved benchmarks (MedBlink).
  • Directing efforts to integrate Newtonian physics into image and video generative models, focusing on generating large-scale, temporally consistent video scene graph datasets and pipelines.

Teaching Assistant

University of Washington

Dec 2025 - Present

Provided comprehensive instructional and academic support for undergraduate and graduate courses in Computer Science.

  • Taught Data Programming (CSE 160) for Fall 2021, Winter 2022, and Fall 2022 semesters, guiding students through fundamental concepts and practical applications.
  • Provided instruction and support for Introduction to Artificial Intelligence (CSE 473) during Spring 2023, Fall 2023, and Winter 2024 academic terms.

Ph.D. Machine Learning Research Internship

Apple

Dec 2025 - Present

Led research on efficient multimodal representations for egocentric data, focusing on reducing data capture costs.

  • Pioneered research in efficient multimodal representations for egocentric data, including video, text, audio, IMU, and hands.
  • Developed 'Perceive-Predict,' a novel framework leveraging predictive coding between co-occurring modalities to reconstruct missing data, significantly reducing capture costs for expensive modalities like video.

Ph.D. Quantitative Health Sciences Internship

Mayo Clinic

Dec 2025 - Present

Directed research efforts to develop a foundational model for histopathology using large-scale image datasets.

  • Led research towards developing a foundational model for histopathology, training on millions of gigapixel-sized histology images.
  • Scaled-up compute operations on the Argonne National Lab computing cluster, resulting in clinically evaluated models for improved diagnostic capabilities.

ML Engineer

Okra, Inc.

Dec 2025 - Present

Developed machine learning models for financial information extraction and integration into lending systems.

  • Engineered models to extract crucial customer financial information from unstructured banking data.
  • Processed key customer earning and spending data to feed into downstream lending pipelines, including predicting income, analyzing spending patterns, and performing reconciliations.

Data Scientist / ML Engineer

Demz Analytics Limited

Dec 2025 - Present

Designed and implemented production-grade recommendation systems for enhanced user experience.

  • Developed and deployed production recommendation systems utilizing advanced techniques such as attention mechanisms and epsilon-greedy bandit strategies.

UG. Research Assistant – Biosignal Processing, Inst. & Control Lab

Obafemi Awolowo University

Dec 2025 - Present

Conducted research in biosignal processing, focusing on EEG signal analysis and neural network applications.

  • Integrated disparate multivariate time series data, employing spectral component characterization for dynamical dimensionality reduction.
  • Developed and trained neural networks for accurate classification and characterization of brain EEG signals.

Education

Computer Science and Engineering

University of Washington

3.97/4.00

Courses

  • Advanced Machine Learning
  • Computer Vision
  • Natural Language Processing
  • Deep Learning
  • Generative Models

Electronic & Electrical Engineering

Obafemi Awolowo University

4.73/5.00

Courses

  • Digital Signal Processing
  • Control Systems Engineering
  • Applied Electronics
  • Microprocessor Systems
  • Electromagnetic Fields and Waves

Awards

Population Health Initiative AI Pilot Research Grant Award

University of Washington

Awarded $100,000 for pioneering AI research in population health.

Microsoft's Accelerate Foundation Models Research Grant

Microsoft

Received $20,000 grant to advance research in foundational AI models.

IBRO-Simons Computational Neuroscience Summer School Travel Grant

IBRO-Simons

Awarded travel grant to attend a prestigious computational neuroscience summer school in Cape Town.

Prof. Kehinde Prize for the Best Graduating Student in the Control Option

Obafemi Awolowo University

Recognized as the top-performing student in the Control Engineering specialization.

Oyebolu Prize for Best Male Graduating Student

Obafemi Awolowo University

Awarded for outstanding academic achievement as the best male graduating student.

Federal Government Scholarship Award, Nigeria

Federal Government of Nigeria

Received a cumulative scholarship valued at $1500 for academic excellence.

Total/NNPC National Merit Scholarship

Total/NNPC

Awarded a cumulative scholarship valued at $1500 based on national merit.

Etisalat Nigeria Merit Scholarship

Etisalat Nigeria

Received a scholarship valued at $250 for academic merit.

Publications

Quilt-LLaVA: Visual Instruction Tuning by Extracting Localized Narratives from Open-Source Histopathology Videos

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Introduces a novel approach for visual instruction tuning using localized narratives from histopathology videos to enhance multimodal LLMs.

Quilt-1M: One Million Image-Text Pairs for Histopathology

NeurIPS

Presents a large-scale dataset of one million image-text pairs specifically curated for histopathology, enabling advanced medical imaging research.

Multi-modal Masked Autoencoders Learn Compositional Histopathological Representations

Machine Learning for Health (ML4H)

Explores the use of multimodal masked autoencoders to learn compositional representations from histopathological data, improving diagnostic capabilities.

Risk Stratification of Solitary Fibrous Tumor Using Whole Slide Image Analysis

LABORATORY INVESTIGATION, ELSEVIER SCIENCE INC

Applies whole slide image analysis for risk stratification of solitary fibrous tumors, contributing to more precise medical diagnostics.

Recent Advances, Applications and Open Challenges in Machine Learning for Health: Reflections from Research Roundtables

ML4H Symposium

A collaborative reflection on the latest advancements, applications, and challenges in machine learning for health, derived from research roundtables.

PathFinder: A Multi-Modal Multi-Agent Framework for Diagnostic Decision-Making in Histopathology

In Submission to ICCV

Proposes a multi-modal, multi-agent framework to enhance diagnostic decision-making processes in histopathology.

MedicalNarratives: Connecting Medical Vision and Language with Procedural and Localized Narratives across all medical imaging domains

In Submission to ICCV

Explores linking medical vision and language through procedural and localized narratives to create a comprehensive framework for medical imaging analysis.

MedBlink: Probing the Fundamental Medical Imaging Knowledge of Multimodal Language Models

In Submission to ICCV

Investigates the foundational medical imaging knowledge embedded within multimodal language models to assess their understanding and capabilities.

Percieve-Predict: Modality and Time-Aware Egocentric Efficient Multi-Modal Representations

In Preparation for NeurIPS

Developing efficient multi-modal representations for egocentric data, incorporating modality and time awareness for improved predictive capabilities.

VPhysics: Temporally consistent Physics in Video (multiframe) Generation via Alignment

In Preparation for NeurIPS

Focuses on generating temporally consistent video frames by integrating physics principles, ensuring realistic motion and interactions.

Synthetic Video Scene Graph Generation

NeurIPS D&B

Research on generating synthetic video scene graphs to improve understanding and manipulation of complex video content.

Multi-Scale Cross-Attention Multiple Instance Learning (MsCAMIL) Network for Automated Triage of Colorectal Polyps

United States and Canadian Academy of Pathology's (USCAP) 114th Annual Meeting

Introduces MsCAMIL network for automated triage of colorectal polyps, enhancing efficiency in pathological diagnosis.

Comparative Performance of Multi-Scale Cross-Attention Multiple Instance Learning (MsCAMIL) and Pathology Trainees in Colorectal Polyp Diagnosis

United States and Canadian Academy of Pathology's (USCAP) 114th Annual Meeting

Compares the diagnostic performance of MsCAMIL against human pathology trainees in colorectal polyp diagnosis.

Supervised domain generalization for integration of disparate scalp EEG datasets for automatic epileptic seizure detection

Computers in Biology and Medicine

Investigates domain generalization techniques to integrate diverse EEG datasets for improved automatic epileptic seizure detection.

Empirical Characterization of the Temporal Dynamics of EEG Spectral Components

International Journal of Online and Biomedical Engineering (IJOE)

Provides an empirical characterization of the temporal dynamics of EEG spectral components, contributing to a deeper understanding of brain activity.

Languages

English (Native)

Skills

Research & Development

  • Experimental Design
  • Model Development
  • Performance Evaluation
  • Scientific Writing
  • Peer Review
  • Grant Writing

Leadership & Mentorship

  • Project Leadership
  • Team Collaboration
  • Teaching
  • Mentoring

Machine Learning

  • Generative Modeling
  • Multimodal Representation Learning
  • Deep Learning
  • Neural Networks
  • Foundation Models
  • Predictive Coding
  • Recommendation Systems
  • Attention Mechanisms
  • Epsilon-Greedy Bandit Strategy

Computer Vision

  • Medical Imaging
  • Histopathology
  • Video Generation
  • Scene Graph Generation
  • Image Analysis
  • Whole Slide Imaging
  • Egocentric Data

Natural Language Processing

  • Multimodal LLMs
  • Text Analysis
  • Localized Narratives

Data Science

  • Data Curation
  • Large-Scale Datasets
  • Time Series Analysis
  • EEG Signal Processing
  • Dimensionality Reduction
  • Statistical Analysis
  • Data Programming

Programming Languages

  • Python

Machine Learning Frameworks

  • PyTorch
  • TensorFlow

Interests

Research

  • Generative Modeling
  • Multimodal Representation Learning
  • Data Curation
  • Artificial Intelligence in Healthcare