Multi-level Optimization Approaches to Computer Vision

October 14, 2019
Dominic Jack | QUT

On a broad level, computer graphics involves representing 3D information in 2D. Computer vision can be thought of as the inverse problem – inferring 3D information from a projected representation. This talk will discuss two deep learning approaches to 3D human pose estimation and single-view object reconstruction that attempt to learn about solution feasibility while incorporating simple computer graphics techniques to ensure consistency with observations. The first approach optimizes a GAN to produce a parameterization of the feasible solution space, then seeks a solution in that space which is maximally consistent with observations. The follow-up approach is based on combining these optimization steps into a single nested optimization problem.

- Andrew Fitzgibbon
  
  Partner Researcher
Research Area
Research Lab
- Microsoft Research Lab - Cambridge

Watch Next

GeoMind: A Multi-Agent Framework for Geospatial Decision Support
January 28, 2026
Muhammad Sohail Danish
From Microfarms to the Moon: A Teen Innovator’s Journey in Robotics
December 9, 2025
Pranav Kumar Redlapalli
MindJourney: Test-Time Scaling with World Models for Spatial Reasoning
August 12, 2025
Reuben Tan
Computational models for brain science
July 24, 2025
Brokoslaw Laschowski
VoluMe: Authentic 3D Video Calls from Live Gaussian Splat Prediction
July 11, 2025
Antonio Criminisi,

Charlie Hewitt,

Marek Kowalski (HE/HIM)
Episode 2: A multi-disciplinary approach
May 1, 2025
Jonathan M. Carlson,

Will Guyman,

Matthew Lungren

, et. al.
Using Optimization and LLMs to Enhance Cloud Supply Chain Operations
December 2, 2024
Beibin Li,

Konstantina Mellou,

Ishai Menache

, et. al.
Analog optical computing for sustainable AI and beyond
September 3, 2024
Francesca Parmigiani,

Jiaqi Chu
Keynote: Phi-3-Vision: A highly capable and "small" language vision model
September 3, 2024
Jianfeng Gao
AutoGen Update: Complex Tasks and Agents
June 4, 2024
Adam Fourney