Computer Vision – StAR Lecture Series: Object Recognition
The state-of-the-art in object recognition has undergone dramatic changes in the last 20 years. In this talk, I will review the progression of the field and discuss why various approaches both succeeded and failed. The talk will cover visual recognition from the early 90’s, including handwritten digit and face detection, to the current state-of-the-art in deep learning applied to object categorization. Algorithms will be explained at an intuitive level. The talk is aimed at the non-expert in computer vision with some knowledge of machine learning. While deep learning is briefly covered, Ross Girshick will be giving a more detailed StAR talk on the subject at a later date.
Speaker Details
C. Lawrence Zitnick is a senior researcher in the Interactive Visual Media group at Microsoft Research, and is an affiliate associate professor at the University of Washington. He is interested in a broad range of topics related to visual object recognition. His current interests include object detection and semantically interpreting visual scenes. He developed the PhotoDNA technology used by Microsoft, Facebook, Google, and various law enforcement agencies to combat illegal imagery on the web. Before joining MSR, he received the PhD degree in robotics from Carnegie Mellon University in 2003. In 1996, he co-invented one of the first commercial portable depth cameras.
- Series:
- Microsoft Research Talks
- Date:
- Speakers:
- Larry Zitnick
- Affiliation:
- MSR
-
-
Larry Zitnick
Principal Researcher
-
Series: Microsoft Research Talks
-
DIABLo: a Deep Individual-Agnostic Binaural Localizer
Speakers:- Shoken Kaneko
-
A Tale of Two Cities: Software Developers in Practice During the COVID-19 Pandemic
Speakers:- Denae Ford Robinson
-
Recent Efforts Towards Efficient And Scalable Neural Waveform Coding
Speakers:- Kai Zhen
-
-
Audio-based Toxic Language Detection
Speakers:- Midia Yousefi
-
What Kind of Computation is Human Cognition? A Brief History of Thought (Episode 2/2)
Speakers:- Paul Smolensky
-
From SqueezeNet to SqueezeBERT: Developing Efficient Deep Neural Networks
Speakers:- Forrest Iandola,
- Sujeeth Bharadwaj
-
Hope Speech and Help Speech: Surfacing Positivity Amidst Hate
Speakers:- Ashique Khudabukhsh
-
What Kind of Computation is Human Cognition? A Brief History of Thought (Episode 1/2)
Speakers:- Paul Smolensky
-
An Ethical Crisis in Computing?
Speakers:- Eric Horvitz,
- Moshe Y. Vardi
-
Towards Mainstream Brain-Computer Interfaces (BCIs)
Speakers:- Brendan Allison
-
-
'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project
Speakers:- Peter Clark
-
Checkpointing the Un-checkpointable: the Split-Process Approach for MPI and Formal Verification
Speakers:- Gene Cooperman
-
Learning Structured Models for Safe Robot Control
Speakers:- Subramanian Ramamoorthy
-
Non-linear Invariants for Control-Command Systems
Speakers:- Pierre Roux
-
Distributed Entity Resolution for Computational Social Science
Speakers:- Rebecca C. Steorts
-
The Worst Form Including All Those Others: Canada’s Experiments with Online Voting
Speakers:- Aleksander Essex
-
How Not to Prove Your Election Outcome
Speakers:- Vanessa Teague
-
Dashboard Mechanisms for Online Marketplaces
Speakers:- Jason Hartline
-
Compacting the Uncompactable: The Mesh Compacting Memory Allocator
Speakers:- Emery Berger
-
Tea: A High-level Language and Runtime System for Automating Statistical Analysis
Speakers:- Eunice Jun
-
Resource-Efficient Redundancy for Large-Scale Data Processing and Storage Systems
Speakers:- Rashmi Vinayak
-
Battling Unfair Demons in Peer Review
Speakers:- Nihar Shah
-
Sequential Estimation of Quantiles with Applications to A/B-testing and Best-arm Identification
Speakers:- Aaditya Ramdas