Visipedia – A distributed visual system composed of machines and people
The web is not perfect: while text is easily searched and organized, pictures (the vast majority of the bits that one can find on-line) are not. In order to see how one could improve the web and make pictures first-class citizens of the web, I explore the idea of Visipedia, a visual interface for Wikipedia that is able to answer visual queries and enables experts to contribute and organize visual knowledge. Four distinct groups of humans would interact through Visipedia: users, experts, visual workers and machine vision scientists. The latter would gradually build automata able to interpret images. I will explore some of the technical challenges involved in making Visipedia happen and present our initial results in crowdsourcing visual annotation, building automated field guides and combining machines and humans for discovering, harvesting and organizing visual information. http://vision.caltech.edu/visipedia/index.html
Joint work with S. Belongie, S. Branson, R. Gomes, K. Wah, P. Welinder
Speaker Details
Pietro Perona is Allen E. Puckett Professor of Electrical Engineering and of Computation and Neural Systems at the California Institute of Technology. His interests are in computational vision and in modeling biological vision.
- Series:
- Microsoft Research Talks
- Date:
- Speakers:
- Pietro Perona
- Affiliation:
- California Institute of Technology
Series: Microsoft Research Talks
-
DIABLo: a Deep Individual-Agnostic Binaural Localizer
Speakers:- Shoken Kaneko
-
A Tale of Two Cities: Software Developers in Practice During the COVID-19 Pandemic
Speakers:- Denae Ford Robinson
-
Recent Efforts Towards Efficient And Scalable Neural Waveform Coding
Speakers:- Kai Zhen
-
-
Audio-based Toxic Language Detection
Speakers:- Midia Yousefi
-
What Kind of Computation is Human Cognition? A Brief History of Thought (Episode 2/2)
Speakers:- Paul Smolensky
-
From SqueezeNet to SqueezeBERT: Developing Efficient Deep Neural Networks
Speakers:- Forrest Iandola,
- Sujeeth Bharadwaj
-
Hope Speech and Help Speech: Surfacing Positivity Amidst Hate
Speakers:- Ashique Khudabukhsh
-
What Kind of Computation is Human Cognition? A Brief History of Thought (Episode 1/2)
Speakers:- Paul Smolensky
-
An Ethical Crisis in Computing?
Speakers:- Eric Horvitz,
- Moshe Y. Vardi
-
Towards Mainstream Brain-Computer Interfaces (BCIs)
Speakers:- Brendan Allison
-
-
'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project
Speakers:- Peter Clark
-
Checkpointing the Un-checkpointable: the Split-Process Approach for MPI and Formal Verification
Speakers:- Gene Cooperman
-
Learning Structured Models for Safe Robot Control
Speakers:- Subramanian Ramamoorthy
-
Non-linear Invariants for Control-Command Systems
Speakers:- Pierre Roux
-
Distributed Entity Resolution for Computational Social Science
Speakers:- Rebecca C. Steorts
-
The Worst Form Including All Those Others: Canada’s Experiments with Online Voting
Speakers:- Aleksander Essex
-
How Not to Prove Your Election Outcome
Speakers:- Vanessa Teague
-
Dashboard Mechanisms for Online Marketplaces
Speakers:- Jason Hartline
-
Compacting the Uncompactable: The Mesh Compacting Memory Allocator
Speakers:- Emery Berger
-
Tea: A High-level Language and Runtime System for Automating Statistical Analysis
Speakers:- Eunice Jun
-
Resource-Efficient Redundancy for Large-Scale Data Processing and Storage Systems
Speakers:- Rashmi Vinayak
-
Battling Unfair Demons in Peer Review
Speakers:- Nihar Shah
-
Sequential Estimation of Quantiles with Applications to A/B-testing and Best-arm Identification
Speakers:- Aaditya Ramdas