Visipedia – A distributed visual system composed of machines and people
- Pietro Perona | California Institute of Technology
The web is not perfect: while text is easily searched and organized, pictures (the vast majority of the bits that one can find on-line) are not. In order to see how one could improve the web and make pictures first-class citizens of the web, I explore the idea of Visipedia, a visual interface for Wikipedia that is able to answer visual queries and enables experts to contribute and organize visual knowledge. Four distinct groups of humans would interact through Visipedia: users, experts, visual workers and machine vision scientists. The latter would gradually build automata able to interpret images. I will explore some of the technical challenges involved in making Visipedia happen and present our initial results in crowdsourcing visual annotation, building automated field guides and combining machines and humans for discovering, harvesting and organizing visual information. http://vision.caltech.edu/visipedia/index.html
Joint work with S. Belongie, S. Branson, R. Gomes, K. Wah, P. Welinder
Speaker Details
Pietro Perona is Allen E. Puckett Professor of Electrical Engineering and of Computation and Neural Systems at the California Institute of Technology. His interests are in computational vision and in modeling biological vision.
-
-
Jeff Running
-
Series: Microsoft Research Talks
-
Decoding the Human Brain – A Neurosurgeon’s Experience
- Dr. Pascal O. Zinn
-
-
-
-
-
-
Challenges in Evolving a Successful Database Product (SQL Server) to a Cloud Service (SQL Azure)
- Hanuma Kodavalla,
- Phil Bernstein
-
Improving text prediction accuracy using neurophysiology
- Sophia Mehdizadeh
-
Tongue-Gesture Recognition in Head-Mounted Displays
- Tan Gemicioglu
-
DIABLo: a Deep Individual-Agnostic Binaural Localizer
- Shoken Kaneko
-
-
-
-
Audio-based Toxic Language Detection
- Midia Yousefi
-
-
From SqueezeNet to SqueezeBERT: Developing Efficient Deep Neural Networks
- Forrest Iandola,
- Sujeeth Bharadwaj
-
Hope Speech and Help Speech: Surfacing Positivity Amidst Hate
- Ashique Khudabukhsh
-
-
-
Towards Mainstream Brain-Computer Interfaces (BCIs)
- Brendan Allison
-
-
-
-
Learning Structured Models for Safe Robot Control
- Subramanian Ramamoorthy
-