From Wayback Machine to WebLab:
New Opportunities for Social Research
- Michael Macy | Cornell University
The Cornell project is creating tools for research on social and information networks based on a largely untapped dataset: the Internet Archive’s 40-billion page collection of Web pages. These snapshots of the Web have been captured and archived every two months for nearly ten years. The project will eventually make very large portions of this massive collection widely accessible for social science research. The flood of available on-line information – from corporate web pages to news groups and blogs – has the potential to open up new frontiers in research on collective behavior of individuals and the diffusion of innovation, as well as practical applications for business and government, such as tracking market trends, the rise and fall of demand, and the spread of consumer preferences. Community watchdog groups will be able to track the spread of “hate sites” and government agencies will be able to trace past and current uses of the Web for organizing and coordinating terrorist attacks. The development of these tools requires the application of advanced research in natural language processing and machine learning algorithms. The project team also includes computer scientists with expertise in the privacy-preserving analysis of data – a basic challenge in making on-line data more readily accessible for research and policy applications in the social and information sciences.
Speaker Details
Michael Macy is Professor and Chair of Sociology at Cornell. In a series of studies funded by the National Science Foundation, his research team used computational models and laboratory experiments with human subjects to explore how threshold effects in network interactions might generate familiar but enigmatic social patterns, such as the emergence and collapse of fads, the spread of self-destructive behaviors, and the polarization of opinion. Macy pioneered the use of agent based models in sociology to explore the effects of heterogeneity, bounded rationality, and network structure on the dynamics and stability of social systems (http://hsd.soc.cornell.edu/Macy.htm). He now heads a team of social, information, and computer scientists who are building tools that will make the Internet Archive (http://www.archive.org/) accessible for research on social and information networks (http://www.news.cornell.edu/stories/Sept05/NSFcybertools.dea.html). He also leads a Cornell initiative to promote cross-disciplinary collaborative research and teaching on social and information networks.
-
-
Jeff Running
-
Watch Next
-
Dion2: A new simple method to shrink matrix in Muon
- Anson Ho,
- Kwangjun Ahn
-
-
-
-
-
-
-
Beyond Swahili: Designing Inclusive AI for Bantu Languages
- Alfred Malengo Kondoro
-
-