Mining Web Data for Public Health
- Mark Dredze | Johns Hopkins University
Recent years have seen the adoption of new Web data sources in a wide range of health areas. Of all areas, public health applications in behavioral medicine have the most potential to change how we conduct research, opening up exciting new opportunities. Fundamentally, behavioral medicine requires understanding how people make health decisions: what influences their decision, how they weigh information, and how social connections impact decisions. Web data sources provide new opportunities for studying these questions.
Answering these questions often requires new data mining methods. In this talk, I will present multi-dimensional topic models of text which jointly capture topic and other aspects of text. We describe Factorial Latent Dirichlet Allocation, a multi-dimensional model in which a document is influenced by K different factors, and each word token depends on a K-dimensional vector of latent variables. I will demonstrate the advantages of this model in the application of mining drug experiences from web forums.
Speaker Details
Mark Dredze is an Assistant Research Professor in Computer Science at Johns Hopkins University and a research scientist at the Human Language Technology Center of Excellence. He is also affiliated with the Center for Language and Speech Processing and the Center for Population Health Information Technology. His research in natural language processing and machine learning has focused on graphical models, semi-supervised learning, information extraction, large-scale learning, and speech processing. His recent work includes health information applications, including information extraction from social media, biomedical and clinical texts. He obtained his PhD from the University of Pennsylvania in 2009.
-
-
Jeff Running
-
Watch Next
-
-
-
Accelerating MRI image reconstruction with Tyger
- Karen Easterbrook,
- Ilyana Rosenberg
-
-
-
-
From Microfarms to the Moon: A Teen Innovator’s Journey in Robotics
- Pranav Kumar Redlapalli
-
-
-