4/10 MITH Digital Dialogue: Jordan Boyd-Graber, "Making Topics More Human(e)" – Maryland Institute for Technology in the Humanities

View Larger Image

Tuesday, April 10, 12:30-1:45PM
MITH Conference Room, McKeldin Library B0135

“Making Topics More Human(e)” by JORDAN BOYD-GRABER

Imagine you need to get the gist of what’s going on in a large text dataset such as all tweets that mention Obama, all e-mails sent within a company, or all newspaper articles published by The New York Times in the 1990s. Topic models, which automatically discover the themes which permeate a corpus, are a popular tool for discovering what’s being discussed. However, topic models aren’t perfect; errors hamper adoption of the model, performance in downstream computational tasks, and human understanding of the data. However, humans can easily diagnose and fix these errors. We present a statistically sound model to incorporate hints and suggestions from humans to iteratively refine topic models to better model large datasets.

We also examine how topic models can be used to understand topic control in debates and discussions. We demonstrate a technique that can identify when speakers are “controlling” the topic of a conversation, which can identify events such as when participants in a debate don’t answer a question, when pundits steer a conversation toward talking points, or when a moderator exerts her influence on a discourse.

This talk will be held in the MITH Conference Room, in the basement of McKeldin Library.

Jordan Boyd-Graber is an assistant professor in Maryland’s iSchool and UMIACS, and a member of the Cloud Computing Center and the Computational Linguistics and Information Processing (CLIP) Lab. His research applies statistical models to natural language problems in ways that interact with humans, learn from humans, or help researchers understand humans. Jordan is an expert in the application of topic models, completely automatic tools that can discover structure and meaning in large, multilingual datasets. He is a contributor to the Natural Language Toolkit (NLTK), a popular tool used in natural language education research. Jordan received his PhD from Princeton University in 2010, advised by Dave Blei, and has bachelors degrees in history and computer science from the California Institute of Technology. He received a best student paper honorable mention at NIPS 2009 and a Computing Innovation Fellowship (declined). His current work is supported by NSF, IARPA, and ARL.

A continuously updated schedule of talks is also available on the Digital Dialogues webpage.

Unable to attend the events in person?

Archived podcasts can be found on the MITH website, and you can follow our Digital Dialogues Twitter account @digdialog as well as the Twitter hashtag #mithdd to keep up with live tweets from our sessions.

All talks free and open to the public. Attendees are welcome to bring their own lunches.

Contact: Emma Millon, Community Lead, MITH (http://mith.umd.edu, mith@umd.edu, 5-9887).

By Emma Millon|2020-10-08T16:02:21-04:00Apr 4, 2012|Community|

NEH Digital Humanities

@NEH_ODH

☀️ Good morning! Did you know that October is National Arts & Humanities Month? We'll be celebrating by highlighting the amazing digital humanities projects we've funded in the past year using #ODHFunded #NAHM20 Follow along! ☀️

2:13 am · October 3, 2020 ·

Retweeted by UMD_MITH

WITNESS

@witnessorg

Your media matters. Protect your media with our in-depth guidance on archiving for the long-term preservation, use, and accessibility of digital video: wit.to/Archive-Guide. twitter.com/document…

1:51 am · October 3, 2020 ·

Retweeted by UMD_MITH

bibliotekah

@tttkay

If you're interested in ethically archiving social media, check out @fromADMwithlove's Social Humans label project through @documentnow docnow.io/social-hum… #archives

1:51 am · October 3, 2020 ·

Retweeted by UMD_MITH

Vernon Mitchell, Jr.

@vcmitchelljr

Really proud of the work that I did with my colleagues at @documentnow. More proud of the new directions the work is going. In our current uncertain sociopolitical moment @DocNow continues to be vitally important to creating new ethical ways to archive social media.

1:48 am · October 3, 2020 ·

Retweeted by UMD_MITH

Trevor Owens 💾🗄🕚

@tjowens

Excited to see “Datasets at the @librarycongress: A Research Guide” by Lynn Weinstein come out! guides.loc.gov/datas… full of great info on the selected datasets collection!

1:41 am · October 3, 2020 ·

Retweeted by UMD_MITH

aleiabrown

@CollardStudies

I’m so glad we are talking about the capaciousness of Black Studies. @amplify285 brings this approach to our work @UMD_AADHum @UMD_MITH, encouraging experimentation/breaking/playing/(re)mixing computational tools in our understanding of Black life. #BlackDH twitter.com/CollardS…

1:32 am · October 3, 2020 ·

Retweeted by UMD_MITH

Matthew Kirschenbaum

@mkirschenbaum

In cooperation with the executor of his literary estate @smtracz, @HyperCardOnline, and the @internetarchive, I am very excited to announce the publication of 14 lost “HyperPoems” by the noted poet William H. Dickey (1928-1994): archive.org/details/… archive.org/details/… pic.twitter.com/0CiO…

1:30 am · October 3, 2020 ·

Retweeted by UMD_MITH

Matthew Kirschenbaum

@mkirschenbaum

The poems, edited for a never-realized posthumous publication, were recovered from the laptop of Deena Larsen at @UMD_MITH by @UMDEnglish Professor Matthew Kirschenbaum. The @internetarchive (fittingly in Dickey’s home city of San Francisco) finally offers us a platform for them.

1:30 am · October 3, 2020 ·

Retweeted by UMD_MITH