Welcome to the fourth iteration of the semi-annual HathiTrust Research Center (HTRC) UnCamp. This is where members of the HTRC community gather to explore the latest developments in using HTRC tools and services to anlayze the HathiTrust Digital Library corpus. Visit https://www.hathitrust.org/htrc_uncamp2018 for more information or see our online proceedings at https://osf.io/view/htrc_uncamp2018 hosted by OSF Meetings.

Friday, January 26 • 8:30am - 9:15am
Keynote: David Mimno: Consistency and Confidence in the Million-book library

Title: Consistency and Confidence in the Million-book library
The promise of digitized million-book libraries is that we can get reliable measurements of complicated historical and cultural processes. In this talk I'll present a general framework for many of the most popular analytics of large scale text, including topic models and word embeddings. Based on this intuition I will show both the promise and potential pitfalls of such analyses. Through several case studies I will present recommendations on how researchers should get the most consistent, confident results, and how we might collectively make Hathi Trust more reliable.

David Mimno

Cornell University

Friday January 26, 2018 8:30am - 9:15am
Moffitt Library, 5th floor

