History as a Data Science
Free Two-Week Research Workshop at Columbia’s History Lab
The goal of this workshop is to offer a very practical introduction to getting direct access to the data in digital archives and analyzing it with algorithms. Hands-on exercises will focus on History Lab's Freedom of Information Archive, the world's largest database of declassified documents. But participants will also learn how to get the data most relevant to their own research by “scraping” websites and downloading from online databases.
The workshop will be led by Matthew Connelly and Raymond Hicks, and feature presentations from leading digital historians and data scientists. Hands-on training will use textual data from History Lab, an NSF-funded project that has aggregated the largest database of declassified government documents in the world.
When: January 6, 2020 – January 17, 2020. Sessions will be from 9am - 1pm each weekday. Participants are encouraged to attend digital history-related workshops and panels at AHA the weekend before. If you would like to participate but the timing poses a problem, tell us which dates and times you can commit to attending.
Where: Columbia University Campus in NYC, the Institute for Social and Economic Research and Policy.
Eligibility: This workshop is open rank; first year Ph.D. students through established scholars are encouraged to apply. Priority will be given to history scholars, others will be eligible to participate on a space available basis.
Financial Support: We are happy to offer a limited amount of financial support to workshop participants for travel and accommodations. Please discuss your budget and tell us about funding sources in your application.
How to Apply: Please see the Call for Applications. We will be admitting participants on a rolling basis, but those who apply before June 15 will get first consideration. Direct questions to Harlowe Wang at [email protected].