Publications

The Archives Unleashed project has a body of published research. Check back for more soon!

Journal Articles

  • Jimmy Lin, Ian Milligan, Jeremy Wiebe, and Alice Zhou. “Warcbase : Scalable Analytics Infrastructure for Exploring Web Archives.” ACM Journal of Computing and Cultural Heritage, Vol. 10, Issue 4, July 2017. [link]
  • Ian Milligan, Nick Ruest, and Anna St.Onge. “The Great WARC Adventure : Using SIPS, AIPS and DIPS to Document SLAAPs.” Digital Studies/Le champ numérique, Vol. 6, 2016. [link]
  • Ian Milligan. “Lost in the Infinite Archive : The Promise and Pitfalls of Web Archives.” International Journal of Humanities and Arts Computing, Vol. 10, No. 1-2 (2016): 87—94. [link] [preprint]

Book

  • Ian Milligan, History in the Age of Abundance? How the Web is Transforming Historical Research. Montreal & Kingston: McGill-Queen’s University Press, 2019. [amazon.ca] [amazon.com] [google books] [publisher]

Peer-Reviewed Conference Publications

  • Ryan Deschamps, Samantha Fritz, Jimmy Lin, Ian Milligan, and Nick Ruest. “The Cost of a WARC : Analyzing Web Archives in the Cloud.” Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, Vol. 19 (2019). [preprint]
  • Ian Milligan, Nathalie Casemajor, Samantha Fritz, Jimmy Lin, Nick Ruest, Matthew S. Weber, and Nicholas Worby. “Building Community and Tools for Analyzing Web Archives through Datathons.” Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, Vol. 19 (2019). [preprint]
  • Ian Milligan, Nick Ruest, and Jimmy Lin. “Content Selection and Curation for Web Archiving : The Gatekeepers vs. the Masses.” Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, Vol. 16 (2016): 107—110. [link] [preprint]
  • Andrew Jackson, Jimmy Lin, Ian Milligan, and Nick Ruest. “Desiderata for Exploratory Search Interfaces to Web Archives in Support of Scholarly Activities.” Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, Vol. 16 (2016): 103—106. [link] [preprint]
  • Jimmy Lin. “Scaling Down Distributed Infrastructure on Wimpy Machines for Personal Web Archiving.” Proceedings of the 24th International World Wide Web Conference Companion (WWW 2015), pages 1351-1355, May 2015, Florence, Italy. [link]

Peer-Reviewed Posters and Demonstrations

  • Ryan Deschamps, Nick Ruest, Jimmy Lin, Samantha Fritz, and Ian Milligan. “The Archives Unleashed Notebook: Madlibs for Jumpstarting Scholarly Exploration of Web Archives.” Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, Vol. 19 (2019). [preprint]
  • Hsiu-Wei Yang, Linqing Liu, Ian Milligan, Nick Ruest, and Jimmy Lin. “Scalable Content-Based Analysis of Images in Web Archives with TensorFlow and the Archives Unleashed Toolkit.” Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, Vol. 19 (2019). [preprint]
  • Nick Ruest, Ian Milligan, and Jimmy Lin. “Warclight: A Rails Engine for Web Archive Discovery.” Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, Vol. 19 (2019). [preprint]

Conference Presentations

  • Nick Ruest and Ian Milligan. “Lowering the Barrier to Access: The Archives Unleashed Cloud Project.” The Web That Was: Archives, Traces, Reflections, June 2019, Amsterdam, Netherlands. [slides]
  • Nick Ruest and Ian Milligan. “See a Little Warclight : Building an Open-Source Web Archive Portal with Project Blacklight.” International Internet Preservation Consortium Web Archiving Conference, June 2019, Zagreb, Croatia. [slides]
  • Nick Ruest and Ian Milligan. “Project Sustainability and Research Platforms : The Archives Unleashed Cloud Project.” International Internet Preservation Consortium Web Archiving Conference, June 2019, Zagreb, Croatia. [slides]
  • Nick Ruest, “Oh, I Get by with a Little Help from my Friends : Interdisciplinary Web Archive Collaboration.” The Fields Institute Workshop on Quantitative Analysis and the Digital Turn in Historical Studies, February 2019, Toronto, Ontario, Canada.
  • Ian Milligan, “Opening up WARCs: The Archives Unleashed Toolkit and Cloud Projects.” International Internet Preservation Consortium Annual Meeting, November 2018, Wellington, New Zealand.
  • Nick Ruest, “Make it WALK.” Archives Association of Ontario 2018, May 2018, Waterloo, Ontario, Canada.
  • Ryan Deschamps, Jimmy Lin, Nick Ruest, Samantha Fritz, Ian Milligan. “Usability, Accessibility, and Performance: Striking the Right Balance with the Archives Unleashed Toolkit.” CSDH/SCHN Digital Humanities Conference 2018, May 2018, Regina, Saskatchewan, Canada.
  • Ian Milligan, “Too Much Information: Transparency, Metadata, and Search in the Age of Web Archives.” American Historical Association Conference, January 2018, Washington, DC, USA.
  • Nick Ruest, “Warclight.” Blacklight European Summit 2017, October 2017, Copenhagen, Denmark.
  • Ziquan Wang, Borui Lin, Ian Milligan, and Jimmy Lin. “Topic Shifts Between Two US Presidential Administrations.” JCDL 2017 Workshop on Web Archiving and Digital Libraries, June 2017, Toronto, Ontario, Canada. [paper draft here]
  • Nick Ruest and Ian Milligan, “Learning to WALK (Web Archives for Longitudinal Knowledge) : Building a National Web Archiving Collaborative Platform.” International Internet Preservation Consortium/RESAW Conference, June 2017, London, England.
  • Ian Milligan and Nick Ruest, “Warcbase : Using Scalable Web Analytics to Analyze Canadian Collections En Masse.” National Symposium on Web Archiving, February 2017, San Francisco, California, USA
  • Ian Milligan, Jimmy Lin, Jeremy Wiebe, and Alice Zhou. “Exploring and Discovering Archive-It Collections with Warcbase.” Digital Humanities 2016, July 2016, Krakow, Poland. [link]
  • Ian Milligan and Nick Ruest, “Engaging the Public with Web Archives: Providing Access to 10 Years of Political History with WebArchives.ca.” Canadian Society of Digital Humanities/Société canadienne des humanités numériques Conference, May 2016, Calgary, Alberta, Canada.
  • Ian Milligan and Nick Ruest, “Hands on with Warcbase.” International Internet Preservation Consortium Conference, April 2016, Reykjavik, Iceland.

Invited Talks and Lectures

  • Ian Milligan, “Working with Cultural Heritage at Scale: Developing Tools and Platforms to Enable Historians to Explore History in the Age of Abundance.” ACL Special Interest Group on Language Technologies for Socio-Economic Science and Humanities, June 2019, Minneapolis, Minnesota, USA.
  • Nick Ruest, “Hot Tips To Boost Your Interdisciplinary Web Archive Collaboration!” Lewis & Ruth Sherman Centre for Digital Scholarship Speaker Series, April 2018, Hamilton, Ontario, Canada.
  • Nick Ruest, “Boosting Your Interdisciplinary Web Archive Collaboration.” BC Research Libraries Group Lecture Series, February 2018, Vancouver, British Columbia, Canada.
  • Ian Milligan, “Big Data and History (‘Or How this Historian Learned to Stop Worrying and Love Big Data).”, Love Your Data Week 2018, February 2018, Vancouver, British Columbia, Canada.
  • Ian Milligan and Nick Ruest, “Twitter and Web Archive Analysis at Scale.” Data Love-In 2018, February 2018, Vancouver, British Columbia, Canada.
  • Ian Milligan and Nick Ruest, “Capturing the Web Today for Tomorrow : Innovations in Capturing and Analyzing Social Media and Websites for the New Scholarly Record.” University Librarian’s Speaker Series on Emergent Research in Digital Scholarship, March 2017, Toronto, Ontario, Canada.
  • Ian Milligan and Nick Ruest, “Walking the WALK : Facilitating Interdisciplinary Web Archive Collaboration.” University of Alberta, June 2016, Edmonton, Alberta, Canada.

Datasets