Big Data in the Arts and Humanities: Some Arts and Humanities Research Council Projects
The SNAP:DRGN project (Standards for Networking Ancient Prosopography: Data and Relations in Greco-Roman Names, hereafter “SNAP”) was funded under the Big Data programme of the AHRC’s Digital Transformations theme in 2014. In the first pilot year of the project, we produced a robust set of recommendations for sharing person data between the many large datasets containing information about historical/mythical persons and person-like entities such as families, groups, deities or monsters (http://snapdrgn.net/cookbook). We also tested these standards against a moderate body of data in the form of three core datasets and two library person catalogues, using Linked Open Data in the form of an RDF triplestore. The web interface for these resources was still somewhat lacking, not having been high on our priorities until the data was ready.
In spring term 2015, a group of students from the MA Digital Humanities at King’s College London expressed an interest in gaining some research experience on the SNAP project. A few of these worked as part of a formal internship module, but most were volunteers who wanted to improve their coding skills, gain project management experience, and consolidate what they had learned about digital humanities in their study. As tutors as well as members of the SNAP research team, we considered this process as important for pedagogical value to the students as any improvement to the tools and process they might provide. There have been significant benefits on both sides.
The students organized themselves according to their interests and priorities, using a Github repository, and we populated the issue tracker with 28 microtasks, roughly grouped under the following headings:
The students reported that they particularly enjoyed the sense of contributing in small ways while fitting into the bigger picture, and benefited from the experience of teamwork and of learning from one another. They regarded the skills they acquired as transferable, and most importantly as complementary to the techniques they were taught in the elementary python and structured data modules in the MA. They have also contributed useful code to the SNAP project code base, and will all receive a co-author credit on the open source software released by the project.
Research team: King’s College London: Gabriel Bodard, K. Faith Lawrence; University of Southampton: Leif Isaksen; University of Oxford: Sebastian Rahtz; Duke University: Hugh Cayless; KU Leuven: Mark Depauw. With thanks to Francesca Giovanetti, Ethan Jean-Marie, Emma King, Argula Rublack and Katherine Ying
back Image: Student volunteers from King’s College London discuss the design of the website for the SNAP:DRGN project