Digital Humanities as Gamified Scholarship

The Digital Humanities trace their origins back to Father Roberto Busa’s efforts to analyse the works of Thomas Aquinas in the 1940s, which was then followed by further efforts to perform textual analysis with the aid of computers. Since that time, the Digital Humanities has expanded to encompass a myriad of other activities (and acquired its name in the process) and a devoted community of practitioners. Nevertheless, doubts persist about whether the growth of the Digital Humanities has had, or has the potential to have, any significant impact on scholarship in the Humanities as a whole.  Although I can’t say for certain, my feeling is that when doubters look back at the past, they tend to be thinking primarily of computational textual analysis as the method that has failed to obtain a wide impact. Whether this is a fair assessment of the Digital Humanities, or whether the appropriate criteria have been selected for assessing the significance for even this one area, is worthy of discussion, but my intention here is to look forward, rather than back. Computational textual analysis is beginning to evolve more rapidly, and to become more widely accessible to both students and scholars, meaning that the past should not be taken as an indication of the future.

The potential of computational textual analysis as a pedagogical tool is something that I will pass over quickly because it is a large topic. But I will mention three ways in which computational textual analysis can make an impact through classroom teaching. First, it provides a structure through which students can move between close and contextual readings, helping to students to achieve genuine insight without the benefit of years of study that more advanced students and established scholars enjoy. I have written about this elsewhere. Second, it increases exposure to computational textual analysis. If the method is to impact the broader field, there has to be a critical mass of students who are familiar with it. Third, computational textual analysis, like many skills taught in the Humanities, is transferrable to the workplace. However, in the case of computational analysis (and associated visualisation techniques), the value is arguably easier to perceive (and believe) by those doing the hiring. But enough on pedagogy. That topic is important enough to deserve a blog post of its own.

Here I want to focus on the scholarly impact of computational textual analysis as it is likely to take shape in the coming years. One of the most exciting developments is that the praxis is starting to acquire a theory—at least, something it is starting to look more acceptable qua theory by those who think the Digital Humanities are undertheorised. In their introduction to the recent special issue of DHQ on the Literary and the Digital Humanities, Jessica Pressman and Lisa Swanstrom present Franco Moretti and Jerome McGann as two ends of the methodological spectrum. Moretti’s notion of “distant reading” by constructing computational models of literary texts challenges traditional methodologies to interpret the texts without close analysis of their contents; McGann’s focus on the “textual condition” of the material objects of study forces us to confront the dynamic nature of textual forms, that our interpretations are based on a set of contingent and unique conditions like an individual performance. The latter leads to the concept of textual “deformance”—the intentional disruption of textual form in order to draw attention to meanings we might not have noticed otherwise. As it becomes easier and easier to deform digital texts algorithmically using computers, the possibilities for experimenting with textual deformance grow. This recognition underlies Stephen Ramsay’s concept of “screwmeneutics” (which elaborates more as “algorithmic criticism” in Reading Machines). Computers are tools from “screwing around” with texts, just to see what happens. In a sense, that is what Moretti is doing as well. By reducing texts to data points, and then mapping their relationships to each other, he seeks to discover insights he might have missed without this broad relational overview.

Whilst this screwing around with the text has something in common with the rhetorical “play” emphasised in some theoretical trends of the last few decades, today it is most often linked to the “performative” aspect of reading. But I want to focus here on another type of similarity between computational analysis and play, a similarity to video games, which would seem à propos, given their common use of digital technologies. A video game is an immersive, interactive environment, often a model or simulated version of the real world, but with some elements abstracted. The player enters the world of the game (what Huizinga called the “magic circle”) and manipulates elements of its world. There is now a well-established literature about the similarities and differences between video games and literature, much of it dealing with the native interactivity of video games. But it tends to be more concerned with the general experience of games and literature in the digital and non-digital media than with acts of scholarship as “play”. To understand what this entails, it seems to me useful to side-step to the adjacent field of gamification. Gamification refers to the incorporation of game-like elements into activities not otherwise considered to be games. The concept has a rapidly-growing following in the business world, where gamification is a strategy for increasing productivity, user satisfaction, and other measures of business success. The basic idea is that human activity is enhanced through fun. If our screwing around digitally with texts is like manipulating the elements of a video game for “fun”, then we have essentially gamified the scholarly process.

Some caveats before we get to the implications. I am not suggesting that traditional forms of reading and interpretation of texts are not immersive, or even that they are not interactive. I am merely suggesting that, due to the incorporation of digital technologies, the extent or quality of engagement shares something in common with video games. Nor am I suggesting that this engagement is ontologically different from traditional acts of reading and interpretation. My point is that it is worth exploring the implications of locating computational methods nearer to video games on the continuum.

What then are the implications of postulating a gamified form of computer-based interpretation? In For the Win: How Game Thinking Can Revolutionize Your Business, Werbach and Hunter postulate that gamification involves both an understanding of game design and an understanding of business techniques (9). Translated into the language of scholarship, the latter could easily equate to traditional forms of disciplinary knowledge. The striking addition, then, is the element of design. Perhaps this too has its equivalent in the rhetoric of chosen for publication, but in the digitally-enhanced world it can mean a great deal more. Taking the video game analogy at its most literal level, digital textual analysis requires the scholar to create a world in which the individual components can be manipulated by algorithms in internally consistent ways that both simulate and diverge from reality (i.e. that which lies outwith any world constructed for the scholarly purpose) or from any other world from which these components and algorithms are drawn. This design process again has its analogy in literary theory, but because its activities can seem more like the activities of other disciplines (often computer science and statistics), that analogy is sometimes lost. For those engaged in this world design, the ideological components of their activities tend to recede, prompting criticism from the more ideologically-oriented sectors of the Humanities. For the Humanities as a whole, there is a tendency for some scholars to dismiss this design process as atheoretical or unconcerned with the theories relevant to the Humanities. The former I find convincing. The latter I think reflects a tendency to refuse to accept that there are more things in heaven and earth than are dreamt of in their philosophies—and some of those things might be interesting. Regardless, the quality of design that makes it so distinct from the argumentative form scholarship that has dominated the Humanities since perhaps the latter half of the twentieth century is its practical nature, often leading to the construction of a tangible product the purpose of which is not explicitly persuasive. This is what prompted Stephen Ramsay to define Digital Humanities scholarship as “building”.

For many, this divergence from the standard scholarly paradigm stands alongside the element of “play” as a barrier to the engaging in Digital Humanities work. (In the case of computational textual analysis, fear—often culturally reinforced fear—of mathematics and coding also plays a role.) The impact of a tenure and promotion committee asking, “What is this?” and being told, “It was a game” or “I was screwing around with the texts” should not be underestimated. Gamification literature talks of “serious games” like flight simulators for training pilots, but at best this approach constructs gamified research as Hilfwissenschaft, preparatory work for the real business of scholarship and therefore of secondary importance for the purposes of professional advancement. I will nevertheless leave discussion of institutional barriers of acceptance to others. In my department, faculty come up for tenure and promotion on a five year cycle, and, barring any major blips, a single peer-reviewed article as an Assistant Professor and one as an Associate Professor will generally be enough for promotion to the next level. Yes, there is a price to be paid for this relatively easy process, but my point is that I don’t feel qualified to speak to the pressures faced by my colleagues at other institutions.

Instead, I want to think further about how games can be “serious”. In the business model of gamification, motivation is the key element: motivation for workers to perform better or for customers to engage more with the product. What might this look like for a gamified form of scholarship? A major feature of gamification is motivation. That is, the element of play encourages experimentation and productive (expected) failure, which enhances innovation and creativity. Textual analysis—if conceived of as a “serious game”, could be expected to deliver definitive answers in the same way a flight simulator might expected to help guarantee a safe landing. Running your texts through an algorithm could be considered a “dry run” prior to more traditional forms of interpretation. I don’t want to discount the value of this approach. As anyone who has done topic modelling knows, algorithms can generate lots of junk and noise alongside meaningful results, and the experimentation required to extract the latter is a good way to test one’s theories. If algorithms show female discourse in a text or corpus to be different from male discourse in ways that would surprise a feminist scholar, this needs to be addressed somehow, either through further experimentation or through reconsideration of the theory. The Humanities should provide answers to important questions, but it should also raise at least as many questions as it answers. Gamified methodologies help provoke these questions.

This is but to say that a more gamified type of scholarship can have an impact. But how would this work? In the case of text analysis, an impactful gamification would require transferring internal motivations to external ones. One way to do this is to make tool design part of the process. This is not to ensure that the data/results can be reproduced by others but to ensure that the game can be re-played multiple variations. Each game is not an exact match of the last; that’s not the point. The point is to spread the use of the game. A tool that others find useful (or fun) will be adopted more widely for Humanities scholarship, and it is hard to argue that a tool that is successful in this way has no impact.

But widespread adoption of a tool is unlikely to satisfy critics who see the “results” of its use not contributing to the discourse of the individual Humanities disciplines which initially motivated their creation. It may seem odd that digital humanists should be castigated for looking outward, rather than inward, from their home disciplines, but we do in fact want to make contributions to the fields in which we were trained. But we still have a long way to go in figuring out how the “game world” of can interact with the “real world” of scholarship. An effective means might be the creation of a community in which both circles interact (in the Lexomics project, we are attempting to build a community-based “best-practices” component to address this issue). But managing participatory communities has been a gamification challenge in the business world, and it is no less a challenge in the academic one.

A more easily tackled approach is to make the tool itself fun, motivating the “real world” scholar to temporarily enter the “game world”—just to see what happens. In games, part of the “fun” element of the game can be an aesthetic experience, and the design of the tool can certainly contribute to that experience. In text analysis, visualisations can play an important role. Reproductions of graphs based on text analysis are not themselves sufficient if they are boring to look at. They must excite the imagination. Both the tool designer and the tool user engage in (and collaborate in) acts of visual rhetoric. This product can produce a full range of responses, which become part of the experience.

It remains to be seen, how that experience is transferred to the “real world” of scholarship, transcending Huizinga’s magic circle. The “dry run” approach is one solution. The game is part of the process, not the outcome, and the experience is taken to inform other scholarly decisions. We might also look to the game mechanics. Being forced to analyse texts in an artificial setup requires the scholar to re-think categories of analysis or the status of the materials being analysed. There may be an analogy in this with the theoretical jargon which forms part of the rhetoric of much writing in the Humanities. But, as with theoretical jargon, engagement with the mechanics of the game can distract from the deliverables, so to speak. This was recognised in the recent “just the results” panel at the MLA Convention organised by the Association for Computers and the Humanities. Perhaps a self-consciously gamified form of scholarship will require us to think clearly about the effective separation of the procedural and presentational components of our scholarship.

But what must not be lost in this separation is the meaningfulness of the play. A game must offer meaningful experiences to engage its players, particularly problem solving. For scholarship, that means asking and trying to answer meaningful questions. Here is where there is considerable debate in the Digital Humanities community as to how much these questions have to relate to the traditional questions of the Humanities? Personally, I do not feel prescriptive on this issue. An important part of the game experience is making choices and finding out what happens. I would like to leave this space as wide open as possible. Initially, we needn’t think of our playful experiments as providing any necessary insight into our “real world” scholarship, nor should we let that scholarship impose strict constraints on our play. That defeats the purpose of the game. We are not building Camelot—only a model.

Gamification can suggest a number of strategies for demonstrating the relationship between our play and our scholarly endeavours. Performing at different levels (“levelling up”) and receiving badges—perhaps representing confidence in the statistical validity of our results, and the like—would be typical methods. Just as (ideally) workers create real innovations and businesses provide real-world rewards for progress in the games, so scholars might progress along a similar continuum of activity. But I am sceptical of these strategies (at least, in this relatively undeveloped account) because they inevitably privilege the reward over the process and diminish interest in the intervening steps. I also suspect that many digital humanists would now go further and suggest that those steps are essentially performative and need not be seen in teleological terms (that is, as a means to some higher scholarly end). Incorporating “play” in scholarship eventually blurs the boundaries between analysis, interpretation, and creativity. That is appealing to some, deeply disturbing to others. As of now, I find myself on the fence, wishing to think more deeply about how to negotiate the status of objects I produce through “scholarly play”.

This post was originally written over the summer when I had been working on a major update of the Lexomics textual analysis tool Lexos and had freshly read my friend Kevin Werbach’s For the Win (there’s my full disclosure with respect to the emphasis on gamification). I had also just finished work on the playful Serendip-o-matic for the One Week | One Tool project. However, the receipt of a major grant to produce a digital edition of a medieval manuscript turned my attention to an entirely different type of work: text markup using TEI. The work I have done on that project has delayed this post considerably, possibly at the expense of coherent thought (I hope not). The intellectual issues raised by trying to represent a manuscript in the form of a digital object are not entirely unlike those of computational text analysis, but I haven’t even begun to address them in this post. Rather than delay further and make this post even longer (and possibly even less coherent), I will simply get it into the blogosphere and hope to develop my ideas further in future posts.

A Whirlwind Summer

This summer has been something of a whirlwind, which hasn’t left much time for blogging. It began with a mad dash to re-write the Lexomics software, changing the language from PHP to Python. Whilst I struggled to pick up a new language (I had only skimmed a few Python tutorials), the amazing students at Wheaton were transforming the tool into something truly awesome. I struggled to keep up and add a few visualisations. The finished tool, called Lexos, is a complete text analysis work flow from pre-processing to statistical analysis to visualisation. I was really excited to see the finished tool (as much as any tool is “finished”), and I look forward to using it in my research.

Barely two weeks later, I departed for DH 2013 in Lincoln, NE, the beginning of a three-week trip. This was a really exciting opportunity to see what’s going on in the Digital Humanities world “up close” (and I had never been to Nebraska). The non-DH highlight was definitely the reception in the natural history museum.

 

DH 2013 Reception

The opening reception at DH 2013 took place in the Natural History Museum at the University of Nebraska, Lincoln. There were some strange guests.

The conference was a whirlwind (not because of the 100-degree heat), and I was particularly happy to spend time with Brian Croxall and Mia Ridge with whom I’d be working in a few weeks time on One Week | One Tool. I also got to do some advance planning with Mark LeBlanc for how we could develop Lexos further.

Before I knew, it I was off to Boston, where I met up with my wife. There we had a strange experience–I think they call it vacation. We had a fabulous two days visiting cousins in Marblehead and wandering the Freedom Trail in Boston (the weather even cooled off for us). All the Americana was quite overwhelming; I haven’t really done colonial US tourism since I was a child.

Marblehead

Marblehead just before sunset.

We next picked up a car and drove down to Wheaton for lunch and a little more Lexos planning, eventually ending up in Fairfield, CT to visit my brother and family. On a day trip to New Haven–another place I unhappily hadn’t been since the 80s–I heard the news that we had been awarded the NEH grant for the Archive of Early Middle English. I was standing in the British Art museum when the text message came through. I paused for approximately one minute and then decided that there was no way I could look at art. I think the shock of the news took over quickly, and I still haven’t quite recovered. I don’t think it will be truly be real until we start work.

Meanwhile, there was still the drive to Philadelphia to visit friends and more colonial Americana: Liberty Bell and Independence Hall.

Independence Hall

Independence Hall

We go to Philly fairly regularly but don’t tend to do this kind of stuff, so it was really quite fun. We also got to do some poolside relaxation, much needed before the final portion of my trip.

Next stop was George Mason University, and One Week | One Tool, one of the most intense experiences I’ve had in a long time. Along with a team of eleven other digital humanists (supported by the able staff of George Mason’s Center for History and New Media), I spent six days (sometimes sixteen-hour days) producing a software tool from start to finish.

The CHNM Tower

Much of the planning and coding took place in the room at the top of the tower at CHNM.

The week-long process was a little artificial, but it was an incredible indictment of the models of scholarship (and teaching) we are bound by. Working with people who have adjacent, but very different skills and interests, was the most fun I have had in a long time. Not only did I make new friends and new contacts, I learned new skills and was drawn out of my comfort zone (as I expected to be). All that work with Python earlier in the summer certainly paid off, as it was the language we adopted for Serendip-o-matic. I did all right, but I also had to learn the Django framework on the fly since Lexos used the simpler Flask framework.

I was not initially sold on the idea of Serendip-o-matic, as my natural tendency was to want to make something that I could use for my research. But there wasn’t time to dwell on that, and, after a day of actual development I started to understand how magical the concept was. This was genuinely a search engine, but one that is quite conceptually different from those we are accustomed to.

waiting hippo

One of those weird things that gets scrawled on development white boards. An early hint of what Serendip-o-matic would become.

What makes Serendip-o-matic special is not just the change of perspective caused by serendipitous discoveries but the methodological shift required to use it. The fact that it also provides access to cultural objects which might otherwise go undiscovered is almost secondary (although it’s actually really, really important). Being part of project in the public humanities was new to me, extremely rewarding, and fabulous training for the Archive of Early Middle English project to come. The lessons learned in project management and outreach will be invaluable. Thanks of all the One Week | One Tool, and especially Tom Scheinfeldt, for creating this incredible experience.

Butterfly at One Week | One Tool

One of the insects that didn’t bite me this summer

By then end of the trip, passing through twelve states (and spending time in half of them), I was completely exhausted. And the summer’s still not over yet. I still have a grant application, a consultant job (which arose on the same day I heard about the NEH grant), and preparation for the upcoming semester’s classes. All I can say is thanks to my loving wife and cats for their tolerance. I hope I haven’t neglected them too much.

So Glad You're Back!

NEH Funds the Archive of Early Middle English

I’m excited to announce that I have received an NEH Scholarly Editions and Translations grant, which I will co-direct with Dorothy Kim from Vassar College. The grant will help create an Archive of Early Middle English (AEME). We’ll start with a full digital edition of Oxford, Bodleian Library, Laud Misc 108, with a complete set of images. Other manuscripts will follow, and by the end of the grant we expect to have a full set of metadata and editorial conventions for other to submit materials. AEME will be designed to be flexible. Not everybody can afford to photograph full manuscripts, so we’ll be working to accommodate images as they become available in the public domain (perhaps licence a few that are not). We’ll also take individual texts, in addition to whole manuscripts. And and all ye multilingual enthusiasts, we haven’t forgotten one of the most important developments in recent scholarship. Although we want Early Middle English (about 1066-1350) to be at the centre, we’ll take any materials in any language that is also found in manuscripts containing Early Middle English.

I will write much more once the project gets started. For now, you can learn a little more on the AEME web site.

Introducing Serendip-o-matic

Serendip-o-matic I’m proud to introduce the online search tool Serendip-o-matic. From July 28-August 3, I worked with a fabulous group digital humanists to produce this tool from scratch as part of the One Week | One Tool project.

Serendip-o-matic connects your sources to digital materials located in libraries, museums, and archives around the world. By first examining your research interests, and then identifying related content in locations such as the Digital Public Library of America (DPLA), Europeana, Trove Australia, and Flickr Commons, Serendip-o-matic’s serendipity engine helps you discover photographs, documents, maps and other primary sources.

Whether you begin with text from an article, a Wikipedia page, or a full Zotero collection, Serendip-o-matic’s special algorithm extracts key terms and returns a surprising reflection of your interests. Because the tool is designed mostly for inspiration, search results aren’t meant to be exhaustive, but rather suggestive, pointing you to materials you might not have discovered. At the very least, the magical input-output process helps you step back and look at your work from a new perspective.

Action Shot from One Week | One Tool

The group brainstorming tool ideas. Photo by Mia Ridge.

At some point, I will blog about the experience, but that will have to wait a little because the project has coincided with activity related to the news that I have received an NEH award to create an Archive of Early Middle English. In the mean time, I refer readers to the blog posts of other participants: Brian Croxall, Jack Dougherty, Mia Ridge, Meghan FrazerAmrys Williams, Ray Palin, and Amanda Visconti.

Eliminating Ideas

Crossing off ideas. The one that eventually became Serendip-o-matic is on another whiteboard to my right. Photo by Mia Ridge.