I spent the first couple of weeks familiarizing myself with the issue of “topic modeling” in literary corpora (the area of work that I will be working on), reading previous work (journal and conference papers), and going through the existing code base here at the Maryland Institute of Technology, which focuses on the work.
The work so far was preliminary in nature, having to do with familiarization.
At this point in time, some reflection is in order as to what the necessary skills may be for success in this internship. This is an internship in the digital humanities, which means that, by its very definition, it straddles two areas that, in the academy, do not usually interact very much: computing and the humanities. So, to work successfully in this area, a practitioner needs more than passing familiarity with both the world of the humanities and the world of software design and development. There is a particular reason why these two areas are so disjoint and disparate in the academy. Scholars in the humanities have a (well-founded) fear of, and skepticism towards, reductionism. They tend to favor a holistic approach and resist methodologies that treat the object of their study as in any way separate from its various contexts. Science, however, by its very nature, tends to be analytical, and, as a consequence, reductive. In the field of digital humanities, therefore, these two aspects/approaches are always in tension.
The area of “topic modeling” is particularly interesting in this regard because this new approach, although statistical in nature, is one of the rare computational approaches that are, arguably, *not* reductive. Topic modeling aspires to discover global properties and qualities of the text, while at the same time connecting those global, macro-level qualities to micro-level detail, and is therefore likely to appeal to humanities scholars in a way that reductive approaches do not. It is an approach that, therefore, in addition to being merely a tool for research, that is, for answering pre-existing research questions, is also an approach that is generative of research questions themselves. It also means, however, that someone working in this area cannot simply work in an instrumental way — he or she will have to have a familiarity with both area — both the software development process, and with how research is carried out in the humanities. The people who work here embody these skills — most have been trained academically in both disciplines.
Some reflection is in order at this point with regard to the existing skill sets I bring to the table in this regard. My recent academic training (before starting the master’s program in the School of Information) had been in the field of humanities, and I have a good sense of the questions, issues and mindsets that characterize the humanities. With regard to my skill set in software, however, my skills are a little rusty, since change is very fast in the world of software and I have not been developing software full-time, year-round in the last few years. So, I am finding that I have some catching up to do in order to get to speed on the project — needing to learn some new tools and methods. In particular, I need to focus on web programming, since I had not worked intensively in that area before, and mastery of the techniques of web programming is an important prerequisite to building applications that would be versatile and user-friendly enough to be attractive to humanists to use. I learn best by reading books (in the old-fashioned, i.e. dead-tree way) and so I got myself books on JQuery, git and Scala (technologies that would be needed in the project) to read.




So, it’s the first week of August. I met with Leigh last week, and Dave was also there. We showed her the project and what I had already done. She said what I had functionality-wise was pretty much what she wanted. I suggested some additional features and she liked it, so we’ll be implementing that soon. I’ll be implementing that soon, that is. I already did one of them this morning. Elisabeth, the designer here, sat in for part of it, and she’ll be making the design. That’s kind of nice, because she’s really good, and I’m not a design expert. I can definitely finish my part by the end of the summer. I am looking forward to having the whole project finished so I can put it in my portfolio and present it at SI.
For work this week I have focused on adding content to the catalog and moving the local instance live. On the latter, it’s almost ready! There are a couple of hitches presently but everything has migrated correctly. For the former, I have been finishing up details on the Apple IIe, and adding an Osborne 1 to the site.
As my time at the CDRH nears its end (my last day is next Friday, August 6th), I have begun to reflect on the work I’ve done this summer. I have had the opportunity to exercise a variety of intellectual “muscles.” My tasks have included:
Writing a blog entry is a nice way to start my week. Much has happened since my last entry. I can’t believe it’s already the end of July! I’m really happy about how things are going. I’ve learned so much! Somehow I learned more by working in this internship than I did all of my first year of grad school. Really it’s true what they said, life experience is the true Teacher. I feel like there are so many career opportunities and paths I can take not only next year but also the year after when I graduate. I’m excited about that, because not having anything to tie me down to Michigan I can go anywhere, and I’ll go to the west coast.