Schedule
This schedule is your go-to resource for the most up-to-date syllabus, reading materials, and assignments. While we will strive to stick to this schedule, please understand that it is not set in stone – updates may occur. Should there be any important changes to the schedule, assignments, or reading materials, you’ll receive an email notification!
Here’s a quick overview to the different types of class meetings we’ll have:
- Discussion: Sessions that combine lecturing with discussions, emphasizing critical engagement with the assigned materials.
- Lab: Hands-on sessions applying concepts in a practical setting.
- Present: Sessions where students present their projects or research.
- Field Trip: Visit to Princeton University Library Special Collections.
Week 1 - Intro to and Foundations of Digital Humanities
In this module, we will take our time to become acquinted with the contours that delineate the field known as Digital Humanities. We discuss what are the essential or minimal ingredients (if there are any), and critically examine various definitions. We traverse the landscape of Digital Humanities and explore how early definitions of the field differ from more recent attempts at defining it.
↪ Jan 30 Discussion Acquaintances
↪ Feb 1 Discussion Foundations
- Slides
- Pre-Class Reflection:
- Examine this website, hit the New Quote-button a few times, and read the definition that appears. Some guiding questions:
- How do the definitions you encounter vary each time you refresh the page? What does this diversity tell you about the nature of Digital Humanities as a field? What are the commonalities between the definitions? What are the differences? Consider the role technology plays in these definitions. Is it merely a tool, or does it fundamentally reshape the humanities?
- Liu, Alan. “Imagining the New Media Encounter.” In A Companion to Digital Literary Studies, edited by Ray Siemens and Susan Schreibman, pp. 1–25. Wiley, 2013.
- Kirschenbaum, Matthew. 2012. “What Is Digital Humanities and What’s It Doing in English Departments?” In Debates in the Digital Humanities, 3–11. University of Minnesota Press.
- Post your reflection in the #reflections channel on Slack before 9:00AM on the day of our class.
- Examine this website, hit the New Quote-button a few times, and read the definition that appears. Some guiding questions:
Week 2 - Data Culture(s)
This week we survey the expansive terrain of Digital Humanities. First, we’ll engage in a discussion on ‘data’ as a core concept, and the methodologies that propel the field forward. Later on this week, we focus on practical applications of metadata and data description.
↪ Feb 6 Discussion Getting our data together
- Slides
- Pre-Class Exercise:
- Play around with Google’s Ngram Viewer to investigate two terms of your choice (e.g. ‘car’ vs. ‘automobile’). Analyze and interpret the trends you observe. Consider the following: what intriguing patterns – if any – emerge? Can specific historical events explain shifts in the usage of these terms? You are welcome to do a quick online search to support your interpretations, but keep it concise – no more than three sentences for each term.
- Post your analysis along with a screenshot of your NGram search in the #ngrams channel on Slack before 9:00AM on the day of our class.
- Pre-Class Reflection:
- Rosenberg, Daniel. 2013. “Data before the Fact.” Raw Data Is an Oxymoron, 15–40.
- Ramsay, Stephen. 2014. “The Hermeneutics of Screwing Around; or What You Do with a Million Books.” In Pastplay: Teaching and Learning History with Technology, edited by Kevin B. Kee, 111–20. Ann Arbor: University of Michigan Press.
- Post your reflection in the #reflections channel on Slack before 9:00AM on the day of our class.
↪ Feb 8 Discussion Metadata and data description
- Slides:
- Reflection:
- Manovich, Lev. “Database as Symbolic Form.” Convergence: The International Journal of Research into New Media Technologies, vol. 5, no. 2, June 1999, pp. 80–99.
- Pomerantz, Jeffrey. “Introduction.” Metadata, The MIT Press, 2015, pp. 1–18.
- Gebru, Timnit, et al. “Datasheets for Datasets.” Communications of the ACM, vol. 64, no. 12, Dec. 2021, pp. 86–92.
- Post your reflection in the #reflections channel on Slack before 9am on the day of our class.
- Optional, complimentary reading for further interest:
- Hoffman, Gretchen. “How Are Cookbooks Classified in Libraries? An Examination of LCSH and LCC.” Proceedings from North American Symposium on Knowledge Organization, vol. 4, no. 1, 2013, pp. 100–11.
Week 3 - A Project of Our Own
In this module, we will start on our class-crowdsourced project featuring a dataset that captures the essence of Princeton University: the Princeton University Historical Postcard Collection. Part of this collection has been digitized and is available through the Princeton University Digital Library. We will outline the various stages of this project, from conception to completion, and identify the tools and techniques that will assist us throughout. We will also pay a visit to the Mudd Library where we will view the collection up close. In the second meeting of this week, we investigate the critical procedures involved in data cleaning, and how OpenRefine can be used in this process.
↪ Feb 13 Field Trip Exploring Princeton's Postcards CANCELED DUE TO SNOW STORM ☃️
↪ Feb 15 Field Trip Exploring Princeton's Postcards
- Pre-Class Exercise:
Explore the Princeton University Historical Postcard Collection and select a postcard that you find interesting. Critically analyze it using one of the following guiding questions:
- If available, find the same postcard on a different online platform. Note differences in color, detail, or cropping. What do these variations suggest about digitizing analog materials?
- Assess the quality of the digital images. Are there details potentially lost or misrepresented in the digital version?
- Look at the metadata provided alongside the digital surrogate. How does this information affect your understanding of the postcard?
- Reflect on your experience navigating the digital collection. How does the digital interface impact your exploration and understanding?
- Hypothesize about how viewing the postcards digitally might differ from viewing them in person.
- What do you think the digitization process for this collection was like? What challenges might have been encountered? What choices were made?
Post your reflection in the #postcards channel on Slack before 9:00AM on the day of our class.
- Pre-Class Reflection:
- Listen to this podcast episode, in which Dot Porter (Curator of Digital Research Services at UPenn’s Schoenberg Institute for Manuscript Studies) is being interviewed by Stewart Varner (Managing Director of the Price Lab at UPenn).
- Manžuch, Zinaida. “Ethical Issues in Digitization of Cultural Heritage.” Journal of Contemporary Archival Studies, vol. 4, no. 2, 2017, pp. 1–17.
- Kropf, Evyn. “Will That Surrogate Do? Reflections on Material Manuscript Literacy in the Digital Environment from Islamic Manuscripts at the University of Michigan Library.” Manuscript Studies: A Journal of the Schoenberg Institute for Manuscript Studies, 2016, pp. 52–70.
- Post your reflection in the #reflections channel on Slack before 9:00AM on the day of our class.
Week 4 - Molding and Modeling Data (and cleaning it too after the snow storm)
In this module, we investigate the critical procedures involved in annotating data, thereby preparing it for analysis. These processes invariably entail the making of mindful decisions. As we gear up for the hands-on tutorials, we also explore an integral component of data analysis: data modeling. We will demystify this process, its bearing on our data handling techniques, and what constitutes an effective data model. In the second meeting of this week, students will present their Data Biographies.
↪ Feb 20 Lab Data Cleaning
- Slides
- Reflection:
- Schöch, Christof. “Big? Smart? Clean? Messy? Data in the Humanities.” Journal of Digital Humanities, vol. 2, no. 3, 2013.
- Rawson, Katie, and Muñoz Trevor. “Against Cleaning.” Debates in the Digital Humanities, University of Minnesota Press, 2019, pp. 279–92.
- Broman, Karl W., and Kara H. Woo. “Data Organization in Spreadsheets.” The American Statistician, vol. 72, no. 1, 2018, pp. 2–10. → Perusall annotations are optional for this article.
- Post your reflection in the #reflections channel on Slack before 9:00AM on the day of our class.
↪ Feb 22 Present Data Biographies
- Presentation overview:
- On this day, students will present their Data Biographies. This is a chance to share the intriguing stories behind the datasets you’ve explored!
- Each presentation should last approximately – but also no more than – 4 minutes, followed by a brief (~1 minute) Q&A session.
- Focus on the narrative of your Data Biography, presenting on aspects you find interesting, such as origin, collector(s), collection method(s), intended use, and any limitations or ethical considerations.
- Use visuals or excerpts from your dataset to illustrate your points and engage the audience.
- For more details, refer to the assignment description.
- Presentation guidelines:
- Maximum of five (5) slides per presentation.
- Please add your slides to this shared slide deck before the start of the class.
- Strict time limit of four (4) minutes for each presentation.
- Order of presentations:
Order - Presenter(s) Order - Presenter(s) 1 - Alison 8 - Helen 2 - Layla 9 - Colin + Melissa 3 - Talia 10 - James 4 - Clay 11 - Raphaela + Emanuelle 5 - Andrew 12 - Pippa 6 - Anya 13 - Yaashree 7 - Ethan 14 - Pia
Week 5 - Text Digitization and Markup Languages
During our first session this week, we will look at into the technology behind making text digitally readable. Our historic postcard collection provides the perfect case study for exploring both Optical Character Recognition (OCR) and Handwritten Text Recognition (HTR) technologies. In our subsequent session, we will explore the world of markup languages, such as HTML and XML. We will discuss their significance in structuring and presenting digital content, and explore how these languages serve as the backbone of content representation on the web.
↪ Feb 27 Lab Text Recognition Technologies
- Slides
- Pre-Class Reflection:
- Terras, Melissa. “The Role of the Library When Computers Can Read: Critically Adopting Handwritten Text Recognition (HTR) Technologies to Support Research.” The Rise of AI, edited by Amanda Wheatley and Sandy Hervieux, ACRL - Association of College & Research Libraries, 2022, pp. 137–48.
- Cordell, Ryan. “Q I-Jtb the Raven. Taking Dirty OCR Seriously.” Book History, vol. 20, 2017, pp. 188–225.
- Post your reflection in the #reflections channel on Slack before 9:00AM on the day of our class.
↪ Feb 29 Lab Markup Languages
- Slides
- Pre-Class Reflection:
- Beshero-Bondar, Elisa, Lee Skallerup Bessette, Quinn Dombrowski, and Roopika Risam. “DSC #5: The DSC and the Impossible TEI Quandaries.” The Data-Sitters Club. June 25, 2020.
- Budak, Nick. “Representing Gender in the Shakespeare and Company Project.” Shakespeare and Company Project, Version 1.5.7., 12 Dec. 2019. → Perusall annotations not required for this article.
Week 6 - Distant Reading
This week begins with a lecture on Distant Reading, where we will explore the theoretical framework and methodologies for analyzing large bodies of text. We’ll discuss how this approach can uncover patterns and trends that are not immediately apparent, setting the stage for the hands-on lab session where we will be leveraging the web-based Voyant Tools, known for their user-friendly, powerful capabilities for visual text analysis. This module is designed to be both instructive and engaging, providing you with practical experience using digital tools to analyze large textual data sets. Drawing inspiration from the works of El Khatib and Ross (2022), we will conduct an ecological reading of Emily Brontë’s novel Jane Eyre, and an exploratory thematic analysis of Mary Shelley’s Frankenstein.
↪ Mar 5 Discussion Distant Reading
- Slides
- Reflection:
- Underwood, Ted. “A Genealogy of Distant Reading.” Digital Humanities Quarterly, vol. 11, no. 2, 2017.
- Tahmasebi, Nina. The Strengths and Pitfalls of Large-Scale Text Mining for Literary Studies. Synergies Conference. Copenhagen, 28 September 2020.
- Post your reflection in the #reflections channel on Slack before 9am on the day of our class.
↪ Mar 7 Lab Applying Voyant Tools
- Slides
- Pre-Class Reflection:
- Nguyen, Dong, et al. “How We Do Things With Words: Analyzing Text as Social and Cultural Data.” Frontiers in Artificial Intelligence, vol. 3, article 63, Aug. 2020, p. 1-14.
- Smits, Thomas, and Melvin Wevers. “A Multimodal Turn in Digital Humanities. Using Contrastive Machine Learning Models to Explore, Enrich, and Analyze Digital Visual Historical Collections.” Digital Scholarship in the Humanities, vol. 38, no. 3, Aug. 2023, pp. 1267–80.
- Post your reflection in the #reflections channel on Slack before 9am on the day of our class.
🌴 Spring Break 🌴
Enjoy a well-deserved break!
Week 7 - Topic Modeling
Topic modeling is a useful tool for exploring large collections of text. It can be used to identify themes in a corpus, to identify outliers, and to explore the relationships between different texts. It is a form of unsupervised machine learning, which means that it does not require a human to provide a training set of labeled data. During the first session of this week, we will look at how the sieving out of topics works, and how to interpret the results. In the second session, Laure Thompson will visit our class and we’ll use jsLDA to build a topic model of our own.
↪ Mar 19 Discussion Topic Modeling
- Slides
- Pre-Class Reflection → Perusall annotations not required for these readings.
- Froehlich, Heather. Moby Dick Is About Whales, or Why Should We Count Words?.
- Blei, David M. “Topic Modeling and Digital Humanities.” Journal of Digital Humanities, vol. 2, no. 1, 2012.
- Nelson, Robert K. Mining the Dispatch.
- Post your reflection in the #reflections channel on Slack before 9am on the day of our class.
↪ Mar 21 Lab Topic Modeling with jsLDA
- Slides, data, and other resources (prepared and taught by dr. Laure Thompson):
- jsLDA website
- Data
- Themes identified from 3,000 volumes of Speculative Fiction (SF) from the HathiTrust digital library.
- Pre-Class Reflection → Perusall annotations not required for these readings.
- Boyd-Graber, Jordan, et al. “Applications of Topic Models.” Foundations and Trends in Information Retrieval, vol. 11, no. 2–3, 2017, pp. 143–296. → This is a big one! No need to read in depth; become acquainted with chapters 1.1, 1.2 (+ 1.3 if you want to get a bit more technical), chapter 3, and chapter 6.
- Antoniak, Maria. A Computational Reading of a Birth Stories Community. 5 Nov. 2019.
- Post your reflection in the #reflections channel on Slack before 9am on the day of our class.
Week 8 - Stylometry
In this module, we will discover the world of stylometry, a field that probes the unique stylistic fingerprint of authors. Utilizing the user-friendly package Stylo, we will undertake a critical review and replication of Patrick Juola’s study (2013), which aimed to reveal the author hiding behind the pseudonym Robert Galbraith. In doing so, we will not only explore the power of stylometric techniques, but also discuss the broader implications of this type of research, emphasizing the ethical and scholarly consequences.
↪ Mar 26 Discussion Stylometry
- Slides
- Pre-Class Reflection
- Whissell, Cynthia. “Traditional and Emotional Stylometric Analysis of the Songs of Beatles Paul McCartney and John Lennon.” Computers and the Humanities, vol. 30, no. 3, 1996, pp. 257–65.
- Holmes, David I., and Judit Kardos. “Who Was the Author? An Introduction to Stylometry.” Chance, vol. 16, no. 2, 2003, pp. 5–8.
- Post your reflection in the #reflections channel on Slack before 9am on the day of our class.
↪ Mar 28 Lab Authorship Attribution with Stylo
- Slides
- Pre-Class Reflection
Binongo, José Nilo G. “Who Wrote the 15th Book of Oz? An Application of Multivariate Analysis to Authorship Attribution.” Chance, vol. 16, no. 2, Mar. 2003, pp. 9–17.
Disclaimer
The readings listed below are part of this week’s lab effort to replicate Patrick Juola’s analysis on the authorship of J.K. Rowling. It is however crucial to recognize that J.K. Rowling has become a figure of controversy owing to her remarks concerning the transgender community. The decision to include her work in our study does not equate to an endorsement of her opinions or a promotion of her work. Our goal is to undertake a computational examination of Rowling’s writings, analyzing the stylistic distinctions between works published under her own name and those released under a pseudonym. The readings listed below are part of a broader discussion on the importance and challenges of addressing controversial creators and topics in Digital Humanities. I am curious about your insights. Can we separate our study of J.K. Rowling’s authorship from her personal opinions, allowing the field of Digital Humanities to persist in its computational analysis of her work (and for me to continue incorporating this case in stylometry classes), or should we consider refraining from including her in our research altogether?
- Juola, Patrick. “Rowling and ‘Galbraith’: An Authorial Analysis.” Language Log, 2013.
- McNamara, Mary. “Commentary: Here’s an Idea: Maybe If We All Stop Talking about J.K. Rowling, She’ll Just Go Away.” Los Angeles Times, 21 Feb. 2023.
Dederer, Claire. “Chapter 3. The Fan.” Monsters. A Fan’s Dilemma, Alfred A. Knopf, 2023.
→ Perusall annotations not required for the readings on Rowling.
- Post your reflection in the #reflections channel on Slack before 9am on the day of our class.
Week 9 - Data Visualization and GIS
This module focuses on the visual dimension of Digital Humanities research. Often, the dissemination of results takes the form of graphs or they are displayed via graphical interfaces. We will explore a whole spectrum of data visualization practices, citing examples of the good, the bad, and even the downright ugly. In this sense, the session is aimed to prepare us for the coming labs on mapping and network analysis.
↪ Apr 2 Discussion Data Visualization
- Slides
- Pre-Class Reflection:
- McCandless, David. The Beauty of Data Visualization. TEDGlobal 2010. Video.
- Walker Rettberg, Jill. “Ways of Knowing with Data Visualizations.” Data Visualization in Society, edited by Martin Engebretsen and Helen Kennedy, Amsterdam University Press, 2020, pp. 35–48.
- Rosenberg, Daniel. “Against Infographics.” Art Journal, vol. 74, no. 4, Oct. 2015, pp. 38–57.
- Post your reflection in the #reflections channel on Slack before 9am on the day of our class.
- Pre-Class Exercise:
- Every year since 2012, the Digital Humanities community has recognized outstanding contributions through the Digital Humanities Awards 🏆. On the award website, you’ll find examples of data visualization that are not only stunning but also effective in conveying complex information.
- Explore the nominees from this year, or past years (2012, 2013, 2014, 2015, 2016, 2017, 2018, 2019, 2020, 2021, 2022).
- Choose a visualization that resonates with you.
- Capture a screenshot of this visualization and add it to our session’s slide deck (limit yourself to one slide; for interactive visualizations, include a link to allow us to explore the visualization in detail during class).
- While a formal presentation isn’t required, prepare to discuss the following in class: What is being depicted? In what ways is it presented? Why do you believe this visualization is impactful? Think about its strengths and any potential weaknesses.
- Every year since 2012, the Digital Humanities community has recognized outstanding contributions through the Digital Humanities Awards 🏆. On the award website, you’ll find examples of data visualization that are not only stunning but also effective in conveying complex information.
↪ Apr 4 Lab GIS
- Slides
- Pre-Class Reflection:
- Monmonier, Mark S. “Chapter 11. Data Maps: A Thicket of Thorny Choices.” How to Lie with Maps, Third edition, The University of Chicago Press, 2018, pp. 153–78.
- Glasze, Georg. “Language(s), Discourse(s), Space(s) – and Their Transformations in the Digital Age: Research Approaches from Cultural and Social Geography.” Geographical Research in the Digital Humanities, edited by Finn Dammann and Dominik Kremer, Bielefeld University Press, 2024, pp. 45–62.
- Post your reflection in the #reflections channel on Slack before 9am on the day of our class.
Week 10 - Network Analysis and Intellectual Property Rights
This week, we continue our exploration of visualization in Digital Humanities with a focus on network analysis. Building on our discussion on data visualization, we’ll look at web-based platforms like Flourish and software like Gephi and learn how network analysis provides valuable insights into relational data. In the latter part of the week, we shift our attention to critical issues such as copyright, fair use, and the principles of open access.
↪ Apr 9 Lab Network Analysis CANCELED
↪ Apr 11 Lab Network Analysis
- Slides
- Pre-Class Reflection
- Weingart, Scott B. “Demystifying Networks, Parts I & II.” Journal of Digital Humanities, vol. 1, no. 1, 2011.
- Rhodes Ii, Mark Alan. “Paul Robeson’s Place in YouTube: A Social Spatial Network Analysis of Digital Heritage.” Digital Scholarship in the Humanities, vol. 34, no. 1, Apr. 2019, pp. 174–88.
- Post your reflection in the #reflections channel on Slack before 9am on the day of our class.
Week 11 - Python for the Curious
This week, we will look at the role of coding in the humanities with a gentle introduction to Python, a programming language as approachable as it is powerful. Our two sessions will provide a panoramic view of Python’s syntax and its practical applications in Humanities research. Through hands-on exercises, we’ll learn to appreciate the role of code in analyzing texts, visualizing data, and perhaps most importantly, in sharpening our critical thinking about coding literacy. Ultimately, the goal of this week is to get a taste of programming in Python, with fear being an exception that is not allowed to interrupt the process.
Or, stated in a Pythonic way:
try:
learn_python()
except Fear as obstacle:
handle(obstacle) # apply strategies to overcome fear
else:
print('Congratulations on embracing coding literacy!') # celebrate
↪ Apr 16 Discussion Intellectual Property Rights
- Slides
- Pre-Class Reading (no Perusall annotations or reflection required!):
- Haggerty, Kenneth. “Intellectual Property Guidelines for the Digital Humanities.” Routledge International Handbook of Research Methods in Digital Humanities, edited by Kristen Schuster and Stuart Dunn, Routledge, 2020, pp. 428–40.
- Lerner, Ben. “The Hofmann Wobble. Wikipedia and the Problem of Historical Memory.” Harper’s Magazine, vol. 347, no. 2083, Dec. 2023, pp. 23–32.
↪ Apr 18 Lab Coding Literacy
- Notebook
- Pre-Class Reflection:
- Schmidt, Benjamin M. “Do Digital Humanists Need to Understand Algorithms?” Debates in the Digital Humanities, vol. 53, 2016.
- Vee, Annette. “Introduction. Computer Programming as Literacy.” Coding Literacy: How Computer Programming Is Changing Writing, The MIT Press, 2017.
- Post your reflection in the #reflections channel on Slack before 9am on the day of our class.
Week 12 - Roles, Responsibilities, and the Future
In this concluding week, we begin by exploring the ethical dimensions of Digital Humanities, with a focus on critical issues such as algorithmic bias and transparency in the age of AI. We confront these developments by analyzing their impacts, potential, and the risks they create. In the final session, students will showcase the progress of their own research proposals, which involves elements of data collection, preparation, and analysis methods. Colleagues from the Center for Digital Humanities at Princeton University are invited, providing an opportunity for everyone to gain feedback on their proposals.
↪ Apr 23 Discussion Ethical Considerations
- Slides
- Pre-Class Reflection (no Perusall annotations required!)
- Presner, Todd, et al. Digital Humanities Manifesto 2.0. 2009, pp. 1–15.
- Catherine D’Ignazio, Lauren Klein, Data Feminism: What Does Feminist Data Science Look Like? LSE Online Event. Chair: Fiona Steele. 2021. YouTube.
Post your reflection in the #reflections channel on Slack before 9am on the day of our class.