Tools

This page provides a curated list of tools that can be used for Digital Humanities projects. Whether you’re interested in text analysis, data visualization, mapping, or any other area, the following resources can be incredibly useful.

Remember to explore each tool and dataset to see how they can best fit into your project or research!

Text Analysis

  • Voyant Tools: Web-based text analysis environment.
  • Google Books Ngram Viewer: Graph and compare word usage over time.
  • TAPoR: Discover research tools for studying texts.
  • AntConc: A text analysis toolkit, especially for concordancing
  • Stylo: Tool for stylometric analysis, often used in authorship attribution.
  • MALLET: A Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text (see also **this browser-based topic model, great for experimenting).
  • Natural Language Toolkit: A Python library for natural language processing.
  • spaCy: Another - slightly more advanced - Python library for natural language processing.
  • Stanford CoreNLP: Another library for natural language processing.
  • TextBlob: Yet another Python library for natural language processing.
  • Gensim: A Python library for topic modeling, document indexing, and similarity retrieval with large corpora.
  • Word2Vec: A tool for computing continuous distributed representations of words.
  • WordNet: A lexical database for the English language.

Content Management Systems

  • Omeka: A free, flexible, and open source web-publishing platform for the display of library, museum, archives, and scholarly collections and exhibitions.
  • WordPress: A free and open-source content management system.
  • Drupal: Another free and open-source content management system.
  • Jekyll: A simple, blog-aware, static site generator.
  • Hugo: A fast and modern static site generator.
  • AVAnnotate: Free and open source workflow and application for sharing annotations of audio and video artifacts and making digital exhibits and editions with AV materials.
  • LEAF: LEAF is an editorial environment for open collaboration and publication. It provides web-based tools and online spaces for more scholars, teachers, and students to take part in collaborative digital knowledge production: creating, publishing, preserving, linking and sharing data across the Web.

Text Digitization

  • Transkribus: A platform for the automated recognition, transcription, and searching of historical documents.
  • daiR: R package for Google Document AI, a powerful server-based OCR processor with support for over 60 languages.
  • Kraken: Open source software that specializes in non-Latin alphabets, including bidirectional, right-to-left, and top-to-bottom scripts. Kraken is Python-based and runs on the command line.
  • CERberus: Great tool for string comparison and error detection in HTR’d or OCR’d text. The tool provides an implementation of Character Error Rate calculation between a reference string and a hypothesis string, with various customization options.

Data Visualization

  • Tableau Public: Create interactive data visualizations.
  • Gephi: Open-source software for all kinds of graphs and networks.
  • RAWGraphs: Open-source data visualization framework, turning spreadsheets and into beautiful graphics.
  • Datawrapper: Create charts, maps, and tables.
  • Palladio: A web-based platform for the visualization of complex, multi-dimensional data.
  • NetworkX: A Python library for the creation, manipulation, and study of the structure, dynamics, and functions of complex networks.
  • Network Navigator: A web-based tool designed to give you a first glimpse of your network data, with both metrics and visualizations.
  • Cytoscape: An open source software platform for visualizing complex networks and integrating these with any type of attribute data.
  • WTFcsv: Tells you WTF is going on with your .csv file.
  • Kumu: A powerful data visualization platform that helps you organize complex information into interactive relationship maps.
  • ClioVis: Interactive timeline, mind-mapping and presentation tool.

Mapping and Spatial Analysis

  • QGIS: A free and open-source geographic information system.
  • Leaflet: JavaScript library for interactive maps.
  • StoryMap JS: Create stories with maps.
  • ArcGIS: Create interactive maps and apps.
  • GeoPandas: A Python library for working with geospatial data.
  • GeoJSON.io: A simple tool for creating, viewing, and sharing maps.
  • Mapbox: Create custom maps.
  • OpenStreetMap: A free, editable map of the world.
  • Recogito: A tool for the collaborative annotation of texts and images.
  • GeoJSON.io: Open-source web visual map editor for GeoJSON files.
  • Mapshaper: A tool for editing Shapefile, GeoJSON, TopoJSON, and CSV files.
  • HistoGIS: Historical Central European borders as GeoJSON.

Image Analysis

  • ImageJ: A public domain Java image processing program.
  • Collection Space Navigator: A browser-based visualization tool to explore, research, and curate large collections of visual digital artifacts that are associated with multidimensional data, such as vector embeddings or tables of metadata.

Bibliographic Tools

  • Zotero: Great reference manager.
  • Mendeley: Another great reference manager.
  • EndNote: Yet another great reference manager.
  • Tropy: A tool for managing and organizing research photos.