A text editor very useful for sophisticated searching and encoding of text in TEI or other XML-based format.
Qualitative Research Coding Applications
‘Coding’ software for recording and encoding themes or other important information in texts.
Optical Character Recognition
The Humanities Resource Center
offers access to several OCR packages, including ABBY Fine Reader, OmniPage, ReadIris, and Tesseract.
Very popular open-source tool for audio conversion and manipulation.
Text Analysis / Concordance
Collation software for comparing and contrasting multiple witnesses of the same document. Offers graphical side-by-side views of base text and witness text.
Concordance software for generating word frequency lists, keyword-in-context lists, n-grams and clusters. Easy to use and very useful.
StanfordNER (Named Entity Recognizer)
Software for extracting ‘entities’ such as names of people, places, organizations, or dates.
Software for assigning part-of-speech markers to each word within a text.
Easy to use Topic Modeling toolkit. Attempts to classify text by statistically significant ‘topics’ or themes.
Software for automatically recognizing and disambiguating place names in text. The coordinates of those places can then be plotted on a map.
A tool for cleaning up messy data.
imagemagick / convert
General purpose command line tool for converting and manipulating images.
General purpose video conversion / transcoding tool. Can also convert video to still images.
Very useful command available on Linux/Unix based computers for downloading web content.
Software for the automatic extraction of citations from text.