Text Tools

Home > Research > Education and Training > Text Tools
24 Sep 2019
Kirsten Keister

OpenITI AOCP: The Open Islamicate Texts Initiative Arabic-script OCR Catalyst Project

By |2020-02-28T10:22:16-05:00Sep 24, 2019|

With generous funding from The Andrew W. Mellon Foundation, OpenITI AOCP will create a new digital text production pipeline for Persian and Arabic texts. OpenITI AOCP will catalyze the digitization of the Persian and Arabic written traditions by addressing the central technical and organizational impediments stymying the development of improved OCR for Arabic-script languages.

8 Mar 2018
Raffaele Viglianti

coreBuilder

By |2019-01-15T11:01:30-05:00Mar 8, 2018|

coreBuilder is an open source web-based visual environment for authoring stand-off markup. The tool aims at making the application of stand-off techniques more approachable in the context of Text Encoding Initiative projects dealing with multidimensional representations of text, without substantially disrupting workflows already familiar to TEI encoders.

31 Mar 2014

Princeton Prosody

By |2019-01-15T10:30:44-05:00Mar 31, 2014|

In late 2013, MITH partnered with the Princeton Prosody Archive to build tools and modules for processing and indexing volumes from the HathiTrust Digital Library, with the goal of creating a comprehensive online archive of English-language monographs on verse meter and prosody in the public domain. These tools allow research groups like the Prosody Archive to import HathiTrust volumes into a Drupal installation for browsing, reading, full-text search, and metadata correction.

22 May 2012

Active OCR

By |2019-01-15T10:31:15-05:00May 22, 2012|

Active OCR: Tightening the Loop in Human Computing for OCR Correction will develop a proof-of-concept application that will experiment with the use of active learning and other iterative techniques for the correction of eighteenth-century texts.

15 May 2012

ANGLES

By |2019-05-13T17:05:44-04:00May 15, 2012|

ANGLES proposes a bridge between humanities centers who have greater resources to program scholarly software and the scholars who form the core user community for such software through their teaching and research.

10 Feb 2012

MONK: Humanities Text Mining in the Digital Library

By |2019-01-15T10:32:51-05:00Feb 10, 2012|

MONK stands for Metadata Offer New Knowledge, and was a digital environment designed to help humanities scholars discover and analyze patterns in the texts they study. It supported both micro analyses of the verbal texture of an individual text and macro analyses that let you locate texts in the context of a large document space consisting of hundreds or thousands of other texts.

9 Feb 2012

Ajax XML Encoder (AXE)

By |2019-05-13T17:01:14-04:00Feb 9, 2012|

AXE is a web-based tool for "tagging" text, video, audio, and image files with XML metadata, a process that is now a necessary but onerous first step in the production of digital material.

7 Feb 2012

Text-Image Linking Environment

By |2019-01-15T10:54:26-05:00Feb 7, 2012|

The Text-Image Linking Environment (TILE) is a web-based tool for creating and editing image-based electronic editions and digital archives of humanities texts.

Go to Top