Princeton Geniza Project


The Princeton Geniza Project is a searchable database of documentary Geniza texts.

The corpus

The corpus currently consists of 4,320 transcriptions in Judaeo-Arabic, Hebrew, Aramaic and Arabic.

Transcriptions in the browser come from three sources:

(1) Goitein’s unpublished transcriptions. These are referred to as “Goitein, typed texts."

(2) Transcriptions published in article or book form by scholars. In the case of Goitein’s published transcriptions, the texts here include the handwritten corrections he made to own offprints.

(3) New transcriptions contributed by scholars and doctoral students who have been affiliated with the Geniza Lab over the years.

Citing the PGP

Anyone is welcome to use these transcriptions for research purposes and to cite them. Transcriptions published elsewhere first should acknowledge the original publication. Transcriptions that appear here for the first time should be cited by the name of the scholar responsible for the transcription with the site name appended. For example: “ed. M. R. Cohen, Princeton Geniza Project.”

If you intend to use these editions for your research, please check them against the manuscripts. Errors have crept in, both in transmission (in the transcription process) and in editing (especially when scholars prepared their transcriptions from microfilm, before the advent of digital photography).

Please notify us of any errors you find by sending an email to mrustow {at}

Technical notes

This edition of the Geniza database is based upon the TextGarden web application developed in 2005 by Rafael Alvarado, Manager of Humanities Computing Research Applications at Princeton, a project that itself replaced and incorporated the original browser developed by Peter Batke in the late 1990s.

The current browser was developed and is managed by Ben Johnston, Educational Technology Consultant in Princeton’s McGraw Center for Teaching and Learning.