This repo contains transcriptions of biblical and non-biblical Dead Sea Scrolls with linguistic annotations.
See also about and feature documentation.
You can run your own programs off-line, and publish your work in online notebooks.
This dataset is developed as part of the CACCHT project, which is a collaboration of Martijn Naaijer, Christian Canu Højgaard, Martin Ehrensvärd, Robert Rezetko, Oliver Glanz, and Willem van Peursen. The goal of CACCHT is to prepare and publish ancient Semitic texts digitally, that can be used for research and education.
The main processing tool is Text-Fabric. It is instrumental to turn the analysis of ancient data into computing narratives.
The ecosystem is Python and Jupyter notebooks.
Start with the tutorial.
- Jarod Jacobs at Warner Pacific College
- Martijn Naaijer at Eep Talstra Center for Bible and Computer
- Dirk Roorda at DANS
- Martin Abegg for providing his data files to us;
N.B.: Releases of this repo have been archived at the Software Heritage Archive and Zenodo. Click the badges to be taken to the archives. There you find ways to cite this work.
We will collect feedback for quite some time before we bump the version to 1.0 or higher.
- 2019-06-11 Release 0.6, fixed the fact that the dotless shin was represented as sin, spotted by Oliver Glanz
- 2019-05-20 Release 0.5, new tests.
- 2019-05-09 Release 0.4.
- 2019-05-08 Release 0.3.
- 2019-05-05 Release 0.2.
- bhsa Core data and feature documentation
- phono Phonological representation of Hebrew words
- parallels Links between similar verses
- valence Verbal valence for all occurrences of some verbs
- trees Tree structures for all sentences
- bridging Open Scriptures morphology ported to the BHSA
- pipeline Generate the BHSA and SHEBANQ from internal ETCBC data files
- shebanq Engine of the shebanq website
- dss Dead Sea Scrolls
- sp Samaritan Pentateuch
- extrabiblical Extra-biblical writings from ETCBC-encoded texts
- peshitta Syriac translation of the Hebrew Bible
- syrnt Syriac translation of the New Testament