Skip to content

Latest commit

 

History

History
100 lines (76 loc) · 4.29 KB

README.md

File metadata and controls

100 lines (76 loc) · 4.29 KB

SWH DOI Project Status: Active – The project has reached a stable, usable state and is being actively developed. Project Status: WIP – Initial development is in progress, but there has not yet been a stable, usable release suitable for the public.

The Dead Sea Scrolls in Text-Fabric

This repo contains transcriptions of biblical and non-biblical Dead Sea Scrolls with linguistic annotations.

See also about and feature documentation.

You can run your own programs off-line, and publish your work in online notebooks.

The CACCHT project: Creating Annotated Corpora of Classical Hebrew Texts

This dataset is developed as part of the CACCHT project, which is a collaboration of Martijn Naaijer, Christian Canu Højgaard, Martin Ehrensvärd, Robert Rezetko, Oliver Glanz, and Willem van Peursen. The goal of CACCHT is to prepare and publish ancient Semitic texts digitally, that can be used for research and education.

Software

The main processing tool is Text-Fabric. It is instrumental to turn the analysis of ancient data into computing narratives.

The ecosystem is Python and Jupyter notebooks.

Getting started

Start with the tutorial.

Authors

Acknowledgements

N.B.: Releases of this repo have been archived at the Software Heritage Archive and Zenodo. Click the badges to be taken to the archives. There you find ways to cite this work.

Status

We will collect feedback for quite some time before we bump the version to 1.0 or higher.

  • 2019-06-11 Release 0.6, fixed the fact that the dotless shin was represented as sin, spotted by Oliver Glanz
  • 2019-05-20 Release 0.5, new tests.
  • 2019-05-09 Release 0.4.
  • 2019-05-08 Release 0.3.
  • 2019-05-05 Release 0.2.

Related datasets

BHSA Family

  • bhsa Core data and feature documentation
  • phono Phonological representation of Hebrew words
  • parallels Links between similar verses
  • valence Verbal valence for all occurrences of some verbs
  • trees Tree structures for all sentences
  • bridging Open Scriptures morphology ported to the BHSA
  • pipeline Generate the BHSA and SHEBANQ from internal ETCBC data files
  • shebanq Engine of the shebanq website

Extended family

  • dss Dead Sea Scrolls
  • sp Samaritan Pentateuch
  • extrabiblical Extra-biblical writings from ETCBC-encoded texts
  • peshitta Syriac translation of the Hebrew Bible
  • syrnt Syriac translation of the New Testament