Sefaria Labs
Loading…
All Projects

Image generated by SVG Fallback

DatasetTo Build
Vote:0

Sefaria Library Training Dataset

Owner: Josh

Transform the full contents of the Sefaria library into the optimal format for model training data.


  1. - Optimize chunking and document size
    - Include every text

  2. - Include every translation

  3. - Monolingual and bilingual versions

  4. - Text versions with inline commentaries

  5. - Text versions with expanded connections