To download this dataset use the following link: https://1drv.ms/f/s!AsYCZBb59Y5Ni8RXq2ba7s-6cfwbBA?e=1bsGH2
This dataset contains Hebrew texts:
- files- 168,591 files of Hebrew texts.
- volume - 7,719MB.
- words count- 829,610,045 words.
Texts are from Knesset protocols- plenum, committees, legislation. Data range from years 1948 through 2023. Updated February 2024
Files:
- KNS_DocumentBill_extracted_texts.zip - legislation documents, 52,564 files.
- KNS_DocumentCommitteeSession_extracted_texts.zip - committees sessions, 71,003 files.
- KNS_DocumentPlenumSession_extracted_texts.zip - plenum sessions, 45,024 files.
If you want to download files directly from Knesset ODATA system you may use the following code: https://github.com/amit-shkolnik/knesset_odata