Skip to content

amit-shkolnik/Knesset-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 

Repository files navigation

Knesset-dataset

To download this dataset use the following link: https://1drv.ms/f/s!AsYCZBb59Y5Ni8RXq2ba7s-6cfwbBA?e=1bsGH2

This dataset contains Hebrew texts:

  1. files- 168,591 files of Hebrew texts.
  2. volume - 7,719MB.
  3. words count- 829,610,045 words.

Texts are from Knesset protocols- plenum, committees, legislation. Data range from years 1948 through 2023. Updated February 2024

Files:

  1. KNS_DocumentBill_extracted_texts.zip - legislation documents, 52,564 files.
  2. KNS_DocumentCommitteeSession_extracted_texts.zip - committees sessions, 71,003 files.
  3. KNS_DocumentPlenumSession_extracted_texts.zip - plenum sessions, 45,024 files.

If you want to download files directly from Knesset ODATA system you may use the following code: https://github.com/amit-shkolnik/knesset_odata

About

This dataset contains Knesset protocols

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors