Style Classification of Rabbinic Literature for Detection of Lost Midrash Tanhuma Material

Midrash collections are complex rabbinic works that consist of text in multiple languages, which evolved through long processes of unstable oral and written transmission. Determining the origin of a given passage in such a compilation is not always straightforward and is often a matter of dispute am...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Journal of data mining and digital humanities Ročník NLP4DH
Hlavní autoři: Tannor, Shlomo, Dershowitz, Nachum, Lavee, Moshe
Médium: Journal Article
Jazyk:angličtina
Vydáno: Nicolas Turenne 13.08.2023
Témata:
ISSN:2416-5999, 2416-5999
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract Midrash collections are complex rabbinic works that consist of text in multiple languages, which evolved through long processes of unstable oral and written transmission. Determining the origin of a given passage in such a compilation is not always straightforward and is often a matter of dispute among scholars, yet it is essential for scholars' understanding of the passage and its relationship to other texts in the rabbinic corpus. To help solve this problem, we propose a system for classification of rabbinic literature based on its style, leveraging recent advances in natural language processing for Hebrew texts. Additionally, we demonstrate how this method can be applied to uncover lost material from a specific midrash genre, Tan\d{h}uma-Yelammedenu, that has been preserved in later anthologies.
AbstractList Midrash collections are complex rabbinic works that consist of text in multiple languages, which evolved through long processes of unstable oral and written transmission. Determining the origin of a given passage in such a compilation is not always straightforward and is often a matter of dispute among scholars, yet it is essential for scholars' understanding of the passage and its relationship to other texts in the rabbinic corpus. To help solve this problem, we propose a system for classification of rabbinic literature based on its style, leveraging recent advances in natural language processing for Hebrew texts. Additionally, we demonstrate how this method can be applied to uncover lost material from a specific midrash genre, Tan\d{h}uma-Yelammedenu, that has been preserved in later anthologies.
Midrash collections are complex rabbinic works that consist of text in multiple languages, which evolved through long processes of unstable oral and written transmission. Determining the origin of a given passage in such a compilation is not always straightforward and is often a matter of dispute among scholars, yet it is essential for scholars' understanding of the passage and its relationship to other texts in the rabbinic corpus. To help solve this problem, we propose a system for classification of rabbinic literature based on its style, leveraging recent advances in natural language processing for Hebrew texts. Additionally, we demonstrate how this method can be applied to uncover lost material from a specific midrash genre, Tan\d{h}uma-Yelammedenu, that has been preserved in later anthologies.
Author Dershowitz, Nachum
Lavee, Moshe
Tannor, Shlomo
Author_xml – sequence: 1
  givenname: Shlomo
  surname: Tannor
  fullname: Tannor, Shlomo
  organization: Tel Aviv University
– sequence: 2
  givenname: Nachum
  orcidid: 0000-0003-0363-2735
  surname: Dershowitz
  fullname: Dershowitz, Nachum
  organization: Tel Aviv University
– sequence: 3
  givenname: Moshe
  orcidid: 0000-0001-9077-6176
  surname: Lavee
  fullname: Lavee, Moshe
  organization: University of Haifa
BookMark eNpNkM1KAzEUhYNUsNbufIA8gK3JZJLJLKX-FaYIWjduws2fTZlOJJku-vaWqYqre7ic8y2-SzTqYucQuqZkXoqilrdbu7ObOaWs4mdoXJRUzHhd16N_-QJNc94SQigvJed8jD7e-kPr8KKFnIMPBvoQOxw9fgWtQxcMbkLvEvT75LCPCd-73pnfUhNzj1fBJsgbvIZus98BXsFxEKC9Quce2uymP3eC3h8f1ovnWfPytFzcNTNDS8pn3oDRVGoBlHrCKlpayox1oraSmIoZAF1oKLmQDgQjzAtjLWFEFlAJW7MJWp64NsJWfaWwg3RQEYIaHjF9Kkh9MK1TUFEOlSuZsKYUTANY6QohPSs400YeWTcnlkkx5-T8H48SNWhWg2Y1aGbfbg1zcQ
ContentType Journal Article
DBID AAYXX
CITATION
DOA
DOI 10.46298/jdmdh.11375
DatabaseName CrossRef
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
DatabaseTitleList CrossRef

Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ: Directory of Open Access Journal (DOAJ)
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 2416-5999
ExternalDocumentID oai_doaj_org_article_a715a7e436dc463baad8e268f3253bc8
10_46298_jdmdh_11375
GroupedDBID 5VS
AAFWJ
AAYXX
ADBBV
ADQAK
AFPKN
ALMA_UNASSIGNED_HOLDINGS
BCNDV
CITATION
FRP
GROUPED_DOAJ
KQ8
M~E
OK1
ID FETCH-LOGICAL-c1415-fcacb18b6a11f03714d13cde69d80c73caab2ba4568ea6303f6cdd03082a76d93
IEDL.DBID DOA
ISSN 2416-5999
IngestDate Fri Oct 03 12:52:29 EDT 2025
Sat Nov 29 04:10:29 EST 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Language English
License https://arxiv.org/licenses/nonexclusive-distrib/1.0
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c1415-fcacb18b6a11f03714d13cde69d80c73caab2ba4568ea6303f6cdd03082a76d93
ORCID 0000-0001-9077-6176
0000-0003-0363-2735
OpenAccessLink https://doaj.org/article/a715a7e436dc463baad8e268f3253bc8
ParticipantIDs doaj_primary_oai_doaj_org_article_a715a7e436dc463baad8e268f3253bc8
crossref_primary_10_46298_jdmdh_11375
PublicationCentury 2000
PublicationDate 2023-08-13
PublicationDateYYYYMMDD 2023-08-13
PublicationDate_xml – month: 08
  year: 2023
  text: 2023-08-13
  day: 13
PublicationDecade 2020
PublicationTitle Journal of data mining and digital humanities
PublicationYear 2023
Publisher Nicolas Turenne
Publisher_xml – name: Nicolas Turenne
SSID ssj0001548555
Score 2.2281015
Snippet Midrash collections are complex rabbinic works that consist of text in multiple languages, which evolved through long processes of unstable oral and written...
Midrash collections are complex rabbinic works that consist of text in multiple languages, which evolved through long processes of unstable oral and written...
SourceID doaj
crossref
SourceType Open Website
Index Database
SubjectTerms computer science - computation and language
computer science - machine learning
Title Style Classification of Rabbinic Literature for Detection of Lost Midrash Tanhuma Material
URI https://doaj.org/article/a715a7e436dc463baad8e268f3253bc8
Volume NLP4DH
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: DOAJ: Directory of Open Access Journal (DOAJ)
  customDbUrl:
  eissn: 2416-5999
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0001548555
  issn: 2416-5999
  databaseCode: DOA
  dateStart: 20140101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources (ISSN International Center)
  customDbUrl:
  eissn: 2416-5999
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0001548555
  issn: 2416-5999
  databaseCode: M~E
  dateStart: 20140101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV07T8MwELZQxcDCG1Fe8gBj1DhOHXvk0YqhrRAUqWKJzi-1CFLUBiQWfju2k1bZWFgiOXKi6DvHd2fdfR9Cly4CcZue5JE1zCUo0oqICzARAw5Ga-aCEBvEJrLRiE8m4qEh9eVrwip64Aq4DmSkC5lJKdMqZVQCaG4Sxi1NulSq0OYbZ6KRTFX9wZ70pFtVuqcsEbzzqt_11EuY-JLChg9qUPUHn9LfRdt1MIivq4_YQxum2Ec7K6EFXP93B-jlqfx-MzgIWPrSnoAmnlv8CFL6zkY8WLMjYxeF4jtThhKrMGkwX5Z4ONMLWE7xGAovy4eHUIa1d4ie-73x7X1UiyJEijhnG1kFShIuGRBiA9-eJlRpw4TmscqoApCJBBcXcQPMOSjLlNaBlQYypgU9Qq1iXphjhCWNIXbPZVbYVDErU8sIEEmVFNaN2uhqBVP-UXFf5C5nCHDmAc48wNlGNx7D9RzPWB1uODvmtR3zv-x48h8vOUVbXg7en_kSeoZa5eLTnKNN9VXOlouLsETcdfjT-wVc8MWW
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Style+Classification+of+Rabbinic+Literature+for+Detection+of+Lost+Midrash+Tanhuma+Material&rft.jtitle=Journal+of+data+mining+and+digital+humanities&rft.au=Shlomo+Tannor&rft.au=Nachum+Dershowitz&rft.au=Moshe+Lavee&rft.date=2023-08-13&rft.pub=Nicolas+Turenne&rft.eissn=2416-5999&rft.volume=NLP4DH&rft_id=info:doi/10.46298%2Fjdmdh.11375&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_a715a7e436dc463baad8e268f3253bc8
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2416-5999&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2416-5999&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2416-5999&client=summon