Style Classification of Rabbinic Literature for Detection of Lost Midrash Tanhuma Material
Midrash collections are complex rabbinic works that consist of text in multiple languages, which evolved through long processes of unstable oral and written transmission. Determining the origin of a given passage in such a compilation is not always straightforward and is often a matter of dispute am...
Uloženo v:
| Vydáno v: | Journal of data mining and digital humanities Ročník NLP4DH |
|---|---|
| Hlavní autoři: | , , |
| Médium: | Journal Article |
| Jazyk: | angličtina |
| Vydáno: |
Nicolas Turenne
13.08.2023
|
| Témata: | |
| ISSN: | 2416-5999, 2416-5999 |
| On-line přístup: | Získat plný text |
| Tagy: |
Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
|
| Abstract | Midrash collections are complex rabbinic works that consist of text in
multiple languages, which evolved through long processes of unstable oral and
written transmission. Determining the origin of a given passage in such a
compilation is not always straightforward and is often a matter of dispute
among scholars, yet it is essential for scholars' understanding of the passage
and its relationship to other texts in the rabbinic corpus. To help solve this
problem, we propose a system for classification of rabbinic literature based on
its style, leveraging recent advances in natural language processing for Hebrew
texts. Additionally, we demonstrate how this method can be applied to uncover
lost material from a specific midrash genre, Tan\d{h}uma-Yelammedenu, that has
been preserved in later anthologies. |
|---|---|
| AbstractList | Midrash collections are complex rabbinic works that consist of text in
multiple languages, which evolved through long processes of unstable oral and
written transmission. Determining the origin of a given passage in such a
compilation is not always straightforward and is often a matter of dispute
among scholars, yet it is essential for scholars' understanding of the passage
and its relationship to other texts in the rabbinic corpus. To help solve this
problem, we propose a system for classification of rabbinic literature based on
its style, leveraging recent advances in natural language processing for Hebrew
texts. Additionally, we demonstrate how this method can be applied to uncover
lost material from a specific midrash genre, Tan\d{h}uma-Yelammedenu, that has
been preserved in later anthologies. Midrash collections are complex rabbinic works that consist of text in multiple languages, which evolved through long processes of unstable oral and written transmission. Determining the origin of a given passage in such a compilation is not always straightforward and is often a matter of dispute among scholars, yet it is essential for scholars' understanding of the passage and its relationship to other texts in the rabbinic corpus. To help solve this problem, we propose a system for classification of rabbinic literature based on its style, leveraging recent advances in natural language processing for Hebrew texts. Additionally, we demonstrate how this method can be applied to uncover lost material from a specific midrash genre, Tan\d{h}uma-Yelammedenu, that has been preserved in later anthologies. |
| Author | Dershowitz, Nachum Lavee, Moshe Tannor, Shlomo |
| Author_xml | – sequence: 1 givenname: Shlomo surname: Tannor fullname: Tannor, Shlomo organization: Tel Aviv University – sequence: 2 givenname: Nachum orcidid: 0000-0003-0363-2735 surname: Dershowitz fullname: Dershowitz, Nachum organization: Tel Aviv University – sequence: 3 givenname: Moshe orcidid: 0000-0001-9077-6176 surname: Lavee fullname: Lavee, Moshe organization: University of Haifa |
| BookMark | eNpNkM1KAzEUhYNUsNbufIA8gK3JZJLJLKX-FaYIWjduws2fTZlOJJku-vaWqYqre7ic8y2-SzTqYucQuqZkXoqilrdbu7ObOaWs4mdoXJRUzHhd16N_-QJNc94SQigvJed8jD7e-kPr8KKFnIMPBvoQOxw9fgWtQxcMbkLvEvT75LCPCd-73pnfUhNzj1fBJsgbvIZus98BXsFxEKC9Quce2uymP3eC3h8f1ovnWfPytFzcNTNDS8pn3oDRVGoBlHrCKlpayox1oraSmIoZAF1oKLmQDgQjzAtjLWFEFlAJW7MJWp64NsJWfaWwg3RQEYIaHjF9Kkh9MK1TUFEOlSuZsKYUTANY6QohPSs400YeWTcnlkkx5-T8H48SNWhWg2Y1aGbfbg1zcQ |
| ContentType | Journal Article |
| DBID | AAYXX CITATION DOA |
| DOI | 10.46298/jdmdh.11375 |
| DatabaseName | CrossRef DOAJ Directory of Open Access Journals |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | CrossRef |
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ: Directory of Open Access Journal (DOAJ) url: https://www.doaj.org/ sourceTypes: Open Website |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 2416-5999 |
| ExternalDocumentID | oai_doaj_org_article_a715a7e436dc463baad8e268f3253bc8 10_46298_jdmdh_11375 |
| GroupedDBID | 5VS AAFWJ AAYXX ADBBV ADQAK AFPKN ALMA_UNASSIGNED_HOLDINGS BCNDV CITATION FRP GROUPED_DOAJ KQ8 M~E OK1 |
| ID | FETCH-LOGICAL-c1415-fcacb18b6a11f03714d13cde69d80c73caab2ba4568ea6303f6cdd03082a76d93 |
| IEDL.DBID | DOA |
| ISSN | 2416-5999 |
| IngestDate | Fri Oct 03 12:52:29 EDT 2025 Sat Nov 29 04:10:29 EST 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Language | English |
| License | https://arxiv.org/licenses/nonexclusive-distrib/1.0 |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c1415-fcacb18b6a11f03714d13cde69d80c73caab2ba4568ea6303f6cdd03082a76d93 |
| ORCID | 0000-0001-9077-6176 0000-0003-0363-2735 |
| OpenAccessLink | https://doaj.org/article/a715a7e436dc463baad8e268f3253bc8 |
| ParticipantIDs | doaj_primary_oai_doaj_org_article_a715a7e436dc463baad8e268f3253bc8 crossref_primary_10_46298_jdmdh_11375 |
| PublicationCentury | 2000 |
| PublicationDate | 2023-08-13 |
| PublicationDateYYYYMMDD | 2023-08-13 |
| PublicationDate_xml | – month: 08 year: 2023 text: 2023-08-13 day: 13 |
| PublicationDecade | 2020 |
| PublicationTitle | Journal of data mining and digital humanities |
| PublicationYear | 2023 |
| Publisher | Nicolas Turenne |
| Publisher_xml | – name: Nicolas Turenne |
| SSID | ssj0001548555 |
| Score | 2.2281015 |
| Snippet | Midrash collections are complex rabbinic works that consist of text in
multiple languages, which evolved through long processes of unstable oral and
written... Midrash collections are complex rabbinic works that consist of text in multiple languages, which evolved through long processes of unstable oral and written... |
| SourceID | doaj crossref |
| SourceType | Open Website Index Database |
| SubjectTerms | computer science - computation and language computer science - machine learning |
| Title | Style Classification of Rabbinic Literature for Detection of Lost Midrash Tanhuma Material |
| URI | https://doaj.org/article/a715a7e436dc463baad8e268f3253bc8 |
| Volume | NLP4DH |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAON databaseName: DOAJ: Directory of Open Access Journal (DOAJ) customDbUrl: eissn: 2416-5999 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0001548555 issn: 2416-5999 databaseCode: DOA dateStart: 20140101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources (ISSN International Center) customDbUrl: eissn: 2416-5999 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0001548555 issn: 2416-5999 databaseCode: M~E dateStart: 20140101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV07T8MwELZQxcDCG1Fe8gBj1DhOHXvk0YqhrRAUqWKJzi-1CFLUBiQWfju2k1bZWFgiOXKi6DvHd2fdfR9Cly4CcZue5JE1zCUo0oqICzARAw5Ga-aCEBvEJrLRiE8m4qEh9eVrwip64Aq4DmSkC5lJKdMqZVQCaG4Sxi1NulSq0OYbZ6KRTFX9wZ70pFtVuqcsEbzzqt_11EuY-JLChg9qUPUHn9LfRdt1MIivq4_YQxum2Ec7K6EFXP93B-jlqfx-MzgIWPrSnoAmnlv8CFL6zkY8WLMjYxeF4jtThhKrMGkwX5Z4ONMLWE7xGAovy4eHUIa1d4ie-73x7X1UiyJEijhnG1kFShIuGRBiA9-eJlRpw4TmscqoApCJBBcXcQPMOSjLlNaBlQYypgU9Qq1iXphjhCWNIXbPZVbYVDErU8sIEEmVFNaN2uhqBVP-UXFf5C5nCHDmAc48wNlGNx7D9RzPWB1uODvmtR3zv-x48h8vOUVbXg7en_kSeoZa5eLTnKNN9VXOlouLsETcdfjT-wVc8MWW |
| linkProvider | Directory of Open Access Journals |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Style+Classification+of+Rabbinic+Literature+for+Detection+of+Lost+Midrash+Tanhuma+Material&rft.jtitle=Journal+of+data+mining+and+digital+humanities&rft.au=Shlomo+Tannor&rft.au=Nachum+Dershowitz&rft.au=Moshe+Lavee&rft.date=2023-08-13&rft.pub=Nicolas+Turenne&rft.eissn=2416-5999&rft.volume=NLP4DH&rft_id=info:doi/10.46298%2Fjdmdh.11375&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_a715a7e436dc463baad8e268f3253bc8 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2416-5999&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2416-5999&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2416-5999&client=summon |