Croatian Idioms Integration: Enhancing the LIdioms Multilingual Linked Idioms Dataset

Uloženo v:
Podrobná bibliografie
Název: Croatian Idioms Integration: Enhancing the LIdioms Multilingual Linked Idioms Dataset
Autoři: Filipović Petrović, Ivana, Beliga, Slobodan
Zdroj: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). :4106-4112
Informace o vydavateli: 2024.
Rok vydání: 2024
Témata: Croatian Idioms, Multilingual Linked Idioms Dataset, Ontolex Lemon, LIdioms, LLOD, Linguistic Linked Open Data, RDF
Popis: Idioms, also referred to as phraseological units in some language terminologies, are a subset within the broader category of multi-word expressions. However, there is a lack of representation of idioms in Croatian, a low-resourced language, in the Linguistic Linked Open Data cloud (LLOD). To address this gap, we propose an extension of an existing RDF-based multilingual representation of idioms, referred to as the LIdioms dataset, which currently includes idioms from English, German, Italian, Portuguese, and Russian. This paper expands the existing resource by incorporating 1,042 Croatian idioms in an Ontolex Lemon format. In addition, to foster translation initiatives and facilitate intercultural exchange, these added Croatian idioms have also been linked to other idioms of the LIdioms dataset, with which they share similar meanings despite their differences in the expression aspect. This addition enriches the knowledge base of the LLOD community with a new language resource that includes Croatian idioms.
Druh dokumentu: Conference object
ISSN: 2951-2093
Přístupové číslo: edsair.dris...01492..ee6cdd1f406bdcef8dccce6f19a6e29f
Databáze: OpenAIRE
FullText Text:
  Availability: 0
CustomLinks:
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Petrovi%C4%87%20F
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edsair
DbLabel: OpenAIRE
An: edsair.dris...01492..ee6cdd1f406bdcef8dccce6f19a6e29f
RelevancyScore: 979
AccessLevel: 3
PubType: Conference
PubTypeId: conference
PreciseRelevancyScore: 979.415405273438
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Croatian Idioms Integration: Enhancing the LIdioms Multilingual Linked Idioms Dataset
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Filipović+Petrović%2C+Ivana%22">Filipović Petrović, Ivana</searchLink><br /><searchLink fieldCode="AR" term="%22Beliga%2C+Slobodan%22">Beliga, Slobodan</searchLink>
– Name: TitleSource
  Label: Source
  Group: Src
  Data: <i>Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)</i>. :4106-4112
– Name: Publisher
  Label: Publisher Information
  Group: PubInfo
  Data: 2024.
– Name: DatePubCY
  Label: Publication Year
  Group: Date
  Data: 2024
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Croatian+Idioms%22">Croatian Idioms</searchLink><br /><searchLink fieldCode="DE" term="%22Multilingual+Linked+Idioms+Dataset%22">Multilingual Linked Idioms Dataset</searchLink><br /><searchLink fieldCode="DE" term="%22Ontolex+Lemon%22">Ontolex Lemon</searchLink><br /><searchLink fieldCode="DE" term="%22LIdioms%22">LIdioms</searchLink><br /><searchLink fieldCode="DE" term="%22LLOD%22">LLOD</searchLink><br /><searchLink fieldCode="DE" term="%22Linguistic+Linked+Open+Data%22">Linguistic Linked Open Data</searchLink><br /><searchLink fieldCode="DE" term="%22RDF%22">RDF</searchLink>
– Name: Abstract
  Label: Description
  Group: Ab
  Data: Idioms, also referred to as phraseological units in some language terminologies, are a subset within the broader category of multi-word expressions. However, there is a lack of representation of idioms in Croatian, a low-resourced language, in the Linguistic Linked Open Data cloud (LLOD). To address this gap, we propose an extension of an existing RDF-based multilingual representation of idioms, referred to as the LIdioms dataset, which currently includes idioms from English, German, Italian, Portuguese, and Russian. This paper expands the existing resource by incorporating 1,042 Croatian idioms in an Ontolex Lemon format. In addition, to foster translation initiatives and facilitate intercultural exchange, these added Croatian idioms have also been linked to other idioms of the LIdioms dataset, with which they share similar meanings despite their differences in the expression aspect. This addition enriches the knowledge base of the LLOD community with a new language resource that includes Croatian idioms.
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: Conference object
– Name: ISSN
  Label: ISSN
  Group: ISSN
  Data: 2951-2093
– Name: AN
  Label: Accession Number
  Group: ID
  Data: edsair.dris...01492..ee6cdd1f406bdcef8dccce6f19a6e29f
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsair&AN=edsair.dris...01492..ee6cdd1f406bdcef8dccce6f19a6e29f
RecordInfo BibRecord:
  BibEntity:
    Languages:
      – Text: Undetermined
    PhysicalDescription:
      Pagination:
        PageCount: 7
        StartPage: 4106
    Subjects:
      – SubjectFull: Croatian Idioms
        Type: general
      – SubjectFull: Multilingual Linked Idioms Dataset
        Type: general
      – SubjectFull: Ontolex Lemon
        Type: general
      – SubjectFull: LIdioms
        Type: general
      – SubjectFull: LLOD
        Type: general
      – SubjectFull: Linguistic Linked Open Data
        Type: general
      – SubjectFull: RDF
        Type: general
    Titles:
      – TitleFull: Croatian Idioms Integration: Enhancing the LIdioms Multilingual Linked Idioms Dataset
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Filipović Petrović, Ivana
      – PersonEntity:
          Name:
            NameFull: Beliga, Slobodan
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 01
              Type: published
              Y: 2024
          Identifiers:
            – Type: issn-print
              Value: 29512093
            – Type: issn-locals
              Value: edsair
          Titles:
            – TitleFull: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
              Type: main
ResultId 1