Predicting Sustainable Development Goals Using Course Descriptions -- from LLMs to Conventional Foundation Models

We present our work on predicting United Nations sustainable development goals (SDG) for university courses. We use an LLM named PaLM 2 to generate training data given a noisy human-authored course description input as input. We use this data to train several different smaller language models to pre...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Journal of data mining and digital humanities Ročník NLP4DH
Hlavní autoři: Kharlashkin, Lev, Macias, Melany, Huovinen, Leo, Hämäläinen, Mika
Médium: Journal Article
Jazyk:angličtina
Vydáno: INRIA 29.04.2024
Nicolas Turenne
Témata:
ISSN:2416-5999, 2416-5999
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract We present our work on predicting United Nations sustainable development goals (SDG) for university courses. We use an LLM named PaLM 2 to generate training data given a noisy human-authored course description input as input. We use this data to train several different smaller language models to predict SDGs for university courses. This work contributes to better university level adaptation of SDGs. The best performing model in our experiments was BART with an F1-score of 0.786.
AbstractList We present our work on predicting United Nations sustainable development goals (SDG) for university courses. We use an LLM named PaLM 2 to generate training data given a noisy human-authored course description input as input. We use this data to train several different smaller language models to predict SDGs for university courses. This work contributes to better university level adaptation of SDGs. The best performing model in our experiments was BART with an F1-score of 0.786.
We present our work on predicting United Nations sustainable development goals (SDG) for university courses. We use an LLM named PaLM 2 to generate training data given a noisy human-authored course description input as input. We use this data to train several different smaller language models to predict SDGs for university courses. This work contributes to better university level adaptation of SDGs. The best performing model in our experiments was BART with an F1-score of 0.786.
Author Kharlashkin, Lev
Hämäläinen, Mika
Macias, Melany
Huovinen, Leo
Author_xml – sequence: 1
  givenname: Lev
  orcidid: 0009-0001-3664-6589
  surname: Kharlashkin
  fullname: Kharlashkin, Lev
– sequence: 2
  givenname: Melany
  surname: Macias
  fullname: Macias, Melany
– sequence: 3
  givenname: Leo
  surname: Huovinen
  fullname: Huovinen, Leo
– sequence: 4
  givenname: Mika
  orcidid: 0000-0001-9315-1278
  surname: Hämäläinen
  fullname: Hämäläinen, Mika
BackLink https://hal.science/hal-04595348$$DView record in HAL
BookMark eNpVkdtKAzEQhoMoeLzzAXIruJpskk1yKdWqsEVBvQ6Tw-qW7aYm24Jv77YV0as5_fMxw3-M9vvYB4TOKbniVanV9dwv_McVZbSUe-io5LQqhNZ6_09-iM5ynhNCqOBKCHGEPp9T8K0b2v4dv6zyAG0Ptgv4NqxDF5eL0A_4PkKX8VveaCZxlfJmnF1ql0Mb-4yLAjcpLnBdzzIe4qjp1-PeOIMOT-Oq97Ap8Cz60OVTdNCMvHD2E0_Q2_TudfJQ1E_3j5ObunBUCllYCeMrBDi1MkjXVMo2rKyosIoJG7RmgTOugtReKOoq7srKc3CS6cCpduwEPe64PsLcLFO7gPRlIrRm24jp3UAaWtcFU1HLQEkriNeckgDKEmAWJBW6VK4ZWRc71gd0_1APN7XZ9AgXWoznrOmovdxpXYo5p9D8LlBitk6ZrVNm6xT7BlOziG0
ContentType Journal Article
Copyright Distributed under a Creative Commons Attribution 4.0 International License
Copyright_xml – notice: Distributed under a Creative Commons Attribution 4.0 International License
DBID AAYXX
CITATION
1XC
VOOES
DOA
DOI 10.46298/jdmdh.13127
DatabaseName CrossRef
Hyper Article en Ligne (HAL)
Hyper Article en Ligne (HAL) (Open Access)
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
DatabaseTitleList
CrossRef

Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 2416-5999
ExternalDocumentID oai_doaj_org_article_61b3a87b50d9410ea8b0a3ba715928cf
oai:HAL:hal-04595348v1
10_46298_jdmdh_13127
GroupedDBID 5VS
AAFWJ
AAYXX
ADBBV
ADQAK
AFPKN
ALMA_UNASSIGNED_HOLDINGS
BCNDV
CITATION
FRP
GROUPED_DOAJ
KQ8
M~E
OK1
1XC
VOOES
ID FETCH-LOGICAL-c1757-b7a1310a41b7e7cf68bf32615b835be993e4348e79d581c64c26d4ac739e419c3
IEDL.DBID DOA
ISSN 2416-5999
IngestDate Fri Oct 03 12:49:23 EDT 2025
Tue Oct 14 20:42:58 EDT 2025
Sat Nov 29 04:10:29 EST 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Keywords multi label classification
LLM
SDG multi label classification LLM
SDG
Language English
License Distributed under a Creative Commons Attribution 4.0 International License: http://creativecommons.org/licenses/by/4.0
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c1757-b7a1310a41b7e7cf68bf32615b835be993e4348e79d581c64c26d4ac739e419c3
ORCID 0000-0001-9315-1278
0009-0001-3664-6589
OpenAccessLink https://doaj.org/article/61b3a87b50d9410ea8b0a3ba715928cf
ParticipantIDs doaj_primary_oai_doaj_org_article_61b3a87b50d9410ea8b0a3ba715928cf
hal_primary_oai_HAL_hal_04595348v1
crossref_primary_10_46298_jdmdh_13127
PublicationCentury 2000
PublicationDate 2024-04-29
PublicationDateYYYYMMDD 2024-04-29
PublicationDate_xml – month: 04
  year: 2024
  text: 2024-04-29
  day: 29
PublicationDecade 2020
PublicationTitle Journal of data mining and digital humanities
PublicationYear 2024
Publisher INRIA
Nicolas Turenne
Publisher_xml – name: INRIA
– name: Nicolas Turenne
SSID ssj0001548555
Score 2.2545047
Snippet We present our work on predicting United Nations sustainable development goals (SDG) for university courses. We use an LLM named PaLM 2 to generate training...
We present our work on predicting United Nations sustainable development goals (SDG) for university courses. We use an LLM named PaLM 2 to generate training...
SourceID doaj
hal
crossref
SourceType Open Website
Open Access Repository
Index Database
SubjectTerms Computation and Language
Computer Science
computer science - computation and language
Title Predicting Sustainable Development Goals Using Course Descriptions -- from LLMs to Conventional Foundation Models
URI https://hal.science/hal-04595348
https://doaj.org/article/61b3a87b50d9410ea8b0a3ba715928cf
Volume NLP4DH
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 2416-5999
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0001548555
  issn: 2416-5999
  databaseCode: DOA
  dateStart: 20140101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 2416-5999
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0001548555
  issn: 2416-5999
  databaseCode: M~E
  dateStart: 20140101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV07T8MwELYQYmDhjSgvWQjG0DhxYnssqKVDW1UCpG6RX1FB0EJTOvLbOTtpGyYWlgyO8_Cd7fvOPt-H0HVkVZKTUAehzpOAEmlgzFETABg21nljMdeebIINBnw0EsMa1ZeLCSvTA5eCa6ZExZIzBc8KSkIruQplrCQDOxxxnbvZN2Si5kyV54Nd0pOkjHSnaSR489W8m_EtiT2BTM0G-VT9YFnGy5VUb1k6e2ingoS4Vf7KPtqwkwO0u6RbwNXoO0Sfw5nbVXFxyvhxfe4J1-J-8MMUuhP2cQDY0dEV7vZqZihwEGB3oAT3ev0Cz6dQZx10jtcUS9hRpL0VR-i503667wYVY0KgAQawQDEJDQwlJYpZpvOUqxzwGUkUAC1lAYtYGlNumTAJJzqlOkoNlZrFwlIidHyMNifTiT1BmEiVRFbY2BhDLc2VAyI5Z8KtGoGX1UA3SxlmH2VijAwcCi_rzMs687JuoDsn4FUdl87aF4CSs0rJ2V9KbqArUM-vd3RbvcyVASQVCbRpQU7_40tnaDsC2OL2iyJxjjbnsy97gbb0Yv5SzC59J4Nr_7v9A4NT2Kg
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Predicting+Sustainable+Development+Goals+Using+Course+Descriptions+--+from+LLMs+to+Conventional+Foundation+Models&rft.jtitle=Journal+of+data+mining+and+digital+humanities&rft.au=Lev+Kharlashkin&rft.au=Melany+Macias&rft.au=Leo+Huovinen&rft.au=Mika+H%C3%A4m%C3%A4l%C3%A4inen&rft.date=2024-04-29&rft.pub=Nicolas+Turenne&rft.eissn=2416-5999&rft.volume=NLP4DH&rft_id=info:doi/10.46298%2Fjdmdh.13127&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_61b3a87b50d9410ea8b0a3ba715928cf
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2416-5999&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2416-5999&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2416-5999&client=summon