Dynamic Topic Models of 'Dynamic Topic Modelling for Exploring the Scientific Literature on Coronavirus: An Unsupervised Labelling Technique'

Gespeichert in:
Bibliographische Detailangaben
Titel: Dynamic Topic Models of 'Dynamic Topic Modelling for Exploring the Scientific Literature on Coronavirus: An Unsupervised Labelling Technique'
Autoren: Guillén-Pacho, Ibai, orcid:0000-0001-7801-
Weitere Verfasser: Badenes-Olmedo, Carlos, Corcho, Oscar
Verlagsinformationen: Zenodo
Publikationsjahr: 2024
Bestand: Zenodo
Schlagwörter: Topic Models, Dynamic Topic Models, Dynamic Topic Labelling, Topic Labelling
Beschreibung: This resource includes the models generated for the work Dynamic Topic Modelling for Exploring the Scientific Literature on Coronavirus: An Unsupervised Labelling Technique. Each zip file has the models with the different configurations (number of topics) for each type and, in addition, an evaluation script (bench.py) and different files necessary for this (localizer, timestamps, CORPUS etc.) are included. The requirements for reusing these models are as follows: Unzip all files and install the required packages ("requirements.txt" file). Download the precompiled DTM implementation of https://github.com/magsilva/dtm/tree/master/bin or compile manually the original implementation https://github.com/blei-lab/dtm Download the DTM wrapper from https://github.com/piskvorky/gensim/releases/tag/3.8.3 ("gensim-3.8.3/gensim/models/wrappers/dtmmodel.py"). Download the DETM python implementation of https://github.com/quynhneo/detm. To run model evaluation: modify the imports in the "bench.py" file to match the DETM and DTM models location and their full path (instructions in the file documentation). To repeat our topic study: follow the "notebook.ipynb" instructions. The overview of this resource is: RESOURCES├── BERTopic│ ├── BERTopic_100│ ├── BERTopic_100_probabilities.npy│ ├── BERTopic_100_topics│ ├── BERTopic_100_topic_words│ ├── BERTopic_200│ ├── BERTopic_200_probabilities.npy│ ├── BERTopic_200_topics│ ├── BERTopic_200_topic_words│ ├── BERTopic_300│ ├── BERTopic_300_probabilities.npy│ ├── BERTopic_300_topics│ ├── BERTopic_300_topic_words│ ├── BERTopic_400│ ├── BERTopic_400_probabilities.npy│ ├── BERTopic_400_topics│ └── BERTopic_400_topic_words│├── DETM│ ├── detm_deberta_model1 # 100 topics model │ ├── detm_deberta_model1_beta.mat│ ├── detm_deberta_model2 # 200 topics model │ ├── detm_deberta_model2_beta.mat│ ├── detm_word2vec_model1 # 100 topics model │ ├── detm_word2vec_model1_beta.mat│ ├── detm_word2vec_model2 # 200 topics model │ ├── detm_word2vec_model2_beta.mat│ └── min_df_3333│ └── .│├── DTM_ALL│ ├── ...
Publikationsart: other/unknown material
Sprache: English
ISSN: 2364-4168
Relation: https://zenodo.org/records/12750327; oai:zenodo.org:12750327; https://doi.org/10.5281/zenodo.12750327
DOI: 10.5281/zenodo.12750327
Verfügbarkeit: https://doi.org/10.5281/zenodo.12750327
https://zenodo.org/records/12750327
Rights: Creative Commons Attribution 4.0 International ; cc-by-4.0 ; https://creativecommons.org/licenses/by/4.0/legalcode
Dokumentencode: edsbas.B9B268CD
Datenbank: BASE
FullText Text:
  Availability: 0
CustomLinks:
  – Url: https://doi.org/10.5281/zenodo.12750327#
    Name: EDS - BASE (s4221598)
    Category: fullText
    Text: View record from BASE
  – Url: https://resolver.ebscohost.com/openurl?sid=EBSCO:edsbas&genre=article&issn=23644168&ISBN=&volume=&issue=&date=20240101&spage=&pages=&title=Dynamic Topic Models of 'Dynamic Topic Modelling for Exploring the Scientific Literature on Coronavirus: An Unsupervised Labelling Technique'&atitle=Dynamic%20Topic%20Models%20of%20%27Dynamic%20Topic%20Modelling%20for%20Exploring%20the%20Scientific%20Literature%20on%20Coronavirus%3A%20An%20Unsupervised%20Labelling%20Technique%27&aulast=Guill%C3%A9n-Pacho%2C%20Ibai&id=DOI:10.5281/zenodo.12750327
    Name: Full Text Finder
    Category: fullText
    Text: Full Text Finder
    Icon: https://imageserver.ebscohost.com/branding/images/FTF.gif
    MouseOverText: Full Text Finder
  – Url: https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=EBSCO&SrcAuth=EBSCO&DestApp=WOS&ServiceName=TransferToWoS&DestLinkType=GeneralSearchSummary&Func=Links&author=Guill%C3%A9n-Pacho%20I
    Name: ISI
    Category: fullText
    Text: Nájsť tento článok vo Web of Science
    Icon: https://imagesrvr.epnet.com/ls/20docs.gif
    MouseOverText: Nájsť tento článok vo Web of Science
Header DbId: edsbas
DbLabel: BASE
An: edsbas.B9B268CD
RelevancyScore: 884
AccessLevel: 3
PubType:
PubTypeId: unknown
PreciseRelevancyScore: 884.306396484375
IllustrationInfo
Items – Name: Title
  Label: Title
  Group: Ti
  Data: Dynamic Topic Models of 'Dynamic Topic Modelling for Exploring the Scientific Literature on Coronavirus: An Unsupervised Labelling Technique'
– Name: Author
  Label: Authors
  Group: Au
  Data: <searchLink fieldCode="AR" term="%22Guillén-Pacho%2C+Ibai%22">Guillén-Pacho, Ibai</searchLink><br /><searchLink fieldCode="AR" term="%22orcid%3A0000-0001-7801-%22">orcid:0000-0001-7801-</searchLink>
– Name: Author
  Label: Contributors
  Group: Au
  Data: Badenes-Olmedo, Carlos<br />Corcho, Oscar
– Name: Publisher
  Label: Publisher Information
  Group: PubInfo
  Data: Zenodo
– Name: DatePubCY
  Label: Publication Year
  Group: Date
  Data: 2024
– Name: Subset
  Label: Collection
  Group: HoldingsInfo
  Data: Zenodo
– Name: Subject
  Label: Subject Terms
  Group: Su
  Data: <searchLink fieldCode="DE" term="%22Topic+Models%22">Topic Models</searchLink><br /><searchLink fieldCode="DE" term="%22Dynamic+Topic+Models%22">Dynamic Topic Models</searchLink><br /><searchLink fieldCode="DE" term="%22Dynamic+Topic+Labelling%22">Dynamic Topic Labelling</searchLink><br /><searchLink fieldCode="DE" term="%22Topic+Labelling%22">Topic Labelling</searchLink>
– Name: Abstract
  Label: Description
  Group: Ab
  Data: This resource includes the models generated for the work Dynamic Topic Modelling for Exploring the Scientific Literature on Coronavirus: An Unsupervised Labelling Technique. Each zip file has the models with the different configurations (number of topics) for each type and, in addition, an evaluation script (bench.py) and different files necessary for this (localizer, timestamps, CORPUS etc.) are included. The requirements for reusing these models are as follows: Unzip all files and install the required packages ("requirements.txt" file). Download the precompiled DTM implementation of https://github.com/magsilva/dtm/tree/master/bin or compile manually the original implementation https://github.com/blei-lab/dtm Download the DTM wrapper from https://github.com/piskvorky/gensim/releases/tag/3.8.3 ("gensim-3.8.3/gensim/models/wrappers/dtmmodel.py"). Download the DETM python implementation of https://github.com/quynhneo/detm. To run model evaluation: modify the imports in the "bench.py" file to match the DETM and DTM models location and their full path (instructions in the file documentation). To repeat our topic study: follow the "notebook.ipynb" instructions. The overview of this resource is: RESOURCES├── BERTopic│ ├── BERTopic_100│ ├── BERTopic_100_probabilities.npy│ ├── BERTopic_100_topics│ ├── BERTopic_100_topic_words│ ├── BERTopic_200│ ├── BERTopic_200_probabilities.npy│ ├── BERTopic_200_topics│ ├── BERTopic_200_topic_words│ ├── BERTopic_300│ ├── BERTopic_300_probabilities.npy│ ├── BERTopic_300_topics│ ├── BERTopic_300_topic_words│ ├── BERTopic_400│ ├── BERTopic_400_probabilities.npy│ ├── BERTopic_400_topics│ └── BERTopic_400_topic_words│├── DETM│ ├── detm_deberta_model1 # 100 topics model │ ├── detm_deberta_model1_beta.mat│ ├── detm_deberta_model2 # 200 topics model │ ├── detm_deberta_model2_beta.mat│ ├── detm_word2vec_model1 # 100 topics model │ ├── detm_word2vec_model1_beta.mat│ ├── detm_word2vec_model2 # 200 topics model │ ├── detm_word2vec_model2_beta.mat│ └── min_df_3333│ └── .│├── DTM_ALL│ ├── ...
– Name: TypeDocument
  Label: Document Type
  Group: TypDoc
  Data: other/unknown material
– Name: Language
  Label: Language
  Group: Lang
  Data: English
– Name: ISSN
  Label: ISSN
  Group: ISSN
  Data: 2364-4168
– Name: NoteTitleSource
  Label: Relation
  Group: SrcInfo
  Data: https://zenodo.org/records/12750327; oai:zenodo.org:12750327; https://doi.org/10.5281/zenodo.12750327
– Name: DOI
  Label: DOI
  Group: ID
  Data: 10.5281/zenodo.12750327
– Name: URL
  Label: Availability
  Group: URL
  Data: https://doi.org/10.5281/zenodo.12750327<br />https://zenodo.org/records/12750327
– Name: Copyright
  Label: Rights
  Group: Cpyrght
  Data: Creative Commons Attribution 4.0 International ; cc-by-4.0 ; https://creativecommons.org/licenses/by/4.0/legalcode
– Name: AN
  Label: Accession Number
  Group: ID
  Data: edsbas.B9B268CD
PLink https://erproxy.cvtisr.sk/sfx/access?url=https://search.ebscohost.com/login.aspx?direct=true&site=eds-live&db=edsbas&AN=edsbas.B9B268CD
RecordInfo BibRecord:
  BibEntity:
    Identifiers:
      – Type: doi
        Value: 10.5281/zenodo.12750327
    Languages:
      – Text: English
    Subjects:
      – SubjectFull: Topic Models
        Type: general
      – SubjectFull: Dynamic Topic Models
        Type: general
      – SubjectFull: Dynamic Topic Labelling
        Type: general
      – SubjectFull: Topic Labelling
        Type: general
    Titles:
      – TitleFull: Dynamic Topic Models of 'Dynamic Topic Modelling for Exploring the Scientific Literature on Coronavirus: An Unsupervised Labelling Technique'
        Type: main
  BibRelationships:
    HasContributorRelationships:
      – PersonEntity:
          Name:
            NameFull: Guillén-Pacho, Ibai
      – PersonEntity:
          Name:
            NameFull: orcid:0000-0001-7801-
      – PersonEntity:
          Name:
            NameFull: Badenes-Olmedo, Carlos
      – PersonEntity:
          Name:
            NameFull: Corcho, Oscar
    IsPartOfRelationships:
      – BibEntity:
          Dates:
            – D: 01
              M: 01
              Type: published
              Y: 2024
          Identifiers:
            – Type: issn-print
              Value: 23644168
            – Type: issn-locals
              Value: edsbas
            – Type: issn-locals
              Value: edsbas.oa
ResultId 1