Dataset for the recognition of Kurdish sound dialects

Dialect recognition System (DRS) is a highly significant subject within the field of speech analysis. The performance of speech recognition systems is adversely impacted by factors such as the age, gender, and dialect features of the speaker. In order to address variations in dialect, it is possible...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Data in brief Jg. 53; S. 110231
Hauptverfasser: Rawf, Karwan M. Hama, Karim, Sarkhel H. Taher, Abdulrahman, Ayub O., Ghafoor, Karzan J.
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Netherlands Elsevier Inc 01.04.2024
Elsevier
Schlagworte:
ISSN:2352-3409, 2352-3409
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract Dialect recognition System (DRS) is a highly significant subject within the field of speech analysis. The performance of speech recognition systems is adversely impacted by factors such as the age, gender, and dialect features of the speaker. In order to address variations in dialect, it is possible to incorporate DRS into speech recognition systems. The system can be configured to utilize the appropriate speech recognition model based on the identification of the spoken dialect. Currently, there is a lack of available datasets suitable for the development of automatic dialect recognition systems specifically tailored for the Kurdish language. The proposed dataset under consideration is assessed using experimental data that has been gathered by personnel associated with the Computer Science Department at the University of Halabja. As the Kurdish language has three main dialects: Northern Kurdish (Badini variation), Central Kurdish (Sorani variant), and Hawrami, three dialects are included in the dataset.
AbstractList Dialect recognition System (DRS) is a highly significant subject within the field of speech analysis. The performance of speech recognition systems is adversely impacted by factors such as the age, gender, and dialect features of the speaker. In order to address variations in dialect, it is possible to incorporate DRS into speech recognition systems. The system can be configured to utilize the appropriate speech recognition model based on the identification of the spoken dialect. Currently, there is a lack of available datasets suitable for the development of automatic dialect recognition systems specifically tailored for the Kurdish language. The proposed dataset under consideration is assessed using experimental data that has been gathered by personnel associated with the Computer Science Department at the University of Halabja. As the Kurdish language has three main dialects: Northern Kurdish (Badini variation), Central Kurdish (Sorani variant), and Hawrami, three dialects are included in the dataset.
Dialect recognition System (DRS) is a highly significant subject within the field of speech analysis. The performance of speech recognition systems is adversely impacted by factors such as the age, gender, and dialect features of the speaker. In order to address variations in dialect, it is possible to incorporate DRS into speech recognition systems. The system can be configured to utilize the appropriate speech recognition model based on the identification of the spoken dialect. Currently, there is a lack of available datasets suitable for the development of automatic dialect recognition systems specifically tailored for the Kurdish language. The proposed dataset under consideration is assessed using experimental data that has been gathered by personnel associated with the Computer Science Department at the University of Halabja. As the Kurdish language has three main dialects: Northern Kurdish (Badini variation), Central Kurdish (Sorani variant), and Hawrami, three dialects are included in the dataset.Dialect recognition System (DRS) is a highly significant subject within the field of speech analysis. The performance of speech recognition systems is adversely impacted by factors such as the age, gender, and dialect features of the speaker. In order to address variations in dialect, it is possible to incorporate DRS into speech recognition systems. The system can be configured to utilize the appropriate speech recognition model based on the identification of the spoken dialect. Currently, there is a lack of available datasets suitable for the development of automatic dialect recognition systems specifically tailored for the Kurdish language. The proposed dataset under consideration is assessed using experimental data that has been gathered by personnel associated with the Computer Science Department at the University of Halabja. As the Kurdish language has three main dialects: Northern Kurdish (Badini variation), Central Kurdish (Sorani variant), and Hawrami, three dialects are included in the dataset.
ArticleNumber 110231
Author Ghafoor, Karzan J.
Abdulrahman, Ayub O.
Rawf, Karwan M. Hama
Karim, Sarkhel H. Taher
Author_xml – sequence: 1
  givenname: Karwan M. Hama
  orcidid: 0000-0002-0350-7435
  surname: Rawf
  fullname: Rawf, Karwan M. Hama
  email: karwan.hamaraouf@uoh.edu.iq
– sequence: 2
  givenname: Sarkhel H. Taher
  surname: Karim
  fullname: Karim, Sarkhel H. Taher
– sequence: 3
  givenname: Ayub O.
  orcidid: 0000-0003-3508-1093
  surname: Abdulrahman
  fullname: Abdulrahman, Ayub O.
– sequence: 4
  givenname: Karzan J.
  surname: Ghafoor
  fullname: Ghafoor, Karzan J.
BackLink https://www.ncbi.nlm.nih.gov/pubmed/38435729$$D View this record in MEDLINE/PubMed
BookMark eNqFkU1v1DAQhi1UREvpD-CCcuSyy4w_4kScUKGlohIXOFuOPW69ysbFdirx78mSUiEO5eTR6HlfWfO8ZEdTmoix1whbBGzf7bY-DlsOXG4RgQt8xk64UHwjJPRHf83H7KyUHQCgkstSvWDHopNCad6fMPXRVluoNiHlpt5Sk8mlmynWmKYmhebLnH0st01J8-QbH-1IrpZX7HmwY6Gzh_eUfb_49O388-b66-XV-YfrjZO9qhsSrfVB42BhGEB4DB2G0KLl5DpnXY9doL6VSooABMC95toDBBGURE3ilF2tvT7ZnbnLcW_zT5NsNL8XKd8Ym2t0IxnkXauk8y5gkN7bDnstSWDXywBg5dL1du26y-nHTKWafSyOxtFOlOZiBCrRQddq_l-U90ILoaWABX3zgM7DnvzjH_9ceAFwBVxOpWQKjwiCOYg0O7OINAeRZhW5ZPQ_GRerPSip2cbxyeT7NUmLlftI2RQXaXLk4yK2LmeLT6R_AYrjtBE
CitedBy_id crossref_primary_10_1016_j_dib_2024_110949
crossref_primary_10_1016_j_mex_2025_103374
crossref_primary_10_1016_j_dib_2025_111826
Cites_doi 10.26480/aim.01.2023.08.14
10.1017/jlg.2017.6
ContentType Journal Article
Copyright 2024
Copyright_xml – notice: 2024
DBID 6I.
AAFTH
AAYXX
CITATION
NPM
7X8
7S9
L.6
DOA
DOI 10.1016/j.dib.2024.110231
DatabaseName ScienceDirect Open Access Titles
Elsevier:ScienceDirect:Open Access
CrossRef
PubMed
MEDLINE - Academic
AGRICOLA
AGRICOLA - Academic
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
PubMed
MEDLINE - Academic
AGRICOLA
AGRICOLA - Academic
DatabaseTitleList
AGRICOLA
MEDLINE - Academic
PubMed

Database_xml – sequence: 1
  dbid: DOA
  name: Directory of Open Access Journals (DOAJ)
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Sciences (General)
Computer Science
EISSN 2352-3409
ExternalDocumentID oai_doaj_org_article_128654cdcf1f4dda81974e31894f00a4
38435729
10_1016_j_dib_2024_110231
S2352340924002026
Genre Journal Article
GroupedDBID 0R~
0SF
4.4
457
53G
5VS
6I.
AACTN
AAEDT
AAEDW
AAFTH
AAIKJ
AALRI
AAXUO
ABMAC
ACGFS
ADBBV
ADEZE
ADRAZ
AEXQZ
AFTJW
AGHFR
AITUG
AKRWK
ALMA_UNASSIGNED_HOLDINGS
AMRAJ
AOIJS
BAWUL
BCNDV
DIK
EBS
EJD
FDB
GROUPED_DOAJ
HYE
IPNFZ
KQ8
M41
M48
M~E
NCXOZ
O9-
OK1
RIG
ROL
RPM
SSZ
AAFWJ
AAYWO
AAYXX
ACVFH
ADCNI
ADVLN
AEUPX
AFJKZ
AFPKN
AFPUW
AIGII
AKBMS
AKYEP
APXCP
CITATION
NPM
7X8
7S9
L.6
ID FETCH-LOGICAL-c495t-e36adf71ba0bb03d1f81ff61a2ec8cac918fe964543f0e002d727d00f3f5417e3
IEDL.DBID DOA
ISICitedReferencesCount 5
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001195607200001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 2352-3409
IngestDate Fri Oct 03 12:52:03 EDT 2025
Thu Oct 02 05:43:19 EDT 2025
Fri Jul 11 15:57:21 EDT 2025
Thu Apr 03 07:02:15 EDT 2025
Tue Nov 18 22:37:03 EST 2025
Sat Nov 29 02:12:37 EST 2025
Sat Apr 13 16:39:53 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Keywords Hawrami
Dialect recognition
Badini
Sorani
Kurdish dialect
Language English
License This is an open access article under the CC BY license.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c495t-e36adf71ba0bb03d1f81ff61a2ec8cac918fe964543f0e002d727d00f3f5417e3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ORCID 0000-0003-3508-1093
0000-0002-0350-7435
OpenAccessLink https://doaj.org/article/128654cdcf1f4dda81974e31894f00a4
PMID 38435729
PQID 2937337430
PQPubID 23479
ParticipantIDs doaj_primary_oai_doaj_org_article_128654cdcf1f4dda81974e31894f00a4
proquest_miscellaneous_3153808672
proquest_miscellaneous_2937337430
pubmed_primary_38435729
crossref_primary_10_1016_j_dib_2024_110231
crossref_citationtrail_10_1016_j_dib_2024_110231
elsevier_sciencedirect_doi_10_1016_j_dib_2024_110231
PublicationCentury 2000
PublicationDate 2024-04-01
PublicationDateYYYYMMDD 2024-04-01
PublicationDate_xml – month: 04
  year: 2024
  text: 2024-04-01
  day: 01
PublicationDecade 2020
PublicationPlace Netherlands
PublicationPlace_xml – name: Netherlands
PublicationTitle Data in brief
PublicationTitleAlternate Data Brief
PublicationYear 2024
Publisher Elsevier Inc
Elsevier
Publisher_xml – name: Elsevier Inc
– name: Elsevier
References Hama Rawf, Mohammed, Abdulrahman, Abdalla, Ghafor (bib0009) 2023; 7
Abdul (bib0004) 2019; 7
Badawi, Saeed, Ahmed, Abdalla, Hassan (bib0008) 2023; 48
Ghafoor, Hama Rawf, Abdulrahman, Taher (bib0002) 2021; 9
Işik, Artuner (bib0003) 2018
Eppler, Benedikt (bib0007) 2017; 5
Abdalla, Qadir, Shakor, Saeed, Jabar, Salam, Amin (bib0005) 2023; 47
Rawf, Kareem, Abdulrahman, Ghafoor (bib0001) 2023; V2
Al-Talabani, Abdul, Ameen (bib0006) 2017; 5
Badawi (10.1016/j.dib.2024.110231_bib0008) 2023; 48
Abdul (10.1016/j.dib.2024.110231_bib0004) 2019; 7
Eppler (10.1016/j.dib.2024.110231_bib0007) 2017; 5
Hama Rawf (10.1016/j.dib.2024.110231_bib0009) 2023; 7
Abdalla (10.1016/j.dib.2024.110231_bib0005) 2023; 47
Rawf (10.1016/j.dib.2024.110231_bib0001) 2023; V2
Al-Talabani (10.1016/j.dib.2024.110231_bib0006) 2017; 5
Ghafoor (10.1016/j.dib.2024.110231_bib0002) 2021; 9
Işik (10.1016/j.dib.2024.110231_bib0003) 2018
References_xml – volume: 5
  start-page: 20
  year: 2017
  end-page: 23
  ident: bib0006
  article-title: Kurdish dialects and neighbor languages automatic recognition
  publication-title: ARO- Sci. J. Koya Univ.
– volume: 7
  start-page: 566
  year: 2019
  end-page: 572
  ident: bib0004
  article-title: Kurdish speaker identification based on one dimensional convolutional neural network
  publication-title: Comput. Methods Differ. Eq.
– volume: 5
  start-page: 109
  year: 2017
  end-page: 130
  ident: bib0007
  article-title: A perceptual dialectological approach to linguistic variation and spatial analysis of Kurdish varieties
  publication-title: J. Linguist. Geogr.
– volume: 7
  start-page: 08
  year: 2023
  end-page: 14
  ident: bib0009
  article-title: A comparative study using 2D CNN and transfer learning to detect and classify arabic-script-based sign language
  publication-title: Acta Inform. Malaysia
– volume: 47
  year: 2023
  ident: bib0005
  article-title: A vast dataset for Kurdish handwritten digits and isolated characters recognition
  publication-title: Data Br.
– volume: 9
  start-page: 10
  year: 2021
  end-page: 14
  ident: bib0002
  article-title: Kurdish dialect recognition using 1D CNN
  publication-title: ARO- Sci. J. Koya Univ.
– volume: V2
  year: 2023
  ident: bib0001
  article-title: A dataset for the classification of different Kurdish dialects
  publication-title: Mendeley Data
– start-page: 1
  year: 2018
  end-page: 4
  ident: bib0003
  article-title: A dataset for Turkish dialect recognition and classification with deep learning
  publication-title: 2018 26th Signal Processing and Communications Applications Conference (SIU), Izmir, Turkey
– volume: 48
  year: 2023
  ident: bib0008
  article-title: Kurdish News Dataset Headlines (KNDH) through multiclass classification
  publication-title: Data Br.
– volume: 7
  start-page: 08
  issue: 1
  year: 2023
  ident: 10.1016/j.dib.2024.110231_bib0009
  article-title: A comparative study using 2D CNN and transfer learning to detect and classify arabic-script-based sign language
  publication-title: Acta Inform. Malaysia
  doi: 10.26480/aim.01.2023.08.14
– volume: 9
  start-page: 10
  issue: 2
  year: 2021
  ident: 10.1016/j.dib.2024.110231_bib0002
  article-title: Kurdish dialect recognition using 1D CNN
  publication-title: ARO- Sci. J. Koya Univ.
– volume: V2
  year: 2023
  ident: 10.1016/j.dib.2024.110231_bib0001
  article-title: A dataset for the classification of different Kurdish dialects
  publication-title: Mendeley Data
– volume: 47
  year: 2023
  ident: 10.1016/j.dib.2024.110231_bib0005
  article-title: A vast dataset for Kurdish handwritten digits and isolated characters recognition
  publication-title: Data Br.
– volume: 48
  year: 2023
  ident: 10.1016/j.dib.2024.110231_bib0008
  article-title: Kurdish News Dataset Headlines (KNDH) through multiclass classification
  publication-title: Data Br.
– start-page: 1
  year: 2018
  ident: 10.1016/j.dib.2024.110231_bib0003
  article-title: A dataset for Turkish dialect recognition and classification with deep learning
– volume: 7
  start-page: 566
  issue: 4
  year: 2019
  ident: 10.1016/j.dib.2024.110231_bib0004
  article-title: Kurdish speaker identification based on one dimensional convolutional neural network
  publication-title: Comput. Methods Differ. Eq.
– volume: 5
  start-page: 20
  issue: 1
  year: 2017
  ident: 10.1016/j.dib.2024.110231_bib0006
  article-title: Kurdish dialects and neighbor languages automatic recognition
  publication-title: ARO- Sci. J. Koya Univ.
– volume: 5
  start-page: 109
  issue: 2
  year: 2017
  ident: 10.1016/j.dib.2024.110231_bib0007
  article-title: A perceptual dialectological approach to linguistic variation and spatial analysis of Kurdish varieties
  publication-title: J. Linguist. Geogr.
  doi: 10.1017/jlg.2017.6
SSID ssj0001542355
Score 2.2977624
Snippet Dialect recognition System (DRS) is a highly significant subject within the field of speech analysis. The performance of speech recognition systems is...
SourceID doaj
proquest
pubmed
crossref
elsevier
SourceType Open Website
Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 110231
SubjectTerms Badini
computer science
data collection
Dialect recognition
gender
Hawrami
human resources
Kurdish dialect
Sorani
speech
Title Dataset for the recognition of Kurdish sound dialects
URI https://dx.doi.org/10.1016/j.dib.2024.110231
https://www.ncbi.nlm.nih.gov/pubmed/38435729
https://www.proquest.com/docview/2937337430
https://www.proquest.com/docview/3153808672
https://doaj.org/article/128654cdcf1f4dda81974e31894f00a4
Volume 53
WOSCitedRecordID wos001195607200001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: Directory of Open Access Journals (DOAJ)
  customDbUrl:
  eissn: 2352-3409
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0001542355
  issn: 2352-3409
  databaseCode: DOA
  dateStart: 20140101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 2352-3409
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0001542355
  issn: 2352-3409
  databaseCode: M~E
  dateStart: 20140101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LT9wwEB4B6oELAlpgy0OuxKGtFGHHziY58hRSBeLQSnuzHNsjFqFsRbIc-e2M42QFB-DCJYfIeXg8znwTf_4G4JAghnOpxURYk1KCguPEYJYnpspSoyiEGKe6YhP59XUxmZQ3L0p9BU5YlAeOhjsSYeukss6iQOWcoQiWq_DjrlTIuemUQAn1vEim4v5ggglZNixjdoQuN60oH0xVYL6nUrwKRJ1e_6t49Bbe7OLOxTqs9YCRHccX3YAlX2_CRj8lG_az143-9RWyM9NSTGoZ4VBGuI4tyEGzms2Q_ZmTMzS3rAmVlFjYMBKYHN_g38X539PLpK-KkFhKZtrEy7FxmIvK8Kri0gksBOJYmNTbwhpbigJ9GYS6JHJPHzxHEMVxjhIzJXIvt2ClntV-B1ipBDdeSJRlpSpemrHDwoig71J6mbkR8MFE2vaS4aFyxb0euGF3mqyqg1V1tOoIfi8u-R_1Mt5rfBLsvmgYpK67E-QAuncA_ZEDjEANo6Z71BDRAN1q-t6zfwwjrGlGhWUSU_vZvNEEgHIpCVnxt9vIECgoG8zTEWxH91j0QhYEQSln-f4ZvduF1fDSkSu0Byvtw9zvwxf72E6bhwNYzifFQef6dLx6On8GAaoE8Q
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Dataset+for+the+recognition+of+Kurdish+sound+dialects&rft.jtitle=Data+in+brief&rft.au=Rawf%2C+Karwan+M.+Hama&rft.au=Karim%2C+Sarkhel+H.+Taher&rft.au=Abdulrahman%2C+Ayub+O.&rft.au=Ghafoor%2C+Karzan+J&rft.date=2024-04-01&rft.issn=2352-3409&rft.eissn=2352-3409&rft.volume=53+p.110231-&rft_id=info:doi/10.1016%2Fj.dib.2024.110231&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2352-3409&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2352-3409&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2352-3409&client=summon