Sentiment analysis for software engineering how far can we go?

Sentiment analysis has been applied to various software engineering (SE) tasks, such as evaluating app reviews or analyzing developers' emotions in commit messages. Studies indicate that sentiment analysis tools provide unreliable results when used out-of-the-box, since they are not designed to...

Full description

Saved in:
Bibliographic Details
Published in:2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE) pp. 94 - 104
Main Authors: Lin, Bin, Zampetti, Fiorella, Bavota, Gabriele, Di Penta, Massimiliano, Lanza, Michele, Oliveto, Rocco
Format: Conference Proceeding
Language:English
Published: New York, NY, USA ACM 27.05.2018
Series:ACM Conferences
Subjects:
ISBN:9781450356381, 1450356389
ISSN:1558-1225
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Sentiment analysis has been applied to various software engineering (SE) tasks, such as evaluating app reviews or analyzing developers' emotions in commit messages. Studies indicate that sentiment analysis tools provide unreliable results when used out-of-the-box, since they are not designed to process SE datasets. The silver bullet for a successful application of sentiment analysis tools to SE datasets might be their customization to the specific usage context. We describe our experience in building a software library recommender exploiting developers' opinions mined from Stack Overflow. To reach our goal, we retrained---on a set of 40k manually labeled sentences/words extracted from Stack Overflow---a state-of-the-art sentiment analysis tool exploiting deep learning. Despite such an effort- and time-consuming training process, the results were negative. We changed our focus and performed a thorough investigation of the accuracy of commonly used tools to identify the sentiment of SE related texts. Meanwhile, we also studied the impact of different datasets on tool performance. Our results should warn the research community about the strong limitations of current sentiment analysis tools.
AbstractList Sentiment analysis has been applied to various software engineering (SE) tasks, such as evaluating app reviews or analyzing developers' emotions in commit messages. Studies indicate that sentiment analysis tools provide unreliable results when used out-of-the-box, since they are not designed to process SE datasets. The silver bullet for a successful application of sentiment analysis tools to SE datasets might be their customization to the specific usage context. We describe our experience in building a software library recommender exploiting crowdsourced opinions mined from Stack Overflow (e.g., what is the sentiment of developers about the usability of a library). To reach our goal, we retrained-on a set of 40k manually labeled sentences/words extracted from Stack Overflow-a state-of-the-art sentiment analysis tool exploiting deep learning. Despite such an effort- and time-consuming training process, the results were negative. We changed our focus and performed a thorough investigation of the accuracy of these tools on a variety of SE datasets. Our results should warn the research community about the strong limitations of current sentiment analysis tools.
Sentiment analysis has been applied to various software engineering (SE) tasks, such as evaluating app reviews or analyzing developers' emotions in commit messages. Studies indicate that sentiment analysis tools provide unreliable results when used out-of-the-box, since they are not designed to process SE datasets. The silver bullet for a successful application of sentiment analysis tools to SE datasets might be their customization to the specific usage context. We describe our experience in building a software library recommender exploiting developers' opinions mined from Stack Overflow. To reach our goal, we retrained---on a set of 40k manually labeled sentences/words extracted from Stack Overflow---a state-of-the-art sentiment analysis tool exploiting deep learning. Despite such an effort- and time-consuming training process, the results were negative. We changed our focus and performed a thorough investigation of the accuracy of commonly used tools to identify the sentiment of SE related texts. Meanwhile, we also studied the impact of different datasets on tool performance. Our results should warn the research community about the strong limitations of current sentiment analysis tools.
Author Lin, Bin
Oliveto, Rocco
Di Penta, Massimiliano
Lanza, Michele
Zampetti, Fiorella
Bavota, Gabriele
Author_xml – sequence: 1
  givenname: Bin
  surname: Lin
  fullname: Lin, Bin
  organization: Università della Svizzera italiana (USI), Switzerland
– sequence: 2
  givenname: Fiorella
  surname: Zampetti
  fullname: Zampetti, Fiorella
  organization: University of Sannio, Italy
– sequence: 3
  givenname: Gabriele
  surname: Bavota
  fullname: Bavota, Gabriele
  organization: Università della Svizzera italiana (USI), Switzerland
– sequence: 4
  givenname: Massimiliano
  surname: Di Penta
  fullname: Di Penta, Massimiliano
  organization: University of Sannio, Italy
– sequence: 5
  givenname: Michele
  surname: Lanza
  fullname: Lanza, Michele
  organization: Università della Svizzera italiana (USI), Switzerland
– sequence: 6
  givenname: Rocco
  surname: Oliveto
  fullname: Oliveto, Rocco
  organization: University of Molise, Italy
BookMark eNqNkDtPwzAUhc1LopTODCwZkVDC9dseUcVLqsQAzJbbXFeG1kF2JNR_T6CZmFjOGb6jM3xn5Dh1CQm5oNBQKuQNpwaolM1vW3lAZlabAQCXiht6SCYDNTVlTB79YadkVso7ADBlhKFqQq5fMPVxO0Tlk9_sSixV6HJVutB_-YwVpnVMiDmm9Tk5CX5TcDb2lLzd373OH-vF88PT_HZRe2ZlX7e21S0oywKFAIZqyWAljF15LUDYpQE0BiXTKlgpW6a518pzQIrAW8H5lFzufyMius8ctz7vnBGSg9IDvdpTv9q6Zdd9FEfB_Zhxoxk3mhmmzT-nbpkjBv4N9IVeLg
CODEN IEEPAD
ContentType Conference Proceeding
Copyright 2018 ACM
Copyright_xml – notice: 2018 ACM
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1145/3180155.3180195
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList

Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9781450356381
1450356389
EISSN 1558-1225
EndPage 104
ExternalDocumentID 8453067
Genre orig-research
GroupedDBID 6IE
6IF
6IG
6IL
6IM
6IN
AAJGR
ABLEC
ABQGA
ACM
ADPZR
ALMA_UNASSIGNED_HOLDINGS
APO
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
GUFHI
IEGSK
IJVOP
LHSKQ
OCL
RIB
RIC
RIE
RIL
RIO
-~X
.4S
.DC
123
23M
29O
5VS
6IH
6IK
8US
AAWTH
ADZIZ
AFFNX
ARCSS
AVWKF
CHZPO
EDO
FEDTE
I-F
I07
IPLJI
M43
RNS
XOL
ID FETCH-LOGICAL-a295t-d9d7d0692f10f0817520c489ca74049b80e88e5276f955d273a76a30e1e03d433
IEDL.DBID RIE
ISBN 9781450356381
1450356389
ISICitedReferencesCount 129
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000454843300013&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Wed Aug 27 02:08:00 EDT 2025
Fri Sep 13 11:04:35 EDT 2024
IsPeerReviewed false
IsScholarly true
Keywords sentiment analysis
software engineering
NLP
Language English
License Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Permissions@acm.org.
LinkModel DirectLink
MeetingName ICSE '18: 40th International Conference on Software Engineering
MergedId FETCHMERGED-LOGICAL-a295t-d9d7d0692f10f0817520c489ca74049b80e88e5276f955d273a76a30e1e03d433
PageCount 11
ParticipantIDs acm_books_10_1145_3180155_3180195
ieee_primary_8453067
acm_books_10_1145_3180155_3180195_brief
PublicationCentury 2000
PublicationDate 20180527
2018-May
PublicationDateYYYYMMDD 2018-05-27
2018-05-01
PublicationDate_xml – month: 05
  year: 2018
  text: 20180527
  day: 27
PublicationDecade 2010
PublicationPlace New York, NY, USA
PublicationPlace_xml – name: New York, NY, USA
PublicationSeriesTitle ACM Conferences
PublicationTitle 2018 IEEE/ACM 40th International Conference on Software Engineering (ICSE)
PublicationTitleAbbrev ICSE
PublicationYear 2018
Publisher ACM
Publisher_xml – name: ACM
SSID ssj0002684816
ssj0006499
Score 2.4995284
Snippet Sentiment analysis has been applied to various software engineering (SE) tasks, such as evaluating app reviews or analyzing developers' emotions in commit...
SourceID ieee
acm
SourceType Publisher
StartPage 94
SubjectTerms Information systems -- Information retrieval -- Retrieval tasks and goals -- Sentiment analysis
Motion pictures
NLP
Sentiment analysis
Software
Software engineering
Task analysis
Training
Subtitle how far can we go?
Title Sentiment analysis for software engineering
URI https://ieeexplore.ieee.org/document/8453067
WOSCitedRecordID wos000454843300013&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PS8MwFA7b8OBp6ibOX0QQvNgtaZMm8eJhOHcaAxV3K2n6AjvYSbe5f9-k7aaCIPbSUkopj9e8L8n7vg-ha6JSElsuAqsNCzzXOlAuTQKRUuMOEZnQlmYTYjKRs5maNtDtjgsDAGXzGfT9ZbmXny3M2i-VDSTjHuE2UVOIuOJq7dZTvGqJ9FCnHoVjB-VrKR_K-MClrkcH_fLszSSa2rz9cFQpC8qo_b9POUDdL2Yenu5qziFqQH6E2ltrBlz_qR2HwX0XkH8H3sqOYAdP8ZMbdTe6APxNh_AOjxcbPNIFHuocvwJ-XNx30cvo4Xk4DmqvhECHiq-CTGUiI7EKLSXWlXnBQ2KYVEYL5iYBqSQgJfBQxFZxnjnQokWsIwIUSJSxKDpGrXyRwwnCNALJgMZAWMq0NClRQHgIIU2p1Vb10JWLXeInAcuk4jXzpI5vUse3h27-fCZJiznYHur46CbvlbhGUgf29PfbZ2jfIRZZdRyeo9aqWMMF2jMfq_myuCwz4hNBXa4t
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1dS8MwFL3MKejT_MT5GUHwxWraJk3iiw_inDiHoKJvJU1vYA9u0m3u75t03VQQxL60lFLK5Tb3JLnnHIBjqjKaWC4Cqw0LPNc6UC5NApGFxh0iNpEtzSZEtytfX9VDDU7nXBhELJvP8Mxflnv5-cCM_VLZuWTcI9wFWOSMRXTK1pqvqHjdEunBTjUOJw7MV2I-IePnLnk9Pjgrz95OYkGbtx-eKmVJaTX-9zGrsPnFzSMP86qzBjXsr0NjZs5Aqn91w6Fw3wfk30FmwiPEAVTy6MbdiS6QfFMivCDtwYS0dEGudJ-8ILkZXG7Cc-v66aodVG4JgY4UHwW5ykVOExXZkFpX6AWPqGFSGS2YmwZkkqKUyCORWMV57mCLFomOKYZI45zF8RbU-4M-bgMJY5QMwwQpy5iWJqMKKY8wCrPQaquacORil_ppwDCdMpt5WsU3reLbhJM_n0mzooe2CRs-uun7VF4jrQK78_vtQ1huP9130s5t924XVhx-kdP-wz2oj4ox7sOS-Rj1hsVBmR2fhqCxdA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2018+IEEE%2FACM+40th+International+Conference+on+Software+Engineering+%28ICSE%29&rft.atitle=Sentiment+Analysis+for+Software+Engineering%3A+How+Far+Can+We+Go%3F&rft.au=Lin%2C+Bin&rft.au=Zampetti%2C+Fiorella&rft.au=Bavota%2C+Gabriele&rft.au=Di+Penta%2C+Massimiliano&rft.date=2018-05-01&rft.pub=ACM&rft.eissn=1558-1225&rft.spage=94&rft.epage=104&rft_id=info:doi/10.1145%2F3180155.3180195&rft.externalDocID=8453067
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781450356381/lc.gif&client=summon&freeimage=true
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781450356381/mc.gif&client=summon&freeimage=true
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=9781450356381/sc.gif&client=summon&freeimage=true