Source code properties of defective infrastructure as code scripts

In continuous deployment, software and services are rapidly deployed to end-users using an automated deployment pipeline. Defects in infrastructure as code (IaC) scripts can hinder the reliability of the automated deployment pipeline. We hypothesize that certain properties of IaC source code such as...

Full description

Saved in:
Bibliographic Details
Published in:Information and software technology Vol. 112; pp. 148 - 163
Main Authors: Rahman, Akond, Williams, Laurie
Format: Journal Article
Language:English
Published: Elsevier B.V 01.08.2019
Subjects:
ISSN:0950-5849, 1873-6025
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract In continuous deployment, software and services are rapidly deployed to end-users using an automated deployment pipeline. Defects in infrastructure as code (IaC) scripts can hinder the reliability of the automated deployment pipeline. We hypothesize that certain properties of IaC source code such as lines of code and hard-coded strings used as configuration values, show correlation with defective IaC scripts. The objective of this paper is to help practitioners in increasing the quality of infrastructure as code (IaC) scripts through an empirical study that identifies source code properties of defective IaC scripts. We apply qualitative analysis on defect-related commits mined from open source software repositories to identify source code properties that correlate with defective IaC scripts. Next, we survey practitioners to assess the practitioner’s agreement level with the identified properties. We also construct defect prediction models using the identified properties for 2439 scripts collected from four datasets. We identify 10 source code properties that correlate with defective IaC scripts. Of the identified 10 properties we observe lines of code and hard-coded string i.e. putting strings as configuration values, to show the strongest correlation with defective IaC scripts. According to our survey analysis, majority of the practitioners show agreement for two properties: include, the property of executing external modules or scripts, and hard-coded string. Using the identified properties, our constructed defect prediction models show a precision of 0.70∼0.78, and a recall of 0.54∼0.67. Based on our findings, we recommend practitioners to allocate sufficient inspection and testing efforts on IaC scripts that include any of the identified 10 source code properties of IaC scripts.
AbstractList In continuous deployment, software and services are rapidly deployed to end-users using an automated deployment pipeline. Defects in infrastructure as code (IaC) scripts can hinder the reliability of the automated deployment pipeline. We hypothesize that certain properties of IaC source code such as lines of code and hard-coded strings used as configuration values, show correlation with defective IaC scripts. The objective of this paper is to help practitioners in increasing the quality of infrastructure as code (IaC) scripts through an empirical study that identifies source code properties of defective IaC scripts. We apply qualitative analysis on defect-related commits mined from open source software repositories to identify source code properties that correlate with defective IaC scripts. Next, we survey practitioners to assess the practitioner’s agreement level with the identified properties. We also construct defect prediction models using the identified properties for 2439 scripts collected from four datasets. We identify 10 source code properties that correlate with defective IaC scripts. Of the identified 10 properties we observe lines of code and hard-coded string i.e. putting strings as configuration values, to show the strongest correlation with defective IaC scripts. According to our survey analysis, majority of the practitioners show agreement for two properties: include, the property of executing external modules or scripts, and hard-coded string. Using the identified properties, our constructed defect prediction models show a precision of 0.70∼0.78, and a recall of 0.54∼0.67. Based on our findings, we recommend practitioners to allocate sufficient inspection and testing efforts on IaC scripts that include any of the identified 10 source code properties of IaC scripts.
Author Rahman, Akond
Williams, Laurie
Author_xml – sequence: 1
  givenname: Akond
  orcidid: 0000-0002-5056-757X
  surname: Rahman
  fullname: Rahman, Akond
  email: aarahman@ncsu.edu
– sequence: 2
  givenname: Laurie
  surname: Williams
  fullname: Williams, Laurie
BookMark eNqFkEFLwzAUx4NMcJt-Aw_9Aq0vTdO1HgQdToWBB_Uc0pcXyJhNSbKB396OevKgp3d5vx_8fws2631PjF1zKDjw-mZXuN5Gb4sSeFtAVQAXZ2zOm5XIayjljM2hlZDLpmov2CLGHQBfgYA5e3jzh4CUoTeUDcEPFJKjmHmbGbKEyR0pG-1BxxQOmA6BMh2n94jBDSlesnOr95Gufu6SfWwe39fP-fb16WV9v81RQJ1ySaIUvBOlpE7ymkvbWQCNssVuVbaV1I1BTbyVljiZptY172RniGNVoTRiyW4nLwYfYyCr0CWdnO9T0G6vOKhTDbVTUw11qqGgUmONEa5-wUNwnzp8_YfdTRiNw46OgoroqEcyLoxxlPHub8E30K5_fA
CitedBy_id crossref_primary_10_1016_j_infsof_2021_106662
crossref_primary_10_1145_3712190
crossref_primary_10_1007_s10664_023_10432_6
crossref_primary_10_1016_j_jss_2021_111059
crossref_primary_10_1007_s10664_022_10215_5
crossref_primary_10_3390_s21227535
crossref_primary_10_1016_j_jss_2024_112179
crossref_primary_10_1007_s10664_020_09841_8
crossref_primary_10_1109_TSE_2024_3393070
crossref_primary_10_1109_TSE_2023_3265962
crossref_primary_10_1145_3579639
crossref_primary_10_1109_TSE_2021_3051492
crossref_primary_10_1016_j_jss_2020_110726
crossref_primary_10_1016_j_jss_2023_111935
Cites_doi 10.1109/TSE.2006.38
10.2307/2529310
10.1109/32.177364
10.1109/MS.2017.86
10.1037/0033-2909.114.3.494
10.1109/TSE.2008.35
10.1145/3022671.2984000
10.1016/j.patrec.2010.03.014
10.1109/TSE.2007.70721
10.1214/aoms/1177730491
10.1109/TSE.2007.256941
10.1109/TSE.2016.2599161
10.1080/00437956.1954.11659520
10.1109/TSE.2016.2584050
10.1177/001316446002000104
10.1890/07-0539.1
10.1109/TSE.2011.103
10.1007/s10664-015-9396-2
10.1023/A:1010933404324
10.1145/2980983.2908083
ContentType Journal Article
Copyright 2019 Elsevier B.V.
Copyright_xml – notice: 2019 Elsevier B.V.
DBID AAYXX
CITATION
DOI 10.1016/j.infsof.2019.04.013
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Business
EISSN 1873-6025
EndPage 163
ExternalDocumentID 10_1016_j_infsof_2019_04_013
S0950584919300965
GroupedDBID --K
--M
-~X
.DC
.~1
0R~
1B1
1~.
1~5
29I
4.4
457
4G.
5GY
5VS
7-5
71M
77K
8P~
9JN
AABNK
AACTN
AAEDT
AAEDW
AAIAV
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AAXUO
AAYFN
AAYOK
ABBOA
ABFNM
ABFRF
ABJNI
ABMAC
ABTAH
ABXDB
ABYKQ
ACDAQ
ACGFO
ACGFS
ACGOD
ACNNM
ACRLP
ACZNC
ADBBV
ADEZE
ADJOM
ADMUD
AEBSH
AEFWE
AEKER
AENEX
AFKWA
AFTJW
AGHFR
AGUBO
AGYEJ
AHHHB
AHZHX
AIALX
AIEXJ
AIKHN
AITUG
AJBFU
AJOXV
ALMA_UNASSIGNED_HOLDINGS
AMFUW
AMRAJ
AOUOD
ASPBG
AVWKF
AXJTR
AZFZN
BKOJK
BKOMP
BLXMC
CS3
DU5
EBS
EFJIC
EFLBG
EJD
EO8
EO9
EP2
EP3
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-Q
G8K
GBLVA
GBOLZ
HLZ
HVGLF
HZ~
IHE
J1W
KOM
LG9
M41
MO0
MS~
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
PQQKQ
Q38
R2-
RIG
ROL
RPZ
SBC
SDF
SDG
SDP
SES
SEW
SPC
SPCBC
SSV
SSZ
T5K
TWZ
UHS
UNMZH
WH7
WUQ
XFK
ZY4
~G-
77I
9DU
AATTM
AAXKI
AAYWO
AAYXX
ABDPE
ABWVN
ACLOT
ACRPL
ACVFH
ADCNI
ADNMO
AEIPS
AEUPX
AFJKZ
AFPUW
AGQPQ
AIGII
AIIUN
AKBMS
AKRWK
AKYEP
ANKPU
APXCP
CITATION
EFKBS
~HD
ID FETCH-LOGICAL-c306t-5e3231b325eb51615fbf00ac59cb72945a8dcae195fe1ed86a61b5bde1c44c5d3
ISICitedReferencesCount 33
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000469899100011&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0950-5849
IngestDate Sat Nov 29 07:05:38 EST 2025
Tue Nov 18 21:12:23 EST 2025
Fri Feb 23 02:23:56 EST 2024
IsPeerReviewed true
IsScholarly true
Keywords Infrastructure as code
Configuration as code
Continuous deployment
Empirical study
Devops
Puppet
Defect prediction
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c306t-5e3231b325eb51615fbf00ac59cb72945a8dcae195fe1ed86a61b5bde1c44c5d3
ORCID 0000-0002-5056-757X
PageCount 16
ParticipantIDs crossref_citationtrail_10_1016_j_infsof_2019_04_013
crossref_primary_10_1016_j_infsof_2019_04_013
elsevier_sciencedirect_doi_10_1016_j_infsof_2019_04_013
PublicationCentury 2000
PublicationDate August 2019
2019-08-00
PublicationDateYYYYMMDD 2019-08-01
PublicationDate_xml – month: 08
  year: 2019
  text: August 2019
PublicationDecade 2010
PublicationTitle Information and software technology
PublicationYear 2019
Publisher Elsevier B.V
Publisher_xml – name: Elsevier B.V
References Rahman, Helms, Williams, Parnin (bib0001) 2015
[Online; accessed 10-October-2017].
Freedman (bib0009) 2005
McCune, Jeffrey (bib0014) 2011
Zheng, Williams, Nagappan, Snipes, Hudepohl, Vouk (bib0025) 2006; 32
Nagappan, Ball, Zeller (bib0045) 2006
Shambaugh, Weiss, Guha (bib0013) 2016; 51
Ray, Hellendoorn, Godhane, Tu, Bacchelli, Devanbu (bib0029) 2016
Cohen (bib0033) 1960; 20
Tantithamthavorn, McIntosh, Hassan, Matsumoto (bib0053) 2017; 43
van der Bent, Hage, Visser, Gousios (bib0019) 2018
(bib0027) 2010
Henningsson, Wohlin (bib0035) 2004
Menzies, Greenwald, Frank (bib0044) 2007; 33
Hummer, Rosenberg, Oliveira, Eilam (bib0018) 2013
Parnin, Helms, Atlee, Boughton, Ghattas, Glover, Holman, Micco, Murphy, Savor, Stumm, Whitaker, Williams (bib0003) 2017; 34
Romano, Kromrey, Coraggio, Skowronek (bib0040) 2006
Hall, Beecham, Bowes, Gray, Counsell (bib0048) 2012; 38
Cito, Leitner, Fritz, Gall (bib0005) 2015
Cliff (bib0039) 1993; 114
Porter (bib0055) 1997
Zhang, Hassan, McIntosh, Zou (bib0031) 2017; 43
Garland (bib0043) 1991
Lewis, Lin, Sadowski, Zhu, Ou, Whitehead Jr. (bib0060) 2013
Zimmermann, Premraj, Zeller (bib0026) 2007
Fukushima, Kamei, McIntosh, Yamashita, Ubayashi (bib0057) 2014
Ghotra, McIntosh, Hassan (bib0050) 2015
Rahman, Devanbu (bib0049) 2013
Ghotra, Mcintosh, Hassan (bib0046) 2017
Harris (bib0011) 1954; 10
Lessmann, Baesens, Mues, Pietsch (bib0051) 2008; 34
Tan, Steinbach, Kumar (bib0008) 2005
Humble, Farley (bib0002) 2010
Nagappan, Ball (bib0024) 2005
Zhang, Mockus, Keivanloo, Zou (bib0030) 2016; 21
Charmaz (bib0036) 2014
Rahman, Partho, Meder, Williams (bib0020) 2017
Pandit (bib0037) 1996; 2
Tosun Misirli, Murphy, Zimmermann, Basar Bener (bib0058) 2011
Weiss, Guha, Brun (bib0017) 2017
Landis, Koch (bib0034) 1977; 33
Walden, Stuckman, Scandariato (bib0054) 2014
Alali, Kagdi, Maletic (bib0032) 2008
Sharma, Fragkoulis, Spinellis (bib0012) 2016
Jiang, Adams (bib0006) 2015
Menzies, Dekhtyar, Distefano, Greenwald (bib0052) 2007; 33
Mann, Whitney (bib0038) 1947; 18
Rahman, Mahdavi-Hezaveh, Williams (bib0022) 2018
Genuer, Poggi, Tuleau-Malot (bib0042) 2010; 31
Munaiah, Kroh, Cabrey, Nagappan (bib0028) 2017
Rahman, Partho, Morrison, Williams (bib0021) 2018
Cutler, Edwards, Beard, Cutler, Hess, Gibson, Lawler (bib0041) 2007; 88
Meneely, Williams (bib0056) 2009
Rahman, Williams (bib0004) 2018
Chillarege, Bhandari, Chaar, Halliday, Moebus, Ray, Wong (bib0059) 1992; 18
Breiman, Friedman, Olshen, Stone (bib0007) 1984
Hanappi, Hummer, Dustdar (bib0016) 2016; 51
P. Labs, Puppet Documentation, 2017
Breiman (bib0010) 2001; 45
Rahman, Parnin, Williams (bib0023) 2019
Pedregosa, Varoquaux, Gramfort, Michel, Thirion, Grisel, Blondel, Prettenhofer, Weiss, Dubourg, Vanderplas, Passos, Cournapeau, Brucher, Perrot, Duchesnay (bib0047) 2011; 12
Rahman (10.1016/j.infsof.2019.04.013_bib0004) 2018
Rahman (10.1016/j.infsof.2019.04.013_bib0020) 2017
Menzies (10.1016/j.infsof.2019.04.013_bib0044) 2007; 33
Chillarege (10.1016/j.infsof.2019.04.013_bib0059) 1992; 18
Cliff (10.1016/j.infsof.2019.04.013_bib0039) 1993; 114
Menzies (10.1016/j.infsof.2019.04.013_bib0052) 2007; 33
Tosun Misirli (10.1016/j.infsof.2019.04.013_bib0058) 2011
Garland (10.1016/j.infsof.2019.04.013_bib0043) 1991
Mann (10.1016/j.infsof.2019.04.013_bib0038) 1947; 18
Jiang (10.1016/j.infsof.2019.04.013_bib0006) 2015
Zhang (10.1016/j.infsof.2019.04.013_bib0030) 2016; 21
Pedregosa (10.1016/j.infsof.2019.04.013_bib0047) 2011; 12
Zhang (10.1016/j.infsof.2019.04.013_bib0031) 2017; 43
Henningsson (10.1016/j.infsof.2019.04.013_bib0035) 2004
Charmaz (10.1016/j.infsof.2019.04.013_bib0036) 2014
Porter (10.1016/j.infsof.2019.04.013_bib0055) 1997
Shambaugh (10.1016/j.infsof.2019.04.013_bib0013) 2016; 51
Meneely (10.1016/j.infsof.2019.04.013_bib0056) 2009
Alali (10.1016/j.infsof.2019.04.013_bib0032) 2008
Nagappan (10.1016/j.infsof.2019.04.013_bib0045) 2006
Hanappi (10.1016/j.infsof.2019.04.013_bib0016) 2016; 51
Ghotra (10.1016/j.infsof.2019.04.013_bib0050) 2015
Fukushima (10.1016/j.infsof.2019.04.013_bib0057) 2014
Weiss (10.1016/j.infsof.2019.04.013_bib0017) 2017
Cutler (10.1016/j.infsof.2019.04.013_bib0041) 2007; 88
Genuer (10.1016/j.infsof.2019.04.013_bib0042) 2010; 31
McCune (10.1016/j.infsof.2019.04.013_sbref0014) 2011
Harris (10.1016/j.infsof.2019.04.013_bib0011) 1954; 10
(10.1016/j.infsof.2019.04.013_sbref0026) 2010
Sharma (10.1016/j.infsof.2019.04.013_bib0012) 2016
Munaiah (10.1016/j.infsof.2019.04.013_bib0028) 2017
Tantithamthavorn (10.1016/j.infsof.2019.04.013_bib0053) 2017; 43
Rahman (10.1016/j.infsof.2019.04.013_bib0022) 2018
Lewis (10.1016/j.infsof.2019.04.013_bib0060) 2013
Breiman (10.1016/j.infsof.2019.04.013_bib0010) 2001; 45
Landis (10.1016/j.infsof.2019.04.013_bib0034) 1977; 33
Tan (10.1016/j.infsof.2019.04.013_bib0008) 2005
Zheng (10.1016/j.infsof.2019.04.013_bib0025) 2006; 32
Ray (10.1016/j.infsof.2019.04.013_bib0029) 2016
Rahman (10.1016/j.infsof.2019.04.013_sbref0022) 2019
Rahman (10.1016/j.infsof.2019.04.013_bib0049) 2013
Breiman (10.1016/j.infsof.2019.04.013_bib0007) 1984
Cito (10.1016/j.infsof.2019.04.013_bib0005) 2015
Humble (10.1016/j.infsof.2019.04.013_bib0002) 2010
Rahman (10.1016/j.infsof.2019.04.013_bib0021) 2018
Romano (10.1016/j.infsof.2019.04.013_bib0040) 2006
Pandit (10.1016/j.infsof.2019.04.013_bib0037) 1996; 2
Lessmann (10.1016/j.infsof.2019.04.013_bib0051) 2008; 34
Parnin (10.1016/j.infsof.2019.04.013_bib0003) 2017; 34
Rahman (10.1016/j.infsof.2019.04.013_bib0001) 2015
Freedman (10.1016/j.infsof.2019.04.013_bib0009) 2005
Hummer (10.1016/j.infsof.2019.04.013_bib0018) 2013
Ghotra (10.1016/j.infsof.2019.04.013_bib0046) 2017
10.1016/j.infsof.2019.04.013_bib0015
Cohen (10.1016/j.infsof.2019.04.013_bib0033) 1960; 20
Hall (10.1016/j.infsof.2019.04.013_bib0048) 2012; 38
Nagappan (10.1016/j.infsof.2019.04.013_bib0024) 2005
van der Bent (10.1016/j.infsof.2019.04.013_bib0019) 2018
Zimmermann (10.1016/j.infsof.2019.04.013_bib0026) 2007
Walden (10.1016/j.infsof.2019.04.013_bib0054) 2014
References_xml – reference: P. Labs, Puppet Documentation, 2017, (
– volume: 18
  start-page: 50
  year: 1947
  end-page: 60
  ident: bib0038
  article-title: On a test of whether one of two random variables is stochastically larger than the other
  publication-title: Ann. Math. Stat.
– year: 2019
  ident: bib0023
  article-title: The seven sins: Security smells in infrastructure as code scripts
  publication-title: Proceedings of the 41st International Conference on Software Engineering
– start-page: 66
  year: 1991
  end-page: 70
  ident: bib0043
  article-title: The mid-point on a rating scale: is it desirable
  publication-title: Marketing Bull.
– start-page: 34
  year: 2018
  end-page: 45
  ident: bib0004
  article-title: Characterizing defective configuration scripts used for continuous deployment
  publication-title: 2018 IEEE 11th International Conference on Software Testing, Verification and Validation (ICST)
– year: 2005
  ident: bib0009
  article-title: Statistical Models : Theory and Practice
– volume: 33
  start-page: 2
  year: 2007
  end-page: 13
  ident: bib0044
  article-title: Data mining static code attributes to learn defect predictors
  publication-title: IEEE Trans. Softw. Eng.
– year: 2010
  ident: bib0002
  article-title: Continuous delivery: Reliable Software Releases Through Build, Test, and Deployment Automation
– start-page: 284
  year: 2005
  end-page: 292
  ident: bib0024
  article-title: Use of relative code churn measures to predict system defect density
  publication-title: Proceedings of the 27th International Conference on Software Engineering
– reference: ).[Online; accessed 10-October-2017].
– year: 2018
  ident: bib0022
  article-title: A systematic mapping study of infrastructure as code research
  publication-title: Inf. Softw. Technol.
– volume: 32
  start-page: 240
  year: 2006
  end-page: 253
  ident: bib0025
  article-title: On the value of static analysis for fault detection in software
  publication-title: IEEE Trans. Softw. Eng.
– start-page: 789
  year: 2015
  end-page: 800
  ident: bib0050
  article-title: Revisiting the impact of classification techniques on the performance of defect prediction models
  publication-title: Proceedings of the 37th International Conference on Software Engineering - Volume 1
– year: 2014
  ident: bib0036
  article-title: Constructing Grounded Theory
– volume: 114
  start-page: 494
  year: 1993
  end-page: 509
  ident: bib0039
  article-title: Dominance statistics: ordinal analyses to answer ordinal questions.
  publication-title: Psychol. Bull.
– start-page: 189
  year: 2016
  end-page: 200
  ident: bib0012
  article-title: Does your configuration code smell?
  publication-title: Proceedings of the 13th International Conference on Mining Software Repositories
– start-page: 313
  year: 1997
  end-page: 316
  ident: bib0055
  article-title: Readings in information retrieval
– start-page: 182
  year: 2008
  end-page: 191
  ident: bib0032
  article-title: What’s a typical commit? a characterization of open source software repositories
  publication-title: 2008 16th IEEE International Conference on Program Comprehension
– start-page: 26
  year: 2011
  end-page: 33
  ident: bib0058
  article-title: An explanatory analysis on eclipse beta-release bugs through in-process metrics
  publication-title: Proceedings of the 8th International Workshop on Software Quality
– start-page: 20
  year: 2017
  end-page: 26
  ident: bib0020
  article-title: Which factors influence practitioners’ usage of build automation tools?
  publication-title: Proceedings of the 3rd International Workshop on Rapid Continuous Software Engineering
– start-page: 1
  year: 2015
  end-page: 10
  ident: bib0001
  article-title: Synthesizing continuous deployment practices used in software development
  publication-title: Proceedings of the 2015 Agile Conference
– volume: 34
  start-page: 86
  year: 2017
  end-page: 95
  ident: bib0003
  article-title: The top 10 adages in continuous deployment
  publication-title: IEEE Softw.
– start-page: 428
  year: 2016
  end-page: 439
  ident: bib0029
  article-title: On the “naturalness” of buggy code
  publication-title: Proceedings of the 38th International Conference on Software Engineering
– start-page: 4:1
  year: 2013
  end-page: 4:2
  ident: bib0018
  article-title: Automated testing of chef automation scripts
  publication-title: Proceedings Demo:38; Poster Track of ACM/IFIP/USENIX International Middleware Conference
– volume: 20
  start-page: 37
  year: 1960
  end-page: 46
  ident: bib0033
  article-title: A coefficient of agreement for nominal scales
  publication-title: Edu. Psychol. Meas.
– start-page: 95
  year: 2004
  end-page: 104
  ident: bib0035
  article-title: Assuring fault classification agreement “an empirical evaluation
  publication-title: Proceedings of the 2004 International Symposium on Empirical Software Engineering
– volume: 2
  start-page: 1
  year: 1996
  end-page: 20
  ident: bib0037
  article-title: The creation of theory: a recent application of the grounded theory method
  publication-title: Qualit. Rep.
– volume: 38
  start-page: 1276
  year: 2012
  end-page: 1304
  ident: bib0048
  article-title: A systematic literature review on fault prediction performance in software engineering
  publication-title: IEEE Trans. Softw. Eng.
– year: 1984
  ident: bib0007
  article-title: Classification and Regression Trees
– volume: 12
  start-page: 2825
  year: 2011
  end-page: 2830
  ident: bib0047
  article-title: Scikit-learn: Machine learning in python
  publication-title: J. Mach. Learn. Res.
– volume: 45
  start-page: 5
  year: 2001
  end-page: 32
  ident: bib0010
  article-title: Random forests
  publication-title: Machine Learning
– start-page: 432
  year: 2013
  end-page: 441
  ident: bib0049
  article-title: How, and why, process metrics are better
  publication-title: Proceedings of the 2013 International Conference on Software Engineering
– year: 2011
  ident: bib0014
  article-title: Pro Puppet
– start-page: 372
  year: 2013
  end-page: 381
  ident: bib0060
  article-title: Does bug prediction support human developers? findings from a Google case study
  publication-title: Proceedings of the 2013 International Conference on Software Engineering
– start-page: 453
  year: 2009
  end-page: 462
  ident: bib0056
  article-title: Secure open source collaboration: An empirical study of linus’ law
  publication-title: Proceedings of the 16th ACM Conference on Computer and Communications Security
– volume: 31
  start-page: 2225
  year: 2010
  end-page: 2236
  ident: bib0042
  article-title: Variable selection using random forests
  publication-title: Pattern Recognit. Lett.
– start-page: 23
  year: 2014
  end-page: 33
  ident: bib0054
  article-title: Predicting vulnerable components: Software metrics vs text mining
  publication-title: 2014 IEEE 25th International Symposium on Software Reliability Engineering
– start-page: 16
  year: 2018
  end-page: 22
  ident: bib0021
  article-title: What questions do programmers ask about configuration as code?
  publication-title: Proceedings of the 4th International Workshop on Rapid Continuous Software Engineering
– volume: 10
  start-page: 146
  year: 1954
  end-page: 162
  ident: bib0011
  article-title: Distributional structure
  publication-title: WORD
– start-page: 164
  year: 2018
  end-page: 174
  ident: bib0019
  article-title: How good is your puppet? an empirically defined and validated quality model for puppet
  publication-title: 2018 IEEE 25th International Conference on Software Analysis, Evolution and Reengineering (SANER)
– volume: 43
  start-page: 1
  year: 2017
  end-page: 18
  ident: bib0053
  article-title: An empirical comparison of model validation techniques for defect prediction models
  publication-title: IEEE Trans. Softw. Eng.
– start-page: 452
  year: 2006
  end-page: 461
  ident: bib0045
  article-title: Mining metrics to predict component failures
  publication-title: Proceedings of the 28th International Conference on Software Engineering
– year: 2005
  ident: bib0008
  article-title: Introduction to Data Mining, (First Edition)
– volume: 88
  start-page: 2783
  year: 2007
  end-page: 2792
  ident: bib0041
  article-title: Random forests for classification in ecology
  publication-title: Ecology
– start-page: 172
  year: 2014
  end-page: 181
  ident: bib0057
  article-title: An empirical study of just-in-time defect prediction using cross-project models
  publication-title: Proceedings of the 11th Working Conference on Mining Software Repositories
– start-page: 1
  year: 2010
  end-page: 23
  ident: bib0027
  article-title: Ieee standard classification for software anomalies
  publication-title: IEEE Std 1044-2009 (Revision of IEEE Std 1044-1993)
– volume: 51
  start-page: 416
  year: 2016
  end-page: 430
  ident: bib0013
  article-title: Rehearsal: a configuration verification tool for puppet
  publication-title: SIGPLAN Not.
– volume: 33
  start-page: 637
  year: 2007
  end-page: 640
  ident: bib0052
  article-title: Problems with precision: a response to “comments on ‘data mining static code attributes to learn defect predictors”’
  publication-title: IEEE Trans. Softw. Eng.
– volume: 43
  start-page: 476
  year: 2017
  end-page: 491
  ident: bib0031
  article-title: The use of summation to aggregate software metrics hinders the performance of defect prediction models
  publication-title: IEEE Trans. Softw. Eng.
– volume: 34
  start-page: 485
  year: 2008
  end-page: 496
  ident: bib0051
  article-title: Benchmarking classification models for software defect prediction: a proposed framework and novel findings
  publication-title: IEEE Trans. Softw. Eng.
– volume: 18
  start-page: 943
  year: 1992
  end-page: 956
  ident: bib0059
  article-title: Orthogonal defect classification-a concept for in-process measurements
  publication-title: IEEE Trans. Softw. Eng.
– start-page: 393
  year: 2015
  end-page: 403
  ident: bib0005
  article-title: The making of cloud applications: An empirical study on software development for the cloud
  publication-title: Proceedings of the 2015 10th Joint Meeting on Foundations of Software Engineering
– start-page: 625
  year: 2017
  end-page: 636
  ident: bib0017
  article-title: Tortoise: Interactive system configuration repair
  publication-title: Proceedings of the 32Nd IEEE/ACM International Conference on Automated Software Engineering
– volume: 33
  start-page: 159
  year: 1977
  end-page: 174
  ident: bib0034
  article-title: The measurement of observer agreement for categorical data
  publication-title: Biometrics
– volume: 51
  start-page: 328
  year: 2016
  end-page: 343
  ident: bib0016
  article-title: Asserting reliable convergence for configuration management scripts
  publication-title: SIGPLAN Not.
– volume: 21
  start-page: 2107
  year: 2016
  end-page: 2145
  ident: bib0030
  article-title: Towards building a universal defect prediction model with rank transformed predictors
  publication-title: Empirical Softw. Eng.
– start-page: 1
  year: 2006
  end-page: 3
  ident: bib0040
  article-title: Appropriate statistics for ordinal level data: Should we really be using
  publication-title: annual meeting of the Florida Association of Institutional Research
– start-page: 146
  year: 2017
  end-page: 157
  ident: bib0046
  article-title: A large-scale study of the impact of feature selection techniques on defect classification models
  publication-title: Proceedings of the 14th International Conference on Mining Software Repositories
– start-page: 45
  year: 2015
  end-page: 55
  ident: bib0006
  article-title: Co-evolution of infrastructure and source code: An empirical study
  publication-title: Proceedings of the 12th Working Conference on Mining Software Repositories
– start-page: 9
  year: 2007
  ident: bib0026
  article-title: Predicting defects for eclipse
  publication-title: Proceedings of the Third International Workshop on Predictor Models in Software Engineering
– start-page: 1
  year: 2017
  end-page: 35
  ident: bib0028
  article-title: Curating github for engineered software projects
  publication-title: Empirical Softw. Eng.
– start-page: 284
  year: 2005
  ident: 10.1016/j.infsof.2019.04.013_bib0024
  article-title: Use of relative code churn measures to predict system defect density
– volume: 12
  start-page: 2825
  year: 2011
  ident: 10.1016/j.infsof.2019.04.013_bib0047
  article-title: Scikit-learn: Machine learning in python
  publication-title: J. Mach. Learn. Res.
– start-page: 66
  year: 1991
  ident: 10.1016/j.infsof.2019.04.013_bib0043
  article-title: The mid-point on a rating scale: is it desirable
  publication-title: Marketing Bull.
– volume: 32
  start-page: 240
  issue: 4
  year: 2006
  ident: 10.1016/j.infsof.2019.04.013_bib0025
  article-title: On the value of static analysis for fault detection in software
  publication-title: IEEE Trans. Softw. Eng.
  doi: 10.1109/TSE.2006.38
– volume: 33
  start-page: 159
  issue: 1
  year: 1977
  ident: 10.1016/j.infsof.2019.04.013_bib0034
  article-title: The measurement of observer agreement for categorical data
  publication-title: Biometrics
  doi: 10.2307/2529310
– start-page: 172
  year: 2014
  ident: 10.1016/j.infsof.2019.04.013_bib0057
  article-title: An empirical study of just-in-time defect prediction using cross-project models
– volume: 18
  start-page: 943
  issue: 11
  year: 1992
  ident: 10.1016/j.infsof.2019.04.013_bib0059
  article-title: Orthogonal defect classification-a concept for in-process measurements
  publication-title: IEEE Trans. Softw. Eng.
  doi: 10.1109/32.177364
– volume: 34
  start-page: 86
  issue: 3
  year: 2017
  ident: 10.1016/j.infsof.2019.04.013_bib0003
  article-title: The top 10 adages in continuous deployment
  publication-title: IEEE Softw.
  doi: 10.1109/MS.2017.86
– volume: 114
  start-page: 494
  issue: 3
  year: 1993
  ident: 10.1016/j.infsof.2019.04.013_bib0039
  article-title: Dominance statistics: ordinal analyses to answer ordinal questions.
  publication-title: Psychol. Bull.
  doi: 10.1037/0033-2909.114.3.494
– volume: 34
  start-page: 485
  issue: 4
  year: 2008
  ident: 10.1016/j.infsof.2019.04.013_bib0051
  article-title: Benchmarking classification models for software defect prediction: a proposed framework and novel findings
  publication-title: IEEE Trans. Softw. Eng.
  doi: 10.1109/TSE.2008.35
– start-page: 393
  year: 2015
  ident: 10.1016/j.infsof.2019.04.013_bib0005
  article-title: The making of cloud applications: An empirical study on software development for the cloud
– start-page: 313
  year: 1997
  ident: 10.1016/j.infsof.2019.04.013_bib0055
– volume: 51
  start-page: 328
  issue: 10
  year: 2016
  ident: 10.1016/j.infsof.2019.04.013_bib0016
  article-title: Asserting reliable convergence for configuration management scripts
  publication-title: SIGPLAN Not.
  doi: 10.1145/3022671.2984000
– volume: 31
  start-page: 2225
  issue: 14
  year: 2010
  ident: 10.1016/j.infsof.2019.04.013_bib0042
  article-title: Variable selection using random forests
  publication-title: Pattern Recognit. Lett.
  doi: 10.1016/j.patrec.2010.03.014
– year: 2018
  ident: 10.1016/j.infsof.2019.04.013_bib0022
  article-title: A systematic mapping study of infrastructure as code research
  publication-title: Inf. Softw. Technol.
– start-page: 1
  year: 2006
  ident: 10.1016/j.infsof.2019.04.013_bib0040
  article-title: Appropriate statistics for ordinal level data: Should we really be using t-test and Cohen’sd for evaluating group differences on the NSSE and other surveys?
– year: 2005
  ident: 10.1016/j.infsof.2019.04.013_bib0009
– start-page: 146
  year: 2017
  ident: 10.1016/j.infsof.2019.04.013_bib0046
  article-title: A large-scale study of the impact of feature selection techniques on defect classification models
– volume: 2
  start-page: 1
  issue: 4
  year: 1996
  ident: 10.1016/j.infsof.2019.04.013_bib0037
  article-title: The creation of theory: a recent application of the grounded theory method
  publication-title: Qualit. Rep.
– start-page: 625
  year: 2017
  ident: 10.1016/j.infsof.2019.04.013_bib0017
  article-title: Tortoise: Interactive system configuration repair
– start-page: 453
  year: 2009
  ident: 10.1016/j.infsof.2019.04.013_bib0056
  article-title: Secure open source collaboration: An empirical study of linus’ law
– start-page: 9
  year: 2007
  ident: 10.1016/j.infsof.2019.04.013_bib0026
  article-title: Predicting defects for eclipse
– year: 2019
  ident: 10.1016/j.infsof.2019.04.013_sbref0022
  article-title: The seven sins: Security smells in infrastructure as code scripts
– start-page: 452
  year: 2006
  ident: 10.1016/j.infsof.2019.04.013_bib0045
  article-title: Mining metrics to predict component failures
– volume: 33
  start-page: 637
  issue: 9
  year: 2007
  ident: 10.1016/j.infsof.2019.04.013_bib0052
  article-title: Problems with precision: a response to “comments on ‘data mining static code attributes to learn defect predictors”’
  publication-title: IEEE Trans. Softw. Eng.
  doi: 10.1109/TSE.2007.70721
– volume: 18
  start-page: 50
  issue: 1
  year: 1947
  ident: 10.1016/j.infsof.2019.04.013_bib0038
  article-title: On a test of whether one of two random variables is stochastically larger than the other
  publication-title: Ann. Math. Stat.
  doi: 10.1214/aoms/1177730491
– start-page: 428
  year: 2016
  ident: 10.1016/j.infsof.2019.04.013_bib0029
  article-title: On the “naturalness” of buggy code
– start-page: 372
  year: 2013
  ident: 10.1016/j.infsof.2019.04.013_bib0060
  article-title: Does bug prediction support human developers? findings from a Google case study
– start-page: 182
  year: 2008
  ident: 10.1016/j.infsof.2019.04.013_bib0032
  article-title: What’s a typical commit? a characterization of open source software repositories
– start-page: 20
  year: 2017
  ident: 10.1016/j.infsof.2019.04.013_bib0020
  article-title: Which factors influence practitioners’ usage of build automation tools?
– start-page: 1
  year: 2015
  ident: 10.1016/j.infsof.2019.04.013_bib0001
  article-title: Synthesizing continuous deployment practices used in software development
– start-page: 95
  year: 2004
  ident: 10.1016/j.infsof.2019.04.013_bib0035
  article-title: Assuring fault classification agreement “an empirical evaluation
– volume: 33
  start-page: 2
  issue: 1
  year: 2007
  ident: 10.1016/j.infsof.2019.04.013_bib0044
  article-title: Data mining static code attributes to learn defect predictors
  publication-title: IEEE Trans. Softw. Eng.
  doi: 10.1109/TSE.2007.256941
– year: 2011
  ident: 10.1016/j.infsof.2019.04.013_sbref0014
– start-page: 26
  year: 2011
  ident: 10.1016/j.infsof.2019.04.013_bib0058
  article-title: An explanatory analysis on eclipse beta-release bugs through in-process metrics
– start-page: 34
  year: 2018
  ident: 10.1016/j.infsof.2019.04.013_bib0004
  article-title: Characterizing defective configuration scripts used for continuous deployment
– volume: 43
  start-page: 476
  issue: 5
  year: 2017
  ident: 10.1016/j.infsof.2019.04.013_bib0031
  article-title: The use of summation to aggregate software metrics hinders the performance of defect prediction models
  publication-title: IEEE Trans. Softw. Eng.
  doi: 10.1109/TSE.2016.2599161
– volume: 10
  start-page: 146
  issue: 2–3
  year: 1954
  ident: 10.1016/j.infsof.2019.04.013_bib0011
  article-title: Distributional structure
  publication-title: WORD
  doi: 10.1080/00437956.1954.11659520
– volume: 43
  start-page: 1
  issue: 1
  year: 2017
  ident: 10.1016/j.infsof.2019.04.013_bib0053
  article-title: An empirical comparison of model validation techniques for defect prediction models
  publication-title: IEEE Trans. Softw. Eng.
  doi: 10.1109/TSE.2016.2584050
– ident: 10.1016/j.infsof.2019.04.013_bib0015
– volume: 20
  start-page: 37
  issue: 1
  year: 1960
  ident: 10.1016/j.infsof.2019.04.013_bib0033
  article-title: A coefficient of agreement for nominal scales
  publication-title: Edu. Psychol. Meas.
  doi: 10.1177/001316446002000104
– start-page: 789
  year: 2015
  ident: 10.1016/j.infsof.2019.04.013_bib0050
  article-title: Revisiting the impact of classification techniques on the performance of defect prediction models
– volume: 88
  start-page: 2783
  issue: 11
  year: 2007
  ident: 10.1016/j.infsof.2019.04.013_bib0041
  article-title: Random forests for classification in ecology
  publication-title: Ecology
  doi: 10.1890/07-0539.1
– volume: 38
  start-page: 1276
  issue: 6
  year: 2012
  ident: 10.1016/j.infsof.2019.04.013_bib0048
  article-title: A systematic literature review on fault prediction performance in software engineering
  publication-title: IEEE Trans. Softw. Eng.
  doi: 10.1109/TSE.2011.103
– start-page: 4:1
  year: 2013
  ident: 10.1016/j.infsof.2019.04.013_bib0018
  article-title: Automated testing of chef automation scripts
– start-page: 1
  year: 2017
  ident: 10.1016/j.infsof.2019.04.013_bib0028
  article-title: Curating github for engineered software projects
  publication-title: Empirical Softw. Eng.
– start-page: 23
  year: 2014
  ident: 10.1016/j.infsof.2019.04.013_bib0054
  article-title: Predicting vulnerable components: Software metrics vs text mining
– year: 2014
  ident: 10.1016/j.infsof.2019.04.013_bib0036
– volume: 21
  start-page: 2107
  issue: 5
  year: 2016
  ident: 10.1016/j.infsof.2019.04.013_bib0030
  article-title: Towards building a universal defect prediction model with rank transformed predictors
  publication-title: Empirical Softw. Eng.
  doi: 10.1007/s10664-015-9396-2
– year: 1984
  ident: 10.1016/j.infsof.2019.04.013_bib0007
– start-page: 45
  year: 2015
  ident: 10.1016/j.infsof.2019.04.013_bib0006
  article-title: Co-evolution of infrastructure and source code: An empirical study
– start-page: 432
  year: 2013
  ident: 10.1016/j.infsof.2019.04.013_bib0049
  article-title: How, and why, process metrics are better
– year: 2010
  ident: 10.1016/j.infsof.2019.04.013_bib0002
– start-page: 1
  year: 2010
  ident: 10.1016/j.infsof.2019.04.013_sbref0026
  article-title: Ieee standard classification for software anomalies
  publication-title: IEEE Std 1044-2009 (Revision of IEEE Std 1044-1993)
– start-page: 16
  year: 2018
  ident: 10.1016/j.infsof.2019.04.013_bib0021
  article-title: What questions do programmers ask about configuration as code?
– volume: 45
  start-page: 5
  issue: 1
  year: 2001
  ident: 10.1016/j.infsof.2019.04.013_bib0010
  article-title: Random forests
  publication-title: Machine Learning
  doi: 10.1023/A:1010933404324
– volume: 51
  start-page: 416
  issue: 6
  year: 2016
  ident: 10.1016/j.infsof.2019.04.013_bib0013
  article-title: Rehearsal: a configuration verification tool for puppet
  publication-title: SIGPLAN Not.
  doi: 10.1145/2980983.2908083
– start-page: 189
  year: 2016
  ident: 10.1016/j.infsof.2019.04.013_bib0012
  article-title: Does your configuration code smell?
– year: 2005
  ident: 10.1016/j.infsof.2019.04.013_bib0008
– start-page: 164
  year: 2018
  ident: 10.1016/j.infsof.2019.04.013_bib0019
  article-title: How good is your puppet? an empirically defined and validated quality model for puppet
SSID ssj0017030
Score 2.4594176
Snippet In continuous deployment, software and services are rapidly deployed to end-users using an automated deployment pipeline. Defects in infrastructure as code...
SourceID crossref
elsevier
SourceType Enrichment Source
Index Database
Publisher
StartPage 148
SubjectTerms Configuration as code
Continuous deployment
Defect prediction
Devops
Empirical study
Infrastructure as code
Puppet
Title Source code properties of defective infrastructure as code scripts
URI https://dx.doi.org/10.1016/j.infsof.2019.04.013
Volume 112
WOSCitedRecordID wos000469899100011&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  customDbUrl:
  eissn: 1873-6025
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017030
  issn: 0950-5849
  databaseCode: AIEXJ
  dateStart: 19950101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1Na9wwEBXNB6GXkuaDpGmLD70FgbW2ZPu4DSltKKE0CezNyPKIJBTvEi9tfn5nPLKz7YYkDfRiFiHZWj3xPDOeNxLiQ4psGNeQSuOVlSm-kKWNwcjEFZ6Em95xndmv2elpPpkU34Jcse2OE8iaJr-9LWb_FWpsQ7BJOvsPcA83xQb8jaDjFWHH65OAP-vC8YekVafsqxklTnNl2Ro80xulYN1YLh3bfUBouTtTSLtosAa50rzPWm6Rtn9Rtth8KSb_3V6GcOoYrc76vogOqbCvYDHSQOKmfDHSsCyBCXHEWKIVw7wHzKJ5lkgTs6J5oNmQLs1Eqbi-ZnjnKia5JTrnyMI1-SD49ygRr-gK07J89a9C2Wc0FZoJ2qTkmekVsTbKdIFctzb-cjw5Gb4uEctxDUaeei-p7PL-lp91v8myYIacb4pXwX-Ixoz7a_ECmi2x0csXtsVHhj8iPKM7-KOpjwb4oz_hj2zL3QP8O-Li0_H50WcZzsmQDh2-udSQoJVeJSMNlSYL3lc-jq3ThavQd0q1zWtnQRXag4I6N9aoSlc1KJemTtfJrlhtpg3sich6AINtxmqXOpNbVykLRT7yytRWVfsi6ZeidKGIPJ1l8qPsswWvS17AkhawjNMSF3BfyGHUjIuoPNI_61e5DIYgG3glbowHR7559sgD8fJuz78Vq4gBvBPr7uf8qr15H3bQb9MAhvA
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Source+code+properties+of+defective+infrastructure+as+code+scripts&rft.jtitle=Information+and+software+technology&rft.au=Rahman%2C+Akond&rft.au=Williams%2C+Laurie&rft.date=2019-08-01&rft.pub=Elsevier+B.V&rft.issn=0950-5849&rft.eissn=1873-6025&rft.volume=112&rft.spage=148&rft.epage=163&rft_id=info:doi/10.1016%2Fj.infsof.2019.04.013&rft.externalDocID=S0950584919300965
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0950-5849&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0950-5849&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0950-5849&client=summon