Upgrades of Genetic Programming for Data Driven Modelling of Time Series

In many engineering fields and scientific disciplines, the results of experiments are in the form of time series, which can be quite problematic to interpret and model. Genetic programming tools are quite powerful in extracting knowledge from data. In this work, several upgrades and refinements are...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Evolutionary computation Ročník 31; číslo 4; s. 1
Hlavní autori: Murari, A, Peluso, E, Spolladore, L, Rossi, R, Gelfusa, M
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: United States 01.12.2023
Predmet:
ISSN:1530-9304, 1530-9304
On-line prístup:Zistit podrobnosti o prístupe
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract In many engineering fields and scientific disciplines, the results of experiments are in the form of time series, which can be quite problematic to interpret and model. Genetic programming tools are quite powerful in extracting knowledge from data. In this work, several upgrades and refinements are proposed and tested to improve the explorative capabilities of Symbolic Regression (SR) via Genetic Programming (GP) for the investigation of time series, with the objective of extracting mathematical models directly from the available signals. The main task is not simply prediction but consists of identifying interpretable equations, reflecting the nature of the mechanisms generating the signals. The implemented improvements involve almost all aspects of GP, from the knowledge representation and the genetic operators to the fitness function. The unique capabilities of genetic programming, to accommodate prior information and knowledge, are also leveraged effectively. The proposed upgrades cover the most important applications of empirical modelling of time series, ranging from the identification of autoregressive systems and partial differential equations to the search of models in terms of dimensionless quantities and appropriate physical units. Particularly delicate systems to identify, such as those showing hysteretic behaviour or governed by delayed differential equations, are also addressed. The potential of the developed tools is substantiated with both a battery of systematic numerical tests with synthetic signals and with applications to experimental data.
AbstractList In many engineering fields and scientific disciplines, the results of experiments are in the form of time series, which can be quite problematic to interpret and model. Genetic programming tools are quite powerful in extracting knowledge from data. In this work, several upgrades and refinements are proposed and tested to improve the explorative capabilities of symbolic regression (SR) via genetic programming (GP) for the investigation of time series, with the objective of extracting mathematical models directly from the available signals. The main task is not simply prediction but consists of identifying interpretable equations, reflecting the nature of the mechanisms generating the signals. The implemented improvements involve almost all aspects of GP, from the knowledge representation and the genetic operators to the fitness function. The unique capabilities of genetic programming, to accommodate prior information and knowledge, are also leveraged effectively. The proposed upgrades cover the most important applications of empirical modeling of time series, ranging from the identification of autoregressive systems and partial differential equations to the search of models in terms of dimensionless quantities and appropriate physical units. Particularly delicate systems to identify, such as those showing hysteretic behavior or governed by delayed differential equations, are also addressed. The potential of the developed tools is substantiated with both a battery of systematic numerical tests with synthetic signals and with applications to experimental data.In many engineering fields and scientific disciplines, the results of experiments are in the form of time series, which can be quite problematic to interpret and model. Genetic programming tools are quite powerful in extracting knowledge from data. In this work, several upgrades and refinements are proposed and tested to improve the explorative capabilities of symbolic regression (SR) via genetic programming (GP) for the investigation of time series, with the objective of extracting mathematical models directly from the available signals. The main task is not simply prediction but consists of identifying interpretable equations, reflecting the nature of the mechanisms generating the signals. The implemented improvements involve almost all aspects of GP, from the knowledge representation and the genetic operators to the fitness function. The unique capabilities of genetic programming, to accommodate prior information and knowledge, are also leveraged effectively. The proposed upgrades cover the most important applications of empirical modeling of time series, ranging from the identification of autoregressive systems and partial differential equations to the search of models in terms of dimensionless quantities and appropriate physical units. Particularly delicate systems to identify, such as those showing hysteretic behavior or governed by delayed differential equations, are also addressed. The potential of the developed tools is substantiated with both a battery of systematic numerical tests with synthetic signals and with applications to experimental data.
In many engineering fields and scientific disciplines, the results of experiments are in the form of time series, which can be quite problematic to interpret and model. Genetic programming tools are quite powerful in extracting knowledge from data. In this work, several upgrades and refinements are proposed and tested to improve the explorative capabilities of Symbolic Regression (SR) via Genetic Programming (GP) for the investigation of time series, with the objective of extracting mathematical models directly from the available signals. The main task is not simply prediction but consists of identifying interpretable equations, reflecting the nature of the mechanisms generating the signals. The implemented improvements involve almost all aspects of GP, from the knowledge representation and the genetic operators to the fitness function. The unique capabilities of genetic programming, to accommodate prior information and knowledge, are also leveraged effectively. The proposed upgrades cover the most important applications of empirical modelling of time series, ranging from the identification of autoregressive systems and partial differential equations to the search of models in terms of dimensionless quantities and appropriate physical units. Particularly delicate systems to identify, such as those showing hysteretic behaviour or governed by delayed differential equations, are also addressed. The potential of the developed tools is substantiated with both a battery of systematic numerical tests with synthetic signals and with applications to experimental data.
Author Rossi, R
Spolladore, L
Murari, A
Peluso, E
Gelfusa, M
Author_xml – sequence: 1
  givenname: A
  surname: Murari
  fullname: Murari, A
  email: andrea.murari@istp.cnr.it
  organization: Consorzio RFX (CNR, ENEA, INFN, Università di Padova, Acciaierie Venete SpA), Corso Stati Uniti 4, 35127 Padova, Italy andrea.murari@istp.cnr.it
– sequence: 2
  givenname: E
  surname: Peluso
  fullname: Peluso, E
  email: emmanuele.peluso@uniroma2.it
  organization: Department of Industrial Engineering, University of Rome "Tor Vergata", via del Politecnico 1, Roma, Italy emmanuele.peluso@uniroma2.it
– sequence: 3
  givenname: L
  surname: Spolladore
  fullname: Spolladore, L
  email: luca.spolladore@uniroma2.it
  organization: Department of Industrial Engineering, University of Rome "Tor Vergata", via del Politecnico 1, Roma, Italy luca.spolladore@uniroma2.it
– sequence: 4
  givenname: R
  surname: Rossi
  fullname: Rossi, R
  email: riccardo.rossi.en@gmail.com
  organization: Department of Industrial Engineering, University of Rome "Tor Vergata", via del Politecnico 1, Roma, Italy riccardo.rossi.en@gmail.com
– sequence: 5
  givenname: M
  surname: Gelfusa
  fullname: Gelfusa, M
  email: gelfusa@ing.uniroma2.it
  organization: Department of Industrial Engineering, University of Rome "Tor Vergata", via del Politecnico 1, Roma, Italy gelfusa@ing.uniroma2.it
BackLink https://www.ncbi.nlm.nih.gov/pubmed/37126579$$D View this record in MEDLINE/PubMed
BookMark eNpNT8tKw0AUHaRiH7pzLbN0E73zSCZZSqutUFGwXYfJzE0ZaTJ1Jin490as4Ooc7nlwz5SMWt8iIdcM7hjL-D0ejS91CSAEnJEJSwUkhQA5-sfHZBrjBwATHNgFGQvFeJaqYkJW28MuaIuR-pouscXOGfoW_HBsGtfuaO0DXehO00VwR2zpi7e43_8oQ2DjGqTvGBzGS3Je633EqxPOyPbpcTNfJevX5fP8YZ0YwYouyUWmlJWa1azIjRVSGquZZMpkygBkmqPUhc0xE5nUgErpquIMa54aXleWz8jtb-8h-M8eY1c2LprhJd2i72PJc8jTPAWhBuvNydpXDdryEFyjw1f5t55_A36QXUQ
CitedBy_id crossref_primary_10_1162_evco_a_00359
crossref_primary_10_1088_1361_6463_ada167
crossref_primary_10_1088_1361_6463_adbe85
ContentType Journal Article
Copyright 2023 Massachusetts Institute of Technology.
Copyright_xml – notice: 2023 Massachusetts Institute of Technology.
DBID NPM
7X8
DOI 10.1162/evco_a_00330
DatabaseName PubMed
MEDLINE - Academic
DatabaseTitle PubMed
MEDLINE - Academic
DatabaseTitleList MEDLINE - Academic
PubMed
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: 7X8
  name: MEDLINE - Academic
  url: https://search.proquest.com/medline
  sourceTypes: Aggregation Database
DeliveryMethod no_fulltext_linktorsrc
Discipline Engineering
Computer Science
EISSN 1530-9304
ExternalDocumentID 37126579
Genre Journal Article
GroupedDBID ---
.4S
.DC
0R~
36B
4.4
53G
5GY
5VS
6IK
AAJGR
AAKMM
AALFJ
AALMD
AAYFX
ABAZT
ABDBF
ABJNI
ACM
ACUHS
ADL
ADPZR
AEBYY
AENEX
AENSD
AFWIH
AFWXC
AIKLT
ALMA_UNASSIGNED_HOLDINGS
ARCSS
ASPBG
AVWKF
AZFZN
BDXCO
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
EAP
EAS
EBC
EBD
EBS
ECS
EDO
EMB
EMK
EMOBN
EPL
EST
ESX
F5P
FEDTE
FNEHJ
GUFHI
HGAVV
HZ~
I-F
I07
IPLJI
JAVBF
LHSKQ
MCG
MINIK
NPM
O9-
OCL
P2P
PK0
RMI
SV3
TUS
W7O
ZWS
7X8
ABVLG
AEJOY
AKRVB
ID FETCH-LOGICAL-c319t-83677d4a1f198cd344cda1417c67c006a2e4a9d8e6364a0e77abb21ef25c2fbd2
IEDL.DBID 7X8
ISICitedReferencesCount 3
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001111960000004&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1530-9304
IngestDate Fri Jul 11 07:06:43 EDT 2025
Wed Feb 19 02:24:25 EST 2025
IsPeerReviewed true
IsScholarly true
Issue 4
Keywords Data Driven Theory
Symbolic regression
Evolutionary Computation
Empirical modelling of signals
Genetic Programming
Time Series Analysis
Language English
License 2023 Massachusetts Institute of Technology.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c319t-83677d4a1f198cd344cda1417c67c006a2e4a9d8e6364a0e77abb21ef25c2fbd2
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
PMID 37126579
PQID 2808585037
PQPubID 23479
ParticipantIDs proquest_miscellaneous_2808585037
pubmed_primary_37126579
PublicationCentury 2000
PublicationDate 2023-12-01
PublicationDateYYYYMMDD 2023-12-01
PublicationDate_xml – month: 12
  year: 2023
  text: 2023-12-01
  day: 01
PublicationDecade 2020
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle Evolutionary computation
PublicationTitleAlternate Evol Comput
PublicationYear 2023
SSID ssj0013201
Score 2.3933322
Snippet In many engineering fields and scientific disciplines, the results of experiments are in the form of time series, which can be quite problematic to interpret...
SourceID proquest
pubmed
SourceType Aggregation Database
Index Database
StartPage 1
Title Upgrades of Genetic Programming for Data Driven Modelling of Time Series
URI https://www.ncbi.nlm.nih.gov/pubmed/37126579
https://www.proquest.com/docview/2808585037
Volume 31
WOSCitedRecordID wos001111960000004&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV07T8MwELaAMsBAobzKS0ZitepX7WRCiFIYoOpApW6Rn4iBpjSlvx87D9EFCYklky1Fp7vL5e677wPgRgssvXcMScI94twrpEnfoOAeVmlBRMrLReFnORol02k6rhtuRQ2rbHJimahtbmKPvEcTHEdYmMnb-SeKqlFxulpLaGyCFgulTIR0yen6FAHXfKkYpeG_vQG-C9pzK5NnKjJjMvx7cVl-ZIbt_77ePtiry0t4V_nDAdhwsw5oN9INsI7kDthd4yE8BI-T-dtCWVfA3MNIRB1uw3GF3PoIJ2CobOFALRUaLGJ2hFFBLe6xx_NxiQTGJpsrjsBk-PB6_4RqhQVkQugtUcKElJYr4kmaGMs4N1YRTqQR0oR4VNRxldrECSa4wk5KpTUlztO-oV5begy2ZvnMnQJIrKFMMOxSarm2qcaOSytJX6mQJ7TtguvGcFnw4DiWUDOXfxXZj-m64KSyfjavqDYyJgkVfZme_eH2OdiJWvAV1uQCtHyIX3cJts1q-V4srkrXCM_R-OUbckjD7A
linkProvider ProQuest
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Upgrades+of+Genetic+Programming+for+Data-Driven+Modeling+of+Time+Series&rft.jtitle=Evolutionary+computation&rft.au=Murari%2C+A&rft.au=Peluso%2C+E&rft.au=Spolladore%2C+L&rft.au=Rossi%2C+R&rft.date=2023-12-01&rft.issn=1530-9304&rft.eissn=1530-9304&rft.volume=31&rft.issue=4&rft.spage=401&rft_id=info:doi/10.1162%2Fevco_a_00330&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1530-9304&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1530-9304&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1530-9304&client=summon