Upgrades of Genetic Programming for Data Driven Modelling of Time Series
In many engineering fields and scientific disciplines, the results of experiments are in the form of time series, which can be quite problematic to interpret and model. Genetic programming tools are quite powerful in extracting knowledge from data. In this work, several upgrades and refinements are...
Saved in:
| Published in: | Evolutionary computation Vol. 31; no. 4; p. 1 |
|---|---|
| Main Authors: | , , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
United States
01.12.2023
|
| Subjects: | |
| ISSN: | 1530-9304, 1530-9304 |
| Online Access: | Get more information |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | In many engineering fields and scientific disciplines, the results of experiments are in the form of time series, which can be quite problematic to interpret and model. Genetic programming tools are quite powerful in extracting knowledge from data. In this work, several upgrades and refinements are proposed and tested to improve the explorative capabilities of Symbolic Regression (SR) via Genetic Programming (GP) for the investigation of time series, with the objective of extracting mathematical models directly from the available signals. The main task is not simply prediction but consists of identifying interpretable equations, reflecting the nature of the mechanisms generating the signals. The implemented improvements involve almost all aspects of GP, from the knowledge representation and the genetic operators to the fitness function. The unique capabilities of genetic programming, to accommodate prior information and knowledge, are also leveraged effectively. The proposed upgrades cover the most important applications of empirical modelling of time series, ranging from the identification of autoregressive systems and partial differential equations to the search of models in terms of dimensionless quantities and appropriate physical units. Particularly delicate systems to identify, such as those showing hysteretic behaviour or governed by delayed differential equations, are also addressed. The potential of the developed tools is substantiated with both a battery of systematic numerical tests with synthetic signals and with applications to experimental data. |
|---|---|
| AbstractList | In many engineering fields and scientific disciplines, the results of experiments are in the form of time series, which can be quite problematic to interpret and model. Genetic programming tools are quite powerful in extracting knowledge from data. In this work, several upgrades and refinements are proposed and tested to improve the explorative capabilities of Symbolic Regression (SR) via Genetic Programming (GP) for the investigation of time series, with the objective of extracting mathematical models directly from the available signals. The main task is not simply prediction but consists of identifying interpretable equations, reflecting the nature of the mechanisms generating the signals. The implemented improvements involve almost all aspects of GP, from the knowledge representation and the genetic operators to the fitness function. The unique capabilities of genetic programming, to accommodate prior information and knowledge, are also leveraged effectively. The proposed upgrades cover the most important applications of empirical modelling of time series, ranging from the identification of autoregressive systems and partial differential equations to the search of models in terms of dimensionless quantities and appropriate physical units. Particularly delicate systems to identify, such as those showing hysteretic behaviour or governed by delayed differential equations, are also addressed. The potential of the developed tools is substantiated with both a battery of systematic numerical tests with synthetic signals and with applications to experimental data. In many engineering fields and scientific disciplines, the results of experiments are in the form of time series, which can be quite problematic to interpret and model. Genetic programming tools are quite powerful in extracting knowledge from data. In this work, several upgrades and refinements are proposed and tested to improve the explorative capabilities of symbolic regression (SR) via genetic programming (GP) for the investigation of time series, with the objective of extracting mathematical models directly from the available signals. The main task is not simply prediction but consists of identifying interpretable equations, reflecting the nature of the mechanisms generating the signals. The implemented improvements involve almost all aspects of GP, from the knowledge representation and the genetic operators to the fitness function. The unique capabilities of genetic programming, to accommodate prior information and knowledge, are also leveraged effectively. The proposed upgrades cover the most important applications of empirical modeling of time series, ranging from the identification of autoregressive systems and partial differential equations to the search of models in terms of dimensionless quantities and appropriate physical units. Particularly delicate systems to identify, such as those showing hysteretic behavior or governed by delayed differential equations, are also addressed. The potential of the developed tools is substantiated with both a battery of systematic numerical tests with synthetic signals and with applications to experimental data.In many engineering fields and scientific disciplines, the results of experiments are in the form of time series, which can be quite problematic to interpret and model. Genetic programming tools are quite powerful in extracting knowledge from data. In this work, several upgrades and refinements are proposed and tested to improve the explorative capabilities of symbolic regression (SR) via genetic programming (GP) for the investigation of time series, with the objective of extracting mathematical models directly from the available signals. The main task is not simply prediction but consists of identifying interpretable equations, reflecting the nature of the mechanisms generating the signals. The implemented improvements involve almost all aspects of GP, from the knowledge representation and the genetic operators to the fitness function. The unique capabilities of genetic programming, to accommodate prior information and knowledge, are also leveraged effectively. The proposed upgrades cover the most important applications of empirical modeling of time series, ranging from the identification of autoregressive systems and partial differential equations to the search of models in terms of dimensionless quantities and appropriate physical units. Particularly delicate systems to identify, such as those showing hysteretic behavior or governed by delayed differential equations, are also addressed. The potential of the developed tools is substantiated with both a battery of systematic numerical tests with synthetic signals and with applications to experimental data. |
| Author | Rossi, R Spolladore, L Murari, A Peluso, E Gelfusa, M |
| Author_xml | – sequence: 1 givenname: A surname: Murari fullname: Murari, A email: andrea.murari@istp.cnr.it organization: Consorzio RFX (CNR, ENEA, INFN, Università di Padova, Acciaierie Venete SpA), Corso Stati Uniti 4, 35127 Padova, Italy andrea.murari@istp.cnr.it – sequence: 2 givenname: E surname: Peluso fullname: Peluso, E email: emmanuele.peluso@uniroma2.it organization: Department of Industrial Engineering, University of Rome "Tor Vergata", via del Politecnico 1, Roma, Italy emmanuele.peluso@uniroma2.it – sequence: 3 givenname: L surname: Spolladore fullname: Spolladore, L email: luca.spolladore@uniroma2.it organization: Department of Industrial Engineering, University of Rome "Tor Vergata", via del Politecnico 1, Roma, Italy luca.spolladore@uniroma2.it – sequence: 4 givenname: R surname: Rossi fullname: Rossi, R email: riccardo.rossi.en@gmail.com organization: Department of Industrial Engineering, University of Rome "Tor Vergata", via del Politecnico 1, Roma, Italy riccardo.rossi.en@gmail.com – sequence: 5 givenname: M surname: Gelfusa fullname: Gelfusa, M email: gelfusa@ing.uniroma2.it organization: Department of Industrial Engineering, University of Rome "Tor Vergata", via del Politecnico 1, Roma, Italy gelfusa@ing.uniroma2.it |
| BackLink | https://www.ncbi.nlm.nih.gov/pubmed/37126579$$D View this record in MEDLINE/PubMed |
| BookMark | eNpNT8tKw0AUHaRiH7pzLbN0E73zSCZZSqutUFGwXYfJzE0ZaTJ1Jin490as4Ooc7nlwz5SMWt8iIdcM7hjL-D0ejS91CSAEnJEJSwUkhQA5-sfHZBrjBwATHNgFGQvFeJaqYkJW28MuaIuR-pouscXOGfoW_HBsGtfuaO0DXehO00VwR2zpi7e43_8oQ2DjGqTvGBzGS3Je633EqxPOyPbpcTNfJevX5fP8YZ0YwYouyUWmlJWa1azIjRVSGquZZMpkygBkmqPUhc0xE5nUgErpquIMa54aXleWz8jtb-8h-M8eY1c2LprhJd2i72PJc8jTPAWhBuvNydpXDdryEFyjw1f5t55_A36QXUQ |
| CitedBy_id | crossref_primary_10_1162_evco_a_00359 crossref_primary_10_1088_1361_6463_ada167 crossref_primary_10_1088_1361_6463_adbe85 |
| ContentType | Journal Article |
| Copyright | 2023 Massachusetts Institute of Technology. |
| Copyright_xml | – notice: 2023 Massachusetts Institute of Technology. |
| DBID | NPM 7X8 |
| DOI | 10.1162/evco_a_00330 |
| DatabaseName | PubMed MEDLINE - Academic |
| DatabaseTitle | PubMed MEDLINE - Academic |
| DatabaseTitleList | PubMed MEDLINE - Academic |
| Database_xml | – sequence: 1 dbid: NPM name: PubMed url: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: 7X8 name: MEDLINE - Academic url: https://search.proquest.com/medline sourceTypes: Aggregation Database |
| DeliveryMethod | no_fulltext_linktorsrc |
| Discipline | Engineering Computer Science |
| EISSN | 1530-9304 |
| ExternalDocumentID | 37126579 |
| Genre | Journal Article |
| GroupedDBID | --- .4S .DC 0R~ 36B 4.4 53G 5GY 5VS 6IK AAJGR AAKMM AALFJ AALMD AAYFX ABAZT ABDBF ABJNI ACM ACUHS ADL ADPZR AEBYY AENEX AENSD AFWIH AFWXC AIKLT ALMA_UNASSIGNED_HOLDINGS ARCSS ASPBG AVWKF AZFZN BDXCO BEFXN BFFAM BGNUA BKEBE BPEOZ CS3 DU5 EAP EAS EBC EBD EBS ECS EDO EMB EMK EMOBN EPL EST ESX F5P FEDTE FNEHJ GUFHI HGAVV HZ~ I-F I07 IPLJI JAVBF LHSKQ MCG MINIK NPM O9- OCL P2P PK0 RMI SV3 TUS W7O ZWS 7X8 ABVLG AEJOY AKRVB |
| ID | FETCH-LOGICAL-c319t-83677d4a1f198cd344cda1417c67c006a2e4a9d8e6364a0e77abb21ef25c2fbd2 |
| IEDL.DBID | 7X8 |
| ISICitedReferencesCount | 3 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001111960000004&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 1530-9304 |
| IngestDate | Fri Jul 11 07:06:43 EDT 2025 Wed Feb 19 02:24:25 EST 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 4 |
| Keywords | Data Driven Theory Symbolic regression Evolutionary Computation Empirical modelling of signals Genetic Programming Time Series Analysis |
| Language | English |
| License | 2023 Massachusetts Institute of Technology. |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c319t-83677d4a1f198cd344cda1417c67c006a2e4a9d8e6364a0e77abb21ef25c2fbd2 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
| PMID | 37126579 |
| PQID | 2808585037 |
| PQPubID | 23479 |
| ParticipantIDs | proquest_miscellaneous_2808585037 pubmed_primary_37126579 |
| PublicationCentury | 2000 |
| PublicationDate | 2023-12-01 |
| PublicationDateYYYYMMDD | 2023-12-01 |
| PublicationDate_xml | – month: 12 year: 2023 text: 2023-12-01 day: 01 |
| PublicationDecade | 2020 |
| PublicationPlace | United States |
| PublicationPlace_xml | – name: United States |
| PublicationTitle | Evolutionary computation |
| PublicationTitleAlternate | Evol Comput |
| PublicationYear | 2023 |
| SSID | ssj0013201 |
| Score | 2.3934188 |
| Snippet | In many engineering fields and scientific disciplines, the results of experiments are in the form of time series, which can be quite problematic to interpret... |
| SourceID | proquest pubmed |
| SourceType | Aggregation Database Index Database |
| StartPage | 1 |
| Title | Upgrades of Genetic Programming for Data Driven Modelling of Time Series |
| URI | https://www.ncbi.nlm.nih.gov/pubmed/37126579 https://www.proquest.com/docview/2808585037 |
| Volume | 31 |
| WOSCitedRecordID | wos001111960000004&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1JSwMxFA5qPejBat3qRgSvoZNlkslJxFo9aOnBQm9DJot4cKZ2an-_ySzYiyB4ySmB8Hjv5W35PgBunHY-75AOCRdRxHCGkdTUImX9W2hjbLmrWEuexXiczGZy0hTcymassvWJlaM2hQ418gFJotDCiqi4nX-iwBoVuqsNhcYm6FAfyoSRLjFb7yJEDV5qhKTP29vBd04GdqWLVAVkTBr9HlxWj8yo-9_r7YO9JryEd7U-HIANm_dAt6VugI0l98DuGg7hIXiczt8WytgSFg4GIGp_Gk7qya0PvwP6yBYO1VKh4SJ4RxgY1MI_9rA_fCKBochmyyMwHT283j-hhmEBaW96S5RQLoRhCjssE20oY9oozLDQXGhvj4pYpqRJLKecqcgKobKMYOtIrInLDDkGW3mR21MAJTc8o8bnu5IyKlzmtHZeE7iJjdA26oPrVnCp1-DQllC5Lb7K9Ed0fXBSSz-d11AbKRWY8FjIsz-cPgc7gQu-njW5AB3n7ddegm29Wr6Xi6tKNfw6nrx8AxCOxLw |
| linkProvider | ProQuest |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Upgrades+of+Genetic+Programming+for+Data-Driven+Modeling+of+Time+Series&rft.jtitle=Evolutionary+computation&rft.au=Murari%2C+A&rft.au=Peluso%2C+E&rft.au=Spolladore%2C+L&rft.au=Rossi%2C+R&rft.date=2023-12-01&rft.issn=1530-9304&rft.eissn=1530-9304&rft.volume=31&rft.issue=4&rft.spage=401&rft_id=info:doi/10.1162%2Fevco_a_00330&rft.externalDBID=NO_FULL_TEXT |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1530-9304&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1530-9304&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1530-9304&client=summon |