Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID
An adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface vehicles (USV) is proposed in this paper. The gains of the PID controller need to be manually adjusted based on experience in the original formulati...
Uložené v:
| Vydané v: | Brodogradnja Ročník 76; číslo 2; s. 1 - 22 |
|---|---|
| Hlavní autori: | , , , |
| Médium: | Journal Article Paper |
| Jazyk: | English |
| Vydavateľské údaje: |
Sveučilište u Zagrebu Fakultet strojarstva i brodogradnje
01.01.2025
Faculty of Mechanical Engineering and Naval Architecture |
| Predmet: | |
| ISSN: | 0007-215X, 1845-5859 |
| On-line prístup: | Získať plný text |
| Tagy: |
Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
|
| Abstract | An adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface vehicles (USV) is proposed in this paper. The gains of the PID controller need to be manually adjusted based on experience in the original formulation. Furthermore, once tuned, these gains remain fixed and making further modifications becomes time-consuming and labor-intensive. To address these limitations, the SAC algorithm is introduced, enabling online tuning of PID gains through agent-environment interaction. Additionally, the strategy of combining SAC algorithm with PID controller mitigates concerns regarding interpretability and security often associated with DRL. In this study, stability analysis of the adaptive trajectory controller based on the SAC-PID algorithm is conducted. This paper horizontally compares the proposed method with traditional PID tuning methods, genetic algorithms (GA), and deep deterministic policy gradient (DDPG) algorithm to highlight the superiority of the SAC-PID approach. Finally, experiments in different scenarios are performed to compare generalization capabilities between DDPG and SAC algorithms. Results demonstrate that the proposed SAC-PID algorithm exhibits excellent stability properties, fast convergence speed, and strong generalization ability. |
|---|---|
| AbstractList | n adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface vehicles (USV) is proposed in this paper. The gains of the PID controller need to be manually adjusted based on experience in the original formulation. Furthermore, once tuned, these gains remain fixed and making further modifications becomes time-consuming and labor-intensive. To address these limitations, the SAC algorithm is introduced, enabling online tuning of PID gains through agent-environment interaction. Additionally, the strategy of combining SAC algorithm with PID controller mitigates concerns regarding interpretability and security often associated with DRL. In this study, stability analysis of the adaptive trajectory controller based on the SAC-PID algorithm is conducted. This paper horizontally compares the proposed method with traditional PID tuning methods, genetic algorithms (GA), and deep deterministic policy gradient (DDPG) algorithm to highlight the superiority of the SAC-PID approach. Finally, experiments in different scenarios are performed to compare generalization capabilities between DDPG and SAC algorithms. Results demonstrate that the proposed SAC-PID algorithm exhibits excellent stability properties, fast convergence speed, and strong generalization ability. An adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface vehicles (USV) is proposed in this paper. The gains of the PID controller need to be manually adjusted based on experience in the original formulation. Furthermore, once tuned, these gains remain fixed and making further modifications becomes time-consuming and labor-intensive. To address these limitations, the SAC algorithm is introduced, enabling online tuning of PID gains through agent-environment interaction. Additionally, the strategy of combining SAC algorithm with PID controller mitigates concerns regarding interpretability and security often associated with DRL. In this study, stability analysis of the adaptive trajectory controller based on the SAC-PID algorithm is conducted. This paper horizontally compares the proposed method with traditional PID tuning methods, genetic algorithms (GA), and deep deterministic policy gradient (DDPG) algorithm to highlight the superiority of the SAC-PID approach. Finally, experiments in different scenarios are performed to compare generalization capabilities between DDPG and SAC algorithms. Results demonstrate that the proposed SAC-PID algorithm exhibits excellent stability properties, fast convergence speed, and strong generalization ability. |
| Author | Xi, Zhaoyong Cui, Zhewen Guan, Wei Zhang, Xianku |
| Author_xml | – sequence: 1 givenname: Wei surname: Guan fullname: Guan, Wei – sequence: 2 givenname: Zhaoyong surname: Xi fullname: Xi, Zhaoyong – sequence: 3 givenname: Zhewen surname: Cui fullname: Cui, Zhewen – sequence: 4 givenname: Xianku surname: Zhang fullname: Zhang, Xianku |
| BookMark | eNpVkUtLA0EQhAeJYHxc_AVzFlbnuTt7DPEVCCio4G3p7enR1WRHZtaA_941EcFTN0XVR0Edskkfe2LsVIpzJVXlLtoUfVUqUe6xqXTGFtbZesKmQoiqUNI-H7CTnLtWiFoYp6yasseZh4-h2xAfErwRDjF9cYz9kOJqRYl7yt1Lz0NM_LNfQ9-T5_kzBUDiG3rtcEWZt5BHOfb8YTYv7heXx2w_wCrTye89Yk_XV4_z22J5d7OYz5YFaq2GQkvUQMpA66WoFHnEyksV2rGedFKA9ITl2N0bJWSNwZIGUwUJxkq0rT5iix3XR3hrPlK3hvTVROiarRDTSwNp-OnYOEKNJghTl9aUrnYkqsqZACS8cSWOrGLHek0I7_9gOyUnpPFttKqdlqP_bOfHFHNOFP4iUjTbOZq_OfQ3ZaF_oQ |
| CODEN | BRODBA |
| ContentType | Journal Article Paper |
| CorporateAuthor | Navigation College, Dalian Maritime University, Dalian, China |
| CorporateAuthor_xml | – name: Navigation College, Dalian Maritime University, Dalian, China |
| DBID | AAYXX CITATION VP8 DOA |
| DOI | 10.21278/brod76206 |
| DatabaseName | CrossRef Portal of Croatian Scientific and Professional Journals – HRČAK DOAJ Directory of Open Access Journals |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | CrossRef |
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Military & Naval Science |
| EISSN | 1845-5859 |
| EndPage | 22 |
| ExternalDocumentID | oai_doaj_org_article_8ec3c4f0496546898e07784fae0d486c oai_hrcak_srce_hr_329831 10_21278_brod76206 |
| GroupedDBID | 2WC AAYXX ABDBF ACUHS ADBBV AENEX ALMA_UNASSIGNED_HOLDINGS BCNDV CITATION E3Z EN8 EOJEC GROUPED_DOAJ KQ8 OBODZ OK1 TR2 VP8 |
| ID | FETCH-LOGICAL-c332t-31c3ae24abd1072edcc7d12fb0901810a1dec6007d42019cf5e3a47f1a451c5b3 |
| IEDL.DBID | DOA |
| ISICitedReferencesCount | 0 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001464258500006&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 0007-215X |
| IngestDate | Fri Oct 03 12:39:04 EDT 2025 Sat Apr 05 04:18:06 EDT 2025 Sat Nov 29 08:04:36 EST 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 2 |
| Language | English |
| License | cc-by-sa: openAccess |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c332t-31c3ae24abd1072edcc7d12fb0901810a1dec6007d42019cf5e3a47f1a451c5b3 |
| Notes | 329831 |
| OpenAccessLink | https://doaj.org/article/8ec3c4f0496546898e07784fae0d486c |
| PageCount | 22 |
| ParticipantIDs | doaj_primary_oai_doaj_org_article_8ec3c4f0496546898e07784fae0d486c hrcak_primary_oai_hrcak_srce_hr_329831 crossref_primary_10_21278_brod76206 |
| PublicationCentury | 2000 |
| PublicationDate | 2025-01-01 |
| PublicationDateYYYYMMDD | 2025-01-01 |
| PublicationDate_xml | – month: 01 year: 2025 text: 2025-01-01 day: 01 |
| PublicationDecade | 2020 |
| PublicationTitle | Brodogradnja |
| PublicationYear | 2025 |
| Publisher | Sveučilište u Zagrebu Fakultet strojarstva i brodogradnje Faculty of Mechanical Engineering and Naval Architecture |
| Publisher_xml | – name: Sveučilište u Zagrebu Fakultet strojarstva i brodogradnje – name: Faculty of Mechanical Engineering and Naval Architecture |
| SSID | ssib009048252 ssj0000561915 |
| Score | 2.2970476 |
| Snippet | An adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface... n adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface vehicles... |
| SourceID | doaj hrcak crossref |
| SourceType | Open Website Open Access Repository Index Database |
| StartPage | 1 |
| SubjectTerms | Deep reinforcement learning PID tuning soft actor critic unmanned surface vehicle |
| Title | Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID |
| URI | https://hrcak.srce.hr/329831 https://doaj.org/article/8ec3c4f0496546898e07784fae0d486c |
| Volume | 76 |
| WOSCitedRecordID | wos001464258500006&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals customDbUrl: eissn: 1845-5859 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0000561915 issn: 0007-215X databaseCode: DOA dateStart: 20050101 isFulltext: true titleUrlDefault: https://www.doaj.org/ providerName: Directory of Open Access Journals |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LS8NAEF6kePAiPml9saB4C012t83mWF8oaBFU6C1Mdmepr1TSVvDfO7uJpZ68eAubkITvm53HMnzD2Im2SHZifEeNNJEqRBqBFRBBolMpaT8JF5i-TYdDPRpl90ujvnxPWC0PXAPX1WikUS72uuaqrzONcZpq5QBjq3TfeO9LWc9SMeUtKSPDFD2xOG3xeXLWjDOI04ji3KjWKvUC57pbkLMir-DHHi1FpyDiT0nruDLwuhR1rjbYepMu8kH9m5tsBcst1r4L0trVFz_lQyBT4c0O3WaPAwsf3oPxWQUv4UT-izft6G9YcRsaNjhlqnxevoN3snw6rxwY5J84Dj1y3Ec2yyclfxicR_c3Fzvs6ery8fw6agYnREZKMSO_aiSgUFBYqu4EWmNSmwhXEC4U0WNILBovTG8Vxf_MuB5KUKlLQPUS0yvkLmuVkxLbjKPKgO4SBxaVo-oMoO-cRgSZeemvDjv-ASz_qPUxcqorAqz5AtYOO_NYLp7wmtZhgZjOG6bzv5jusNPAxK-31CvTyiBd5lJkWiZ7__G1fbYm_KjfcNpywFqzao6HbNV8zp6n1VGwtm8vjNdq |
| linkProvider | Directory of Open Access Journals |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Adaptive+trajectory+controller+design+for+unmanned+surface+vehicles+based+on+SAC-PID&rft.jtitle=Brodogradnja&rft.au=Wei+Guan&rft.au=Zhaoyong+Xi&rft.au=Zhewen+Cui&rft.au=Xianku+Zhang&rft.date=2025-01-01&rft.pub=Faculty+of+Mechanical+Engineering+and+Naval+Architecture&rft.issn=0007-215X&rft.eissn=1845-5859&rft.volume=76&rft.issue=2&rft.spage=1&rft.epage=22&rft_id=info:doi/10.21278%2Fbrod76206&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_8ec3c4f0496546898e07784fae0d486c |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0007-215X&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0007-215X&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0007-215X&client=summon |