Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID

An adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface vehicles (USV) is proposed in this paper. The gains of the PID controller need to be manually adjusted based on experience in the original formulati...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Brodogradnja Ročník 76; číslo 2; s. 1 - 22
Hlavní autoři: Guan, Wei, Xi, Zhaoyong, Cui, Zhewen, Zhang, Xianku
Médium: Journal Article Paper
Jazyk:angličtina
Vydáno: Sveučilište u Zagrebu Fakultet strojarstva i brodogradnje 01.01.2025
Faculty of Mechanical Engineering and Naval Architecture
Témata:
ISSN:0007-215X, 1845-5859
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Abstract An adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface vehicles (USV) is proposed in this paper. The gains of the PID controller need to be manually adjusted based on experience in the original formulation. Furthermore, once tuned, these gains remain fixed and making further modifications becomes time-consuming and labor-intensive. To address these limitations, the SAC algorithm is introduced, enabling online tuning of PID gains through agent-environment interaction. Additionally, the strategy of combining SAC algorithm with PID controller mitigates concerns regarding interpretability and security often associated with DRL. In this study, stability analysis of the adaptive trajectory controller based on the SAC-PID algorithm is conducted. This paper horizontally compares the proposed method with traditional PID tuning methods, genetic algorithms (GA), and deep deterministic policy gradient (DDPG) algorithm to highlight the superiority of the SAC-PID approach. Finally, experiments in different scenarios are performed to compare generalization capabilities between DDPG and SAC algorithms. Results demonstrate that the proposed SAC-PID algorithm exhibits excellent stability properties, fast convergence speed, and strong generalization ability.
AbstractList n adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface vehicles (USV) is proposed in this paper. The gains of the PID controller need to be manually adjusted based on experience in the original formulation. Furthermore, once tuned, these gains remain fixed and making further modifications becomes time-consuming and labor-intensive. To address these limitations, the SAC algorithm is introduced, enabling online tuning of PID gains through agent-environment interaction. Additionally, the strategy of combining SAC algorithm with PID controller mitigates concerns regarding interpretability and security often associated with DRL. In this study, stability analysis of the adaptive trajectory controller based on the SAC-PID algorithm is conducted. This paper horizontally compares the proposed method with traditional PID tuning methods, genetic algorithms (GA), and deep deterministic policy gradient (DDPG) algorithm to highlight the superiority of the SAC-PID approach. Finally, experiments in different scenarios are performed to compare generalization capabilities between DDPG and SAC algorithms. Results demonstrate that the proposed SAC-PID algorithm exhibits excellent stability properties, fast convergence speed, and strong generalization ability.
An adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface vehicles (USV) is proposed in this paper. The gains of the PID controller need to be manually adjusted based on experience in the original formulation. Furthermore, once tuned, these gains remain fixed and making further modifications becomes time-consuming and labor-intensive. To address these limitations, the SAC algorithm is introduced, enabling online tuning of PID gains through agent-environment interaction. Additionally, the strategy of combining SAC algorithm with PID controller mitigates concerns regarding interpretability and security often associated with DRL. In this study, stability analysis of the adaptive trajectory controller based on the SAC-PID algorithm is conducted. This paper horizontally compares the proposed method with traditional PID tuning methods, genetic algorithms (GA), and deep deterministic policy gradient (DDPG) algorithm to highlight the superiority of the SAC-PID approach. Finally, experiments in different scenarios are performed to compare generalization capabilities between DDPG and SAC algorithms. Results demonstrate that the proposed SAC-PID algorithm exhibits excellent stability properties, fast convergence speed, and strong generalization ability.
Author Xi, Zhaoyong
Cui, Zhewen
Guan, Wei
Zhang, Xianku
Author_xml – sequence: 1
  givenname: Wei
  surname: Guan
  fullname: Guan, Wei
– sequence: 2
  givenname: Zhaoyong
  surname: Xi
  fullname: Xi, Zhaoyong
– sequence: 3
  givenname: Zhewen
  surname: Cui
  fullname: Cui, Zhewen
– sequence: 4
  givenname: Xianku
  surname: Zhang
  fullname: Zhang, Xianku
BookMark eNpVkUtLA0EQhAeJYHxc_AVzFlbnuTt7DPEVCCio4G3p7enR1WRHZtaA_941EcFTN0XVR0Edskkfe2LsVIpzJVXlLtoUfVUqUe6xqXTGFtbZesKmQoiqUNI-H7CTnLtWiFoYp6yasseZh4-h2xAfErwRDjF9cYz9kOJqRYl7yt1Lz0NM_LNfQ9-T5_kzBUDiG3rtcEWZt5BHOfb8YTYv7heXx2w_wCrTye89Yk_XV4_z22J5d7OYz5YFaq2GQkvUQMpA66WoFHnEyksV2rGedFKA9ITl2N0bJWSNwZIGUwUJxkq0rT5iix3XR3hrPlK3hvTVROiarRDTSwNp-OnYOEKNJghTl9aUrnYkqsqZACS8cSWOrGLHek0I7_9gOyUnpPFttKqdlqP_bOfHFHNOFP4iUjTbOZq_OfQ3ZaF_oQ
CODEN BRODBA
ContentType Journal Article
Paper
CorporateAuthor Navigation College, Dalian Maritime University, Dalian, China
CorporateAuthor_xml – name: Navigation College, Dalian Maritime University, Dalian, China
DBID AAYXX
CITATION
VP8
DOA
DOI 10.21278/brod76206
DatabaseName CrossRef
Portal of Croatian Scientific and Professional Journals – HRČAK
Directory of Open Access Journals (DOAJ)
DatabaseTitle CrossRef
DatabaseTitleList

CrossRef
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Military & Naval Science
EISSN 1845-5859
EndPage 22
ExternalDocumentID oai_doaj_org_article_8ec3c4f0496546898e07784fae0d486c
oai_hrcak_srce_hr_329831
10_21278_brod76206
GroupedDBID 2WC
AAYXX
ABDBF
ACUHS
ADBBV
AENEX
ALMA_UNASSIGNED_HOLDINGS
BCNDV
CITATION
E3Z
EN8
EOJEC
GROUPED_DOAJ
KQ8
OBODZ
OK1
TR2
VP8
ID FETCH-LOGICAL-c332t-31c3ae24abd1072edcc7d12fb0901810a1dec6007d42019cf5e3a47f1a451c5b3
IEDL.DBID DOA
ISICitedReferencesCount 0
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001464258500006&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0007-215X
IngestDate Fri Oct 03 12:39:04 EDT 2025
Sat Apr 05 04:18:06 EDT 2025
Sat Nov 29 08:04:36 EST 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 2
Language English
License cc-by-sa: openAccess
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c332t-31c3ae24abd1072edcc7d12fb0901810a1dec6007d42019cf5e3a47f1a451c5b3
Notes 329831
OpenAccessLink https://doaj.org/article/8ec3c4f0496546898e07784fae0d486c
PageCount 22
ParticipantIDs doaj_primary_oai_doaj_org_article_8ec3c4f0496546898e07784fae0d486c
hrcak_primary_oai_hrcak_srce_hr_329831
crossref_primary_10_21278_brod76206
PublicationCentury 2000
PublicationDate 2025-01-01
PublicationDateYYYYMMDD 2025-01-01
PublicationDate_xml – month: 01
  year: 2025
  text: 2025-01-01
  day: 01
PublicationDecade 2020
PublicationTitle Brodogradnja
PublicationYear 2025
Publisher Sveučilište u Zagrebu Fakultet strojarstva i brodogradnje
Faculty of Mechanical Engineering and Naval Architecture
Publisher_xml – name: Sveučilište u Zagrebu Fakultet strojarstva i brodogradnje
– name: Faculty of Mechanical Engineering and Naval Architecture
SSID ssib009048252
ssj0000561915
Score 2.2970476
Snippet An adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface...
n adaptive proportional integral derivative (PID) controller based on the soft actor-critic (SAC) algorithm for trajectory control of unmanned surface vehicles...
SourceID doaj
hrcak
crossref
SourceType Open Website
Open Access Repository
Index Database
StartPage 1
SubjectTerms Deep reinforcement learning
PID tuning
soft actor critic
unmanned surface vehicle
Title Adaptive trajectory controller design for unmanned surface vehicles based on SAC-PID
URI https://hrcak.srce.hr/329831
https://doaj.org/article/8ec3c4f0496546898e07784fae0d486c
Volume 76
WOSCitedRecordID wos001464258500006&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 1845-5859
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0000561915
  issn: 0007-215X
  databaseCode: DOA
  dateStart: 20050101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LS8QwEA4iHryIT1xfBBRvxTZJN8lxXRUFXQQV9lbSZILPrnQf4L93ktZlPXnxVkJpy8z0m2-G4RtCTnKmvJAcEgCTJqLk3UT7Mk98idwbMVkJF9X1b-VgoIZDfb-w6ivMhDXywI3hzhRYboVPg6656CqtIJVSCW8gdUJ1bUBfZD0LxVSIJI2ByXI277YEnqzbdQapTDDPDRut0iBwrs5KBCtEhbD2aCE7RRF_JK3PtTVvC1nnap2stXSR9prP3CBLUG2S3bsorV1_0VM6MBgqtP1Dt8hjz5nPgGB0UpvX2JH_ou04-jvU1MWBDYpMlU6rDxNAlo6ntTcW6Aye44wcDZnN0VFFH3r95P7mYps8XV0-9q-TdnFCYjlnE8RVyw0wYUqH1R0DZ610GfMl2gUzemoyBzYI0zuB-V9bnwM3QvrMiDyzecl3yHI1qmCXUMc9Egqw0MXCw4u8FJo7mWkTdHC8TDvk-MdgxWejj1FgXRHNWszN2iHnwZbzO4KmdTxATxetp4u_PN0hp9ETv57SnIxrC3hZcKYVz_b-4237ZJWFVb-x23JAlif1FA7Jip1NXsb1UYy2bwgx1sE
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Adaptive+trajectory+controller+design+for+unmanned+surface+vehicles+based+on+SAC-PID&rft.jtitle=Brodogradnja&rft.au=Wei+Guan&rft.au=Zhaoyong+Xi&rft.au=Zhewen+Cui&rft.au=Xianku+Zhang&rft.date=2025-01-01&rft.pub=Faculty+of+Mechanical+Engineering+and+Naval+Architecture&rft.issn=0007-215X&rft.eissn=1845-5859&rft.volume=76&rft.issue=2&rft.spage=1&rft.epage=22&rft_id=info:doi/10.21278%2Fbrod76206&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_8ec3c4f0496546898e07784fae0d486c
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0007-215X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0007-215X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0007-215X&client=summon