Celerity-RSim: Porting Light Propagation Simulation to Accelerator Clusters Using a High-Level API
Time-of-Flight (ToF) camera systems are increasingly capable of analyzing larger 3D spaces and providing more detailed and precise results. To increase the speed-to-solution during development, testing and validation of such systems, light propagation simulation is employed. One such simulation, RSi...
Saved in:
| Published in: | International journal of parallel programming Vol. 53; no. 3; p. 17 |
|---|---|
| Main Authors: | , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
New York
Springer US
01.06.2025
Springer Nature B.V |
| Subjects: | |
| ISSN: | 0885-7458, 1573-7640 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | Time-of-Flight (ToF) camera systems are increasingly capable of analyzing larger 3D spaces and providing more detailed and precise results. To increase the speed-to-solution during development, testing and validation of such systems, light propagation simulation is employed. One such simulation, RSim, was previously performed on single workstations, however, the increase in detail required for newer ToF hardware necessitates cluster-level parallelism in order to maintain an experiment latency which enables productive design work. Celerity is a high-level parallel API and runtime system for clusters of accelerators intended to simplify the development of domain science applications. It automatically manages data and work distribution, while also transparently enabling asynchronous compute and communication overlapping. In this paper, we present a use case study of porting the full RSim application to GPU clusters using the Celerity system. In order to improve scalability, a new parallelization scheme was employed for the core simulation task, and Celerity was extended with a high-level split constraints feature which enables this scheme. We present strong- and weak-scaling experiments for the resulting application on three accelerator clusters and up to 128 GPUs, and also evaluate the relative programming effort required to distribute the application on multiple GPUs using different APIs. |
|---|---|
| AbstractList | Time-of-Flight (ToF) camera systems are increasingly capable of analyzing larger 3D spaces and providing more detailed and precise results. To increase the speed-to-solution during development, testing and validation of such systems, light propagation simulation is employed. One such simulation, RSim, was previously performed on single workstations, however, the increase in detail required for newer ToF hardware necessitates cluster-level parallelism in order to maintain an experiment latency which enables productive design work. Celerity is a high-level parallel API and runtime system for clusters of accelerators intended to simplify the development of domain science applications. It automatically manages data and work distribution, while also transparently enabling asynchronous compute and communication overlapping. In this paper, we present a use case study of porting the full RSim application to GPU clusters using the Celerity system. In order to improve scalability, a new parallelization scheme was employed for the core simulation task, and Celerity was extended with a high-level split constraints feature which enables this scheme. We present strong- and weak-scaling experiments for the resulting application on three accelerator clusters and up to 128 GPUs, and also evaluate the relative programming effort required to distribute the application on multiple GPUs using different APIs. |
| ArticleNumber | 17 |
| Author | Thoman, Peter Molina Heredina, Facundo Fahringer, Thomas Gschwandtner, Philipp |
| Author_xml | – sequence: 1 givenname: Peter surname: Thoman fullname: Thoman, Peter email: peter.thoman@uibk.ac.at organization: University of Innsbruck – sequence: 2 givenname: Philipp surname: Gschwandtner fullname: Gschwandtner, Philipp organization: University of Innsbruck – sequence: 3 givenname: Facundo surname: Molina Heredina fullname: Molina Heredina, Facundo organization: PH3 GmbH – sequence: 4 givenname: Thomas surname: Fahringer fullname: Fahringer, Thomas organization: University of Innsbruck |
| BookMark | eNp9kE1LAzEQhoMo2Fb_gKeA52g-dpOst7KoLRQsas8hzWbXLdtNTbJC_72pK3jzNDPM-8zAMwXnvestADcE3xGMxX0gWHCOMM1RGqVA9AxMSC4YEjzD52CCpcyRyHJ5CaYh7DDGhZByAral7axv4xG9vrX7B7h2PrZ9A1dt8xHh2ruDbnRsXQ_TeujGNjo4N-YE6ug8LLshROsD3IQTquEiwWhlv2wH5-vlFbiodRfs9W-dgc3T43u5QKuX52U5XyHDCI-IVltRSY2FLrjhNKtrJqzGOStqymhheJYxs6XU5pgXBbNaFKTKuTFUa8oqwmbgdrx78O5zsCGqnRt8n14qRmSyIkVepBQdU8a7ELyt1cG3e-2PimB1cqlGlyq5VD8uFU0QG6GQwn1j_d_pf6hvmV94Tg |
| Cites_doi | 10.1111/j.1467-8659.2010.01844.x 10.1109/eScience51609.2021.00019 10.21203/rs.3.rs-2045925/v1 10.1109/ASAP.2018.8445127 10.1145/2492045.2492055 10.1145/2833179.2833191 10.1121/1.2936367 10.1109/TPDS.2021.3097283 10.1007/s11227-019-02829-2 10.1145/3388333.3388643 10.1109/CCGrid57682.2023.00018 10.1007/978-3-030-29400-7_21 10.1121/1.398336 10.1016/j.cam.2014.02.011 10.1109/SFCS.2000.892128 10.1007/s11227-022-05040-y 10.1007/s11042-015-2943-4 10.1145/3072959.2943779 10.1002/cpe.6663 10.1007/s10766-017-0490-5 10.1145/3388333.3388662 10.1145/3456669.3456692 10.1109/SC.2012.71 10.1109/MCG.1984.6429331 |
| ContentType | Journal Article |
| Copyright | The Author(s) 2025 Copyright Springer Nature B.V. Jun 2025 |
| Copyright_xml | – notice: The Author(s) 2025 – notice: Copyright Springer Nature B.V. Jun 2025 |
| DBID | C6C AAYXX CITATION 7SC 8FD JQ2 L7M L~C L~D |
| DOI | 10.1007/s10766-025-00787-2 |
| DatabaseName | Springer Nature OA Free Journals CrossRef Computer and Information Systems Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
| DatabaseTitle | CrossRef Computer and Information Systems Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Advanced Technologies Database with Aerospace ProQuest Computer Science Collection Computer and Information Systems Abstracts Professional |
| DatabaseTitleList | CrossRef Computer and Information Systems Abstracts |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 1573-7640 |
| ExternalDocumentID | 10_1007_s10766_025_00787_2 |
| GrantInformation_xml | – fundername: University of Innsbruck and Medical University of Innsbruck |
| GroupedDBID | -Y2 -~X .4S .86 .DC .VR 06D 0R~ 0VY 199 1N0 2.D 203 28- 29J 2J2 2JN 2JY 2KG 2LR 2P1 2VQ 2~H 30V 4.4 406 408 409 40D 40E 5GY 5QI 5VS 67Z 6NX 78A 7WY 8FE 8FG 8FL 8G5 8TC 8UJ 95- 95. 95~ 96X AABHQ AACDK AAHNG AAIAL AAJBT AAJKR AANZL AAOBN AAPKM AARHV AARTL AASML AATNV AATVU AAUYE AAWCG AAYIU AAYJJ AAYQN AAYTO AAYZH ABAKF ABBBX ABBRH ABBXA ABDBE ABDBF ABDPE ABDZT ABECU ABFSI ABFTD ABFTV ABHLI ABHQN ABJNI ABJOX ABKCH ABKTR ABMNI ABMQK ABNWP ABQBU ABQSL ABSXP ABTEG ABTHY ABTKH ABTMW ABULA ABUWG ABWNU ABXPI ACAOD ACBXY ACDTI ACGFO ACGFS ACHSB ACHXU ACIHN ACKNC ACMDZ ACMFV ACMLO ACNCT ACOKC ACOMO ACPIV ACREN ACUHS ACZOJ ADHIR ADHKG ADKNI ADKPE ADMLS ADRFC ADTPH ADURQ ADYFF ADYOE ADZKW AEAQA AEBTG AEFIE AEFQL AEGAL AEGNC AEJHL AEJRE AEKMD AEMSY AENEX AEOHA AEPYU AESKC AETLH AEVLU AEXYK AFBBN AFDZB AFEXP AFGCZ AFKRA AFLOW AFOHR AFQWF AFWTZ AFYQB AFZKB AGAYW AGDGC AGGDS AGJBK AGMZJ AGQEE AGQMX AGQPQ AGRTI AGWIL AGWZB AGYKE AHAVH AHBYD AHKAY AHPBZ AHSBF AHYZX AIAKS AIGIU AIIXL AILAN AITGF AJBLW AJRNO AJZVZ ALMA_UNASSIGNED_HOLDINGS ALWAN AMKLP AMTXH AMXSW AMYLF AOCGG ARAPS ARCSS ARMRJ ATHPR AXYYD AYFIA AYJHY AZFZN AZQEC B-. B0M BA0 BBWZM BDATZ BENPR BEZIV BGLVJ BGNMA BKOMP BPHCQ BSONS C6C CAG CCPQU COF CS3 CSCUP DDRTE DL5 DNIVK DPUIP DU5 DWQXO E.L EAD EAP EAS EBLON EBS EDO EIOEI EJD EMK EPL ESBYG ESX FEDTE FERAY FFXSO FIGPU FINBP FNLPD FRNLG FRRFC FSGXE FWDCC GGCAI GGRSB GJIRD GNUQQ GNWQR GQ7 GQ8 GROUPED_ABI_INFORM_RESEARCH GUQSH GXS H13 HCIFZ HF~ HG5 HG6 HMJXF HQYDN HRMNR HVGLF HZ~ H~9 I-F I09 IHE IJ- IKXTQ ITM IWAJR IXC IZIGR IZQ I~X I~Z J-C J0Z JBSCW JCJTX JZLTJ K60 K6V K6~ K7- KDC KOV KOW LAK LLZTM M0C M2O M4Y MA- MS~ N2Q NB0 NDZJH NPVJJ NQJWS NU0 O9- O93 O9G O9I O9J OAM OVD P19 P62 P9O PF0 PHGZM PHGZT PQBIZ PQBZA PQQKQ PROAC PT4 PT5 Q2X QOK QOS R89 R9I RHV RNI RNS ROL RPX RSV RZC RZE RZK S16 S1Z S26 S27 S28 S3B SAP SCJ SCLPG SCO SDH SDM SHX SISQX SJYHP SNE SNPRN SNX SOHCF SOJ SPISZ SRMVM SSLCW STPWE SZN T13 T16 TAE TEORI TN5 TSG TSK TSV TUC TUS U2A U5U UG4 UOJIU UTJUX UZXMN VC2 VFIZW W23 W48 WH7 WK8 YLTOR Z45 ZMTXR ZY4 ~8M ~EX AAYXX ABFSG ABRTQ ACSTC AEZWR AFFHD AFHIU AHWEU AIXLP CITATION PQGLB 7SC 8FD JQ2 L7M L~C L~D |
| ID | FETCH-LOGICAL-c316t-2db7d8a07a96c624ff37ea0539f2329c6443cb22e506993ea791d56cc2aa23d13 |
| IEDL.DBID | RSV |
| ISICitedReferencesCount | 0 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001450843000002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 0885-7458 |
| IngestDate | Wed Nov 05 08:40:25 EST 2025 Sat Nov 29 07:56:06 EST 2025 Sat May 17 01:10:13 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 3 |
| Keywords | Distributed memory Productivity Celerity Parallel programming Light propagation Multi-gpu SYCL HPC GPU computing |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c316t-2db7d8a07a96c624ff37ea0539f2329c6443cb22e506993ea791d56cc2aa23d13 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| OpenAccessLink | https://link.springer.com/10.1007/s10766-025-00787-2 |
| PQID | 3180788759 |
| PQPubID | 48389 |
| ParticipantIDs | proquest_journals_3180788759 crossref_primary_10_1007_s10766_025_00787_2 springer_journals_10_1007_s10766_025_00787_2 |
| PublicationCentury | 2000 |
| PublicationDate | 20250600 2025-06-00 20250601 |
| PublicationDateYYYYMMDD | 2025-06-01 |
| PublicationDate_xml | – month: 6 year: 2025 text: 20250600 |
| PublicationDecade | 2020 |
| PublicationPlace | New York |
| PublicationPlace_xml | – name: New York |
| PublicationTitle | International journal of parallel programming |
| PublicationTitleAbbrev | Int J Parallel Prog |
| PublicationYear | 2025 |
| Publisher | Springer US Springer Nature B.V |
| Publisher_xml | – name: Springer US – name: Springer Nature B.V |
| References | M de Castro (787_CR5) 2023; 79 Z Fu (787_CR9) 2016; 75 AS Glassner (787_CR11) 1989 CR Trott (787_CR28) 2022; 33 787_CR12 787_CR17 787_CR18 EA Lehmann (787_CR19) 2008; 124 787_CR15 787_CR16 787_CR1 787_CR3 787_CR2 787_CR20 787_CR4 787_CR6 787_CR24 787_CR25 787_CR22 C Schissler (787_CR23) 2017 A Ernstsson (787_CR8) 2018; 46 787_CR26 M Vorländer (787_CR29) 1989; 86 787_CR27 E D’Azevedo (787_CR7) 2014; 270 M Hapala (787_CR14) 2011; 30 M Hansard (787_CR13) 2012 A Rasch (787_CR21) 2020; 76 AS Glassner (787_CR10) 1984; 4 |
| References_xml | – volume: 30 start-page: 199 issue: 1 year: 2011 ident: 787_CR14 publication-title: Comput. Graph. Forum doi: 10.1111/j.1467-8659.2010.01844.x – ident: 787_CR12 doi: 10.1109/eScience51609.2021.00019 – ident: 787_CR18 doi: 10.21203/rs.3.rs-2045925/v1 – ident: 787_CR1 doi: 10.1109/ASAP.2018.8445127 – ident: 787_CR17 doi: 10.1145/2492045.2492055 – ident: 787_CR15 doi: 10.1145/2833179.2833191 – volume: 124 start-page: 269 issue: 1 year: 2008 ident: 787_CR19 publication-title: J. Acoust. Soc. Am. doi: 10.1121/1.2936367 – volume: 33 start-page: 805 issue: 4 year: 2022 ident: 787_CR28 publication-title: IEEE Trans. Parallel Distrib. Syst. doi: 10.1109/TPDS.2021.3097283 – volume: 76 start-page: 5117 issue: 7 year: 2020 ident: 787_CR21 publication-title: J. Supercomput. doi: 10.1007/s11227-019-02829-2 – ident: 787_CR6 doi: 10.1145/3388333.3388643 – volume-title: An Introduction to Ray Tracing year: 1989 ident: 787_CR11 – ident: 787_CR20 – ident: 787_CR22 doi: 10.1109/CCGrid57682.2023.00018 – ident: 787_CR25 doi: 10.1007/978-3-030-29400-7_21 – volume: 86 start-page: 172 issue: 1 year: 1989 ident: 787_CR29 publication-title: J. Acoust. Soc. Am. doi: 10.1121/1.398336 – volume: 270 start-page: 109 year: 2014 ident: 787_CR7 publication-title: J. Comput. Appl. Math. doi: 10.1016/j.cam.2014.02.011 – ident: 787_CR3 doi: 10.1109/SFCS.2000.892128 – volume: 79 start-page: 9409 issue: 9 year: 2023 ident: 787_CR5 publication-title: J. Supercomput. doi: 10.1007/s11227-022-05040-y – ident: 787_CR24 – volume: 75 start-page: 5205 year: 2016 ident: 787_CR9 publication-title: Multimedia Tools Appl. doi: 10.1007/s11042-015-2943-4 – year: 2017 ident: 787_CR23 publication-title: ACM Trans. Graph. doi: 10.1145/3072959.2943779 – ident: 787_CR27 doi: 10.1002/cpe.6663 – ident: 787_CR16 – volume: 46 start-page: 62 issue: 1 year: 2018 ident: 787_CR8 publication-title: Int. J. Parallel Prog. doi: 10.1007/s10766-017-0490-5 – ident: 787_CR26 doi: 10.1145/3388333.3388662 – ident: 787_CR4 doi: 10.1145/3456669.3456692 – volume-title: Time-of-Flight Cameras: Principles, Methods and Applications year: 2012 ident: 787_CR13 – ident: 787_CR2 doi: 10.1109/SC.2012.71 – volume: 4 start-page: 15 issue: 10 year: 1984 ident: 787_CR10 publication-title: IEEE Comput. Graph. Appl. doi: 10.1109/MCG.1984.6429331 |
| SSID | ssj0009788 |
| Score | 2.3587556 |
| Snippet | Time-of-Flight (ToF) camera systems are increasingly capable of analyzing larger 3D spaces and providing more detailed and precise results. To increase the... |
| SourceID | proquest crossref springer |
| SourceType | Aggregation Database Index Database Publisher |
| StartPage | 17 |
| SubjectTerms | Application programming interface Clusters Computer Science Processor Architectures Simulation Software Engineering/Programming and Operating Systems Theory of Computation |
| Title | Celerity-RSim: Porting Light Propagation Simulation to Accelerator Clusters Using a High-Level API |
| URI | https://link.springer.com/article/10.1007/s10766-025-00787-2 https://www.proquest.com/docview/3180788759 |
| Volume | 53 |
| WOSCitedRecordID | wos001450843000002&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAVX databaseName: Springer LINK customDbUrl: eissn: 1573-7640 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0009788 issn: 0885-7458 databaseCode: RSV dateStart: 19970101 isFulltext: true titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22 providerName: Springer Nature |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NS8MwFA8yPXhxfuJ0Sg7eNNAma9J4G8OhMMbYdOwW0jSVgW6ydv795mUtVdGDntOm5b28jyS_934IXVmdiASOwLLIZqTDqCYy5pro2IY0MMxYz3U4HYjhMJ7N5KgsCssrtHt1Jek99adiN8EBMBsRiGuCOMe77cJdDOY4nkzrVrvCs00684mI6ERxWSrz8xxfw1GdY367FvXRpt_833_uo70yu8TdzXI4QFt2cYiaFXMDLg35CCU9F22Ato6MJ_PXWwx4UvcNPICtOh6t3E762asMu-GS4AsXS9w1Bl6EjTruvayhyUKOPeoAawyQETIAEBLujh6O0VP_7rF3T0q2BWJYyAtC00SksQ6Eltxw2skyJqx2Niozl3VJ4xInZhJKbRRwl9RYLWSYRgC71pqyNGQnqLFYLuwpwi4l4Aw6T1rGgV1dAutYmkiZJmGcUtZC15XQ1dumqYaq2yeD-JQTn_LiU7SF2pVeVGlguXKuKAAgZCRb6KbSQz38-2xnf3v8HO1Sr0o4d2mjRrFa2wu0Y96Leb669AvvA0Fi0Vw |
| linkProvider | Springer Nature |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NT8IwFG8MmuhF_Iwoag_etMnWsnX1RogE4yQEkHBbuq4zJAqGDf9--8qWqdGDnrt1y3t9H21_7_0QutIy5jEcgaWeTkmLUUlE4EsiA-1SRzGlLdfhJOT9fjCdikFRFJaVaPfyStJ66k_FbtwHwKxHIK5xYhzvZstELADyDUeTqtUut2yTxnw8wlteUJTK_DzH13BU5ZjfrkVttOnW__efe2i3yC5xe70c9tGGnh-gesncgAtDPkRxx0QboK0jw9Hs9RYDntR8A4ewVceDpdlJP1uVYTNcEHzhfIHbSsGLsFHHnZcVNFnIsEUdYIkBMkJCACHh9uD-CD1178adHinYFohirp8TmsQ8CaTDpfCVT1tpyriWxkZFarIuoUzixFRMqfYc3yQ1WnLhJh7ArqWkLHHZMarNF3N9grBJCXwGnSc184FdXQDrWBILkcRukFDWQNel0KO3dVONqGqfDOKLjPgiK76INlCz1EtUGFgWGVfkABDSEw10U-qhGv59ttO_PX6JtnvjxzAK7_sPZ2iHWrXCGUwT1fLlSp-jLfWez7LlhV2EHxsn1EA |
| linkToPdf | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV3fT8IwEG4MGuOL-DOiqH3wTRtYu62rbwQlEhdCRAlvS9d1hkQHgeHfb69sAY0-GJ-7dctdr3fXfncfQldaxjyGI7DU0ylxGZVEBL4kMtAObSqmtOU6HIa81wtGI9Ffq-K3aPfySnJZ0wBdmrK8MU3SxlrhG_cBPOsR8HGcmE140wXSIMjXB8NV211umSeNKXmEu15QlM38PMdX17SKN79dkVrP06n-_5_30G4RdeLWcpnsow2dHaBqyeiACwM_RHHbeCGgsyNPg_H7LQacqfkeDiGFx_2ZybBfrSqxGS6Iv3A-wS2l4EVI4HH7bQHNF-bYohGwxAAlISGAk3Cr3z1CL5375_YDKVgYiGKOnxOaxDwJZJNL4SufumnKuJbGdkVqojGhTEDFVEyp9pq-CXa05MJJPIBjS0lZ4rBjVMkmmT5B2IQKPoOOlJr5wLougI0siYVIYidIKKuh61IB0XTZbCNatVUG8UVGfJEVX0RrqF7qKCoMbx6ZLaoJAElP1NBNqZPV8O-znf7t8Uu03b_rRGG393iGdqjVKhzN1FElny30OdpSH_l4Pruw6_ETYFPdJA |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Celerity-RSim%3A+Porting+Light+Propagation+Simulation+to+Accelerator+Clusters+Using+a+High-Level+API&rft.jtitle=International+journal+of+parallel+programming&rft.au=Thoman%2C+Peter&rft.au=Gschwandtner%2C+Philipp&rft.au=Molina+Heredina%2C+Facundo&rft.au=Fahringer%2C+Thomas&rft.date=2025-06-01&rft.issn=0885-7458&rft.eissn=1573-7640&rft.volume=53&rft.issue=3&rft_id=info:doi/10.1007%2Fs10766-025-00787-2&rft.externalDBID=n%2Fa&rft.externalDocID=10_1007_s10766_025_00787_2 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0885-7458&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0885-7458&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0885-7458&client=summon |