Search Results - "Loop Level Parallelization"
-
1
Multiple execution of the same MPI application: exploiting parallelism at hotspots with minimal code changes
ISSN: 1869-2672, 1869-2680Published: Berlin/Heidelberg Springer Berlin Heidelberg 01.12.2025Published in GEM international journal on geomathematics (01.12.2025)“…For a typical climate model, parallelization based on a domain decomposition is a predominant technique to speed up its computation as an MPI (Message Passing…”
Get full text
Journal Article -
2
INTERNATIONAL PATENT: MYNATIX LTD FILES APPLICATION FOR "HIGH-PERFORMANCE CODE PARALLELIZATION COMPILER WITH LOOP-LEVEL PARALLELIZATION"
Published: Washington, D.C HT Digital Streams Limited 03.11.2024Published in US Fed News Service, Including US State News (03.11.2024)Get full text
Newsletter -
3
Just-in-Time Compilation-Inspired Methodology for Parallelization of Compute Intensive Java Code
ISSN: 0254-7821, 2413-7219Published: Mehran University of Engineering and Technology 01.01.2017Published in Mehran University Research Journal of Engineering and Technology (01.01.2017)“…Compute intensive programs generally consume significant fraction of execution time in a small amount of repetitive code. Such repetitive code is commonly…”
Get full text
Journal Article -
4
Scalable parallel implementation of exact inference in Bayesian networks
ISBN: 9780769526126, 0769526128ISSN: 1521-9097Published: IEEE 2006Published in 12th International Conference on Parallel and Distributed Systems - (ICPADS'06) (2006)“…We present a scalable parallel implementation for exact inference in Bayesian networks. We explore two levels of parallelization: top level parallelization…”
Get full text
Conference Proceeding -
5
Hybrid Approach for Parallelization of Sequential Code with Function Level and Block Level Parallelization
ISBN: 0769525547, 9780769525549Published: IEEE 2006Published in PARELEC 2006 : International Symposium on Parallel Computing in Electrical Engineering : 13-17 September 2006, Bialystok, Poland (2006)“…Automatic parallelization of a sequential code is about finding parallel segments in the code and executing these segments parallely by sending them to…”
Get full text
Conference Proceeding -
6
Acceleration of Semiempirical QM/MM Methods through Message Passage Interface (MPI), Hybrid MPI/Open Multiprocessing, and Self-Consistent Field Accelerator Implementations
ISSN: 1549-9626, 1549-9626Published: United States 08.08.2017Published in Journal of chemical theory and computation (08.08.2017)“…The strategy and implementation of scalable and efficient semiempirical (SE) QM/MM methods in CHARMM are described. The serial version of the code was first…”
Get more information
Journal Article -
7
Programming parallel dense matrix factorizations and inversion for new-generation NUMA architectures
ISSN: 0743-7315, 1096-0848Published: Elsevier Inc 01.05.2023Published in Journal of parallel and distributed computing (01.05.2023)“… by proposing multi-domain implementations for DMFI plus a hybrid task- and loop-level parallelization…”
Get full text
Journal Article -
8
Combining Data Reuse With Data-Level Parallelization for FPGA-Targeted Hardware Compilation: A Geometric Programming Framework
ISSN: 0278-0070, 1937-4151Published: New York IEEE 01.03.2009Published in IEEE transactions on computer-aided design of integrated circuits and systems (01.03.2009)“…) decisions and loop-level parallelization, in the context of field-programmable-gate-array-targeted hardware compilation…”
Get full text
Journal Article -
9
Numerical methods used in fusion science numerical modeling
ISSN: 0022-3778, 1469-7807Published: Cambridge, UK Cambridge University Press 01.04.2015Published in Journal of Plasma Physics (01.04.2015)“…) parallel programing are introduced and the loop-level parallelization is shown as an example…”
Get full text
Journal Article -
10
A Study on the Improvement of Athena++ Solvers for HPC-AI Applications
ISSN: 2768-0940Published: IEEE 27.10.2025Published in International Symposium on Networks, Computers and Communications (27.10.2025)“… Key strategies include selective input array transfer to minimize memory overhead, CPU-side preprocessing of external functions, and loop-level parallelization within CUDA kernels…”
Get full text
Conference Proceeding -
11
Improved Affine Partition Algorithm for Compile-Time and Runtime Performance
ISSN: 1079-8587, 2326-005XPublished: 01.01.2011Published in Intelligent automation and soft computing (01.01.2011)Get full text
Journal Article -
12
Parallel Block Preconditioning Techniques for the Numerical Simulation of the Shallow Water Flow Using Finite Element Methods
ISSN: 0021-9991, 1090-2716Published: Elsevier Inc 01.11.1995Published in Journal of computational physics (01.11.1995)“…In this paper, we report our work on applying Krylov iterative methods, accelerated by parallelizable domain-decomposed (DD) preconditioners, to the solution…”
Get full text
Journal Article -
13
Learning to Parallelize with OpenMP by Augmented Heterogeneous AST Representation
ISSN: 2331-8422Published: Ithaca Cornell University Library, arXiv.org 09.05.2023Published in arXiv.org (09.05.2023)“…Detecting parallelizable code regions is a challenging task, even for experienced developers. Numerous recent studies have explored the use of machine learning…”
Get full text
Paper -
14
Two Roads to Parallelism: From Serial Code to Programming with STAPL
ISSN: 1530-2075Published: IEEE 01.05.2019Published in Proceedings - IEEE International Parallel and Distributed Processing Symposium (01.05.2019)“… In the first part we introduce the Hybrid Analysis (HA) compiler framework that can seamlessly integrate static and run-time analysis of memory references into a single framework capable of full automatic loop level parallelization…”
Get full text
Conference Proceeding -
15
An easily-implemented, block-based fast marching method with superior sequential and parallel performance
ISSN: 2331-8422Published: Ithaca Cornell University Library, arXiv.org 31.10.2018Published in arXiv.org (31.10.2018)“…The fast marching method is well-known for its worst-case optimal computational complexity in solving the Eikonal equation, and has been employed in numerous…”
Get full text
Paper -
16
A generalized GCD test based affine partitioning algorithm
ISBN: 142447616X, 9781424476169ISSN: 2160-1283Published: IEEE 01.12.2010Published in The 2nd International Conference on Information Science and Engineering (01.12.2010)“… The affine partitioning algorithm put forward by the SUIF compiler team has been proved to be successful in automatic discovery of the loop-level parallelization in programs…”
Get full text
Conference Proceeding -
17
Data reuse and parallelism in hardware compilation
Published: ProQuest Dissertations & Theses 01.01.2008“…This thesis presents a methodology to automatically determine a data memory organisation at compiletime, suitable to exploit data reuse and loop-level parallelization, in order to achieve high…”
Get full text
Dissertation -
18
Adaptive Parallelization and Optimization for the JAMAICA Chip Multi-Processor Architecture
ISBN: 0438589297, 9780438589292Published: ProQuest Dissertations & Theses 01.01.2007“…Chip Multi-Processor (CMP) systems are now very popular. This trend to have multicore and multi-threading makes the system increasingly difficult to target…”
Get full text
Dissertation -
19
Combining data reuse exploitationwith data-level parallelization for FPGA targeted hardware compilation: A geometric programming framework
ISBN: 9781424419609, 1424419603ISSN: 1946-147XPublished: IEEE 01.09.2008Published in 2008 International Conference on Field Programmable Logic and Applications (01.09.2008)“…) exploitation and loop-level parallelization, in the context of FPGA-targeted hardware compilation…”
Get full text
Conference Proceeding -
20
Parallelization and performance optimization of bioinformatics and biomedical applications targeted to advanced computer architectures
ISBN: 9780542227646, 0542227649Published: ProQuest Dissertations & Theses 01.01.2005“…In this dissertation, we focus on three representative applications targeted to advanced computer architectures: parallel Hmmpfam (Hidden Markov Model for…”
Get full text
Dissertation

