Search Results - "Computing methodologies Distributed computing methodologies Distributed programming languages"

Refine Results
  1. 1

    Skywalker: Efficient Alias-Method-Based Graph Sampling and Random Walk on GPUs by Wang, Pengyu, Li, Chao, Wang, Jing, Wang, Taolei, Zhang, Lu, Leng, Jingwen, Chen, Quan, Guo, Minyi

    Published: IEEE 01.09.2021
    “…Graph sampling and random walk operations, capturing the structural properties of graphs, are playing an important role today as we cannot directly adopt…”
    Get full text
    Conference Proceeding
  2. 2

    MAD-Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems by Hsia, Samuel, Golden, Alicia, Acun, Bilge, Ardalani, Newsha, DeVito, Zachary, Wei, Gu-Yeon, Brooks, David, Wu, Carole-Jean

    Published: IEEE 29.06.2024
    “…Training and deploying large-scale machine learning models is time-consuming, requires significant distributed computing infrastructures, and incurs high…”
    Get full text
    Conference Proceeding
  3. 3

    AdaGL: Adaptive Learning for Agile Distributed Training of Gigantic GNNs by Zhang, Ruisi, Javaheripi, Mojan, Ghodsi, Zahra, Bleiweiss, Amit, Koushanfar, Farinaz

    Published: IEEE 09.07.2023
    “…Distributed GNN training on contemporary massive and densely connected graphs requires information aggregation from all neighboring nodes, which leads to an…”
    Get full text
    Conference Proceeding
  4. 4

    Gluon-Async: A Bulk-Asynchronous System for Distributed and Heterogeneous Graph Analytics by Dathathri, Roshan, Gill, Gurbinder, Hoang, Loc, Jatala, Vishwesh, Pingali, Keshav, Nandivada, V. Krishna, Dang, Hoang-Vu, Snir, Marc

    ISSN: 2641-7936
    Published: IEEE 01.09.2019
    “…Distributed graph analytics systems for CPUs, like D-Galois and Gemini, and for GPUs, like D-IrGL and Lux, use a bulk-synchronous parallel (BSP) programming…”
    Get full text
    Conference Proceeding
  5. 5

    Auto-parallelizing stateful distributed streaming applications by Schneider, Scott, Hirzel, Martin, Gedik, Bugra, Wu, Kun-Lung

    Published: ACM 01.09.2012
    “…Streaming applications transform possibly infinite streams of data and often have both high throughput and low latency requirements. They are comprised of…”
    Get full text
    Conference Proceeding
  6. 6

    Legate NumPy: Accelerated and Distributed Array Computing by Bauer, Michael, Garland, Michael

    ISSN: 2167-4337
    Published: ACM 17.11.2019
    “…NumPy is a popular Python library used for performing array-based numerical computations. The canonical implementation of NumPy used by most programmers runs…”
    Get full text
    Conference Proceeding
  7. 7

    Risk and Mitigation of Nondeterminism in Distributed Cyber-Physical Systems by Bateni, Soroush, Lohstroh, Marten, Wong, Hou Seng, Kim, Hokeun, Lin, Shaokai, Menard, Christian, Lee, Edward A.

    ISSN: 2832-6520
    Published: ACM 21.09.2023
    “…Asynchronous frameworks for distributed embedded systems, like ROS and MQTT, are increasingly used in safety-critical applications such as autonomous driving,…”
    Get full text
    Conference Proceeding
  8. 8

    Scalable and Consistent Graph Neural Networks for Distributed Mesh-based Data-driven Modeling by Barwey, Shivam, Balin, Riccardo, Lusch, Bethany, Patel, Saumil, Balakrishnan, Ramesh, Pal, Pinaki, Maulik, Romit, Vishwanath, Venkatram

    Published: IEEE 17.11.2024
    “…This work develops a distributed graph neural network (GNN) methodology for mesh-based modeling applications using a consistent neural message passing layer…”
    Get full text
    Conference Proceeding
  9. 9

    Optimizing Distributed ML Communication with Fused Computation-Collective Operations by Punniyamurthy, Kishore, Hamidouche, Khaled, Beckmann, Bradford M.

    Published: IEEE 17.11.2024
    “…Machine learning models are distributed across multiple nodes using numerous parallelism strategies. The resulting collective communication is often on the…”
    Get full text
    Conference Proceeding
  10. 10

    Stratified sampling for even workload partitioning by Paudel, Jeeva, Amaral, Jose Nelson

    Published: ACM 01.08.2014
    “…This work presents a novel algorithm, Workload Partitioning and Scheduling (WPS), for evenly partitioning the computational workload of large…”
    Get full text
    Conference Proceeding
  11. 11

    Unifying Artifacts and Activities in a Visual Tool for Distributed Software Development Teams by Froehlich, Jon, Dourish, Paul

    ISBN: 9780769521633, 0769521630
    ISSN: 0270-5257
    Published: Washington, DC, USA IEEE Computer Society 23.05.2004
    “…In large projects, software developers struggle with two sources of complexity the complexity of the code itself, and the complexity of of the process of…”
    Get full text
    Conference Proceeding
  12. 12

    Legate Sparse: Distributed Sparse Computing in Python by Yadav, Rohan, Lee, Wonchan, Elibol, Melih, Patti, Taylor Lee, Papadakis, Manolis, Garland, Michael, Aiken, Alex, Kjolstad, Fredrik, Bauer, Michael

    ISSN: 2167-4337
    Published: ACM 11.11.2023
    “…The sparse module of the popular SciPy Python library is widely used across applications in scientific computing, data analysis and machine learning. The…”
    Get full text
    Conference Proceeding
  13. 13

    Accelerating Communications in Federated Applications with Transparent Object Proxies by Pauloski, J. Gregory, Hayot-Sasson, Valerie, Ward, Logan, Hudson, Nathaniel, Sabino, Charlie, Baughman, Matt, Chard, Kyle, Foster, Ian

    ISSN: 2167-4337
    Published: ACM 11.11.2023
    “…Advances in networks, accelerators, and cloud services encourage programmers to reconsider where to compute-such as when fast networks make it cost-effective…”
    Get full text
    Conference Proceeding
  14. 14

    CUDASTF: Bridging the Gap Between CUDA and Task Parallelism by Augonnet, Cedric, Alexandrescu, Andrei, Sidelnik, Albert, Garland, Michael

    Published: IEEE 17.11.2024
    “…Organizing computation as asynchronous tasks with data-driven dependencies is a simple and efficient model for single- and multi-GPU programs. Sequential Task…”
    Get full text
    Conference Proceeding
  15. 15

    Managing Workflow Malleability in Urgent Computing for Earthquake Alerts by Ejarque, Jorge, Monterrubio-Velasco, Marisol, Bhihe, Cedric, Pienkowska, Marta, De La Puente, Josep, Badia, Rosa M.

    Published: IEEE 17.11.2024
    “…When large earthquakes happen, first responders need fast and accurate information regarding their impact. UCIS4EQ is an urgent computing platform that…”
    Get full text
    Conference Proceeding
  16. 16

    Parallelized Multi-Agent Bayesian Optimization in Lava by Snyder, Shay, Gobin, Derek, Clerico, Victoria, Risbud, Sumedh R., Parsa, Maryam

    Published: IEEE 30.07.2024
    “…In parallel with the continuously increasing parameter space dimensionality, search and optimization algorithms should support distributed parameter…”
    Get full text
    Conference Proceeding
  17. 17

    Multi-layer faults in the architectures of mobile, context-aware adaptive applications: a position paper by Sama, Michele, Rosenblum, David S, Wang, Zhimin, Elbaum, Sebastian

    ISBN: 1605580228, 9781605580227
    ISSN: 0270-5257
    Published: 10.05.2008
    “…Five cellphones are sold every second, and there are four times more cellphones than computers, meaning there are some billions of mobile handheld devices in…”
    Get full text
    Journal Article
  18. 18

    Asynchronous Distributed-Memory Parallel Algorithms for Influence Maximization by Singhal, Shubhendra Pal, Hati, Souvadra, Young, Jeffrey, Sarkar, Vivek, Hayashi, Akihiro, Vuduc, Richard

    Published: IEEE 17.11.2024
    “…Influence maximization (IM) is the problem of finding the k most influential nodes in a graph. We propose distributed-memory parallel algorithms for the two…”
    Get full text
    Conference Proceeding
  19. 19

    Enabling Low-Overhead HT-HPC Workflows at Extreme Scale using GNU Parallel by Maheshwari, Ketan, Arndt, William, Karimi, Ahmad Maroof, Yin, Junqi, Suter, Frederic, Johnson, Seth, Da Silva, Rafael Ferreira

    Published: IEEE 17.11.2024
    “…GNU Parallel is a versatile and powerful tool for process parallelization widely used in scientific computing. This paper demonstrates its effective…”
    Get full text
    Conference Proceeding
  20. 20

    Parsl+CWL: Towards Combining the Python and CWL Ecosystems by Karle, Nishchay, Clifford, Ben, Babuji, Yadu, Chard, Ryan, Katz, Daniel S., Chard, Kyle

    Published: IEEE 17.11.2024
    “…The Common Workflow Language (CWL) is a widely adopted language for defining and sharing computational workflows. It is designed to be independent of the…”
    Get full text
    Conference Proceeding