Suchergebnisse - Distributed MultiThreaded Checkpointing DMTCP

  • Treffer 1 - 12 von 12
Treffer weiter einschränken
  1. 1

    Checkpointing Tools in a Supercomputer Center von Savin, G. I., Shabanov, B. M., Fedorov, R. S., Baranov, A. V., Telegin, P. N.

    ISSN: 1995-0802, 1818-9962
    Veröffentlicht: Moscow Pleiades Publishing 01.12.2020
    Veröffentlicht in Lobachevskii journal of mathematics (01.12.2020)
    “… Berkeley Lab Checkpoint/Restart (BLCR), Checkpoint Restore In Userspace (CRIU), and Distributed MultiThreaded Checkpointing (DMTCP) tools are examined …”
    Volltext
    Journal Article
  2. 2

    DMTCP: Transparent checkpointing for cluster computations and the desktop von Ansel, J., Arya, K., Cooperman, G.

    ISBN: 9781424437511, 1424437512
    ISSN: 1530-2075
    Veröffentlicht: IEEE 01.05.2009
    “… DMTCP (distributed multithreaded checkpointing) is a transparent user-level checkpointing package for distributed applications …”
    Volltext
    Tagungsbericht
  3. 3

    Optimizing Checkpoint-Restart Mechanisms for HPC with DMTCP in Containers at NERSC von Timalsina, Madan, Gerhardt, Lisa, Tyler, Nicholas, Blaschke, Johannes P, Arndt, William

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 26.07.2024
    Veröffentlicht in arXiv.org (26.07.2024)
    “… ). It focuses on the use of Distributed MultiThreaded CheckPointing (DMTCP) in various computational settings, including both within and outside of containers …”
    Volltext
    Paper
  4. 4

    Adapting the DMTCP Plugin Model for Checkpointing of Hardware Emulation von Garg, Rohan, Arya, Kapil, Cao, Jiajun, Cooperman, Gene, Evans, Jeff, Garg, Ankit, Rosenberg, Neil A, Suresh, K

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 02.03.2017
    Veröffentlicht in arXiv.org (02.03.2017)
    “… The new plugin model for the upcoming version 3.0 of DMTCP (Distributed MultiThreaded Checkpointing …”
    Volltext
    Paper
  5. 5

    Checkpointing SPAdes for Metagenome Assembly: Transparency versus Performance in Production von Jain, Twinkle, Wang, Jie

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 04.03.2021
    Veröffentlicht in arXiv.org (04.03.2021)
    “… : Distributed MultiThreaded CheckPointing) to long-running production workloads of SPAdes. This work has exposed several bugs and limitations of DMTCP, which were fixed to support the large memory and fragmented intermediate files of SPAdes …”
    Volltext
    Paper
  6. 6

    DMTCP: Transparent Checkpointing for Cluster Computations and the Desktop von Ansel, Jason, Arya, Kapil, Cooperman, Gene

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 24.02.2009
    Veröffentlicht in arXiv.org (24.02.2009)
    “… DMTCP (Distributed MultiThreaded CheckPointing) is a transparent user-level checkpointing package for distributed applications …”
    Volltext
    Paper
  7. 7

    Use of checkpoint-restart for complex HEP software on traditional architectures and Intel MIC von Arya, Kapil, Cooperman, Gene, Dotti, Andrea, Elmer, Peter

    ISSN: 1742-6596, 1742-6588, 1742-6596
    Veröffentlicht: Bristol IOP Publishing 01.01.2014
    Veröffentlicht in Journal of physics. Conference series (01.01.2014)
    “… (Distributed Multithreaded Checkpointing) package. We analyze both single- and multi-threaded applications and test on both standard Intel x86 architectures and on Intel MIC …”
    Volltext
    Journal Article
  8. 8

    Use of checkpoint-restart for complex HEP software on traditional architectures and Intel MIC von Arya, Kapil, Cooperman, Gene, Dotti, Andrea, Elmer, Peter

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 22.01.2014
    Veröffentlicht in arXiv.org (22.01.2014)
    “… (Distributed Multithreaded Checkpointing) package. We analyze both single- and multi-threaded applications and test on both standard Intel x86 architectures and on Intel MIC …”
    Volltext
    Paper
  9. 9

    Performance evaluation of checkpoint/restart techniques: For MPI applications on Amazon cloud von Azeem, Basma Abdel, Helal, Manal

    Veröffentlicht: Faculty of Computers & Information - Cairo Univers 01.12.2014
    “… (Distributed Multithreaded Checkpointing (DMTCP) and Berkeley Lab Checkpoint/Restart library (BLCR …”
    Volltext
    Tagungsbericht
  10. 10

    Be Kind, Rewind: Checkpoint & Restore Capability for Improving Reliability of Large-Scale Semiconductor Design von Ljubuncic, Igor, Rozenfeld, Avikam, Goldis, Andrew, Giri, Ravi

    Veröffentlicht: IEEE 01.09.2014
    “… Intel's chip design run in a large-scale globally distributed environment with 600,000 cores …”
    Volltext
    Tagungsbericht
  11. 11

    Temporal Debugging using URDB von Visan, Ana Maria, Polyakov, Artem, Solanki, Praveen S, Arya, Kapil, Denniston, Tyler, Cooperman, Gene

    ISSN: 2331-8422
    Veröffentlicht: Ithaca Cornell University Library, arXiv.org 27.10.2009
    Veröffentlicht in arXiv.org (27.10.2009)
    “… ) support for today's multi-core architectures; (iii) reversible debugging of multi-process and distributed computations; and (iv …”
    Volltext
    Paper
  12. 12

    Unibus: Aspects of heterogeneity and fault tolerance in cloud computing von Slawinska, Magdalena, Slawinski, Jaroslaw, Sunderam, Vaidy

    ISBN: 9781424465330, 1424465338
    Veröffentlicht: IEEE 01.04.2010
    “… In order to support fault tolerance we use DMTCP (Distributed MultiThreaded CheckPointing …”
    Volltext
    Tagungsbericht