A review on Virtualized Infrastructure Managers with management and orchestration features in NFV architecture
K Kaur, V Mangat, K Kumar - Computer Networks, 2022 - Elsevier
Abstract Nowadays, Network Function Virtualization (NFV) is a growing and powerful
technology in the research community and IT world. Traditional computer networks consist of …
technology in the research community and IT world. Traditional computer networks consist of …
CRUM: Checkpoint-restart support for CUDA's unified memory
Unified Virtual Memory (UVM) was recently introduced with CUDA version 8 and the Pascal
GPU. The older CUDA programming style is akin to older large-memory UNIX applications …
GPU. The older CUDA programming style is akin to older large-memory UNIX applications …
Crac: Checkpoint-restart architecture for cuda with streams and uvm
T Jain, G Cooperman - SC20: International Conference for High …, 2020 - ieeexplore.ieee.org
The share of the top 500 supercomputers with NVIDIA GPUs is now over 25% and continues
to grow. While fault tolerance is a critical issue for supercomputing, there does not currently …
to grow. While fault tolerance is a critical issue for supercomputing, there does not currently …
Eliminating vulnerabilities by disabling unwanted functionality in binary programs
Driven by application diversification and market needs, software systems are integrating
new features rapidly. However, this “feature creep” can compromise software security, as …
new features rapidly. However, this “feature creep” can compromise software security, as …
System-level scalable checkpoint-restart for petascale computing
Fault tolerance for the upcoming exascale generation has long been an area of active
research. One of the components of a fault tolerance strategy is checkpointing. Petascale …
research. One of the components of a fault tolerance strategy is checkpointing. Petascale …
Distributed configuration, authorization and management in the cloud-based internet of things
Network-based deployments within the Internet of Things increasingly rely on the cloud-
controlled federation of individual networks to configure, authorize, and manage devices …
controlled federation of individual networks to configure, authorize, and manage devices …
MANA for MPI: MPI-agnostic network-agnostic transparent checkpointing
Transparently checkpointing MPI for fault tolerance and load balancing is a long-standing
problem in HPC. The problem has been complicated by the need to provide checkpoint …
problem in HPC. The problem has been complicated by the need to provide checkpoint …
A highly reliable metadata service for large-scale distributed file systems
Many massive data processing applications nowadays often need long, continuous, and
uninterrupted data accesses. Distributed file systems are used as the back-end storage to …
uninterrupted data accesses. Distributed file systems are used as the back-end storage to …
MANA-2.0: A future-proof design for transparent checkpointing of MPI at scale
MANA-2.0 is a scalable, future-proof design for transparent checkpointing of MPI-based
computations. Its network transparency (“network-agnostic”) feature ensures that MANA-2.0 …
computations. Its network transparency (“network-agnostic”) feature ensures that MANA-2.0 …
Smart scene management for IoT-based constrained devices using checkpointing
Typical devices of the Internet of Things are usually under-powered, and have limited RAM.
This is due to energy and cost concerns. Yet, IoT applications require increasingly complex …
This is due to energy and cost concerns. Yet, IoT applications require increasingly complex …