Angara interconnect makes GPU-based Desmos supercomputer an efficient tool for molecular dynamics calculations V Stegailov, E Dlinnova, T Ismagilov, M Khalilov, N Kondratyuk, ... The International Journal of High Performance Computing Applications 33 (3 …, 2019 | 62 | 2019 |
Performance analysis of CUDA, OpenACC and OpenMP programming models on TESLA V100 GPU M Khalilov, A Timoveev Journal of Physics: Conference Series 1740 (1), 012056, 2021 | 41 | 2021 |
Early performance evaluation of the hybrid cluster with torus interconnect aimed at molecular-dynamics simulations V Stegailov, A Agarkov, S Biryukov, T Ismagilov, M Khalilov, N Kondratyuk, ... Parallel Processing and Applied Mathematics: 12th International Conference …, 2018 | 22 | 2018 |
Performance of supercomputers based on Angara interconnect and novel AMD CPUs/GPUs A Shamsutdinov, M Khalilov, T Ismagilov, A Piryugin, S Biryukov, ... International Conference on Mathematical Modeling and Supercomputer …, 2020 | 12 | 2020 |
Optimization of MPI-process mapping for clusters with Angara interconnect MR Khalilov, AV Timofeev Lobachevskii Journal of Mathematics 39, 1188-1198, 2018 | 10 | 2018 |
{OSMOSIS}: Enabling {Multi-Tenancy} in Datacenter {SmartNICs} M Khalilov, M Chrapek, S Shen, A Vezzu, T Benz, S Di Girolamo, ... 2024 USENIX Annual Technical Conference (USENIX ATC 24), 247-263, 2024 | 2 | 2024 |
Towards OpenUCX and GPUDirect technology support for the Angara interconnect M Khalilov, A Timofeev, D Polyakov Russian Supercomputing Days, 591-603, 2022 | 1 | 2022 |
Leveraging Interconnect QoS Capabilities for Congestion-Aware MPI Communication M Khalilov, A Slinka, Q Zhang 2021 Workshop on Exascale MPI (ExaMPI), 1-8, 2021 | 1 | 2021 |
Network-Offloaded Bandwidth-Optimal Broadcast and Allgather for Distributed AI M Khalilov, S Di Girolamo, M Chrapek, R Nudelman, G Bloch, T Hoefler arXiv preprint arXiv:2408.13356, 2024 | | 2024 |
Understanding Data Movement in Tightly Coupled Heterogeneous Systems: A Case Study with the Grace Hopper Superchip L Fusco, M Khalilov, M Chrapek, G Chukkapalli, T Schulthess, T Hoefler arXiv preprint arXiv:2408.11556, 2024 | | 2024 |
HEAR: Homomorphically Encrypted Allreduce M Chrapek, M Khalilov, T Hoefler Proceedings of the International Conference for High Performance Computing …, 2023 | | 2023 |
Implementation of OpenUCX framework and GPUDirect technology support for the Angara interconnect MR Khalilov, AV Timofeev Параллельные вычислительные технологии (ПаВТ'2022), 130-130, 2022 | | 2022 |
Evaluating OpenMP, OpenACC and CUDA parallel programming models for the GPU: Performance Analysis M Khalilov, A Timofeev Параллельные вычислительные технологии (ПаВТ'2020), 40-51, 2020 | | 2020 |
Topology-Aware Mapping of MPI Processes for Clusters with Torus Interconnect Angara MR Khalilov, AV Timofeev Параллельные вычислительные технологии (ПаВТ'2018), 395-395, 2018 | | 2018 |
Early evaluation of the hybrid cluster with torus interconnect aimed at cost-effective molecular-dynamics simulations S Vladimir, K Nikolay, KM Ruslanovich, TA Vladimirovich, VS Vecher, ... Lecture Notes in Computer Science, 2017 | | 2017 |