Flattened clos: Designing high-performance deadlock-free expander data center networks using graph contraction

S Zhao, Q Zhang, P Cao, X Zhang, X Wang… - … USENIX Symposium on …, 2023 - usenix.org
Clos networks have witnessed the successful deployment of RoCE in production data
centers. However, as DCN bandwidth keeps increasing, building Clos networks is becoming …

Flexible silicon photonic architecture for accelerating distributed deep learning

Z Wu, L Yuan Dai, Y Wang, S Wang… - Journal of Optical …, 2024 - opg.optica.org
The increasing size and complexity of deep learning (DL) models have led to the wide
adoption of distributed training methods in datacenters (DCs) and high-performance …

Threshold-based routing-topology co-design for optical data center

P Cao, S Zhao, D Zhang, Z Liu, M Xu… - IEEE/ACM …, 2023 - ieeexplore.ieee.org
Despite the bandwidth scaling limit of electrical switching and the high cost of building Clos
data center networks (DCNs), the adoption of optical DCNs is still limited. There are two …

FIGRET: Fine-Grained Robustness-Enhanced Traffic Engineering

X Liu, S Zhao, Y Cui, X Wang - Proceedings of the ACM SIGCOMM 2024 …, 2024 - dl.acm.org
Traffic Engineering (TE) is critical for improving network performance and reliability. A key
challenge in TE is the management of sudden traffic bursts. Existing TE schemes either do …

Nonblocking conditions for Clos fabrics with non-uniform switch radixes

T Inoue, T Mano, K Anazawa, T Uno - Journal of Optical …, 2024 - opg.optica.org
<? TeX 2pc 0pt?> Datacenter networks (DCNs) evolve over years and so comprise switches
from different generations. Thus, each stage/layer of the Clos fabric may consist of switches …

Orchid: enhancing HPC interconnection networks through infrequent topology reconfiguration

L Qin, H Gu, X Yu, Z Cai, J Liu - Journal of Optical Communications …, 2024 - opg.optica.org
Interconnection networks are key components of high-performance computing (HPC)
systems. As HPC evolves towards the exascale era, providing sufficient bisection bandwidth …

A case for server-scale photonic connectivity

AV Kumar, A Devraj, D Bunandar, R Singh - Proceedings of the 23rd …, 2024 - dl.acm.org
The commoditization of machine learning is fuelling the demand for compute required to
both train large models and infer from them. At the same time, scaling the performance of …

COCSN: A Multi-tiered Cascaded Optical Circuit Switching Network for Data Center

S Li, H Gu, X Yu, H Huang, S Wang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
A cascaded network represents a classic scaling-out model in traditional electrical switching
networks. Recent proposals have integrated optical circuit switching at specific tiers of these …

Finding Your Niche: An Evolutionary Approach to HPC Topologies

SJ Young, J Suetterlein, J Firoz… - 2023 IEEE High …, 2023 - ieeexplore.ieee.org
Traditional interconnection network design approaches focus on building general network
topologies by optimizing the bisection bandwidth or minimizing the network's diameter to …

Reducing Reconfiguration Time in Hybrid Optical-Electrical Datacenter Networks

S Zhang, S Shan, S Zhao - Proceedings of the 7th Asia-Pacific …, 2023 - dl.acm.org
We study how to reduce the reconfiguration time in hybrid optical-electrical Datacenter
Networks (DCNs). With a layer of Optical Circuit Switches (OCSes), hybrid optical-electrical …