FIGRET: Fine-Grained Robustness-Enhanced Traffic Engineering

X Liu, S Zhao, Y Cui, X Wang - Proceedings of the ACM SIGCOMM 2024 …, 2024 - dl.acm.org
Traffic Engineering (TE) is critical for improving network performance and reliability. A key
challenge in TE is the management of sudden traffic bursts. Existing TE schemes either do …

Nonblocking conditions for Clos fabrics with non-uniform switch radixes

T Inoue, T Mano, K Anazawa, T Uno - Journal of Optical …, 2024 - opg.optica.org
<? TeX 2pc 0pt?> Datacenter networks (DCNs) evolve over years and so comprise switches
from different generations. Thus, each stage/layer of the Clos fabric may consist of switches …

Graph-Driven Insights for Heterogeneous Data Center Networks: Leveraging Converters With a Novel Graph Transformer Approach

X Ji, C Xu, Z Zhang, L Zhong, K Gao… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Data centers extensively employ topologies such as CLOS, BCube, and Xpander to manage
their large networks of devices. Data center networks (DCN), which rely on traditional …

Machine Learning-Based Monitoring Trail: On Fast and Accurate Optical Path Failure Localization in All-Optical DCNs

Z Zhao, L Guo, B Wu - Journal of Lightwave Technology, 2024 - ieeexplore.ieee.org
We propose Machine Learning-based Monitoring Trail (mlm-trail) to monitor optical path
failures for large model training. Existing monitoring trail (m-trail) method can only localize …

Advancing Data Center Networks: A Focus on Energy and Cost Efficiency

Z Chkirbene, R Hamila, A Al-Dweik, T Khattab - IEEE Access, 2023 - ieeexplore.ieee.org
Data centers serve as the backbone for cloud computing, enterprise services, and
infrastructure-based offerings. One area of ongoing research in data center networking …

FC+: Near-optimal Deadlock-free Expander Data Center Networks

X Zhang, P Cao, Y Lyu, Q Zhang, S Zhao… - 2023 IEEE Intl Conf …, 2023 - ieeexplore.ieee.org
Expander networks have gained attention as a cost-efficient alternative to expensive Clos
networks in data centers. However, they face challenges with deadlocks caused by the …

Network Load Balancing with Parallel Flowlets for AI Training Clusters

P Cao, W Cheng, S Zhao, Y Xiong - … of the 2024 SIGCOMM Workshop on …, 2024 - dl.acm.org
Unlike traditional data center traffic, AI training traffic primarily consists of large-size flows
that are fewer in number. This characteristic poses a challenge in balancing routing …