RAIL: A Case for Redundant Arrays of Inexpensive Links in Data Center Networks

  • Danyang Zhuo ,
  • Manya Ghobadi ,
  • Ratul Mahajan ,
  • ,
  • Xuan Zou ,
  • Hang Guan ,
  • Arvind Krishnamurthy ,
  • Thomas Anderson

USENIX NSDI 2017 |

While there are many proposals to reduce the cost of data center networks (DCN), little attention has been paid to the role played by the physical links that carry packets. By studying over 300K optical links across many production DCNs, we show that these links are operating quite conservatively relative to the requirements in the IEEE standards. Motivated by this observation, to reduce DCN costs, we propose using transceivers—a key contributor to DCN cost—beyond their currently specified limit. Our experiments with multiple commodity transceivers show that their reach can be “stretched” 1.6 to 4 times their specification. However, with stretching, the performance of 1–5% of the DCN paths can fall below the IEEE standard. We develop RAIL, a system to ensure that in such a network, applications only use paths that meet their performance needs. Our proposal can reduce the network cost by up to 10% for 10Gbps networks and 44% for 40Gbps networks, without affecting the applications’ performance.