Hostname: page-component-89b8bd64d-4ws75 Total loading time: 0 Render date: 2026-05-06T05:26:15.099Z Has data issue: false hasContentIssue false

Reliable multicast using remote direct memory access (RDMA) over a passive optical cross-connect fabric enhanced with wavelength division multiplexing (WDM)

Published online by Cambridge University Press:  23 October 2019

Kin-Wai Leong
Affiliation:
Viscore Technologies Inc, 15, Fitzgerald Road, Ottawa, K2H 9G1, Canada Rockport Networks Inc., 515 Legget Drive, Ottawa, ON K2K 3G4, Canada
Zhilong Li
Affiliation:
Viscore Technologies Inc, 15, Fitzgerald Road, Ottawa, K2H 9G1, Canada
Yunqu Leon Liu*
Affiliation:
Viscore Technologies Inc, 15, Fitzgerald Road, Ottawa, K2H 9G1, Canada
*
Corresponding author: Yunqu Leon Liu Email: leon.liu@viscore.com

Abstract

It has been well studied that reliable multicast enables consistency protocols, including Byzantine Fault Tolerant protocols, for distributed systems. However, no transport-layer reliable multicast is used today due to limitations with existing switch fabrics and transport-layer protocols. In this paper, we introduce a layer-4 (L4) transport based on remote direct memory access (RDMA) datagram to achieve reliable multicast over a shared optical medium. By connecting a cluster of networking nodes using a passive optical cross-connect fabric enhanced with wavelength division multiplexing, all messages are broadcast to all nodes. This mechanism enables consistency in a distributed system to be maintained at a low latency cost. By further utilizing RDMA datagram as the L4 protocol, we have achieved a low-enough message loss-ratio (better than one in 68 billion) to make a simple Negative Acknowledge (NACK)-based L4 multicast practical to deploy. To our knowledge, it is the first multicast architecture able to demonstrate such low message loss-ratio. Furthermore, with this reliable multicast transport, end-to-end latencies of eight microseconds or less (< 8us) have been routinely achieved using an enhanced software RDMA implementation on a variety of commodity 10G Ethernet network adapters.

Information

Type
Industrial Technology Advances
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCSA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-ncsa/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is included and the original work is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use.
Copyright
Copyright © The Authors, 2019
Figure 0

Fig. 1. The ratio of $N^{\wedge} N$ to N! as a function of N.

Figure 1

Fig. 2. Optical distributed broadcast-select switch.

Figure 2

Fig. 3. 1:N fan-out crossbar view of ODBSS.

Figure 3

Fig. 4. Software RoCE latency measurement.

Figure 4

Fig. 5. Scale-out in multidimensions.

Figure 5

Fig. 6. The logical view of multicast enhancement.

Figure 6

Fig. 7. The physical implementation view of multicast enhancement.

Figure 7

Fig. 8. Proof-of-concept test-bed setup.

Figure 8

Fig. 9. Classic binomial multicast [5].