HPC Interconnects

Overview

A high performance computational cluster is only as fast as the weakest link. And if that link is the interconnect technology between cluster nodes, you could very well find that you have just deployed the worlds fastest door-stop! The ideal HPC cluster configuration has a balanced specification of processing power, memory capacity & bandwidth and fast inter-node communication channels.

There are two common ways to improve inter-node communication bandwidth:

  • Channel bonding
  • Cluster interconnects

Channel bonding involves combining two or more slower communication channels to build-up performance by load-balancing the data flow. Typically the communication channels consist of Fast or Gigabit Ethernet ports integrated on the system motherboard. This method is extremely cheap as it utilizes ubiquitous Ethernet technology and requires very little additional expertise to configure, deploy and manage. The disadvantage with this method is that while net bandwidth may increase, so would latency. Also some applications may need to be altered to take advantage of the load-balancing nature of a bonded communication channel.

Over the years, as multi-node HPC clusters have taken a precedence over traditional vertical scaling supercomputing technology, a new breed of interconnect options have been developed. These are highly specialized ASICs and adapters that are tuned specifically for the high-bandwidth, low-latency requirements of HPC cluster environments and applications. The idea is that interconnect technology should be able to move large amounts of data quickly enough around 10s or even 100s of nodes, that the cost-performance-scalability ratio of HPC clusters significantly overshadows that of vertical-scaling SMPs.

Highly specialized or standards-based high-speed interconnects augment in-built Gigabit Ethernet technology. If your application message passing bandwidth and latency requirements cannot be met by gigabit Ethernet, the following high-speed interconnects provide excellent alternatives.

Host Channel Adapters

Myricom Myri-10G

Silverstorm 7000

Intel Pro/10GbE

Pathscale Infinipath HTX

Illustration

Myri10G-small

Silverstorm-7000-small

Intel Pro/10GbE small

PathScaleInfiniHTX-small

Technology

10GbE Ethernet & 10G Myrinet

Infiniband 4X

10GbE Ethernet

Infiniband 4X (Hypertransport)

System interface

PCI-Express

PCI-X

PCI-X

HTX

Network interface

10GBase-CX4

IB 4X

10GBase-CX4

IB 4X

Deployment

very high bandwidth, low latency

high bandwidth, medium latency

medium-to-high bandwidth, high latency

high bandwidth, very low latency

Cost ratio

medium

high

low

medium

Compatibility

Xeon & Opteron platforms

Xeon & Opteron platforms

Xeon & Opteron platforms

Opteron platforms

Bandwidth (MB/s)

~1,200

~890

~876

~954

Latency (μs)

2.0

3.8

10.1

1.29