Cisco Blogs


Cisco Blog > High Performance Computing Networking

SC’11 Roundup

November 18, 2011 at 3:53 pm PST

As usual, I’m exhausted — in a good way — at the end of an SC week.  Whew!

Thanks to all who came to see my demo (showing 5.17us NetPIPE MPI latency over Ethernet via Linux VFIO and Cisco’s “Palo” NIC — no, that’s not iWARP and it’s not IBoIP a.k.a. RoCE — see my prior post for a little more info), and thanks to all who came to the Open MPI BOF.  I counted about 100 people at the BOF.  The BOF slides are available, if you missed the actual event.

Brock and I did a [probably incredibly embarrassing] short video spot with Rich Brueckner at the end of the show (another in the RCE-Cast <--> InsightHPC crossover series).  The convention announcer guy was literally saying “The show is over; please leave” over the PA while we were recording.  Whenever Rich gets to posting the video, I think you’ll see why I usually stick to writing.  :-)

Read More »

Tags: ,

SC’11 Cisco booth demo: Open MPI over Linux VFIO

November 14, 2011 at 5:06 pm PST

Linux VFIO (Virtual Function IO) is an emerging technology that allows direct access to PCI devices from userspace.  Although primarily designed as a hypervisor-bypass technology for virtualization uses, it can also be used in an HPC context.

Think of it this way: hypervisor bypass is somewhat similar to operating system (OS) bypass.  And OS bypass is a characteristic sought in many HPC low-latency networks these days.

Drop by the Cisco SC’11 booth (#1317) where we’ll be showing a technology preview demo of Open MPI utilizing Linux VFIO over the Cisco “Palo” family of first-generation hardware virtualized NICs (specifically, the P81E PCI form factor).  VIFO + hardware virtualized NICs allow benefits such as:

  • Low HRT ping-pong latencies over Ethernet via direct access to L2 from userspace (4.88us)
  • Hardware steerage of inbound and outbound traffic to individual MPI processes

Let’s dive into these technologies a bit and explain how they benefit MPI.

Read More »

Tags: , , ,

SC’11, Open MPI BOF, and 10 petaflops!

November 6, 2011 at 6:07 am PST

SC'11 logoI’m sure most everyone has heard already, but the K supercomputer has been upgraded and now reaches over 10 petaflops.  Wow!

10.51 petaflops, actually, so if you round up, you can say that they “turned it up to 11.”  Ahem.

We’ll actually have Shinji Sumimoto from the K team speaking during the Open MPI BOF at SC’11.  Rolf vandeVaart from NVIDIA will also be discussing their role in Open MPI during the BOF.

We have the 12:15-1:15pm timeslot on Wednesday (room TCC 303); come join us to hear about the present status and future plans for Open MPI.

Tags: , ,

The MPI C++ Bindings

October 31, 2011 at 6:06 am PST

What a strange position I find myself in: the C++ bindings have become somewhat of a divisive issue in the MPI Forum.  There are basically 3 groups in the Forum:

  1. Those who want to keep the C++ bindings deprecated.  Meaning: do not delete them, but do not add any C++ bindings for new MPI-3 functions.
  2. Those who want to un-deprecate the C++ bindings.  Meaning: add C++ bindings for all new MPI-3 functions.
  3. Those who want to delete the C++ bindings.  Meaning: kill.  Axe.  Demolish.  Remove.  Never speak of them again.

Let me explain.

Read More »

Tags: ,

Shared Receive Queues

October 25, 2011 at 5:00 am PST

In my last post, I talked about the so-called eager RDMA optimization, and its effects on resource consumption vs. latency optimization.

Let’s talk about another optimization: shared receive queues.

Shared receive queues are not a new idea, and certainly not exclusive to MPI implementations.  They’re a way for multiple senders to send to a single receiver while only consuming resources from a common pool.

Read More »

Tags: , , ,