October 18, 2008

FCoE versus PCI Express Multi-Root IOV, and questionable analysis

I heard some interesting noise recently comparing PCI Express Mutli-Root IO Virtualization to FibreChannel over Ethernet.Or in networking parlance PCIe MR-IOV vs FCoE(now there is a mouthful!).

I’ll get to a technical analysis in a bit but wanted to put as a disclaimer I have not read the report.   Why?   Because I don’t deign to pay the price of a small car for the right to read less-than-insightful reports. Additionally, there is not enough value in the seat license to warrant the expenditure, and the analysis should be independent and be analytical.   While I cannot in any way prove a bias in the analysis that says ‘FCoE is a folly’ I would ask that the technical facts of the systems, the vendor support, ability to execute in the current economic climate, and the benefits today to customers be used to evaluate these technologies.  

This is not the case for many of the industry analysts, just a small few.  But those small few unfortunately ruin the bunch.  As many analysts like Drue Reeves at Burton Group, Vernon Turner at IDC, Rob Whitely at Forrester, Andreas Antonopoulos at Nemertes, and Zeus Kerravalla at Yankee Group know we sometimes agree and sometimes disagree with each other, discuss and debate many different opinions, and in the end share a healthy respect for each other because we jointly feel that we are all trying to act in the customers best interest.

So when I read an article about the ‘Folly of FibreChannel over Ethernet’ advocating a technology designed for deployment between a North Bridge and a I/O expansion module to be extended outside of the chassis, over a wire, with a repeater bridge adapter board, to an external device that then houses the network interface modules for all of the servers that may connect into it I am a little circumspect of the analysts intention.  Why you may ask?

- PCIe is a bus that is internal to the server, using physical addressing.  In an increasingly virtual world using a physically addressed bus as the network is a bit odd.
- PCIe was designed to be located inside the server chassis, it was not designed nor was the data link structured to allow for longer-haul transmission.
- These implementations are called ‘Multi-Root IOV’, there is not a standard for these. It is based on the premise the PCIe can be extended outside the server box to another box at the top of the rack that houses the NICs, and other expansion boards.
- PCI is an extremely chatty protocol, very sensitive to latency, and adding any noticeable propagation delay to the transmission will noticeably degrade the performance.  PCI-SIG understood this well when they standardized SR-IOV, Single Root = Single Server.  But these hopeful implementations are for multiple servers. 
- PCIe is not a switching standard, and therefore it does not define how two servers are going to communicate.
- The only way to create a communication path between the two servers is to translate PCIe into Ethernet or FibreChannel, then switch, then go back to PCIe, then go along the wire to the server again.  Sort of purpose defeating since history has shown that stateful and CPU dependent gateways do not garner market traction.
- Not sure how a MR-IOV scenario addresses the need for Fabric-A/Fabric-B models of high availability. 
- And lastly not sure at all how a MR-IOV solution would identify the Virtual Machine and signal the VM identity to the NIC to the switch for policy enforcement.

I do want to add though that if the technical, operational, protocol, and ASIC/SW implementation issues get worked out over the next several years that it usually takes to align these types of forces there may be an interesting play for the two to work together with FCoE coming out of the MR-IOV ToR device.  But until that day, several years in the future at best, especially given the economic times we are facing, I think I will put my bet strongly on the FCoE implementation, especially when coupled with the VN-Link technologies introduced jointly between Cisco and VMWare. And doubly so since EMC and Network Appliance have both announced support for this architecture and product shipping. 

A few proof points-
Network Appliance to Support FCoE and Nexus 5020

Chuck Hollis’s Blog from EMC discussing FCoE and why it is getting serious.

A wonderful Quote from the FCIA press release “The timely development of the FCoE standard by T11 Fibre Channel standards body allows a high degree of compatibility and interoperability for today’s FCoE products.” 

FCoE = Real, shipping, in production, solid vendor ecosystem support, final stages of standards track.

dg (putting on asbestos suit right now, something tells me this is going to be a lively post/comment thread)

Posted by Douglas Gourlay at 05:47PM PST

Tags:

16 Comments

Silvano Gai Oct 18, 2008

I think we are really comparing apples and oranges.

Ethernet is the winning technology in the Data Center.

In 2009 most of the servers will have 10GbE on the motherboard, in 2010 40GbE will start to appear on the market and 2011 will be the year of 100GbE coming to market.

FCoE is the way to carry FC over Ethernet. FCoE is a simple encapsulation of one FC frame over one Ethernet frame.

The FCoE ecosystem includes all the major network, server, virtualization and software vendors.

Just have a look at http://www.fcoe.com to have an idea of the tremendous ecosystem that is pushing FCoE.

The FCoE standard is already in letter ballot and new products are appearing on the market every day.

PCIe is a nice server bus, but it is not a network technology and therefore will never succeed competing with Ethernet.

My 0.2 cents

—Silvano

Peter Christy Oct 18, 2008

Doug is absolutely right in pointing out that market consolidation isn’t in the best interests of Gartner since they thrive on competition and confusion. Both Microsoft and Cisco are bad for Gartner’s business (for the same reason).

We (IRG) view Cisco’s converged data center networking more in a long term view.  Cisco history has been one of converging disparate networking technologies onto a common IP fabric.  In the short term it creates expertise and OpEx advantages for the customer (fewer technologies to understand and manage). In the longer term convergence has always created interesting and fundamentally new opportunities (e.g., the use of VoIP phones as part of an IT system).

Convergence is always expensive and slow, and as such plays into strengths of the market leader (why Gartner won’t like it). In the long term we think converged data center networks will enable much more than just FCoE. Maybe Gartner isn’t able to imagine the possibilities as well .....

Bill Kane Oct 19, 2008

This is a rant by someone who obviously feels threatened. Unfortunately it is also full of errors.

"These implementations are called ‘Multi-Root IOV’, there is not a standard for these."

See http://www.pcisig.com/specifications/iov/multi-root/ for a copy of the standard that does not exist.

Douglas Gourlay Oct 20, 2008

Bill,
good feedback!  I am not the expert by any means on the standards process and status so will withhold comment on that in deference to people who are closer to it than I am but one observation and comment-

We should agree on what we mean when we say standard.  There is some understandable room for interpretation and error and I did use the term in a rather implicit manner.  The page you mention requires a membership to download.  Thus I would argue that this is not an ‘open’ standard, but one that is shared with the members of the unincorporated PCI-SIG entity. I am not sure of the standards process within PCI-SIG, if there is a ratification and publishing of defined and agreed on open standards like IETF has for instance. (there may be, am claiming ignorance on this front- however, I think we can say that there is a difference between de facto and de jure standards as well as different trajectories for technologies even within open and semi-open standards bodies.)

Now being ‘full of errors’ I would respectfully disagree.

shreyas shah Oct 20, 2008

Doug G,

I completely agree with you. here is the article that shows how MRIOV and FCOE are complimentary.

Regards
Shreyas

shreyas shah Oct 20, 2008

Doug,

Were you looking for this article? It shows MRIOV and FCOE in data center convergence and consolidation.

Regards,
Shreyas

Silvano Gai Oct 20, 2008

To compare two standards you can do few simple tests.

The first one I call it the google test:
mr-iov returns about 8,550 entries
FCoE returns about 212,000 entries

The second one is called the Wikipedia test:
Wikipedia says that FCoE is supported by: “Absolute Analysis, BLADE Network Technologies, Broadcom, Brocade, Cisco, EMC, Emulex, Finisar, HP, IBM, Intel, Hitachi Data Systems, Mellanox, NetApp, Nuova, Qlogic and Sun Microsystems.:
It has no entry for mr-iov.

This are not definitive metrics but they give you a good idea of the situation.

The reality is that there is not an ecosystems about mr-iov comparable to the one of FCoE.

Cheers

—Silvano

shreyas shah Oct 21, 2008

mr iov (Space instead of - character) : google test will return 145,000 hits.

FYI: Any networking adaptor can be connected to MRIOV switches including Ethernet, FC, FCOE, IB, Myricom ....

Mark Oct 21, 2008

Didn’t PCI Express Advanced Switching die years ago?  You would really need PCIe Advanced Switching to do anything interesting with PCIe shared I/O.  Otherwise, it is just an bus extender.
A switched PCIe backplane might be mildly interesting in blade server chassis, especially ATCA chassis, which was the original driver for switched PCIe, but IEEE 802.3ap standardized backplane Ethernet, along with enhancements to Ethernet, should provide similar capabilities.

shreyas shah Oct 22, 2008

you would need support of MR protocols on networking adaptors. It is PCIe with header addition to detect which processor is using which part of the adaptor.

ASI was a grand scheme and very high expectations changing the software layers in hosts (People undermined the effort and huge resistance on change:-() and complexity of the implementation killed it.

As you said PCIe is host centric, MR makes it suitable for shared IO. only caveat is it does not support processor to processor communication and that will stay proprietary till PCISIG wakes up and standardizes it….

rohit Nov 6, 2008

Shreyas/Doug

I think there is a role MR/IOV should play in the high density server interconnect world and its overlap with 10GbE or FCoE will depend on additional intelligence that can be signalled or switched across a PCIe switch.

Shreyas - would like to talk to you more about this - pls email me @ if you have some time.

-rohit

Michael Williams Nov 11, 2008

FYI, when prompted for the username/password to download the MR/IOV document, I simply used “user” and “password” and it allowed me to download it.

A would agree with Doug that, although the statement about lacking a standard for MR/IOV, PCIe, whatever, doesn’t change the argument that it’s not designed to be, nor can it compete with a data switching technology like ethernet.  I don’t believe the folks pushing PCIe as opposed to FCoE are arguing that it is, but is seems the argument is where the demarcation is between server I/O and network fabric, specifically as it pertains to consolidated I/O.

IMHO, ethernet is the single fabric, now and into the (forseeable) future.  Just like VoIP turned voice/telephony into “just another app” that rode the Layer 3 IP infrastructure, Data Center Ethernet and the enhancements bundled in simply pave the way for things like Fibre Channel to be “just another app” riding the Layer 2 ethernet infrastructure.  The Gartner folks thinking the primary drive of FCoE is consolidated I/O at the server level are missing the big picture.

Mike

Virtual Keyboard Jan 14, 2009

Both standards have the right to live. IMHO

Mark Feb 21, 2009

Okay, I downloaded the above mentioned MR-IOV specification doc.  And after looking through it, here are my thoughts.

Considering FCoE (or iSCSI) consolidated I/O versus MR-IOV I/O adapter sharing, let’s look at two examples.  How is the end result different in these two cases?

1. A blade server chassis, where each server had an onboard CEE NIC, and the blade chassis has a CEE switch connected to an external CEE network.

2. A blade server chassis, where each server has no onboard NIC, but has an onboard PCI root port, and the blade chassis has a multi-root aware PCI switch along with one or more multi-root aware CEE NICs connected to an external CEE network.

Both provide virtualized, shared I/O to the individual blade servers.

The question is, which is better?

Layering SR-IOV operations at the individual blade server level on top of MR-IOV at the blade chassis level may create a far too complex, multi-layered virtualized PCI environment to manage.

Second, looking at the MR-IOV specification doc, I see topics like switches, congestion management, buffers, performance monitoring ... this is a network!  A managed network, no less.  One word I do not see in the document is “security”.

Third, while SR-IOV is a technology which replaces existing software based adapter sharing mechanisms (NPIV, VMFS, vSwitch, Virtual IPs, etc.), MR-IOV seems like it may be a solution in search of a problem, unless its purpose is to replace a switch with an adapter, or more specifically, replace a many adapter/one switch solution with a one switch/one adapter solution.

Considering the latter, is what is being replaced (I/O adapters, or more specifically, Ethernet NICs) expensive?  Are they underutilized, and therefore well suited for sharing?

Right now, some customers are scared of consolidated I/O because they feel they have too much I/O per server to put both storage and IP traffic onto a single 10Gb pipe.  How would they accept 16 blade servers sharing a few NICs and HBAs, much less converged FCoE adapters?

PCIe MR-IOV looks a lot like NGIO, Future I/O, and the early days of InfiniBand:  A switched I/O bus extension technology allowing sharing of I/O adapters.  InfiniBand failed at this use, primarily because it inserted more complexity into the environment than the solution gained in efficiency.  I do not see how MR-IOV will solve this.

Some may remember a desire to use IB as a blade backplane I/O sharing technology.  IBM’s BladeCenter H actually was designed to support this capability.  It may have made sense in the days of unvirtualized blade servers with two 1GHz cores and 2 GB RAM.  Today, highly virtualized blades with eight 3 GHz cores, and 48 GB RAM (and 16 cores and 96 GB RAM in 2010), need enough dedicated I/O that adapter sharing makes little sense, in light of adding a new, intermediate, single-purpose network.

Tim Burton May 19, 2009

“FCoE is the way to carry FC over Ethernet. FCoE is a simple encapsulation of one FC frame over one Ethernet frame.”

If you are comparing MR-IOV on its ability to augment FC and Ethernet then when compared with FCoE its up against a tough battle.  The key point with MR-IOV is that is can accommodate any protocol that has a card that will fit into a PCIe slot.  So sharing GPUs, h264 accelerator cards, IB HBAs, 10GBe NICs, 8Gb FC and another other future standard.

Also lets not forget some applications can’t currently be virtualized.  I work in the broadcast arena and frequently see racks of 2U servers with a single PCIe SDI video capture card and twin FC HBAs for redundancy (not bandwidth).  If that could be collapsed down into a blade center H with 14 blades, with the external PCIe chassis allowing connection for the capture cards (with repatching in case of server/card failure) and a few shared FC HBAs that would be a good saving on space and power. Not to measure the decrease in switching infrastructure required.

More and more media PCIe products are moving towards extending PCIe outside of the server/workstation case.  Big boards wobbling about with huge ‘donky knobbler’ looms attached or break out boxes (Avid & Blackmagic design for example).  It’s far neater to bring whole shebang out on a PCIe connection to a rack mounted video IO box.

i assume you guys have seen this? - http://www.nextio.com

1 Trackback

Storage Blogs - Storage Monkeys Blogs Nov 17, 2008

FCoE versus MR-IOV…huh? Fibre Channel over Ethernet (FCoE) versus Multi-Root I/O Virtualization (MR-IOV)? Huh? How is it that someone is comparing these two different technologies that were intended to address different problems? Or am I completely missing the boat here? ...

Post a comment

Name:
Email:
URL:

Comments:

Notify me of follow-up comments?

Submit the word you see below:


Post a trackback

Ping this URL to post a trackback:
http://blogs.cisco.com/trackback/6527/rfRzeytf/

More blog posts

Previous post:
WAN Optimization: Available Today from your Service Provider

Next post:
Cisco on Cisco Data Center TechMinute Podcast Series

Recent posts:
July 2009 Archive