The Drawbacks of Backbone Switching Hubs

Switch your Thinking
about
Switching to Switches!

"Vendor vs Customer in Mass Matrix Confusion"

Date: Jan 24, 1995

Written by: Walter Benton

Table of Contents

1. Approved Standards vs De-Facto-Standards 3

2. Media Hype! Is it hype or not? 3

3. Switching! What is it? 4

4. Switching! Good and/or Bad? 5

5. Your Choice 6

6. What Kind Problems Can Occur? 6

7. Buffering, How Big is Enough? 8

8. Design Factors 8

9. The Backbone Syndrome 10

10. The Front-end Syndrome 11

11. Solution or Not? 12

12. Peer-to-Peer vs Client/Server 12

13. Switching Hub Inter-Connectivity (High-speed) 13

14. Slow Bridge/Router Performance 15

15. To Summarize 15

A word about the author:

Walter Benton is an American with over 17 years of computer experience and 7 years working with NetWare LANs. He specialized in satellite tracking and satellite telecommunications for the U.S. Navy using 3GHz Microwave equipment in Yokosuka, Japan 17-years ago. He is now a Manager in the Network System Marketing Department of Memorex Telex Japan where he has been working for the past 5 years.

1. Approved Standards vs De-Facto-Standards (toc)

Many new technologies were released during 1994. A good portion of those technologies have yet to be fully approved by any international standards committees. Standards give us the freedom to mix-and-match various vendors' products as necessary to meet our ever growing networking needs. Without standards, multi-vendor networks will not inter-operate. As networking is now expanding at ever faster paces, it is all the more imperative that approved standards be used to build our future networks.

Looking back at 1994, I found it quite difficult to know whom to believe, what is right and which is best. It was a hard year to safely choose a product based upon either publicity or by brand name. What will the future bring? How can we, in any perspective, comprehend and feel relatively assured that what we spend money on today, will still be useable in the future?

It seems that during 1994, we abandoned those standards committees. What will the future realistically be like is still a big unknown? We fell prey to mass PR campaigns from a small handful of (up until now) successful vendors whom all thought they had good solid strategies. And in doing so, we ended up ignoring everything else that we have held precious to networking over the past several years like "Standards" just to name one. Placing your LAN in possible jeopardy is an option that nobody wants to choose.

2. Media Hype! Is it hype or not? (toc)

During 1994, "Ethernet Switching Hubs" were the center of media attention. But look around today, How many large installations actually have Ethernet switches fully implemented throughout their entire network, from front-end workgroups to the backbone? How many success stories have you read either here in Japan, or for that matter, anywhere in the world of switches replacing Bridges and Routers as collapsed backbone architectures? What success rates have these installations really experienced; what troubles do they currently experience; what kind of problems can they expect to experience in the near future?

You don't have to look hard to find switch installations, they're all over the place, but most of them only have a small handful of users (usually 60 or less) physically implemented in a simple workgroup. What about the massive "about to be installed" sites with hundreds or thousands of users who fell for the media hype? Today, you see network managers all over going for switches, and you also hear of order backlogs mainly from Bay Networks (previously Synoptics), for their LattisSwitch 28115! This backlog is over 300 in Japan alone, but over 1200 world-wide. Why is there such a backlog? It's not just due to a production backlog where they can't make them fast enough to ship everywhere, but due to technological problems that have just recently shown up in the United States (end of Nov '94).

Switches, as known today, supposedly first came out in 1993. Kalpana and formerly Synoptics (now Bay Networks) are two names well known throughout the world today for their switching hubs. Today, almost all vendors offer some kind of "Switching" hub although the terminology of a "Switch" is quite vague in it's interpretation.

One vendor, Fibronics Inc. of Israel, actually shipped the world's first true switching hub back in 1991. It was a 12-port Ethernet matrix-switching hub with an FDDI ring attachment. But at that time, switching was not a catch phrase and so it was dubbed the "FX-8610 Workstation Server" and not a "switching hub". It was built around the "store-and-forward" method that many vendors implement in their hubs today. It originally supported only 4 users per port, but was then increased to support up to 32 users per port due to user demands. This hub still ships today with FDDI as the standard high-speed interface.

A newer model started shipping the latter part of 1994 labeled the FX-8616 with 16-ports and two optional FDDI or TP-PMD ports. TP-PMD is the officially approved 100Mbit/s Twisted Pair standardized version of FDDI.

3. Switching! What is it? (toc)

When you talk about Bridges or Routers or even Repeaters; these terms are quite well known. They are known for what they are, for what they are supposed to do, and for how they are supposed to do it, not to mention what their limitations are. These products have standards committees' approvals regulating their inter-connectivity. You can mix-and-match any "standard compliant" vendor's bridge with any other "standardized" vendors' bridge, router or repeater today. You can do this because of APPROVED STANDARDS!

But how do you define a "Switch"? What is the standard for a switch? What approvals by standards organizations like (IEEE, IAEA, ANSI, etc.) have been given to switches? How do you determine what is and what is not a switch? A switch can have any one or more of the following:

Shared or Private Segments
Bit-rates of 10 Mbit/s, 20 Mbit/s or even 100 Mbit/s for Ethernet ports
Bit-rates of 16 Mbit/s or 32 Mbit/s for Token Ring ports (yet to be fully offered by most vendors)
Bit-rates of 100Mbit/s for FDDI switches
Then you hear talk about 25-, 52-, 100- and 155-Mbit/s ATM switches
Cut-through switches, error-correcting switches or store-and-forward switches
Secure Switching
Virtual Switching
Routing-capable switching
IP domain segmenting switches
etc. etc. etc.

Basically put, a switch is just that "A (fast) switching BRIDGE" and nothing more at this time in the eyes of the standards committees. But looking at the performance increases that small test-beds have recently shown, this looks to be a possible way to help solve bottlenecks that we currently face in today's shared-network environments. A lot of time and money has been spent by several vendors on PR and seminars about LPP (Lan Per Port), and how much money you can save by retro-fitting switching hubs into your current network. Due to the cheaper price-tag and the easier manageability of switches compared to Routers, not to mention the "supposed" faster transfer rate than the currently installed Bridges and Routers offer, switches are becoming an almost necessity item on a majority of network manager's shopping lists.

4. Switching! Good and/or Bad? (toc)

Going back to basics, a switch operates in virtually the same way as a bridge. Each port on a Switch is a separate collision domain. Even when switched to another port, each segment is a separate entity. The information is passed at the OSI LLC (Logical Link Control) of the Layer 2 just like a bridge.

Latency times of switches are in the order of 10盜 - 80盜, much shorter than bridges (avg. speed around 300盜) and routers (avg. speed 2ms or greater), because of the switching technology used. The fastest type of switch "on the market" today is 40us, Kalpana developed this switch and called it "Cut-Through" switching or otherwise known as "On-the-Fly" switching. Both terms are used quite frequently today.

"Cut-Through" or "On-the-Fly" switching reads only the first 14-bytes of each packet to decipher the destination address, and then creates an "On-the-Fly" connection (or switched network) while the packet is still being received by the port. The advantages of this method is a low-latency time of 40盜, but the disadvantages are that because only the first 14-bytes are read, packet runts, collision fragments, incomplete packets or otherwise errored packets are switched as well. A bridge, on the other hand, filters out these unwanted packets and does not send them out to other segments.

Another difference is that switches only forward data to the segment(s) that each packet is destined for. This means that the non-destined segments won't receive that data, reducing the amount of unnecessary traffic for those non-destined segments. This, in-turn, then increases available bandwidth on those unswitched segments. Some intelligent bridges also have this feature.

Some vendors thought that errored packets (runts, short-frames, collision fragments, etc.) should not be forwarded and therefore developed another type of "Error Checking" switch. This method reads the first 64-bytes of each packet before deciding whether to forward that packet on to the destination segment or not. This method reduces the re-transmission of most bad data. But to gain this advantage, there is an overhead latency time which is slower than the "Cut-Through" method. Even so, it is still faster than either a bridge or a router. Either way, even this method will forward errored frames such as the CRC as it is in the end of the frame.

Then there is "Store-and-Forward" technology. This method gives a complete integrity check of each packet before sending it to it's destination segment (including a CRC check after the complete packet has been received). This is the safest switching method known today, but also the slowest as compared to the other switching methods. Even so, "Store-and-Forward" switches are still faster than ordinary conventional approved bridges and routers. Either way, more effective bandwidth utilization is realized than that offered in conventional equipment.

5. Your Choice (toc)

If you only have a small number of errors across all segments, the "Cut-Through" method is by far the fastest method available. If you want to reduce the number of errored or bad packets that are received from one segment and keep them from being passed on to their destination segment(s), then the "Error Checking" or even better yet, the "Store-and-Forward" technique is the more preferred.

All of these switches perform fast switching, but because they are still quite new to the market, they have not been fully tested as to what extent they can be effectively used. What are their limits? Here is where further investigation is required to find out if and how switches can be effectively implemented into large networks. Which methods are most appropriate, what are the actual limitations of switches as they are currently designed? Basically put, more testing needs to be done before standards committees will approve switches. Until such time, should you decide to implement switching into your network, you will be one of the many test-beds (otherwise known as "beta sites") that are currently learning what the limits of switching are the hard way.

Regardless of which switch you choose, there are various other considerations that must also be looked into before designing a switched network. Switches are thought of as Plug-and-Play systems. You just plug them in and they automatically switch for you with minimal configuration. You would immediately assume that these switches can be placed anywhere to relieve bottlenecks by just dropping them into the troubled area.

If this is truly all there was to switches, standards would already be out and I wouldn't be writing this article. Designing switched networks, on the contrary, is quite complicated due to unforeseen limitations that have only recently began to crop up in the United States and will soon be seen here in Japan. The reasons why standards have not been issued yet is also becoming clearer to those who decided "We don't need them this time around!".

But whether we follow standards or even more so if we don't, we must still carefully plan our network design, especially in a switched environment. We must ensure that the design is flexible enough to cater to both our present demands and can scale successfully into our future needs easily while ensuring a smooth migration path.

6. What Kind Problems Can Occur? (toc)

Adnet Technologies, a vendor of LAN systems, found out the hard way what kind of problem their installed customers were having when one of them called and complained that the system Adnet had installed just a week before, was almost at a stand-still.

After an investigation, Adnet found out that the LAN was choking due to packet pileups in the buffers on the backbone switches. Further investigation revealed that it wasn't only Adnet's isolated customer, but various other similar installations were also experiencing the same slowdown problem.

Switches came into the market place quicker than most people expected. Switches broke all the speed barriers that have been in place for some time now, but they were not issued any tickets or even warnings for going over the regulated speed. Vendor after vendor embraced the "If we don't have a switch, we're out of business" approach. And the problems associated with switches were ignored as unnecessary "costly" delays by most of the vendors.

Data just comes to the switch and either the first 14-bytes, 64-bytes or the complete packet are read in and switched to their destination. What happens if two users on the same port are simultaneously talking to the same server through a switch on another port? Basically put, while one user is transferring data across the switch, the other user must wait his turn to access the same switch. Because both PCs are on the same port and the server that both want to talk to are on another port, a separate switch must occur for each of them to be connected.

Let's assume that one PC has already sent a request to the server and his packet has been switched properly. The next user then sends his data across the switch, but at that same time, the switch simultaneously receives the answer from the server in response to the first user's request. One of the following things will happen:

If the switch receives the request from the server first, the second user's request will be queued into a buffer until the request from the server has been sent across the switch.
If the switch receives the second user's request first, the packet sent from the server will be queued into a buffer until the request from the second user has been sent across the switch.

A 1.2 Mbyte file is 1,228,800 bytes in size. That means that if all packet sizes were of the Ethernet maximum size (1518 bytes), it would require 810 packets to deliver one user's data. As both the server and the workstation are transferring 1.2 Mbyte files simultaneously, that amount would double to 2.4 Mbytes or 1620 packets (probably more packets as very few LANs send at the full Ethernet maximum size). When the buffers become full, the packets that won't fit into the buffer are dropped. These dropped packets must then be detected via protocols and re-transmitted. The re-transmissions in turn increase the amount of traffic which only exacerbates the situation.

7. Buffering, How Big is "Big Enough"? (toc)

The above example was referring to only two PCs, one sending and one receiving a 1.2 Mbyte file to/from one server. What happens when 20 users and 1 or 2 servers are connected to the switch box? As long as the average amount of traffic sent through each switch is minimal, the effects are usually better throughput than either a "Bridge" or a "Router" due to the lower latency that switches offer.

What about when traffic increases? To what extent can your LAN be safely switched? How can you know when your buffers overflow? To answer this, you need to purchase expensive sniffers or analyzers and actually measure what peaks and averages you currently have. To properly find peaks and averages of a network, you have to monitor it for between several days and several weeks depending on the daily fluctuations on your LAN. Each segment can and will show different results depending on the user's attached to that segment.

It is also normal for end-of-the-month traffic to exceed the daily amounts so ensure that those statistics are recorded too. But what are the peaks of your switch? Again, this can differ from installation to installation and vendor to vendor, so the only way to tell for sure is to measure throughput after your switch is installed. What if you want to add a new application or one or more new users to your current LAN? You have to perform the long measurement process all over again to see what impact your new application/user(s) have on the network!

What if you are planning a new installation and don't know what the averages and peaks will be? Ethernet was normally designed to handle burst traffic. As the number of users increase, the amount of burst traffic also increases, so how can you safely plan a new installation?

As far as switches are concerned, when the buffer becomes full, packets are dropped. These dropped packets have to be detected by either the protocol, the application or some other method and re-transmitted. This costs valuable time (regardless of how it is detected) and re-transmissions on a busy network only helps to frustrate the situation by taking up otherwise unnecessary bandwidth.

Even if you add 1 Mbyte of RAM per switched port, for large switching hubs this could require adding up to 20 Mbytes or more of RAM which can turn out to be quite costly. What happens when that 1 Mbyte is over-run? Add another Megabyte? How big of a buffer is big enough? Can additional RAM be added to the current hardware? Because each installation is different, there is no correct answer here. Therefore, the only real answer is to go back to the design-board and re-design your current hubs into a workable design.

8. Design Factors (toc)

Place the servers on fast reliable media such as FDDI. As 100Base-T and 100VG-AnyLAN have yet to be proven standards (although both seem likely prospects). FDDI is the only proven 100Mbit/s standard available.
The throughput of FDDI (90Mbit/s) is over twice that of 100Base-T (40Mbit/s) and it has been proven time and again to be safe enough for mission critical use. FDDI offers features of Dual-counter-rotating-ring, and you can also concentrate your switching hubs into dual-homed FDDI concentrators to rid yourself of the old FDDI problem when the ring fails in two places, cutting your ring into two separate rings.
FDDI does not limit you with the maximum length problems that 100Base-T currently face. And this does not even begin to mention that 100Mbit/s Ethernet hubs with high-speed "backbone" connectivity, "not just high-speed server attachment", are designed using repeater technology which means you have a limitation as to the number of hubs that you can inter-connect to expand your network. Not very flexible for networks with large expansion capacity!
If your hub does not offer high-speed connectivity, connect the servers to the hubs with multiple Ethernet segments to increase aggregate throughput to the servers.
If you are installing a Client/Server network, the server segment(s) will always be the busiest. If you attach the server to multiple segments, the load can then be distributed amongst several segments so that no individual segment is overloaded. In NetWare networks, this requires a special NLM (NetWare Loadable Module) to allow the server to distribute the data evenly over several segments all using the same network number.
In the United States, several Switching NIC manufacturers have developed a switching NIC to fit inside a NetWare server with several ports built into the NIC, but even these cards are experiencing buffer overflows and their vendors are now working with Novell to develop a flow control method to tell the server to back-off sending so much traffic when the buffers become full. This back-pressure NLM is still under development though and has not been accepted by all vendors as either a de-facto-standard or by the Standards Committees' as an approved standard yet. But this is only for the server side, what about the client side?
When creating a network, you must consider balancing your loads so as not to overflow the switch buffers. Some things to take into consideration are the types and locations of your users. Try balancing your loads into workgroups such that each group will utilize approximately the same bandwidth.
- Word processing and spread-sheet users are usually considered light users. These users create only a minimal burst load on the network when either retrieving or saving files from/to the server's disk. If these users perform no other tasks, their traffic will be trivial and quite a number of these users can be placed on one port.
- Data base users usually require a bit more bandwidth because they don't pull the complete database into their local work stations like word processing or spread-sheet users do. Depending on the application they are running, these users can be considered either light, medium-light, medium or heavy users.
- With the recent movement towards ever faster PCs, the number of graphics applications have increased drastically with average user created files ranging anywhere from 1Mbyte to several tens-of-Mbytes. Heavy-ended graphics packages can consume several hundred-Mbytes per file.
- Up and coming multimedia applications that use video and sound require the most bandwidth. These type of users require not only bandwidth, but dedicated bandwidth (real-time) to fully utilize full-color, full-screen, 30 frames-per-second viewing. These types of users will not be able to effectively use 10Mbit/s Ethernet even in a switched environment. Even 100Mbit/s FDDI, not to mention the lower-bandwidth available 100Base-T or 100VG-AnyLAN, will be unable to fully support any large number of multi-media users. Small test environments might work with the 100Mbit/s media, but further testing with the yet to be developed multi-media applications along with more advanced compression techniques will decide which of the above, if any can be effectively used on a large-scale.
9. The Backbone Syndrome (toc)
10. The Front-end Syndrome (toc)
11. Solution or Not? (toc)
12. Peer-to-Peer vs Client/Server (toc)
13. Switching Hub Inter-Connectivity (High-speed) (toc)
14. Slow Bridge/Router Performance (toc)
15. To Summarize (toc)

1.	Approved Standards vs De-Facto-Standards	3
2.	Media Hype! Is it hype or not?	3
3.	Switching! What is it?	4
4.	Switching! Good and/or Bad?	5
5.	Your Choice	6
6.	What Kind Problems Can Occur?	6
7.	Buffering, How Big is Enough?	8
8.	Design Factors	8
9.	The Backbone Syndrome	10
10.	The Front-end Syndrome	11
11.	Solution or Not?	12
12.	Peer-to-Peer vs Client/Server	12
13.	Switching Hub Inter-Connectivity (High-speed)	13
14.	Slow Bridge/Router Performance	15
15.	To Summarize	15

The Drawbacks of Backbone Switching Hubs

Switch your Thinking about Switching to Switches!

Switch your Thinking
about
Switching to Switches!