Campus Network Design
"Case Study: Designing Campus Network With Management in Mind"
This article was published in LAN Technology in 1989.
Using a star topology to link subnetworks to a single
backbone-in-a-room has given Carnegie Mellon University a flexible
and highly manageable campus network.
by John Leong
John Leong is director of Networking and Communications at
Carnegie Mellon University in Pittsburgh, Penn.
Computing at Carnegie Mellon University underwent significant
changes during the late 1970s and early 1980s. The old centralized
computing environment has given way to a highly distributed one. The
main driving force behind the change is the rapidly declining cost of
computers.
Carnegie Mellon is a relatively small,private university with
approximately 5,500 students and 1,500 faculty and staff members.
Computing has played a significant role in all departments, from the
Computer Science Department and Robotics Institute, to the Drama
School and the Graduate School of Industrial Administration. In the
past, the computing needs of the university were supported by a small
number of very expensive machines. With the exception of the Computer
Science Department, and possibly the Electrical & Computer
Engineering Department, most departments simply could not afford to
purchase their own equipment. This is no longer the case. Today,
there are hundreds of moderately priced minicomputers and work-
stations, as well as thousands of relatively inexpensive IBM PCs and
compatibles and Apple Macintoshes located in buildings all over the
campus. Besides providing the engines for academic, research, and
administrative computing, the computers support a well established and
heavily used electronic mail and bulletin board system for the campus.
While there are significant advantages to distributed computing, it
does make resource and information sharing potentially more difficult.
Unless the evolution to distributed processing is properly managed, it
can quickly lead to fragmentation and chaos. The foundation of a
coherent distributed system is comprehensive data communications
support.
This article gives an overview of the evolution and development of
data communications at Carnegie Mellon University. In it I will
describe the cable plant, networking technology, protocols, LAN
interconnection, network management, and applications we use.
Hopefully, this discussion will help you design a campus network of your
own that is easy to manage.
The Cable Plan
The cabling system is one of the most neglected aspects of data
communications. Typically, cables are installed in an ad-hoc
fashion. But in the long run, a lack of overall planning can result in
a large, unmanageable network. Acknowledging the fundamental need to
use data communications as a utility, Carnegie Mellon committed
significant financial resources to the design and implementation of a
comprehensive cable plant for the campus.
Cable plant installation is an expensive and disruptive undertaking,
so it is desirable not to have to repeat the exercise during the life
of a building. Since networking technology changes every five years
or so, it is important to select a good, generic cabling system rather
than one that is tailored for a currently popular type of network. A
good cabling system should have the ability to support all of today's
important networking technologies. It should also have the potential
to meet the requirements for higher performance networking in the
future.
There are two aspects to our cabling system: inter-building cabling
and intra-building cabling. The intra-building cabling system
implementation is the more costly and complex of the two.
The Inter-Building Cabling System
Carnegie Mellon is located on a relatively compact campus with a
radius of approximately 1 kilometer. With two major exceptions, most
buildings can be interconnected without right-of- way problems.
Coaxial broadband cable and fiber-optic cable are the two main
contenders for the inter-building cabling system.
A coaxial broadband system offers multi-media capability, a large
band-width, and is proven in the field. However, upon close examination,
there are problems with these perceived advantages. Coaxial cable
can indeed be an excellent media for analog video, and it can support
voice and data as well. The typical cable television (CATV) system can
easily carry more than 100 video channels. Broadband systems can also
handle high-quality audio, even in stereo.
A broadband system has a large amount of bandwidth available through
the use of multiple channels. However poor adherence to channel
allocation standards for data usage has been a significant problem.
Equipment from different vendors often use the same or overlapping
frequencies, and frequency agile modems remain expensive.
There are, indeed, years of CATV broadband operational
experience. However, CATV is primarily a rooted-tree network with
a small number of centrally located signal sources. Almost all the nodes
(which are television sets) on the network are receive only stations. There
is not that much operational experience with large scale, two-way
broadband systems where every station has the potential to disrupt or
disable the entire network.
On the whole, my colleagues and I favor a star-shaped, fiber optic
network. From an operational point of view, the star topology is much
easier to maintain than bus or circular ring systems. With star
topology networks, most of the troubleshooting can be handled at one
location --the hub. This certainly beats having to dispatch technicians
to the field for every problem call. Another big advantage of
fiber-optic networks is that they are not affected by lightning. This
is especially important for an inter-building cabling system in
the Pittsburgh area.
Today a lot of development work is being done to increase the data
handling capability of fiber-optic cabling. Digitized voice is the
most commonly used high speed, fiber based service. Long distance
telephone companies use fiber cables to support inter-city services
at rates over 1 gigabit per second. In the data communications industry,
Proteon Inc. has been marketing an 80 megabit-per-second
(Mbit/sec) ring over fiber for almost two years while the 100-Mbit/sec
fiber distributed data interface (FDDI) ring is in an advanced stage of
development. For broadband data systems, the typical higher speed
services operate at 10 Mbits/sec over a 12-megahertz channel.
In the early 1980s, we interconnected most of the campus buildings
with 144 fiber-optic cables. At the recommendation of our local telephone
company, 50-micron cables were used. We soon discovered that the
telecommunications and data communication industries were on different
wavelengths. While the telecommunication industry had standardized on
50-micron cable, the data industry at the time backed the 100-micron cable.
When you connect a small 50- micron cable into a wide 100-micron tap,
only 25 percent of the light gets into the cable. This significant loss at the
connection is quite a problem. We are lucky that our campus is compact.
Hence,
even taking 75 percent loss at launch, most of the links still manage to
function satisfactorily.
From an operational point of view, the star topology is much easier to
maintain than bus or circular ring systems. Over the past few years,
we have increased our fiber plant by installing additional cable. This time
around
we used 62.5-micron fiber, which is the converging standard for both the
tele-communications and data communications industries. Today we have
more than 400 cables connecting all buildings on campus, including
student residences.
The Intra-Building Cabling System
Intra-building cabling system selection and implementation is one of
the least glamorous but potentially most expensive aspects of
networking. A poorly designed and installed cable plant can be a
maintenance nightmare.
We have two main selection criteria for our intra-building cabling
system. First, it must have a star topology. For larger buildings,
multiple interconnected stars is an acceptable configuration. Since
we have conscientiously uncoupled the cable plant selection from a
particular network technology, our second criteria is that the
cabling system must be able to support a variety of network types.
A bus type of network, such as Ethernet, has the advantages that it
is easy to install and has shorter aggregate cable lengths. However,
when the network becomes quite large, a bus network can be
extremely difficult to maintain. Many problems can only be determined
by the low-tech approach of trial and error, un-coupling stations
from the network until the error causing culprit is found.
In a bus network with hundreds of stations spread over several floors,
it can take considerable time to get to every station, even if you
know where they are all located and if none of the rooms housing the
stations are locked. In the case of a star-shaped network, most of
the problem isolation can be done at the hub of the star, typically in
a wiring closet. The effect on problem isolation time can be dramatic.
It can reduce the down time of a network from hours to minutes.
Like many universities, Carnegie Mellon has a heterogeneous
computing environment and similar networking requirements. Installing a
cabling system that will only support one specific networking technology is
simply not acceptable. Given that we had decided on a star topology
system in 1985, the alternatives available for a general purpose
intra-building cabling plant were the AT&T Premises Distribution
System (PDS) and the IBM Cabling System (CS). Both systems call for
an integrated set of voice and data cables. The main difference
between the two is the way in which data is handled. The AT&T PDS
scheme calls for the installation of an additional set of four
voice-grade, unshielded, twisted pairs for data. The IBM CS uses a
set of two higher quality, data-grade, shielded twisted pairs instead.
While it is possible to run relatively high-speed data over voice
pairs, the attenuation typically confines such operations to a very
limited distance. Furthermore, satisfying FCC radiation regulations
can be a problem. On the other hand, shielded twisted-pair
data cable has much better capability, particularly when dealing with
high-speed networks. Since labor accounts for more than half of the
cost of most cable plant installations, the impact of the higher
cost for the IBM cabling material on the overall budget is not that
significant. Moreover, the benefits of a better quality cable can be
greater in the long run. Labor cost is a big factor in an intra-building
wiring project. It is important to note that labor costs vary
considerably across the country, so the "cost per outlet" figure for
such projects should be treated with caution.
With the IBM cabling system, we can handle all the networking
technologies we need to support, including IBM's Token-Ring, Proteon's
ProNet, standard serial lines, IBM 3270 terminals, Apple Computer Inc.'s
AppleTalk, AT&T Starlan, and even full-motion color video. When we first
evaluated the IBM cabling system back in the mid-1980s, we
discovered a serious problem: It is unable to support Ethernet.
Ethernet requires four shielded twisted pairs for its transceiver
(drop) cable, while there are only two data pairs in the IBM cable.
This problem was resolved when SynOptics Communications Inc. was
founded by a group of people from Xerox. SynOptics supplies a network
called LattisNet which turns the IBM cabling system into a star-shaped
Ethernet. Since then, a number of vendors have introduced products
that even support Ethernet over telephone wiring.
We selected the IBM approach, and managed to complete the
rewiring in the early fall of 1987 after 18 months of intense construction.
Over 12,000 outlets have been installed, with almost every room in every
building outfitted with at least one outlet.
A quick overview of our IBM cabling design is as follows. Type 2
cables with both voice and data pairs are used to connect the wall
outlets back to the wiring closets. The voice pairs are punched down
into standard telephone style punch blocks, and the data pairs are
terminated in a patch panel. Typically, there are a number of
strategically located wiring closets in a building. One of those
closets is designated as the main wiring closet for the building, or
the building entrance facility. All the closets are connected to the
main wiring closet by Type 1 cables, which have two data pairs. The
ratio of Type 2 to Type 1 cables is 4:1. The phone pairs are trunked
from the wiring closets to the main closet using the standard cables
with 25, 50, or 100 pairs. The main wiring closet serves both as the
cross connect between the different closets and as the building
entrance facility where the inter-building fiber network terminates.
An important point to emphasize is that the IBM cabling system does
not mean IBM Token-Ring. The cabling system is really a collection
of wires that needs to be patched to the appropriate electronics
inside the wiring closet to form a network. The electronics can be for
Ethernet, IBM Token-Ring, AppleTalk, or other network technologies.
About the time we were completing the rewiring project, Digital
Equipment Corp. (DEC) introduced its own cabling system called
DECconnect. It is essentially the AT&T PDS scheme with the addition of
a 50-ohm (RG-58) coaxial cable for Ethernet and a 75-ohm (RG-59)
coax for video. In that scheme, low-to-moderate speed data is handled
by the voice-grade twisted pairs, while high-speed data service is provided
by the thin Ethernet cable. It is interesting to note that DEC, the main
purveyor
of Ethernet, has also acknowledged that a star-shaped Ethernet is a more
appropriate distribution scheme than the bus topology system.
Upon examination, we think the IBM cabling system is still preferable.
In the DEC cabling system, the high-speed, 50-ohm, RG-58 cable is
rather limiting, and is of questionable use for networks other than
Ethernet. As for the 75-ohm cable, it is questionable whether a broad-
band network will work well with a layout designed for a telephone and
data network. Typically, broadband network architecture has very
different configuration parameters and constraints.
Building LANs on the Cabling System
In general, local area networking technology falls into two classes:
vendor-specific and general-purpose. Vendor-specific networks are
designed for specific computers; one popular example is AppleTalk
for the Macintosh. We tolerate vendor-specific networks and integrate
them into our strategic plan if there is significant demand for the
product. AppleTalk fits this category. In the case of general-purpose
LANs, the network technology must be a public-domain standard.
Proprietary technology is not acceptable for the simple reason that
it would be extremely difficult, both technically and from a business point
of view, for any one vendor to produce controllers for all
the different types of computers purchased by Carnegie Mellon. In
the case of standard networking technology, the onus is on the
equipment vendor to produce the appropriate controller. The two
general-purpose LANs that are heavily used within the university are
Ethernet and IBM Token-Ring.
We have extensive experience with Ethernet, having used this
technology since the early 1980s. This technology is reasonably mature
and has been adopted by virtually all the scientific and engineering
computer vendors. In general, it is a very stable and efficient
network. However, due to the lack of built-in network management
features troubleshooting an Ethernet network can be tricky. As I
mentioned earlier, we have come to the conclusion that the standard
bus type of Ethernet is very difficult to maintain.
In addition, the evolution from Ethernet Version 1 to IEEE 802.3 has
been painful at times. We have many Version 1 transceivers in place,
while equipment coming on-stream conforms to the 802.3 specifications.
The most infuriating problem with incompatible Ethernet equipment is
that it functions in a "flaky" manner rather than failing outright.
We have reconfigured a few of the more important Ethernets into a
star topology using SynOptics' equipment over the IBM cabling system. We
are also supporting thin Ethernet operation over the IBM cabling
system using modified baluns and multi-port thin Ethernet
repeaters from Cabletron and DEC. The use of the star topology is our
strategic direction for Ethernet deployment. Currently, we have more
than 22 Ethernets on campus with more than 60 segments. There are
almost 1,000 workstations and minicomputers on those networks.
As mentioned earlier, we also support AppleTalk. LocalTalk, which is
Apple's cabling system for AppleTalk, uses a daisy-chain type of bus
configuration. To achieve the star configuration we prefer, we used
PhoneNet and the Star Controller from Farallon Computing Inc. Since
the Macintosh is an inexpensive machine, coming up with an
acceptable cost per connection was a big challenge for us. We did a
series of experiments and concluded that each Star Controller port can
be used to support up to three parallel PhoneNET runs over our IBM
cabling system, providing that they are properly terminated In
addition, each of the runs can support a Macintosh and an Apple LaserWriter,
or three Macintoshes. We have been deploying AppleTalk over
the IBM cabling system since April of 1988. By the end of the year, we
had more than 800 nodes attached to over 50 subnets. We anticipate
that the number of nodes will exceed 1,000 in the near future.
We have also been using the IBM 4-Mbit/sec Token-Ring since its
inception, and currently have 39 IBM Token- Rings on the campus supporting
almost 1,300 nodes. Approximately 60 percent of these nodes are IBM
PCs and compatibles; the rest are IBM RT workstations. In general,
the IBM Token-Ring is a well-engineered technology with extensive
network management capabilities that make maintenance relatively simple.
It is not as popular as Ethernet since it has not been around as
long. Furthermore, whereas Ethernet chip sets are available from
multiple vendors Texas Instruments Inc. is currently the only publicly
available source of the token ring chip set, which is also relatively costly.
One frequently asked question concerns the slower data rate of the
IBM 4-Mbit/sec Token-Ring versus the faster 10 Mbit/sec Ethernet. In our
experience the difference is not all that noticeable. If you take into
account the
time required by software for protocol and application handling, the raw data
transfer latency is insignificant. On the other hand, we are mildly interested
in the new 16-Mbit/sec Token-Ring offering, since the difference in price
between the 4-Mbit/sec and the price 16-Mbit/sec products is not that great.
Connecting LAN's With an Inverted Backbone
The multiple LANs are connected to the backbone using the fiber-optic
inter-building cabling system. Both Ethernet and Token-Ring can
operate well over fiber, and we have tested fiber-optic AppleTalk
interfaces with some success. We have an "inverted" backbone, which
means that instead of having the backbone network "reaching out" and
connecting together the various subnets, the subnets "reach in" and
connect to the backbone. This arrangement allows us to have a
physically small backbone located in one room.
The fact that the backbone is in one location, and that every network
appears in that location, makes operation management much easier. We
can, therefore, carry out most of the troubleshooting and
reconfiguration in our data communications room and reduce the occasions
we have to dispatch technicians to the field. We used
Ethernet as the backbone network primarily because we had Ethernet
operational longer than any other network. For the time being,
it is offering us marginally satisfactory service in terms of
bandwidth. However, sooner or later it will be saturated. The issue is
how to prolong the life of your investment, particularly in network
interface cards, and what to do when the crunch comes. An important
consideration in network configuration is the 80/20 rule; if
possible, you should try to configure the network so that 80 percent
of the traffic is confined within the subnets, and only about 20 percent
makes it onto the backbone. We monitor the backbone network load very
carefully. Our current average load is under 25 percent, with peaks
at more than 60 percent. When will the load become unacceptable? That
very much depends on the application profile. For some applications, an
average network load of 30 percent is not even noticeable, while for others
this load may be intolerable.
For our particular network application mix, we plan to take action
when the average traffic load approaches 30 percent. There are two
possible alternatives at that point: change to a higher speed backbone,
or segment the backbone into multiple interconnected networks.
The first approach looks relatively straightforward. However, you
should not consider the raw network bandwidth by itself. It has to
be examined in the context of a networking system, including
components such as subnets and routers. If the stations attached to
the high-speed network, particularly the routers, do not have the
power to handle the higher speed, especially in bursty situations,
the chances of packet loss will increase significantly. Once a packet
is lost, the typical software time-out for recovery will take seconds,
which can seriously damage the effective throughput of the
communicating stations.
Hence, simply increasing backbone bandwidth without looking at the
traffic profile is like trying to squeeze more capacity out of a
saturated highway by increasing the speed limit. If the entrance and
exit systems are not upgraded accordingly, the resulting congestion at those
points can actually make system performance worse
than it was before. Furthermore, upgrading to a different (and often
more expensive) networking technology can mean big bucks since a lot
of network interfaces, software, and even routers, have to be changed.
Because of the economic implications, we are somewhat less
enthusiastic about FDDI than other people. We are looking at ways to
preserve most of our investments while at the same time obtaining
gigabit-per-second level performance. Segmenting the backbone network
into multiple subnets is a reasonably short-term solution. But it
can be tricky and requires careful study of the network traffic pattern.
Choosing Protocols for Internetworking
At Carnegie Mellon, the most popular protocols are Transmission
Control Protocol/Internet Protocol (TCP/IP) and DECnet. XNS (Xerox
Network Systems) is used in a small number of workstations and some IBM
PCs and compatibles. AppleTalk is also popular due to the large number of
Macintoshes. Most communications are carried out within the same family of
machines; for example, most of the traffic from one VMS VAX machine to
another VMS VAX uses DECnet.
Occasionally, these machines need to communicate with different
vendors' computers. A typical application is electronic messaging
and terminal emulation. While transfer of object files between
dissimilar machines is of questionable use, transfer of data and
source files is another matter. In a distributed processing
environment, it is also highly desirable to allow machines from multiple
vendors to access a common set of network services. Thus, in
various scenarios, a machine has to communicate with other machines
that do not implement its "native" protocol.
There are three main approaches to overcoming the protocol
incompatibility problem. First, you can implement most of the
protocols on most of the machines. This requires a tremendous effort
and is not really a viable approach, unless the number of
protocols involved is very small. A second alternative is to use
translation machines. This is easier, since multiple protocols need
only be implemented in these machines, but it still requires a lot of
work. Furthermore, the translation machines will become bottlenecks.
The third and most effective approach is to choose a standard protocol
that each machine can implement over and above its native protocol.
This is similar to using English as the lingua franca. We opted for
this approach.
The standard protocol should not be chosen primarily
for its technical merit. While that is an important consideration,
even more important is how widely available it is. The only two
candidates that match our criteria at this point are IBM's Systems
Network Architecture (SNA) and the Defense Advanced Research
Projects Agency's (DARPA) TCP/IP protocols. Because of IBM's dominance
in the data processing market, virtually every computer vendor has
SNA interconnection capability. We decided not to adopt this
protocol, however, since we have no local user base and little SNA
expertise on campus.
DARPA's TCP/IP, on the other hand, is one of the main protocols used
on campus. It is embedded in Berkeley UNIX, which runs on a wide
variety of machines. Furthermore, it is required by the Department of
Defense for its contractors, which provides a strong incentive for
equipment manufacturers to support the protocol.
Finally, TCP/IP is also the protocol used by the popular Advanced
Research Projects Agency Network (ARPANET) as well as a host of
national and regional research networks such as the National Science
Foundation Network (NSFNET). We found implementation of this protocol
readily available on a wide variety of machines, from the small
Macintoshes and IBM PCs and compatibles to workstations,
minicomputers, mainframes, and even supercomputers. The International
Standards Organization (ISO) set of protocols is a potential
contender. However, the definition of these protocols is just being
completed. Currently, there are few implementations, although this
situation will likely change within the next five years.
For Carnegie Mellon and most academic and research institutions, it
is clear that TCP/IP is the appropriate choice for now. Note that
having a standard protocol does not mean that it is the only permitted
protocol on campus. Therefore, VMS VAXes will continue to communicate
among themselves using DECnet. However, when they wish to
communicate with a processor that does not support DECnet, TCP/IP
will be used.
Network Interconnectlon
With a standard internetwork protocol, we can carry out network
interconnection at level 3, or the network layer, of the Open
Systems Interconnection model. The use of routers for network
interconnection at this level permits us to be independent of the
physical network. A router connects two or more networks, which may
be of the same or different technologies.
Historically, we have had a variety of non-commercial routers in our
networks. All the routers are based on off-the-shelf hardware with
software originally developed by the Carnegie Mellon Computer Science
Department. The exception is the Kinetics Inc. Fastpath-4 gateway
for connecting LocalTalk to Ethernet. It is a commercial product
with software that originated at Stanford University. All in all, we
have the capability to interconnect a variety of networking
technologies. We have also connected some networks using link layer
bridges. Bridges are protocol independent. They are specific,
however, to a particular networking technology. We have used bridges
because we realized that: (1) using a standard instead of native
protocol reduces the capability of the communicating stations, and
(2) there are some campus departments that have a strong
communications requirement that can best be met by using non-standard
protocols. Generally, we prefer routers to bridges since they give
us more network management capabilities as well as a lower cost per
connection.
Currently, all the local area networks on campus are interconnected.
This provides a comprehensive and high-speed data communications
utility that extends to almost every building on the campus. Today,
this campus internetwork has more than 3,000 attached stations.
With the decreasing cost of workstations and personal computers, the
number of attached stations is expected to increase over the next few
years.
Network Management
Creating the campus wide high-speed network was a relatively
complex task. Managing its day-to-day operation as well as managing its
evolution are even more demanding. We created a model that guides our
development of network management tools. In our model, we have
divided network management into two related but distinct categories:
operation management and evolution management.
Operation management deals with "real-time" operational issues
and network policing. Real-time operational issues are problem
determination, location, isolation, and recovery, which are usually
handled by operators working under a fair amount of stress. Once a
problem is discovered, we must be able to isolate it quickly from the
rest of the network. (Half an hour is a good upper limit before users
start getting restless and hostile.)
Policing the network means watching for intentional and accidental
abuse of the network. It is a background monitoring function that
calls for continuous verification of the administrative view of the
network against the real world. Normally, nothing much happens. When
things go seriously wrong, however, the network can deteriorate in a
matter of seconds.
Evolution management deals with planning for the growth of and
changes to the network based on traffic load and traffic patterns. While this
activity has significant strategic importance, it does not have the
time pressures associated with operation management. It is
interesting to note that operation management is an around-the-clock
activity handled by moderately paid technicians, with support from
a more highly skilled engineering staff. Evolution management, on
the other hand, is typically carried out by experienced network
planners.
In order to provide reliable service 24 hours a day, seven days a
week, to take into account the spectrum of knowledge of any given
user. Hence generic tools for collecting statistics may be helpful for
long-term network planning or producing research papers, but they
are of little use to the operations staff.
We have been exploring the use of expert systems for network
management, particularly operation management. This type of system
should reduce the demand on the support engineers, as well as provide a
valuable training tool for new operators. Using the model as a
guideline, we have acquired or built a substantial repertoire of
network management tools. Judging by the number of strange problems
that mysteriously "disappear," we still have a way to go.
Furthermore, our good tools must be available to the operations staff.
Tools must be designed tools need to be better integrated. A key aspect of
this effort is the implementation of the Simple Network Management
Protocol (SNMP) on all network components, including workstations.
Using SNMP will allow all devices to report exceptional conditions,
and also provide network managers with some control over their
operation. Managing a data communications network is far more
complex than managing traditional telecommunications systems. In
the telecommunications world, the types of equipment attached to the
network are usually limited, and their functions are relatively
restricted and well-defined.
When managing data communications networks, you not only have
to be well acquainted with the ins and outs of the networking hardware, you
also need to understand communications protocols, and sometimes even
implementations and quirks of the software running in the workstations.
It is often very difficult to distinguish a real network problem from a
problem induced by bad software.
Conclusion
We have laid the foundation for a comprehensive, heterogeneous,
and high-speed data communications utility for the Carnegie Mellon
campus. Currently, we have more than 3,000 nodes on the network, and
this number is expected to increase in the coming years. The challenge
now is to manage the growth of the network, to exploit developing
technologies and, most importantly, to provide a quality of service
comparable at least to those found in the telecommunications world.
Reprinted with permission of LAN Technology. Copyright 1989 by M&T
Publishing, Inc. All rights reserved.
|