Network Working Group Smoot Carl-Mitchell
Request for Comments: 1027 Texas Internet Consulting
John S. Quarterman
Texas Internet Consulting
October 1987
Using ARP to Implement Transparent Subnet Gateways
Status of this Memo
This RFCdescribes the use of the Ethernet Address Resolution
Protocol (ARP) by subnet gateways to permit hosts on the connected
subnets to communicate without being aware of the existence of
subnets, using the technique of "Proxy ARP" [6]. It is based on
RFC-950 [1], RFC-922 [2], and RFC-826 [3] and is a restricted subset
of the mechanism of RFC-925 [4]. Distribution of this memo is
unlimited.
Acknowledgment
The work described in this memo was performed while the authors were
employed by the Computer Sciences Department of the University of
Texas at Austin.
IntrodUCtion
The purpose of this memo is to describe in detail the implementation
of transparent subnet ARP gateways using the technique of Proxy ARP.
The intent is to document this widely used technique.
1. Motivation
The Ethernet at the University of Texas at Austin is a large
installation connecting over ten buildings. It currently has more
than one hundred hosts connected to it [5]. The size of the
Ethernet and the amount of traffic it handles prohibit tying it
together by use of repeaters. The use of subnets provided an
attractive alternative for separating the network into smaller
distinct units.
This is exactly the situation for which Internet subnets as
described in RFC-950 are intended. Unfortunately, many vendors had
not yet implemented subnets, and it was not practical to modify the
more than half a dozen different operating systems running on hosts
on the local networks.
Therefore a method for hiding the existence of subnets from hosts
was highly desirable. Since all the local area networks supported
ARP, an ARP-based method (commonly known as "Proxy ARP" or the "ARP
hack") was chosen. In this memo, whenever the term "subnet" occurs
the "RFC-950 subnet method" is assumed.
2. Design
2.1 Basic method
On a network that supports ARP, when host A (the source) broadcasts
an ARP request for the network address corresponding to the IP
address of host B (the target), host B will recognize the IP address
as its own and will send a point-to-point ARP reply. Host A keeps
the IP-to-network-address mapping found in the reply in a local
cache and uses it for later communication with host B.
If hosts A and B are on different physical networks, host B will not
receive the ARP broadcast request from host A and cannot respond to
it. However, if the physical network of host A is connected by a
gateway to the physical network of host B, the gateway will see the
ARP request from host A. Assuming that subnet numbers are made to
correspond to physical networks, the gateway can also tell that the
request is for a host that is on a different physical network from
the requesting host. The gateway can then respond for host B,
saying that the network address for host B is that of the gateway
itself. Host A will see this reply, cache it, and send future IP
packets for host B to the gateway. The gateway will forward such
packets to host B by the usual IP routing mechanisms. The gateway
is acting as an agent for host B, which is why this technique is
called "Proxy ARP"; we will refer to this as a transparent subnet
gateway or ARP subnet gateway.
When host B replies to traffic from host A, the same algorithm
happens in reverse: the gateway connected to the network of host B
answers the request for the network address of host A, and host B
then sends IP packets for host A to gateway. The physical networks
of host A and B need not be connected to the same gateway. All that
is necessary is that the networks be reachable from the gateway.
With this approach, all ARP subnet handling is done in the ARP
subnet gateways. No changes to the normal ARP protocol or routing
need to be made to the source and target hosts. From the host point
of view, there are no subnets, and their physical networks are
simply one big IP network. If a host has an implementation of
subnets, its network masks must be set to cover only the IP network
number, excluding the subnet bits, for the system to work properly.
2.2 Routing
As part of the implementation of subnets, it is eXPected that the
elements of routing tables will include network numbers including
both the IP network number and the subnet bits, as specified by the
subnet mask, where appropriate. When an ARP request is seen, the
ARP subnet gateway can determine whether it knows a route to the
target host by looking in the ordinary routing table. If attempts
to reach foreign IP networks are eliminated early (see Sanity Checks
below), only a request for an address on the local IP network will
reach this point. We will assume that the same network mask applies
to every subnet of the same IP network. The network mask of the
network interface on which the ARP request arrived can then be
applied to the target IP address to produce the network part to be
looked up in the routing table.
In 4.3BSD (and probably in other operating systems), a default route
is possible. This default route specifies an address to forward a
packet to when no other route is found. The default route must not
be used when checking for a route to the target host of an ARP
request. If the default route were used, the check would always
succeed. But the host specified by the default route is unlikely to
know about subnet routing (since it is usually an Internet gateway),
and thus packets sent to it will probably be lost. This special
case in the routing lookup method is the only implementation change
needed to the routing mechanism.
If the network interfaces on which the request was received and
through which the route to the target passes are the same, the
gateway must not reply. In this case, either the target host is on
the same physical network as the gateway (and thus the host should
reply for itself), or this gateway is not on the most direct path to
the desired network, i.e., there is another gateway on the same
physical network that is on a more direct path and the other gateway
should respond.
RFC-925 [4] describes a general mechanism for dynamic subnet routing
using Proxy ARP and routing caches in the gateways. Our technique
is restricted subset of RFC-925, in which we use static subnet
routes which are determined administratively. As a result, our
transparent subnet gateways require no new network routing table
entries nor ARP cache entries; the only tables which are affected
are the ARP caches in the host.
In our implementation, routing loops are prevented by proper
administration of the subnet routing tables in the gateways.
2.3 Multiple gateways
The simplest subnet organization to administer is a tree structure,
which cannot have loops. However, it may be desirable for
reliability or traffic accommodation to have more than one gateway
(or path) between two physical networks. ARP subnet gateways may be
used in such a situation: a requesting host will use the first ARP
response it receives, even if more than one gateway supplies one.
This may even provide a rudimentary load balancing service, since if
two gateways are otherwise similar, the one most lightly loaded is
the more likely to reply first.
More complex mechanisms could be built in the form of gateway-to-
gateway protocols, and will no douBT become necessary in networks
with large numbers of subnets and gateways, in the same way that
gateway-to-gateway protocols are generally necessary among IP
gateways.
2.4 Sanity checks
Care must be taken by the network and gateway administrators to keep
the network masks the same on all the subnet gateway machines. The
most common error is to set the network mask on a host without a
subnet implementation to include the subnet number. This causes the
host to fail to attempt to send packets to hosts not on its local
subnet. Adjusting its routing tables will not help, since it will
not know how to route to subnets.
If the IP networks of the source and target hosts of an ARP request
are different, an ARP subnet gateway implementation should not
reply. This is to prevent the ARP subnet gateway from being used to
reach foreign IP networks and thus possibly bypass security checks
provided by IP gateways.
An ARP subnet gateway implementation must not reply if the physical
networks of the source and target of an ARP request are the same.
In this case, either the target host is presumably either on the
same physical network as the source host and can answer for itself,
or the target host lies in the same direction from the gateway as
does the source host, and an ARP reply from the would cause a loop.
An ARP request for a broadcast address must elicit no reply,
regardless of the source address or physical networks involved. If
the gateway were to respond with an ARP reply in this situation, it
would be inviting the original source to send actual traffic to a
broadcast address. This could result in the "Chernobyl effect"
wherein every host on the network replies to such traffic, causing
network "meltdown".
2.5 Multiple logical subnets per physical network
The most straightforward way to assign subnet numbers is one to one
with physical networks. There are, however, circumstances in which
multiple logical subnets per physical network are quite useful. One
of the more common is when it is planned that a group of
workstations will be put on their own physical network but the
gateway to the new physical network needs to be tested first. (A
repeater might be used when the gateway was not usable). If a rule
of one subnet per physical network is enforced, the addresses of the
workstations must be changed every time the gateway is tested. If
they may be assigned addresses using a new subnet number while they
are still on the old physical network, no further address changes
are needed.
To permit multiple subnets per physical network, an ARP subnet
gateway must use the physical network interface, not the subnet
number to determine when to reply to an ARP request. That is, it
should send a proxy ARP reply only when the source network interface
differs from the target network interface. In addition, appropriate
routing table entries for these "phantom" subnets must be added to
the subnet gateway routing tables.
2.6 Broadcast addresses
There are two kinds of IP broadcast addresses: main IP directed
network broadcast and subnet broadcast. An IP network broadcast
address consists of the network number plus a well-known value in
the rest (local part) of the address. An IP subnet broadcast is
similar, except both the IP network number and the subnet number
bits are included. RFC-922 standardized the use of all ones in the
local part, but there were two conventions in use before that: all
ones and all zeros. For example, 4.2BSD used all zeros, and 4.3BSD
uses all ones. Thus there are four kinds of IP directed broadcast
addresses still currently in use on many networks.
With transparent subnetting a subnet gateway must not issue an IP
broadcast using the subnet broadcast address, e.g., 128.83.138.255.
Hosts on the physical network that receive the broadcast will not
understand such an address as a broadcast address, since they will
not have subnets enabled (or will not have subnet implementations).
In fact, 4.2BSD hosts (with or without subnet implementations) will
instead treat an address with all ones in the local part as a
specific host address and try to forward the packet. Since there is
no such target host, there will be no entry in the forwarding host's
ARP tables and it will generate an ARP request for the target host.
This presents the scenario (actually observed) of a 4.3BSD gateway
running the rwho program, which broadcasts a packet once a minute,
causing every 4.2BSD host on the local physical network to generate
an ARP request at the same time. The same problem occurs with any
subnet broadcast address, whether the local part is all zeros or all
ones.
Thus a subnet gateway in a network with hosts that do not understand
subnets must take care not to use subnet broadcast addresses:
instead it must use the IP network directed broadcast address
instead.
Finally, since many hosts running out-of-date software will still be
using (and expecting) old-style all-zeros IP network broadcast
addresses, the gateway must send its broadcast addresses out in that
form, e.g., 128.83.0.0. It might be safe to also send a duplicate
packet with all ones in the local part, e.g., 128.83.255.255. It is
not clear whether the local network broadcast address of all ones,
255.255.255.255, will cause ill effects, but it is very likely that
it will not be recognized by many hosts that are running older
software.
3. Implementation in 4.3BSD
Subnet gateways using ARP have been implemented by a number of
different people. The particular method described in this memo was
first implemented in 4.2BSD on top of retrofitted beta-test 4.3BSD
subnet code, and has since been reimplemented as an add-on to the
distributed 4.3BSD sources. The latter implementation is described
here.
Most of the new kernel code for the subnet ARP gatewaying function
is in the generic Ethernet interface module, netinet/if_ether.c. It
consists of eight lines in in_arpinput that perform a couple of
quick checks (to ensure that the facility is enabled on the source
interface and that the source and target addresses are on different
subnets), call a new routine, if_subarp, for further checks, and
then build the ARP response if all checks succeed. This code is
only reached when an ARP request is received, and does nothing if
the facility is not enabled on the source interface. Thus
performance of the gateway should be very little degraded by this
addition. (Performance of the requesting host should also be
similar to the latter case, as the only difference there is between
efficiency of the ARP cache and of the routing tables).
The routine if_subarp (about sixty lines) ensures that the source
and target addresses are on the same IP network and that the target
address is none of the four kinds of directed broadcast address. It
then attempts to find a path to the target either by finding a
network interface with the desired subnet or by looking in the
routing tables. Even if a network interface is found that leads to
the target, for a reply to be sent the ARP gateway must be enabled
on that interface and the target and source interfaces must be
different.
The file netinet/route.c has a static routing entry structure
definition added, and modifications of about eight lines are made to
the main routing table lookup routine, rtalloc, to recognize a
pointer to that structure (when passed by if_subarp) as a direction
to not use the default route in this routing check. The processor
priority level (critical section protection) around the inner
routing lookup check is changed to a higher value, as the routine
may now be called from network interface interrupts as well as from
the internal software interrupts that drive processing of IP and
other high level protocols. This raised processor priority could
conceivably slow the whole kernel somewhat if there are many routing
checks, but since the critical section is fast, the effect should be
small.
A key kernel modification is about fifteen lines added to the
routine ip_output in netinet/ip_output.c. It changes subnet
broadcast addresses in packets originating at the gateway to IP
network broadcast addresses so that hosts without subnet code (or
with their network masks set to ignore subnets) will recognize them
as broadcast addresses. This section of code is only used if the
ARP gateway is turned on for the outgoing interface, and only
affects subnet broadcast addresses.
A new routine, in_mainnetof, of about fifteen lines, is added to
netinet/in.c to return the IP network number (without subnet number)
from an IP address. It is called from if_subarp and ip_output.
Two kernel parameter files have one line added to each: net/if.h
has a definition of a bit in the network interface structure to
indicate whether subnet ARP gateways are enabled, and netinet/in.h
refers to in_mainnetof.
In addition to these approximately 110 lines of kernel source
additions, there is one user-level modification. The source to the
command ifconfig, which is used to set addresses and network masks
of network interfaces, has four lines added to allow it to turn the
subnet ARP gateway facility on or off, for each interface. This is
documented in eleven new lines in the manual entry for that command.
4. Availability
The 4.3BSD implementation is currently available by anonymous FTP
(login anonymous, passWord guest) from sally.utexas.edu as
pub/subarp, which is a 4.3BSD "diff -c" listing from the 4.3BSD
sources that were distributed in September 1986.
This implementation was not included in the 4.3BSD distribution
proper because U.C. Berkeley CSRG thought that that would reduce the
incentive for vendors to implement subnets per RFC-950. The authors
concur. Nonetheless, there are circumstances in which the use of
transparent subnet ARP gateways is indispensable.
References
1. Mogul, J., and J. Postel, "Internet Standard Subnetting
Procedure", RFC-950, Stanford University and USC/Information
Sciences Institute, August 1985.
2. Mogul, J., "Broadcasting Internet Datagrams in the Presence of
Subnets", RFC-922, Computer Science Department, Stanford
University, October 1984.
3. Plummer, D., "An Ethernet Address Resolution Protocol or
Converting Network Protocol Addresses to 48-bit Ethernet
Addresses for Transmission on Ethernet Hardware", RFC-826,
Symbolics, November 1982.
4. Postel, J., "Multi-LAN Address Resolution", RFC-925,
USC/Information Sciences Institute, October 1984.
5. Carl-Mitchell, S., and J. S. Quarterman, "Nameservers in a Campus
Domain", SIGCUE Outlook, Vol.19, No.1/2, pp.78-88, ACM SIG
Computer Uses in Education, P.O. Box 64145, Baltimore, MD 21264,
Spring/Summer 1986.
6. Braden, R., and J. Postel, "Requirements for Internet Gateways",
RFC-1009, USC/Information Sciences Institute, June 1987.