RFC708 - Elements of a Distributed Programming System

Network Working GroupJames E. White

Request for Comments: 708Augmentation Research Center

Elements of a Distributed Programming System

January 5, 1976

James E. White

Augmentation Research Center

Stanford Research Institute

Menlo Park, California 94025

(415) 326-6200 X2960

This paper suggests some extensions to the simple Procedure Call Protocol

described in a previous paper (27197). By eXPanding the procedure call

model and standardizing other common forms of inter-process interaction,

sUCh extensions would provide the applications programmer with an even

more powerful distributed programming system.

The work reported here was supported by the Advanced Research Projects

Agency of the Department of Defense, and by the Rome Air Development

Center of the Air Force.

This paper will be submitted to publication in the Journal of Computer

Languages.

Network Working GroupJames E. White

Request for Comments: 708Elements of a Distributed Programming System

INTRODUCTION

In a companion paper [i], the author proposes a simple protocol and

software framework that would facilitate the construction of distributed

systems within a resource-sharing computer network by enabling distant

processes to communicate with one another at the procedure call level.

Although of great utility even in its present form, this rudimentary

"distributed programming system (DPS)" supports only the most fundamental

ASPects of remote procedure calling. In particular, it permits the

caller to identify the remote procedure to be called, supply the

necessary arguments, determine the outcome of the procedure, and recover

its results. The present paper extends this simple procedure call model

and standardizes other common forms of process interaction to provide

a more powerful and comprehensive distributed programming system. The

particular extensions proposed in this paper serve hopefully to reveal the

DPS concept's potential, and are offered not as dogma but rather as

stimulus for further research.

The first section of this paper summarizes the basic distributed

programming system derived in [1]. The second section describes the

general strategy to be followed in extending it. The third and longest

section identifies and explores some of the aspects of process interaction

that are sufficiently common to warrant standardization, and suggests

methods for incorporating them in the DPS model.

REVIEWING THE BASIC SYSTEM

The distributed programming system derived in [1] assumes the existence

of and is built upon a network-wide "inter-process communication (IPC)"

facility. As depicted in Figure 1, DPS consists of a high-level model of

computer processes and a simple, application-independent "procedure

call protocol (PCP)" that implements the model by regulating the dialog

between two processes interconnected by means of an IPC communication

"channel." DPS is implemented by an installation-provided "run-time

environment (RTE)," which is link loaded with (or otherwise made

available to) each applications program.

-1-

Network Working Group James E. White

Requests for Comments: 708Elements of a Distributed Programming System

Reviewing the Basic System

The Model

The procedure call model (hereafter termed the Model) views a process as a

collection of remotely callable subroutines or "procedures." Each procedure

is invoked by name, can be supplied a list of arguments, and returns to its

caller both a boolean outcome, indicating whether it succeeded or failed,

and a list of results. The Model permits the process at either end of the

IPC channel to invoke procedures in its neighbor, and further permits a

process to accept two or more procedure calls for concurrent execution.

The arguments and results of procedures are modeled from a small set of

primitive "data types," listed below:

LIST: A list is an ordered sequence of N data objects called

"elements" (here and throughout these descriptions, N is

confined to the range [0, 2**15-1]). A LIST may contain

other LISTs as elements, and can therefore be employed to

construct arbitrarily complex, composite arguments or results.

CHARSTR: A character string is an ordered sequence of N ASCII

characters, and conveniently models a variety of textual

entities, from short user names to whole paragraphs of text.

BITSTR: A bit string is an ordered sequence of N bits and,

therefore, provides a means for representing arbitrary

binary data (for example, the contents of a Word of memory).

INTEGER: An integer is a fixed-point number in the range

[-2**31, 2**31-1], and conveniently models various kinds of

numerical data, including time intervals, distances, and so on.

INDEX: An index is an integer in the range [1, 2**15-1]. As

its name and value range suggest, an INDEX can be used to

address a particular bit of character within a string, or

element within a list. Furthermore, many of the protocol

extensions to be proposed in this paper will employ INDEXES as

handles for objects within the DPS environment (for example,

processes and channels).

BOOLEAN: A boolean represents a single bit of information

and has either the value true or false.

EMPTY: An empty is a valueless place holder within a LIST of

parameter list.

-2-

Network Working Group James E. White

Requests for Comments: 708Elements of a Distributed Programming System

Reviewing the Basic System

The Protocol

The procedure call protocol (hereafter terms the Protocol), which

implements the Model, defines a "transmission format" (like those suggested

in Appendix A) for each of the seven data types listed above, and

requires that parameters be encoded in that format whenever they are

transported between processes.

The Protocol also specified the inter-process messages by which remote

procedures are invoked. These messages can be described symbolically as

follows:

message-type=CALL[tid] procedure-name arguments

message-type=RETURN tid outcome results

The first message invokes the procedure whose NAME is specified using the

ARGUMENTS provided. The second is returned in eventual response to the

first and reports the OUTCOME and RESULTS of the completed procedure.

Whenever OUTCOME indicates that a procedure has failed, the procedure's

RESULTS are required to be an error number and diagnostic message, the

former to help the invoking program determine what to do next, the

latter for possible presentation to the user. The presence of an

optional "transaction identifier (TID)" in the CALL message constitutes

a request by the caller for an acknowledging RETURN message echoing the

identifier.

Although data types and their transmission formats serve primarily as

vehicles for representing the arguments and results of remote procedures,

they can just as readily and effectively be employed to represent the

messages by which those parameters are transmitted. The Protocol,

therefore, represents each of the two messages described above as a PCP

data object, namely, a LIST whose first element is an INDEX message

type. The following concise statement of the Protocol results:

LIST (CALL, tid, procedure, arguments)

INDEX=1 [INDEX] CHARSTR LIST

LIST (RETURN, tid, outcome, results)

INDEX=2 INDEX BOOLEAN LIST

Here and in subsequent protocol descriptions, elements enclosed in square

brackets are optional (that is, may be EMPTY). The RESULTS of an

unsuccessful procedure would be represented as follows:

LIST (error, diagnostic)

INDEX CHARSTR

-3-

Network Working Group James E. White

Requests for Comments: 708Elements of a Distributed Programming System

Reviewing the Basic System

The Run-Time Environment

The run-time environment (hereafter termed the environment) interfaces the

applications program to a remote process via an IPC channel. In doing so,

it provides the applications program with a collection of "primitives,"

implemented either as subroutines or system calls, that the applications

program can employ to manipulate the remote process to which the channel

connects it. The environment implements these primitives by sending

and receiving various protocol messages via the channel.

In its present rudimentary form, the Protocol enables the environment to

make a single, remote procedure calling primitive like the following

available to the applications program:

CALLPROCEDURE (procedure, arguments -> outcome, results)

CHARSTR LIST BOOLEAN LIST

This primitive invokes the indicated remote PROCEDURE using the ARGUMENTS

provided and returns its OUTCOME and RESULTS. While this primitive

blocks the invoking applications program until the remote procedure

returns, a variant that simply initiates the call and allows the

applications program to collect the outcome and results in a second

operation can also be provided.

Since the interface between the environment and the applications program

is machine- and possibly even language-dependent, environment-provided

primitives can only be described in this paper symbolically. Although

PCP data types provide a convenient vehicle for describing their

arguments and results are therefore used for that purpose above and

throughout the paper, such parameters will normally be transmitted

between the environment and the applications program in some internal

format.

BOOTSTRAPPING THE NEW PROTOCOL FUNCTIONS

Since the Protocol already provides a mechanism for invoking arbitrary

remote procedures, the Model extensions to be proposed in this paper

will be implemented whenever possible as procedures, rather than as

additional messages. Unlike applications procedures, these special

"system procedures" will be called and implemented by run-time environments,

rather than by the applications programs they serve. Although inAccessible

to the remote applications program via the normal environment-provided

remote procedure calling primitive, system procedures will enable the

environment to implement and offer new primitives to its applications

program.

-4-

Network Working Group James E. White

Requests for Comments: 708Elements of a Distributed Programming System

Bootstrapping the New Protocol Functions

The calling sequences of many of these new primitives will closely

correspond to those of the remote system procedures by which they are

implemented. Other primitives will be more complex and require for their

implementation calls to several system procedures, possibly in different

processes. Besides describing the Protocol additions required by various

Model extensions proposed, the author will, throughout this paper, suggest

calling sequences for the new primitives that become available to the

applications program.

-5-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Some Possible Extensions to the Model

SOME POSSIBLE EXTENSIONS TO THE MODEL

1. Creating Remote Processes

Before a program in one machine can use resources in another, it must either

create a new process in the remote machine, or gain access to an existing

one. In either case, the local process must establish an IPC channel to a

resident dispatching process within the remote system, specify the program

to be started or contacted. and identify itself so that its access to the

program can be established and billing carried out. After these preliminary

steps have been accomplished, the requested process assumes responsibility

for the IPC channel and substantive communication begins.

The manner in which the environment carries out the above scenario is

largely dictated by the IPC facility upon which the distributed system is

based. If the IPC facility itself provides single primitive that

accomplishes the entire task, then the environment need only invoke that

primitive. If, on the other hand, it only provides a mechanism by which

the environment can establish a channel to the remote dispatcher, as is

the case within the ARPA computer Network (the ARPANET), then the Protocol

itself must contain provisions for naming the program to be run and

presenting the required credential.

Adding to the Protocol the following system procedure enables the local

environment to provide the remote dispatcher with the necessary information

in this latter case:

INIPROCESS (program, credential)

CHARSTR LIST (user, password, account)

CHARSTR CHARSTR CHARSTR

Its arguments include the name of the applications PROGRAM to be run; and

the USER name, PASSWORD, and ACCOUNT of the local user to whom its use is

to be billed.

This new procedure effectively adds to the Model the notion of "creation," and enables the environment to offer the following primitives

to its applications program:

CRTPROCESS (computer, program, credential -> ph)

CHARSTR CHARSTR (as above) INDEX

DELPROCESS (ph)

INDEX

-6-

Network Working GroupJames E. White

Request for Comments: 708 Elements of a Distributed Programming System

Some Possible Extensions to the Model

Creating Remote Processes

The first primitive creates a new process or establishes contact with an

existing one by first creating a channel to the dispatcher within the

indicated COMPUTER and then invoking the remote system procedure INIPROCESS

with the specified PROGRAM name and CREDENTIALS as arguments. The primitive

returns a "process handle PH" by which the applications program can refer to

the newly created process in subsequent dialog with the local environment

by the IPC facility, an index into a table within the environment, or anything

else the environment's implementor may find convenient.

The second primitive "deletes" the previously created process whose handle

PH is specified by simply deleting the IPC channel to the remote process and

reclaiming any internal table space that may have been allocated to the

process.

2. Introducing Processes to One Another

The simplest distributed systems begin with a single process that creates,

via the CRTPROCESS primitive described above, one or more "inferior"

processes whose resources it requires. Some or all of these inferiors may

in turn require other remote resources and so create interiors of their

own. This creative activity can proceed, in principle, to arbitrary depth.

The distributed system is thus a tree structure whose nodes are processes

and whose branches are IPC channels.

Although a distributed system can include an arbitrarily large number of

processes, each process is cognizant of only the process that created it

and those it itself creates, that is, its parent and sons. The radius

within which a process can access the resources of the tree is thus

artificially small. This limited sharing range, which prevents the

convenient implementation of many distributed systems, can be overcome

by extending the Model to permit an arbitrarily complex network of

communication paths to be superimposed upon the process tree.

One of the many ways by which the Protocol can provide for such communication

paths is to permit one process to "introduce" and thereby make known to one

another any two processes it itself knows (for example, two of its sons,

or its parent and son). Once introduced, the two processes would be able

to invoke one another's procedures with the same freedom the introducing

process enjoys. They could also introduce one another to other processes,

and so create even longer communication paths.

-7-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Some Possible Extensions to the Model

Introducing Processes to One Another

2.1 Introductions Within a Homogeneous Environment

Provided one remains within a "homogeneous environment" (that is, the domain

of a single IPC facility), the introduction of two processes requires little

more than the formation of an IPC channel between them. Adding to the

Protocol the following system procedures, which manipulate IPC "ports,"

enables the run-time environment of the process performing the introduction

to negotiate such a channel:

ALOPORT (-> ph, COMPUTER, PORT)

INDEX CHARSTR any

CNNPORT (ph, computer, port)

INDEX CHARSTR any

DCNPORT (ph)

INDEX

The detailed calling sequences for these procedures are dictated by the IPC

facility that underlies the distributed system. Those above are therefore

only representative of what may be required within any particular network,

but are only slightly less complicated than those required, for example,

within the ARPANET.

To create the channel, the introducing process' run-time environment

allocates a PORT in each target process via ALOPORT, and then instructs

each process via CNNPORT to connect its port to the other's via the IPC

facility. The process handle PH returned by ALOPORT serves as a handle

both initially for the allocated port, and then later for the process to

which the attached channel provides access. To "separate" the two processes,

the introducing process' environment need only invoke the DCNPORT procedure

in each process, thereby dissolving the channel, releasing the associated

ports, and deallocating the process handles.

Armed with these three new system procedures, the environment can provide

the following new primitives to its applications program:

ITDPROCESS (ph1, ph2 -> ph12, PH21, ih)

INDEX INDEX INDEX INDEX INDEX

SEPPROCESS (ih)

INDEX

-8-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Some Possible Extensions to the Model

Introducing Process to One Another

The first primitive introduces the two processes whose handles PH1 and PH2

are specified. Each handle may designate either a son, in which case the

handle is one returned by CRTPROCESS; the parent process, for which a

special handle (for example, 1) must always be defined; or a previously

introduced process, in which case the handle is one oBTained in a previous

invocation of ITDPROCESS.

ITDPROCESS returns handles PH12 and PH21 by which the two processes will

know one another, as well as an "introduction handle IH" that the applications

program can later employ to separate the two processes via SEPPROCESS. The

applications program initiating the introduction assumes responsibility for

communicating to each introduced applications program its handle for the

other.

2.2 Introductions Within a Heterogeneous Environment

While their interconnection via an IPC channel is sufficient to introduce

two processes to one another, in a heterogeneous environment the creation

of such a channel is impossible. Suppose, as depicted in Figure 2, that

processes P1 and P2 (in computers C1 and C2, respectively) are interconnected

within a distributed system by means of a network IPC facility. Assume

further that P2 attaches to the system another process P3 in a minicomputer

M that although attached to C2 is not formally a part of the network. With

this configuration, it is impossible for P2 to introduce processes P1 and P3

to one another by simply establishing an IPC channel between them, since

they are not within the domain of a single IPC facility.

One way of overcoming this problem is to extend the Model to embrace the

notion of a composite or "logical channel" composed of two or more physical

(that is, IPC) channels. A message transmitted by process P1 via the logical

channel to Pn (n=3 in the example above) would be relayed over successive

physical channels by the environments of intermediate processes P2 through

Pn-1. Although more expensive than physical channels, since each message

must traverse at least two physical channels and be handled by all the

environments along the way, logical channels would nevertheless enable

processes that could not otherwise do so to access one another's resources.

Since the relaying of messages is a responsibility of the environment, the

applications program need never be aware of it.

-9-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Some Possible Extensions to the Model

Introducing Processes to One Another

As depicted in Figure 3, a logical channel would consist of table entries

maintained by the environment of each process P1 through Pn, plus the

environment to forward messages that arrive with a "routing code" addressing

the local table entry. Each table entry would contain process handles for

the two adjacent processes, as well as the routing code recognized by each.

To communicate a message to its distant neighbor, the source process (say

P1) would transmit it via its IPC channel to P2, with a routing code

addressing the appropriate table entry within P2. Upon receipt of the

message, P2 would locate its table entry via the routing code, update the

message with the routing code recognized by P3, and forward the message

to P3. Eventually the message would reach its final destination, Pn.

Adding to the Protocol the following system procedures enables the

environment to construct a logical channel like that described above:

CRTROUTE (mycode, oldcode -> code, ph)

INDEX [INDEX] INDEX INDEX

DELROUTE (yourcode)

INDEX

The simplest logical channel (n=3) is created by P2, which invokes CRTROUTE

in both P1 and P3, specifying in each case the routing code MYCODE it has

assigned to its segment of the logical channel, and receiving in return

the routing CODES and process handles PHs assigned by the two processes.

OLDCODE is not required in this simple case and is therefore EMPTY.

More complicated logical channels (n>3) are required when one or both

of the processes to be introduced is already linked, by a logical channel,

to the process performing the introduction. In such cases, a portion of

the new channel to be constructed must replicate the existing channel, and

hence the routing code OLDCODE for the table entry that represents that

channel within the target process is specified as an additional argument

of the system procedure. The target process must call CRTROUTE recursively

in the adjacent process to replicate the rest of the model channel.

-10-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Some Possible Extensions to the Model

Introducing Processes to One Another

The process Pi that creates a logical channel assumes responsibility for

insuring that it is eventually dismantled. It deletes the logical channel

by invoking DELROUTE in Pi-1 and Pi+1, each of which propagates the call

toward its end of the channel.

3. Controlling Access to Local Resources

The process introduction primitive proposed above effectively permits

access to a process to be transmitted from one process to another. Any

process P2 that already possesses a handle to a process P1 can obtain a

handle for use by a third process P3. Once P1 and P3 have been introduced,

P3 can freely call procedures in P1 (and vice versa).

Although a process can, by aborting the ALOPORT system procedure, prevent

its introduction to another process and so restrict the set of processes

that gain access to it, finer access controls may sometimes be required.

A process may, for example, house two separate resources, one of which

is to be made available only to its parent (for example), and the other

to any process to which the parent introduces it. Before such a strategy

can be conveniently implemented, the Model must be extended to permit

access controls to be independently applied to individual resources within

a single process.

Although a single procedure can be considered a resource, it is more practical and convenient to conceive of larger, composite resources

consisting of a number of related procedures. A simple data base

management module containing procedures for creating, deleting, assigning

values to, reading, and searching for data objects exemplifies such

composite resources. Although each procedure is useless in isolating, the

whole family of procedures provides a meaningful service. Such "package"

of logically related procedures might thus be the most reasonable object

of the finer access controls to be defined.

Access controls can be applied to packages by requiring that a process

first "open" and obtain a handle for a remote package before it may call

any of the procedures it contains. When the process attempts to open

the package, its right to do so can be verified and the attempt aborted if

necessary. Challenging the open attempt would, of course, be less expensive

than challenging every procedure call. The opening of a package would also

provide a convenient time for package-dependent state information to be

initialized.

-11-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Some Possible Extensions to the Model

Controlling Access to Local Resources

Adding to the Protocol the following pair of system procedures enables the

environment to open and close packages within another process. For

efficiency, these procedures manipulate an arbitrary number of packages

in a single transaction.

OPNPACKAGE (packages -> pkhs)

LISTofCHARSTRs LISTofINDEXs

CLSPACKAGE (pkhs)

(as above)

The first procedure opens and returns "package handles PKHS" for the

specified PACKAGES; the second closes one or more packages and releases

the handles PKHS previously obtained for them.

Besides incorporating these two new system procedures, the Protocol must

further require that a package handle accompany the procedure name in every

CALL message (an EMPTY handle perhaps designating a system procedure). Note

that this requirement has the side effect of making the package the domain

within which procedure names must be unique.

The system procedures described above enable the environment to make

available to its applications program, primitives that have calling

sequences similar to those of the corresponding system procedures but

which accept the process handle of the target process as an additional

argument. Their implementation requires only that the environment

identify the remote process from its internal tables and invoke OPNPACKAGE

or CLSPACKAGE in that process.

4. Standardizing Access to Global Variables

Conventional systems often maintain global "variables" that can be accessed

by modules throughout the system. Such variables are typically manipulated

using primitives of the form:

(1) Return the current value of V.

(2) Replace the current contents of V with a new value.

These primitives are either provided as language constructs or implemented

by specialized procedures. The former approach encourages uniform

treatment of all variables within the system.

Those distributed systems that maintain remotely-accessible variables must

also select a strategy for implementing the required access primitives.

While such primitives can, of course, be implemented as specialized

-12-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Some Possible Extensions to the Model

Standardizing Access to Global Variables

applications procedures, adding to the Protocol the following new system

procedures insures a uniform run-time access mechanism:

RDVARIABLE (pkh, variable -> value)

INDEX CHARSTR any

WRVARIABLE (pkh, variable, value)

INDEX CHARSTR any

These procedures effectively define variables as named data objects modeled

from PCP data types, and suggest that they be clustered in packages with

related procedures. The system procedures return and specify, respectively,

the VALUE of the VARIABLE whose name and package handle PKH are specified.

These new procedures enable the environment to make available its applications

program, primitives that have calling sequences similar to those of the

corresponding system procedures but which accept the process handle of the

target process as an additional argument. These primitives provide a basis

upon which a suitably modified compiler can reestablish the compile-time

uniformity that characterizes the manipulation of variables in conventional

programming environments. Their implementation requires only that the local

environment identify the remote process from its internal tables and invoke

RDVARIABLE or WRVARIABLE in that process.

Most variables will restrict the range of data types and values that may be

assigned to them; some may even be read-only. But because they are modeled

using PCP data types, their values can, in principle, be arbitrarily complex

(for example, a LIST of LISTS) and the programmer may sometimes wish to

manipulate only a single element of the variable (or, if the element is

itself a LIST, just one of its elements; and so on, to arbitrary depth).

Adding the following argument to their calling sequences extends the system

procedures proposed above to optionally manipulate a single element of a

variable's composite value:

substructure

(LISTofINDEXs)

At successive levels of the value's tree structure, the INDEX of the desired

element is identified; the resulting list of indices identifies the

SUBSTRUCTURE whose value is to be returned or replaced.

-13-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Some Possible Extensions to the Model

Routing Parameters Between Procedures

5. Routing Parameters Between Procedures

In conventional programming systems, the results of procedures are used in a

variety of ways, depending upon the context of the calls made upon them. A

result may, for example:

(1) Provide the basis for a branch decision within the calling

program.

(2) Become an argument to a subsequent procedure call.

(3) Be ignored and thus effectively discarded.

At run-time, the knowledge of a result's intended use usually lies solely

within the calling program, which examines the results, passes it to a

second procedure, or ignores it as it chooses.

In a distributed system, the transportation of results from callee to caller,

carried out by means of one of more inter-process messages, can be an

expensive operation, especially when the results are large. Data movement

can be reduced in Cases 2 and 3 above by extending the Model to permit the

intended disposition of each procedure result to be made known in advance

to the callee's environment. In Case 2, provided both callees reside

within the same process, the result can be held at its source and later

locally supplied to the next procedure. In Case 3, the result can be

discarded at its source (perhaps not even computed), rather than sent and

discarded at its destination.

5.1 Specifying Parameters Indirectly

Variables offer potential for the eliminating the inefficiencies involved in

Case 2 above by providing a place within the callees' process where results

generated by one procedure can be held until required by another. The

Protocol can be extended to permit variables to be used in this way by

allowing the caller of any procedure to include optional "argument- and

result-list mask" like the following as additional parameters of the CALL

message:

parameter list mask

[LIST variable, ...)]

[CHARSTR]

A parameter list mask would permit each parameter to be transmitted either

directly, via the parameter list, or indirectly via a VARIABLE within the

callee's process. Thus each element of the mask specifies how the callee's

-14-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Some Possible Extensions to the Model

Routing Parameters Between Procedures

environment is to obtain or dispose of the corresponding parameter. To supply

the result of one procedure as an argument to another, the caller need only

then appropriately set corresponding elements of the result and argument

list masks in the first and second calls, respectively. The result list

mask should be ignored if the procedure fails, and the error number and

diagnostic message returned directly to the caller.

5.2 Providing Scratch Variables for Parameter Routing

Although each applications program could provide variables for use as described

above, a more economical approach is to extend the Model to permit special

"scratch variables," maintained by the environment without assistance from

its applications program, to be created and deleted as necessary at run-time.

Adding to the Protocol the following pair of system procedures enables the

local environment to create and delete such variables in a remote process:

CRTVARIABLE (variable, value)

CHARSTR any

DELVARIABLE (variable)

CHARSTR

These procedures create and delete the specified VARIABLE, respectively.

CRTVARIABLE also assigns an initial VALUE to the newly-created variable.

These new procedures enable the environment to make available to its

applications program, primitives that have calling sequences similar to

those of the corresponding system procedures but which accept the process

handle of the target process as an additional argument. Their implementation

required only that the environment identify the remote process from its

internal tables and invoke CRTVARIABLE or DELVARIABLE in that process.

5.3 Discarding Results

The inefficiencies that result in Case 3 above are conveniently eliminated

by allowing the caller to identify via the result list mask (for example,

via a zero-length CHARSTR) that a result will be ignored and therefore need

not be returned to the caller.

-15-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Some Possible Extensions to the Model

Supporting a Richer Spectrum of Control Transfers

6. Supporting a Richer Spectrum of Control Transfers

As currently defined by the Model, a procedure call is a simple two-stage

dialog in which the caller first describes the operation it wishes performed

and the callee, after performing the operation, reports its outcome.

Although this simple dialog form is sufficient to conveniently implement

a large class of distributed systems, more complex forms are sometimes

required. The Model can be extended to admit a variety of more powerful

dialog forms, of which the four described below are examples.

6.1 Transferring Control Between Caller and Callee

Many conventional programming systems permit caller and callee to exchange

control any number of times before the callee returns. Such "coroutine

linkages" provide a means, for example, by which the callee can obtain

help with a problem that it has encountered or deliver the results of one

suboperation and obtain the arguments for the next.

Adding to the Protocol the following system procedure, whose invocation

relinquishes control of another, previously initiated procedure, enables

the environment to effect a coroutine linkage between caller and callee:

TAKEPROCEDURE (tid, yourtid, parameters)

INDEX BOOLEAN LIST

Its arguments include the identifier TID of the affected transaction, an

indication YOURTID of from whose name space the identifier was assigned

(that is, whether the process relinquishing control is the caller or callee),

and PARAMETERS provided by the procedure surrendering control. By exploiting

an existing provision of the Protocol (that is, by declining acknowledgment

of its calls to TAKEPROCEDURE) the invoking environment can effect the

control transfer with a single inter-process message.

The addition of this new procedure to the Protocol enables the environment

to provide the following new primitive to its applications program:

LINKPROCEDURE (tid, arguments -> outcome, results)

INDEX LIST [BOOLEAN] LIST

-16-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Some Possible Extensions to the Model

Supporting a Richer Spectrum of Control Transfers

This primitive assumes that the CALLPROCEDURE primitive is also modified to

return the pertinent transaction identifier should the callee initiate a

coroutine linkage rather than return. Invocation of LINKPROCEDURE then

continues the dialog by supplying ARGUMENTS and returning control to the remote

procedure, and then awaiting the next transfer of control and the RESULTS that

accompany it. If the remote procedure then returns, rather than initiating

another coroutine linkage, the primitive reports its OUTCOME and invalidates

the transaction identifier.

While this primitive blocks the applications program until the remoter

procedure relinquishes control, a variant that simply initiates the coroutine

linkage and allows the applications program to collect the outcome and

results in a second operation can also be provided.

6.2 Signaling the Caller/Callee

A monolog is often more appropriate than the dialog initiated by a coroutine

linkage. The caller or callee might wish, for example, to report an event it

has detected or send large parameters piecemeal to minimize buffering

requirements. Since no return parameters are required in such cases, the

initiating procedure need only "signal" its partner, while retaining control

of the call.

Adding to the Protocol the following system procedure extends the Model to

support signals and enables the environment to transmit parameters to or

from another, previously initiated procedure without relinquishing control

of the call:

SGNLPROCEDURE (tid, yourtid, parameters)

INDEX BOOLEAN LIST

Like the TAKEPROCEDURE procedure already described, its arguments include

the identifier TID of the affected transaction, an indication YOURTID of

from whose name space the identifier was assigned, and the PARAMETERS

themselves.

This new procedure enables the environment to make available to its

applications program a primitive that has a calling sequence similar to that

of the system procedure but which does not require YOURTID as an argument.

Its implementation requires only that the environment identify the remote

process via its internal tables and invoke SGNLPROCEDURE in that process.

-17-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Some Possible Extensions to the Model

Supporting a Richer Spectrum of Control Transfers

By requesting the acknowledgment of each call to SGNLPROCEDURE and, if

necessary, delaying subsequent calls affecting the same transaction until

the acknowledgment arrives, the invoking environment effects a crude form of

flow control and so prevents the remote process' buffers from being overrun.

6.3 Soliciting Help from Superiors

As in conventional programming systems, remotely callable procedures within

a distributed system will sometimes call upon others to carry out portions

of their task. Each procedure along the "thread of control" resulting from

such nested calls is, in a sense, responsible to not only its immediate caller

but also to all those procedures that lie above it along the control thread.

To properly discharge its responsibilities, a procedure must sometimes

communicate with these "superiors."

Occasionally a procedure reaches a point in its execution beyond which it

cannot proceed without external assistance. It might, for example, require

additional resources or further direction from the human user upon whose

behalf it is executing. Before reaching this impasse, the procedure may

have invested considerable real and/or processing time that will be lost

if it aborts.

Adding to the Protocol the following system procedure minimizes such

inefficiencies by enabling the environment to solicit help from a callee's

superiors:

HELPPROCEDURE (tid, number, information -> solution)

INDEX INDEX any any

Its arguments include the identifier TID of the affected transaction (the

direction of the control transfer being implicit in this case), a NUMBER

identifying the problem encountered, and arbitrary supplementary

INFORMATION.

The primitive that this new procedure enables the environment to provide

its applications program has an identical calling sequence. Its implementation

requires only that the environment identify the remote process from its

internal tables and invoke HELPPROCEDURE in that process.

The search for help begins with invocation of HELPPROCEDURE in the caller's

environment. If the caller understands the problem (that is, recognizes

-18-

Network Working Group James E. White

Request for Comments: 708 Elements of a Distributed Programming Model

Supporting a Richer Spectrum of Control Transfers

its number) and is able to solve it, HELPPROCEDURE will simply return whatever

SOLUTION information the caller provides. Otherwise, HELPPROCEDURE must give

the next superior an opportunity to respond by calling itself recursively in

that process. The search terminates as soon as a superior responds positively

or when the end of the control thread is reached. In the latter case, each of

the nested HELPPROCEDURE procedures returns unsuccessfully to indicate to its

caller that the search failed.

6.4 Reporting an Event to Superiors

A procedure sometimes witnesses or causes an event of which its superiors

should be made aware (for example, the start or completion of some major

step in the procedure's execution). Adding to the Protocol the following

system procedure enables the environment to notify a callee's superiors of an

arbitrary event:

NOTEPROCEDURE (tid, number, information)

INDEX INDEX any

Like HELPPROCEDURE, its arguments include the identifier TID of the

transaction it affects, a NUMBER identifying the event being reports, and

arbitrary supplementary INFORMATION.

The primitive that this new procedure enables the environment to provide its

applications program has an identical calling sequence. Its implementation

requires only that the environment identify the remote process from its

internal tables and invoke NOTEPROCEDURE in that process.

By requesting acknowledgment of each call to NOTEPROCEDURE and, if necessary,

delaying subsequent calls that affect that transaction until the acknowledgment

arrives, the invoking environment effects a crude form of flow control and so

prevents the remote process' buffers from being overrun.

Notification of the procedure's superiors begins with invocation of

NOTEPROCEDURE in the caller's process and works its way recursively up the

thread of control until the top is reached.

-19-

Network Working GroupJames E. White

Request for Comments: 708 Elements of a Distributed Programming System

Some Possible Extensions to the Model

Aborting Executing Procedures

7. Aborting Executing Procedures

Conventional systems that accept commands from the user sometimes permit him

to cancel an executing command issued inadvertently or with erroneous

parameters, or one for whose completion he cannot wait. This ability is

particularly important when the command (for example, one that compiles a

source file) has a significant execution time. In a distributed system, the

execution of such a command may involve the invocation of one or more remote

procedures. Its cancellation, therefore, requires the abortion of any

outstanding remote procedure calls.

Adding to the Protocol the following system procedure provides the basis

for a command cancellation facility by enabling the environment to abort

another, previously invoked procedure:

ABRTPROCEDURE (tid)

INDEX

Its sole argument is the identified TID of the transaction it affects.

The primitive that this new procedure enables the environment to make

available to the applications program has an identical calling sequence.

Its implementation requires only that the local environment identify the

remote process from its internal tables and invoke ABRTPROCEDURE in that

process.

-20-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Conclusions

CONCLUSION

The EXPANDED Protocol and Model that result from the extensions proposed in

the present paper are summarized in Appendixes B and C, respectively.

Needless to say, many additional forms and aspects of process interaction,

of which Appendix D suggests a few, remain to be explored. Nevertheless,

the primitives already made available by the run-time environment provide

the applications programmer with a powerful and coherent set of tools for

constructing distributed systems.

-21-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Acknowledgments

ACKNOWLEDGMENTS

Many individuals within both SRI's Augmentation Research Center (ARC) and the

larger ARPANET community have contributed their time and ideas to the

development of the Protocol and Model described in this and its companion

paper. The contributions of the following individuals are expressly

acknowledged: Dick Watson, Jon Postel, Charles Irby, Ken Victor, Dave Maynard,

Larry Garlick of ARC; and Bob Thomas and Rick Schantz of Bolt, Beranek and

Newman, Inc.

ARC has been working toward a high-level framework for network-based

distributed systems for a number of years now [2]. The particular Protocol

and Model result from research begun by ARC in July of 1974. This research

included developing the Model; designing and documenting, and implementing

a prototype run-time environment for a particular machine [4, 5], specifically

a PDP-10 running the Tenex operating system developed by Bolt, Beranek and

Newman, Inc. [6]. Three design iterations were carried out during a 12-month

period and the resulting specification implemented for Tenex. The Tenex RTE

provides a superset of the capabilities proposed in this paper.

The work reported here was supported by the Advanced Research Project Agency

of the Department of Defense, and by the Rome Air Development Center of the

Air Force.

-22-

Network Working GroupJames E. White

Request for Comments: 708 Elements of a Distributed Programming System

Appendix A: Transmission Formats for PCP Data Objects

APPENDIX A

TRANSMISSION FORMATS FOR PCP DATA OBJECTS

Data objects must be encoded in a standard transmission format before they can

be sent from one process to another via the Protocol. An effective strategy

is to define several formats and select the most appropriate one at run-time,

adding to the Protocol a mechanism for format negotiation. Format negotiation

would be another responsibility of the environment and could thus be made

completely invisible to the applications program.

Suggested below are two transmission formats. The first is a 36-bit binary

format for use between 36-bit machines, the second an 8-bit binary, "universal"

format for use between dissimilar machines. Data objects are fully typed in

each format to enable the environment to automatically decode and internalize

incoming parameters should it be desired to provide this service to the

applications program.

PCPB36, For Use Between 36-Bit Machines

Bits 0-13 Unused (zero)

Bits 14-17 Data type

EMPTY =1 INTEGER=4 LIST=7

BOOLEAN=2 BITSTR =5

INDEX -3 CHARSTR=6

Bits 18-20 Unused (zero)

Bits 21-35 Value or length N

EMPTY unused (zero)

BOOLEAN 14 zero-bits + 1-bit value (TRUE=1/FALSE=0)

INDEX unsigned value

INTEGER unused (zero)

BITSTR unsigned bit count N

CHARSTR unsigned character count N

LIST unsigned element count N

Bits 36- Value

EMPTY unused (nonexistent)

BOOLEAN unused (nonexistent)

INDEX unused (nonexistent)

INTEGER two's complement full-word value

BITSTR bit string + zero padding to word boundary

CHARSTR ASCII string + zero padding to word boundary

LIST element data objects

-23-

Network Working James E. White

Request for Comments: 708Elements of a Distributed Programming System

Appendix A: Transmission Formats for PCP Data Objects

PCPB8, For Use Between Dissimilar Machines

Byte 0 Data type

EMPTY =1 INTEGER=4 LIST=7

BOOLEAN=2 BITSTR =5

INDEX =3 CHARSTR=6

Bytes 1- Value

EMPTY unused (nonexistent)

BOOLEAN 7 zero-bits + 1-bit value (TRUE=1/FALSE=0

INDEX 2 byte unsigned value

INTEGER 4-type two's complement value

BITSTR 2-byte unsigned bit count N + bit string

+ zero padding to byte boundary

CHARSTR 2-byte unsigned character count N + ASCII string

LIST 2-byte element count N + element data objects

-24-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Appendix B:The Expanded Procedure Call Protocol

APPENDIX B

THE EXPANDED PROCEDURE CALL PROTOCOL

The Protocol that results from the extensions proposed in this paper is

summarized below. The reader should note the concise syntactic description

made possible by the underlying notion of PCP data types.

Parameter list masks have been included not only as additional parameters

of the CALL message, as proposed in the paper, but as arguments of the

TAKEPROCEDURE and SGNLPROCEDURE system procedures as well. Throughout the

Protocol description, "MASK" is shorthand for:

[LIST (variable [CHARSTR], ...)]

Messages

LIST (route INDEX, opcode INDEX CALL=1, tid [INDEX],

pkh [INDEX], procedure CHARSTR, arguments LIST,

argumentlistmask MASK, resultlistmask MASK)

LIST (route INDEX, opcode INDEX RETURN=2, tid INDEX,

outcome BOOLEAN, results LIST)

If OUTCOME is FALSE

RESULTS is LIST (error INDEX, diagnostic CHARSTR)

Process-Related System Procedures

INIPROCESS (program CHARSTR,

credentials LIST (error CHARSTR, password CHARSTR,

account CHARSTR))

ALOPORT (-> ph INDEX, computer CHARSTR, port)

CNNPORT (ph INDEX, computer CHARSTR, port)

DCNPORT (ph INDEX)

CRTROUTE (mycode INDEX, oldcode [INDEX]

-> code INDEX, ph INDEX)

DELROUTE (yourcode INDEX)

Package-Related System Procedures

OPNPACKAGE (packages LISTofCHARSTRs -> pkhs LISTofINDEXs)

CLSPACKAGE (pkhs LISTofINDEXs)

-25-

Network Working Group James E. White

Request for Comments: 708Elements of a Distributed Programming System

Appendix B: The Expanded Procedure Call Protocol

Variable-Related System Procedures

CRTVARIABLE (variable CHARSTR, value)

DELVARIABLE (variable CHARSTR)

RDVARIABLE (pkh INDEX, variable CHARSTR,

substructure [LISTofINDEXs] -> value)

Procedure-Related System Procedures

TAKEPROCEDURE (tid INDEX, yourtid BOOLEAN, parameters LIST,

argumentlistmask MASK, resultlistmask MASK)

SGNLPROCEDURE (tid INDEX, yourtid BOOLEAN, parameters LIST,

parameterlistmask MASK)

HELPPROCEDURE (tid INDEX, number INDEX, information -> solution)

NOTEPROCEDURE (tid INDEX, number INDEX, information)

ABRTPROCEDURE (tid INDEX)

-26-

Network Working GroupJames E. White

Request for Comments: 708 Elements of a Distributed Programming

Appendix C: Summary of RTE Primitives

APPENDIX C

SUMMARY OF RTE PRIMITIVES

The DPS primitives made available to the applications program as a result of

the Model extensions proposed in this paper are summarized below.

Collectively, they provide the applications programmer with a powerful

and coherent set of tools for constructing distributed systems. Some of

the primitives (for example, CRTPROCESS and DELPROCESS) are necessary elements

for a "network operating system (NOS)," into which DPS may itself one day

evolve.

CRTPROCESS (computer, program, credentials -> PH)

DELPROCESS (ph)

ITDPROCESS (ph1, ph2 -> ph12, ph21, ih)

SEPPROCESS (ih)

Packages

OPNPACKAGE (ph, packages -> pkhs)

CLSPACKAGE (ph, pkhs)

Variables

CRTVARIABLE (ph, variable, value)

DELVARIABLE (ph, variable)

RDVARIABLE (ph, pkh, variable, substructure -> value)

WRTVARIABLE (ph, pkh, variable, substructure, value)

Procedures

CALLPROCEDURE (ph, pkh, procedure, arguments, argumentlistmask,

resultlistmask, -> outcome, results, tid)

LINKPROCEDURE (tid, arguments, argumentlistmask,

resultlistmask, -> outcome, results)

SGNLPROCEDURE (tid, parameters, parameterlistmask)

HELPPROCEDURE (tid, number, information -> solution)

NOTEPROCEDURE (tid, number, information)

ABRTPROCEDURE (tid)

-27-

Network Working Group Elements of a Distributed Programming System

Request for Comments: 708Appendix D: Additional Areas for Investigation

APPENDIX D

ADDITIONAL AREAS FOR INVESTIGATION

Although the expanded distributed programming system developed in this paper

and summarized in the previous appendix is already very powerful, many

additional aspects of process interaction remain, of course, to be explored.

Among the additional facilities that the Protocol must eventually enable the

environment to provide are mechanisms for:

(1) Queuing procedure calls for long periods of time (for

example, days).

(2) Broadcasting requests to groups of processes.

(3) Subcontracting work to other processes (without remaining

a middleman).

(4) Supporting brief or infrequent inter-process exchanges

with minimal startup overhead.

(5) Recovering from and restarting after system errors.

-28-

Network Working GroupElements of a Distributed Programming System

Request for Comments: 708 References

REFERENCES

1. White, J. E., "A High-Level Framework for Network-Based Resource Sharing,"

submitted for publication in the AFIPS Conference Proceedings of the 1976

National Computer Conference.

2. Watson, R. W., Some Thoughts on System Design to Facilitate Resource

Sharing, ARPA Network Working Group Request for Comments 592, Augmentation

Research Center, Stanford Research Institute, Menlo Park, California,

November 20, 1973 (SRI-ARC Catalog Item 20391).

3. White, J. E., DPS-10 Version 2.5 Implementor's Guide, Augmentation

Research Center, Stanford Research Institute, Menlo Park, California,

August 15, 1975 (SRI-ARC Catalog Item 26282).

4. White, J. E., DPS-10 Version 2.5 Programmer's Guide, Augmentation Research

Center, Stanford Research Institute, Menlo Park, California, August 13,

1975 (SRI-ARC Catalog Item 26271).

5. White, J. E., DPS-10 Version 2.5 Source Code, Augmentation Research

Center, Stanford Research Institute, Menlo Park, California, August 13,

1975 (SRI-ARC Catalog Item 26267).

6. Bobrow, D. G., Burchfiel, J. D., Murphy, D. L., Tomlinson, R. S., "TENEX,

a paged Time Sharing System for the PDP-10," Communications of the ACM,

Vol. 15, No. 3, pp. 135-143, March 1972.

-29-

Network Working GroupElements of a Distributed Programming System

Request for Comments: 708 Figure List

FIGURE LIST

Fig. 1Interfacing distant applications programs via their run-time

environments and an IPC channel.

Fig. 2Two processes that can only be introduced via a logical channel.

Fig. 3 A logical channel.