[Sis-ams] 'reconnect' timer?
Scott Burleigh
Scott.Burleigh at jpl.nasa.gov
Mon Aug 25 13:56:42 EDT 2008
Ray, Timothy J. (GSFC-583.0) wrote:
>
> Dear WG members,
>
>
>
> 4.2.7.4.2 specifies that a node shall (in response to imputing the
> death of its registrar) query the configuration server to determine
> the network location of the restarted registrar, and then send a
> 'reconnect' message to the new registrar (and the registrar shall
> respond with either a 'reconnected' or 'you-are-dead'). Good so far.
> But what if the node does not receive a response from the registrar?
> I don't think our spec covers this situation.
>
>
>
> In my opinion, AMS should require the node to start a timer when it
> sends the 'reconnect' message. AMS should also specify the action to
> be taken if the timer expires.
>
Tim, good point; the spec is not as clear on this as it should be. The
intent is that:
a) The immediate response of a node to the imputed death of a
registrar should be simply to note that the registrar is no longer
running, nothing more. (The same is true, really, of the effect on a
registrar of the imputed death of a configuration server.)
b) At the expiration of any heartbeat period, the node is supposed to
send a heartbeat to the registrar. If it knows that the registrar is
running (at a known network location), great. If not, the node is
supposed to try to determine the new location of the restarted
registrar, by interrogating the configuration server, and then -- if
that attempt was successful -- send a reconnect message to the restarted
registrar at its new location.
c) If there's no response (reconnected or you_are_dead) to the
reconnect message, no problem. At the expiration of the next heartbeat
period, the node will do step (b) again and this will result in (at
most) transmission of another reconnect.
That is, the reconnect cycle is "soft state"; in effect, the heartbeat
cycle functions as the timer on reconnect messages.
I like this approach, as it is pretty simple and at this point I think
we want to avoid introducing any more complexity (such as another timer)
into the spec if we can possibly help it. But what I have written here
is really hard to infer from the current text of 4.2.7, so I need to do
some serious reworking of this text. I'll try to do that this week,
unless we don't agree that this approach is okay. What do you think?
Scott
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ccsds.org/pipermail/sis-ams/attachments/20080825/67f8edfd/attachment.htm
More information about the Sis-ams
mailing list