[Pvfs2-developers] Re: RFC: Config file overhaul

Sam Lang Tue, 12 Sep 2006 16:18:14 -0700


On Sep 12, 2006, at 5:07 PM, Murali Vilayannur wrote:

Hey guys,
Sam and I discussed the option of overhauling the configurationfile andserver setup options today and here is a summary of what we hadcome upwith.. (Sam, please correct me if I missed out anything ormisunderstood
something)

Well-known problems with the current setup are
a) the need for 2 config files (a global one and a per-serverconfig file)and the implicit reliance on a savvy admin to keep these consistentand
synchronized.
b) Not having a clean way to be able to do a SIGHUP or equivalentto get
restart functionality on servers.

We were thinking of going about solving this in a 3 step process.
1) Eliminate the need for a per-server config file and pass the
server host id as a command line parameter. The fs.conf file willhave a
new Defaults tag for the storage space in case it is the same on all
servers (which is usually the case) like the log file pathname.
Since the fs.conf aliases section maps aliases to host ids, serverscan
use that to obtain the aliases and thence the metahandle, datahandle
ranges.
If each server has a different storage space path, we could have anew tag
<StorageSpace>
localhost1 /tmp/path1
localhost2 /tmp/path2
...
</StorageSpace>
and so on..
NOTE: If people feel that passing an address to the servers is in some
sense similar to having the good old config file and does notreally solve
the config problems, we could probably
just listen on the 0.0.0.0 address and any ephemeral port, andregisterthe ephemeral port with the portmapper. More on this issue a littlelater.2) Now, we eliminate the need for having a synchronized fs.conffile bydesignating a single metadata server (similar to what Julian hadwrittenearlier today) as the sole authority of the config files for aparticular
file system and having it push the config files to all the servers.
When this metadata server starts up, it parses through all thehostid's
with the exception of itself and then connects to all the remaining
servers associated with that fsid and does a putconfig request (a new
request type).
Question is, how does this server talk to the remaining servers topushthe config requests? We could just simply use tcp for thiscommunicationand have the putconfig act like an implicit SIGHUP which will causeallservers to restart and listen on the appropriate interfacessubsequent to
the putconfig.
As regards the port numbers to do the putconfig, we could query the
portmapper or dedicate a range of port numbers on each host forrunning
the remaining servers initially.

So what we have is as follows

<Root Server>                 <Other servers>
start by providing either the actualbmi urlhostid or any ephemeral port number ascmdline
                            parameter.

                            setup the sm's etc and wait for a putconfig()
                            All other requests will be Nack'ed

parse the fs.conf
files and issue a putconfig()
to all the remaining servers.
And until all the remaining
servers acknowledge, continue
doing this.

Start waiting for regular
reqs to show up

It is now enough for
SIGHUP  to be sent to
the RootServer. All other
servers will ignore
SIGHUP. The root server
will issue a putconfig()
to all servers.
as soon as we get a putconfig, act likeit wasa SIGHUP, parse the config files andrestart
                            implicitly.
                            Now start servicing new requests..

                            Any subsequent putconfig() requests will again
act like a SIGHUP (wait for currentreqs to be
                            drained, new reqs NACked) and restart.
NOTE: each server maintains a global in-memory session identifierthat is
propagated by the putconfig() from the root-server and this identifier
is part of every client servreq structure. If it matches with theserver
id, the requests are processed, else the clients are forced to do a
getconfig().
I still dont think we have addressed all issues here.. For instanceserver(non root) restarts should cause putconfig's implicitly orgetconfig's?Should the session id be on db on the root servers (and non rootservers)
or is it sufficient to be in memory?

I felt this topic requires some discussion and brain storming before
prototyping and hence this longish email.

To continue the discussion here that I had offline with Murali, whatabout having servers send an initial hello message to the masterserver, so that they don't have to figure out what port to listenon? The master can send that server's HostID, wait for a readyresponse, and then push the config file to that endpoint.


-sam

Comments and suggestions welcome!
thanks,
Murali


_______________________________________________
Pvfs2-developers mailing list
Pvfs2-developers@beowulf-underground.org
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers

[Pvfs2-developers] Re: RFC: Config file overhaul

Reply via email to