[ptp-dev] Some Changes I Need to Implement

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]

[ptp-dev] Some Changes I Need to Implement

From: Nathan DeBardeleben <ndebard@xxxxxxxx>
Date: Thu, 08 Jun 2006 08:24:31 -0600
Delivered-to: ptp-dev@xxxxxxxxxxx
List-archive: <http://eclipse.org/pipermail/ptp-dev>
List-help: <mailto:ptp-dev-request@eclipse.org?subject=help>
List-subscribe: <https://dev.eclipse.org/mailman/listinfo/ptp-dev>, <mailto:ptp-dev-request@eclipse.org?subject=subscribe>
List-unsubscribe: <https://dev.eclipse.org/mailman/listinfo/ptp-dev>, <mailto:ptp-dev-request@eclipse.org?subject=unsubscribe>
User-agent: Thunderbird 1.5.0.4 (Macintosh/20060530)

So I have a few things I need to change to the base code and with theresource manager changes going in I think we might need to work outexactly how to do this in the "new system".

Firstly - I need to change the model (method, not model object :)) thatinterfaces to the runtime subsystem so that instead of the Eclipse-sideasking for information about the runtime-system (such as node status,process status, job status) these are instead triggered by the runtimesubsystem when appropriate. An example would probably help.

Currently, when we start up we send a Startup() message to the runtimesystem. It does so. Then we ask the runtime system how many nodes itknows about. It returns a number. Then, for each node we ask it, insequence, what it knows about those nodes (up/down, node name, generalattributes, etc). Similarly, when a job starts, we get a jobID back andthen we go back to the runtime system and ask it for information fromthat jobID - including, in sequence, attributes related to each process.

OK, that's the old way. The new way I want to do it better keeps up ourevent-driven methodology. In particular, the runtime subsystem willtell us information about the nodes, jobs, processes it knows about witha general 'discovery' message. So here's an example of how the newmodel will work:

We send a Startup() message. Then we just sit there. At some point,the runtime subsystem sends back information abuot the system. "Hey, Iknow of these machines, with these nodes, and these are the attributesof these nodes". I'll have to change some of the UI code so that it ismore flexible to having partial information (for instance, when we knowthere are 256 machines, I will create those components and they can bedisplayed - but we might not yet know anything about the status of thesemachines, that will be coming in asynchronously). Similarly, when westart a job up we'll get back a jobID, and then we'll get back someinformation about the processes related to that job as well - but,again, asynchronously.

Greg and I talked about this and we like this a lot better. It'llsimplify the model considerably and make implementing the runtimecomponent for some runtime systems that are less advanced than OMPIeasier. Not only will I remove the use of these functions(getNodeAttributes, getProcessAttributes, etc) but I'll straight upremove the functions all together.

Any thoughts or concerns, Randy in particular, about how this interactswith the RM? Should it at all?

I'll put my other thoughts in a separate email so they can be handledseparately.


--
-- Nathan
Correspondence
---------------------------------------------------------------------
Nathan DeBardeleben, Ph.D.
Los Alamos National Laboratory
Parallel Tools Team
High Performance Computing Environments
phone: 505-667-3428
email: ndebard@xxxxxxxx
---------------------------------------------------------------------

Follow-Ups:
- Re: [ptp-dev] Some Changes I Need to Implement
  - From: Randy M. Roberts

Prev by Date: [ptp-dev] easyeclipse
Next by Date: [ptp-dev] Attributes
Previous by thread: [ptp-dev] easyeclipse
Next by thread: Re: [ptp-dev] Some Changes I Need to Implement
Index(es):
- Date
- Thread

Breadcrumbs