Re: [ptp-dev] Questions about PTP SDM debugger

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]

Re: [ptp-dev] Questions about PTP SDM debugger

From: Greg Watson <g.watson@xxxxxxxxxxxx>
Date: Mon, 25 Aug 2008 16:18:03 -0400
Delivered-to: ptp-dev@xxxxxxxxxxx
List-archive: <https://dev.eclipse.org/mailman/private/ptp-dev>
List-help: <mailto:ptp-dev-request@eclipse.org?subject=help>
List-subscribe: <https://dev.eclipse.org/mailman/listinfo/ptp-dev>, <mailto:ptp-dev-request@eclipse.org?subject=subscribe>
List-unsubscribe: <https://dev.eclipse.org/mailman/listinfo/ptp-dev>, <mailto:ptp-dev-request@eclipse.org?subject=unsubscribe>


On Aug 25, 2008, at 11:00 AM, Dave Wootton wrote:

Greg
Some additional questions
1) It looks like I don't pass the name of the application executableas aparameter on the top level SDM instance since the top level instanceisn't
directly invoking the SDM instances required for individual tasks.

No this isn't necessary. The debugger protocol supplies the executablename and the application arguments.

2) What are the invocation parameters of the individual SDM? I'msort ofguessing I need the hostname and port of the top SDM, the pathnameof theapplication and any parameters the application requires. I'mguessing then
the individual SDM starts, starts a debugger instance and the debugger
instance starts the application instance.

The master sdm should be invoked with as 'sdm --host=address --port=port --debugger=gdb-mi --numprocs=n' where address is the addressof the machine running eclipse and port is a port number assigned byPTP. The servers will be started with something like 'mpirun sdm -debugger=gdb-mi --numprocs=n'.

3) Is the routing file on a node a list of all tasks in theapplication or
only the tasks running on that node?


A list of all tasks.


4) How does the routing file get loaded onto each individual node?

At the moment it is assumed there is a shared filesystem. Thisrequirement will be removed in a later version, and the sdm'sthemselves will be used to propagate the routing file.

5) How does each individual SDM know how to connect back to the topSDM if
the top SDM host/port is not a parameter?

Connections propagate up the tree (starting from the master). Each sdmknows the index of its children (computed as a binomial tree) so itjust attempts to connect to its children using the address/portobtained from the routing file.

6) If the individual SDM is passed the host/port that it connects tothe
top SDM, how do I find out what that top level SDM port is?

There is no easy way to do this at the moment, since it is generatedinternally and passed to the submitJob command as an argument. Theeasiest way would be to print out the arguments to the submitJobcommand either in the Java side of the RM or in your proxy.

I think I understand how this is supposed to work, and it seemsreasonablefor the case where the user specifies a host list file. In the casewhere
we use LoadLeveler to allocate nodes, I'm not sure how this will work
since we have no way of knowing what nodes are allocated until thepoe job
(the SDMs) starts.

The SDMs do nothing until they get the routing file. Would it bepossible to launch the SDMs, get the node information from LL, thencreate the routing file? This is how the new OMPI RM works.


Greg

Follow-Ups:
- Re: [ptp-dev] Questions about PTP SDM debugger
  - From: Dave Wootton

References:
- Re: [ptp-dev] Questions about PTP SDM debugger
  - From: Dave Wootton

Prev by Date: Re: [ptp-dev] Externalized messages on PTP
Next by Date: Re: [ptp-dev] Externalized messages on PTP
Previous by thread: Re: [ptp-dev] Questions about PTP SDM debugger
Next by thread: Re: [ptp-dev] Questions about PTP SDM debugger
Index(es):
- Date
- Thread

Breadcrumbs