Skip to main content

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]
[ptp-dev] PTP enhancements to support LoadLeveler

We have some success now with seeing job and node status from LoadLeveler 
in the PTP views. There are some enhancements we think would be useful to 
LoadLeveler and other resource managers as well as some questions

Suggestions

1) LoadLeveler has a number of statuses for a job and node beyond the 
simple running and exited that I am using for PE. We originally talked 
about adding additional icons or different colored icons to represent 
those statuses, but we think there are too many for this to be a good 
idea. We're thinking that if the job and machine view could be enhanced to 
allow flyover/tooltip help for each icon in the view. That text could be 
used to display additional status. We are also thinking this status would 
be sent from the proxy as an additional, optional, attribute to node and 
job change events. Each time job or node status changed we would send 
across a new event. The tooltip text would present whatever thye last 
additional status for that node or job was. It might make sense to extend 
this to machine and task objects as well, but we don't have a need for 
these to support LoadLeveler.
2) LoadLeveler can provide additional, detailed information on a submitted 
job, such as resources allocated to a job, reason why a job is not 
running, etc that we think would be useful to the user. Our thinking here 
is that the user could right click on an entry in the job view and get a 
popup menu witt 'detailed status' as one choice. Selecting that results in 
a command to the proxy requesting status. The proxy sends back an event 
with the detailed status. The GUI proceses that event and opens a popup, 
scrollable, window displaying that status (or maybe put this info in a new 
PTP view)
3) It looks like the job view uses task indices starting at 0. LoadLeveler 
identifies the application tasks in a parallel application using their 
task rank 0..n-1 and uses -1 for the master (poe) process of the 
application as well as -1 for the sole task of a serial application. Is it 
possible to modify the definition of task index to allow -1 (and maybe an 
arbitary lower linit)

Questions
1) Is the PID attribute to a process mandatory? This information is not 
provided to us by LoadLeveler, so we would like to omit it, or use '-1' as 
the pid if none is provided. We could also live with a requirement that we 
send pid 0, but that woulod be a bit inconsistent with task index where we 
have -1 as a task index.

I also have a couple bugs which I will log in bugzilla when I get a chance

Dave


Back to the top