Re: [ptp-dev] Questions related to batch job submission

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]

Re: [ptp-dev] Questions related to batch job submission

From: Greg Watson <g.watson@xxxxxxxxxxxx>
Date: Wed, 20 Jun 2007 09:24:03 -0600
Delivered-to: ptp-dev@xxxxxxxxxxx
List-archive: <https://dev.eclipse.org/mailman/listinfo/ptp-dev>
List-help: <mailto:ptp-dev-request@eclipse.org?subject=help>
List-subscribe: <https://dev.eclipse.org/mailman/listinfo/ptp-dev>, <mailto:ptp-dev-request@eclipse.org?subject=subscribe>
List-unsubscribe: <https://dev.eclipse.org/mailman/listinfo/ptp-dev>, <mailto:ptp-dev-request@eclipse.org?subject=unsubscribe>


On Jun 20, 2007, at 8:40 AM, Dave Wootton wrote:

Greg
We weren't planning to reimplement any LL function in PTP. A LLuser would
submit a multi-step job as a single command file containing the
specification statements to define the multi-step job. Ourimplementation
for PTP would follow that model.
The question we had was related to the 'jobid' that the proxy fillsin in
each new job event it sends to the front end. There's a comment in
proxy_event.c that states that new jobs created in response to asubmitcommand must fill in jobid with the jobid passed to the proxy inthe jobcommand. This works fine when the job command file specifies asingle jobstep since when we query LL to get the job info, we only have asingle new
job event.
In the case of a multi-step job, there will be a single submitcommand,but when LL processes the job submission, it can result in multiplenewjobs, where the number cannot be determined by parsing the commandfile(only LL knows the correct number of steps after it has parsed thecommand
file as part of the submit process).
In the case of a multi-step job, as we generate new job events foreachjob step of the just submitted job, what do we use for the jobid ineach
event? It seems like we would generate the first event using the jobid
from the submit command and -1 for the jobid of the remainder ofthe new
job events, but we're not sure that's right.



Sorry, I misunderstood your question.

The RM keeps track of each job submission, and compares the jobsubmission ID attribute against any new job event that it receives.When it finds a new job event with matching job submission IDattribute it completes the submission, then throws away the ID. Ifthe RM receives a new job event without a job submission ID attribute(or one that has already been completed), then it simply adds the jobto the model.

The upshot of this is that you need return the job submission IDattribute on a single new job event in order to complete the submitcommand. You can then send unsolicited new job events with this samejob submission ID attribute and they will just be added to the model.In fact, I suggest you do this since the job submission ID attributecould then be used to show which jobs belong to the multi-step job inthe UI. Since both LSF and LL have these multi-step jobs, this wouldbe a nice feature to have.

Also, we have figured out how to recognize new jobs as they areinitiallysubmitted, so we are able to provide an immediate indication to theuser
that the job was submitted.


Cool.

Greg

References:
- Re: [ptp-dev] Questions related to batch job submission
  - From: Dave Wootton

Prev by Date: Re: [ptp-dev] Problem with creating enumerated attribute definitions?
Next by Date: Re: [ptp-dev] Problem with creating enumerated attribute definitions?
Previous by thread: Re: [ptp-dev] Questions related to batch job submission
Next by thread: Re: [ptp-dev] Questions related to batch job submission
Index(es):
- Date
- Thread

Breadcrumbs