Re: [ptp-dev] Questions related to batch job submission

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]

Re: [ptp-dev] Questions related to batch job submission

From: Greg Watson <g.watson@xxxxxxxxxxxx>
Date: Tue, 19 Jun 2007 14:02:28 -0600
Delivered-to: ptp-dev@xxxxxxxxxxx
List-archive: <https://dev.eclipse.org/mailman/listinfo/ptp-dev>
List-help: <mailto:ptp-dev-request@eclipse.org?subject=help>
List-subscribe: <https://dev.eclipse.org/mailman/listinfo/ptp-dev>, <mailto:ptp-dev-request@eclipse.org?subject=subscribe>
List-unsubscribe: <https://dev.eclipse.org/mailman/listinfo/ptp-dev>, <mailto:ptp-dev-request@eclipse.org?subject=unsubscribe>

Some more thoughts on 1). LSF seems to have something called a 'jobarray' (http://hpc.ilri.cgiar.org/documents/admin_6.0/G_jobarrays.html) which sounds kind of like what you're talking aboutin that you can set up dependencies between jobs, etc. The wholething is submitted using a single submit command. It would seem to befairly straight forward to add an LSF specific launch configurationpage that lets you build up this array, then sends it as a bunch ofattributes on the submission command. One new job event would be sentthat is the 'array' job and returns the submission ID, then new jobevents for each of the individual jobs in the array could also besent. The LSF proxy could then distinguish commands sent to the'array' job ID or the individual job IDs.


Seems like something similar could work for LL.

Greg

On Jun 18, 2007, at 1:38 PM, Dave Wootton wrote:

Hi
We were discussing details of batch job submission thru PTP and hadsome
questions about expected PTP behavior and how we should implement our
support
1) The user can submit a job which contains a set of job steps.From ourperspective, each job step behaves as if it was a separate job,although
there may be dependency and conditional execution specifications that
require job step 1 to complete before job step 2 canm begin, orthat job
step 2 can only run if job step 1 completed successfully, etc. The job
submission/job command file that specifes the individual job stepsis a
single file that will be passed to the proxy in a single run command.

Current ptp behavior is that the run command includes a jobid that is
generated by the front end and passed to the proxy. The proxyresponds to
the run command with an event containing that jobid as well as the
proxy-generated identifier for that job. This works for a singlejob, or
for the first step of a multi-step job.

How should multi-step jobs be handled? Should the PTP front end have a
list or array of jobs steps built at the time the job is submitted,anduse the same jobid for each of those steps? Should the front endgenerate
a unique jobid for each step that is then passed across in the run
command, maybe as an array of jobids, and then the proxy generate anewjob event for each step using the corresponding jobid? Should theproxyjust use the passed jobid for the first step and use -1 as thejobid for
all subsequent steps, since the front end doesn't know about the
additional job steps?

2) When we submit a job, the job may not appear on any job queue for a
while, possibly several minutes. We won't have some job related
information, such as cluster (machine name) where the job was queued,
until the job appears on the job queue. If we delay our eventresponse tothe run command until we have the required information, does thatcauseproblems, such as blocking any additional jobs from being queueduntil theevent notification from the first run command is received? Does thefrontend have problems tracking multiple 'in process' run commandsactive at
the same time?

Thanks
Dave
_______________________________________________
ptp-dev mailing list
ptp-dev@xxxxxxxxxxx
https://dev.eclipse.org/mailman/listinfo/ptp-dev

References:
- [ptp-dev] Questions related to batch job submission
  - From: Dave Wootton

Prev by Date: Re: [ptp-dev] Questions related to batch job submission
Next by Date: Re: [ptp-dev] Problem with creating enumerated attribute definitions?
Previous by thread: Re: [ptp-dev] Questions related to batch job submission
Next by thread: [ptp-dev] Problem with creating enumerated attribute definitions?
Index(es):
- Date
- Thread

Breadcrumbs