bapp
Displays information about application profile configuration.
Synopsis
bapp [-alloc] [-l | -w] [application_profile ...]Description
Displays information about application profiles that are configured in the lsb.applications file.
Returns application name, task statistics, and job state statistics for all application profiles.
In the LSF multicluster capability, returns the information about all application profiles in the local cluster.
Returns job slot statistics if the -alloc option is used.
CPU time is normalized.
Options
- -alloc
- Shows counters for slots in RUN, SSUSP, USUSP, and RSV state. The slot allocation is different depending on whether the job is an exclusive job or not.
- -w
Wide format. Fields are displayed without truncation.
- -l
Long format with additional information.
Displays the following additional information about the application profile:- Description
- Profile characteristics and statistics
- Parameters
- Resource usage limits
- Associated commands
- Binding policy
- NICE value
- Job controls
- Pending time limits
- Eligible pending time limits
- (As of Fix Pack 4) APS priority factors
- application_profile ...
- Displays information about the specified application profile.
- -h
- Prints command usage to stderr and exits.
- -V
- Prints product release version to stderr and exits.
Default output format
Displays the following fields:
- APPLICATION_NAME
- The name of the application profile. Application profiles are named to correspond to the type of application that usually runs within them.
- NJOBS
- The
total number of tasks for all jobs that are currently held in the application profile. The total
includes tasks in pending, running, and suspended jobs.
If the -alloc option is used, total is the sum of the counters for RUN, SSUSP, USUSP, and RSV states.
- PEND
- The number of tasks for all pending jobs in the application profile. If used with the -alloc option, output is 0, because pending jobs do not have slot allocation.
- RUN
- The number of tasks for all running jobs in the application profile. If the -alloc option is used, the total is allocated slots for the jobs in the application profile.
- SUSP
- The number of tasks for all suspended jobs in the application profile. If the -alloc option is used, the total is allocated slots for the jobs in the application profile.
Long output format with the -l option
In addition to the default output fields, the -l option displays the following fields:
- Description
- A description of the typical use of the application profile.
- PARAMETERS/STATISTICS
-
- SSUSP
- The number of tasks for all jobs in the application profile that are suspended by LSF because of load levels or run windows.
- USUSP
- The number of tasks for all jobs in the application profile that are suspended by the job submitter or by the LSF administrator.
- RSV
- The number of tasks that reserve slots for pending jobs in the application profile.
- ENV_VARS
- The name-value pairs that are defined by the application-specific environment variables.
- Per-job resource usage limits
- The soft resource usage limits that are imposed on the jobs that are associated with the
application profile. These limits are imposed on a per-job and a per-process basis. The following per-job limits are supported:
- CPULIMIT
- The maximum CPU time a job can use, in minutes, relative to the CPU factor of the named host. The CPULIMIT is scaled by the CPU factor of the execution host so that jobs are allowed more time on slower hosts.
- MEMLIMIT
- The maximum running set size (RSS) of a process.
By default, the limit is shown in KB. Use the LSF_UNIT_FOR_LIMITS parameter in the lsf.conf file to specify a larger unit for display (MB, GB, TB, PB, or EB).
- MEMLIMIT_TYPE
- A memory limit is the maximum amount of memory a job is allowed to consume. Jobs that exceed the level are killed. You can specify different types of memory limits to enforce, based on PROCESS, TASK, or JOB (or any combination of the three).
- PROCESSLIMIT
- The maximum number of concurrent processes that are allocated to a job.
- SWAPLIMIT
- The swap space limit that a job can use.
By default, the limit is shown in KB. Use the LSF_UNIT_FOR_LIMITS parameter in the lsf.conf file to specify a larger unit for display (MB, GB, TB, PB, or EB).
- TASKLIMIT
- The maximum number of tasks that are allocated to a job.
- THREADLIMIT
- THREADLIMITThe maximum number of concurrent threads that are allocated to a job.
- Per-process resource usage limits
- The following UNIX per-process resource limits are supported:
- CORELIMIT
- The maximum size of a core file.
By default, the limit is shown in KB. Use the LSF_UNIT_FOR_LIMITS parameter in the lsf.conf file to specify a larger unit for display (MB, GB, TB, PB, or EB).
- DATALIMIT
- The maximum size of the data segment of a process, in KB. This limit restricts the amount of memory a process can allocate.
- FILELIMIT
- The maximum file size a process can create, in KB.
- RUNLIMIT
- The maximum wall clock time a process can use, in minutes. RUNLIMIT is scaled by the CPU factor of the execution host.
- STACKLIMIT
- The maximum size of the stack segment of a process. This limit restricts the amount of memory a
process can use for local variables or recursive function calls.
By default, the limit is shown in KB. Use the LSF_UNIT_FOR_LIMITS parameter in the lsf.conf file to specify a larger unit for display (MB, GB, TB, PB, or EB).
- BIND_JOB
- The processor binding policy for sequential and parallel job processes enabled in the
application profile. Displays one of the following values: NONE,
BALANCE, PACK,
ANY, USER, or
USER_CPU_LIST.
bapp -l app1 APPLICATION NAME: app1 -- test processor binding options ... PARAMETERS: BIND_JOB: ANY
For compatibility with an earlier versions, the bapp -l command displays Y or N if the BIND_JOB parameter is defined with those values in the application profile.
- CHKPNT_DIR
- The checkpoint directory, if automatic checkpointing is enabled for the application profile.
- CHKPNT_INITPERIOD
- The initial checkpoint period in minutes. The periodic checkpoint does not happen until the initial period elapses.
- CHKPNT_PERIOD
- The checkpoint period in minutes. The running job is check-pointed automatically every checkpoint period.
- CHKPNT_METHOD
- The checkpoint method.
- MIG
- The migration threshold in minutes. A value of 0 (zero) specifies
that a suspended job is to be migrated immediately.
Where a host migration threshold is also specified, and is lower than the job value, the host value is used.
- PRE_EXEC
- The job-based pre-execution command for the application profile. The PRE_EXEC command runs on the execution host before the job that is associated with the application profile is dispatched to the execution host (or to the first host selected for a parallel batch job).
- POST_EXEC
- The job-based post-execution command for the application profile. The POST_EXEC command runs on the execution host after the job finishes.
- HOST_PRE_EXEC
- The host-based pre-execution command for the application profile. The HOST_PRE_EXEC command runs on all execution hosts before the job that is associated with the application profile is dispatched to the execution hosts. If a job-based pre-execution PRE_EXEC command was defined at the queue-level/application-level/job-level, the HOST_PRE_EXEC command runs before the PRE_EXEC command of any level. The host-based pre-execution command cannot be run on Windows systems.
- HOST_POST_EXEC
- The host-based post-execution command for the application profile. The HOST_POST_EXEC command runs on the execution hosts after the job finishes. If a job-based post-execution POST_EXEC command was defined at the queue-level/application-level/job-level, the HOST_POST_EXEC command runs after the POST_EXEC command of any level. The host-based post-execution command cannot be run on Windows systems.
- LOCAL_MAX_PREEXEC_RETRY_ACTION
- The action to take on a job when the number of times to attempt its pre-execution command on the local cluster (the value of the LOCAL_MAX_PREEXEC_RETRY parameter) is reached.
- JOB_INCLUDE_POSTPROC
- If the JOB_INCLUDE_POSTPROC=Y parameter is defined, post-execution processing of the job is included as part of the job.
- JOB_POSTPROC_TIMEOUT
- Timeout in minutes for job post-execution processing. If post-execution processing takes longer than the timeout, the sbatchd daemon reports that post-execution failed (POST_ERR status). On UNIX, it kills the process group of the job's post-execution processes. On Windows, only the parent process of the pre-execution command is killed when the timeout expires, the child processes of the pre-execution command are not killed.
- REQUEUE_EXIT_VALUES
- Jobs that exit with these values are automatically re-queued.
- RES_REQ
- Resource requirements of the application profile. Only the hosts that satisfy these resource requirements can be used by the application profile.
- JOB_STARTER
- An executable file that runs immediately before the batch job, taking the batch job file as an input argument. All jobs that are submitted to the application profile are run through the job starter, which is used to create a specific execution environment before LSF processes the jobs themselves.
- RERUNNABLE
- If the RERUNNABLE field displays yes, jobs in the application profile are automatically restarted or rerun if the execution host becomes unavailable. However, a job in the application profile is not restarted if you use bmod to remove the rerunnable option from the job.
- RESUME_CONTROL
- The configured actions for the resume job control.
The configured actions are displayed in the format [action_type, command] where action_type is RESUME.
- SUSPEND_CONTROL
- The configured actions for the suspend job control.
The configured actions are displayed in the format [action_type, command] where action_type is SUSPEND.
- TERMINATE_CONTROL
- The configured actions for terminate job control.
The configured actions are displayed in the format [action_type, command] where action_type is TERMINATE.
- NO_PREEMPT_INTERVAL
- The configured uninterrupted running time (minutes) that must pass before a pre-emptable job can be preempted.
- MAX_TOTAL_TIME_PREEMPT
- The configured maximum total preemption time (minutes) after which preemption cannot take place.
- NICE
- The relative scheduling priority at which jobs from the application run.
- Current working directory (CWD) information
-
- JOB_CWD
- The current working directory for the job in the application profile. The path can be absolute or relative to the submission directory, and includes dynamic patterns.
- JOB_CWD_TTL
- The time to live for the current working directory for a job. LSF cleans the created CWD after a job finishes based on the TTL value.
- JOB_SIZE_LIST
- A list of job sizes (number of tasks) allowed on this application, including the default job size that is assigned if the job submission does not request a job size. Configured in the lsb.applications file.
- PEND_TIME_LIMIT
- The pending time limit for a job in the application profile. If a job remains pending for longer than this specified time limit, IBM® Spectrum LSF RTM (LSF RTM) triggers the alarm and other actions. Configured in the lsb.applications file.
- ELIGIBLE_PEND_TIME_LIMIT
- The eligible pending time limit for a job in the application profile. If a job remains in an eligible pending state for longer than this specified time limit, IBM Spectrum LSF RTM (LSF RTM) triggers the alarm and other actions. Configured in the lsb.applications file.
- PRIORITY
- The APS priority factor for the application profile. PRIORITY is defined by the PRIORITY parameter in the configuration file lsb.applications.
See also
lsb.applications, lsb.queues, bsub, bjobs, badmin, mbatchd