[ Platform Documentation ] [ Title ] [ Contents ] [ Previous ] [ Next ] [ Index ]
The
lsb.acct
file is the batch job log file of LSF. The master batch daemon (seembatchd
(8)) generates a record for each job completion or failure. The record is appended to the job log filelsb.acct
. The file is located inLSB_SHAREDIR/
<clustername>
/logdir
, whereLSB_SHAREDIR
must be defined inlsf.conf(5)
and<clustername>
is the name of the LSF cluster, as returned bylsid(1)
. Seembatchd
(8) for the description ofLSB_SHAREDIR
.[ Top ]
lsb.acct Structure
The job log file is an ASCII file with one record per line. The fields of a record are separated by blanks. If the value of some field is unavailable, "" is logged for character string, 0 for time and number, and -1 for resource usage.
Records and fields
The fields of a record are separated by blanks. The first string of an event record indicates its type. The following types of events are recorded:
JOB_FINISH
A job has finished.
If LSF_HPC_EXTENSIONS="SHORT_EVENTFILE" is specified in
lsf.conf
, older daemons and commands (pre-LSF Version 6.0) cannot recognize thelsb.acct
file format.The fields in order of occurrence are:
Event type (%s)
Which is "JOB_FINISH"
Version Number (%s)
Version number of the log file format
Event Time (%d)
Time the event was logged (in seconds since the epoch)
jobId (%d)
ID for the job
userId (%d)
UNIX user ID of the submitter
options (%d)
Bit flags for job processing
numProcessors (%d)
Number of processors initially requested for execution
submitTime (%d)
Job submission time
beginTime (%d)
Job start time - the job should be started at or after this time
termTime (%d)
Job termination deadline - the job should be terminated by this time
startTime (%d)
Job dispatch time - time job was dispatched for execution
userName (%s)
User name of the submitter
queue (%s)
Name of the job queue to which the job was submitted
resReq (%s)
Resource requirement specified by the user
dependCond (%s)
Job dependency condition specified by the user
preExecCmd (%s)
Pre-execution command specified by the user
fromHost (%s)
Submission host name
cwd (%s)
Current working directory
inFile (%s)
Input file name (%s)
outFile (%s)
output file name
errFile (%s)
Error output file name
jobFile (%s)
Job script file name
numAskedHosts (%d)
Number of host names to which job dispatching will be limited
askedHosts (%s)
List of host names to which job dispatching will be limited (%s for each); nothing is logged to the record for this value if the last field value is 0. If there is more than one host name, then each additional host name will be returned in its own field
numExHosts (%d)
Number of processors used for execution
execHosts (%s)
List of execution host names (%s for each); nothing is logged to the record for this value if the last field value is 0
jStatus (%d)
Job status. The number 32 represents EXIT, 64 represents DONE
hostFactor (%f)
CPU factor of the first execution host
jobName (%s)
Job name
command (%s)
Complete batch job command specified by the user
lsfRusage
The following fields contain resource usage information for the job. If the value of some field is unavailable (due to job abortion or the difference among the operating systems), -1 will be logged. Times are measured in seconds, and sizes are measured in KBytes.
ru_utime
User time used
ru_stime (%f)
System time used
ru_maxrss (%d)
Maximum shared text size
ru_ixrss (%d)
Integral of the shared text size over time (in kilobyte seconds)
ru_ismrss (%d)
Integral of the shared memory size over time (valid only on Ultrix)
ru_idrss (%d)
Integral of the unshared data size over time
ru_isrss (%d)
Integral of the unshared stack size over time
ru_minflt (%d)
Number of page reclaims
ru_magflt (%d)
Number of page faults
ru_nswap (%d)
Number of times the process was swapped out
ru_inblock (%d)
Number of block input operations
ru_oublock (%d)
Number of block output operations
ru_ioch (%d)
Number of characters read and written (valid only on HP-UX)
ru_msgsnd (%d)
Number of System V IPC messages sent
ru_msgrcv (%d)
Number of messages received
ru_nsignals (%d)
Number of signals received
ru_nvcsw (%d)
Number of voluntary context switches
ru_nivcsw (%d)
Number of involuntary context switches
ru_exutime (%d)
Exact user time used (valid only on ConvexOS)
mailUser (%s)
Name of the user to whom job related mail was sent
projectName (%s)
Project name
exitStatus (%d)
UNIX exit status of the job
maxNumProcessors (%d)
Maximum number of processors specified for the job
loginShell (%s)
Login shell used for the job
timeEvent (%s)
Time event string for the job - JobScheduler only
idx (%d)
Job array index
maxRMem (%d)
Maximum resident memory usage in KBytes of all processes in the job
maxRSwap (%d)
Maximum virtual memory usage in KBytes of all processes in the job
inFileSpool (%s)
Spool input file
commandSpool (%s)
Spool command file
rsvId %s
Advance reservation ID; for example, "
user2#0
"additionalInfo (%s)
Placement information of HPC jobs
exitInfo (%d)
Job termination reason, see
<lsbatch/lsbatch.h>
warningAction (%s)
Job warning action
warningTimePeriod (%d)
Job warning time period in seconds
chargedSAAP (%s)
SAAP charged to a job
sla (%s)
SLA service class name under which the job runs
EVENT_ADRSV_FINISH
An advance reservation has expired. The fields in order of occurrence are:
Event type (%s)
Which is "EVENT_ADRSV_FINISH"
Version Number (%s)
Version number of the log file format
Event Logging Time (%d)
Time the event was logged (in seconds since the epoch); for example, "
1038942015
"Reservation Creation Time (%d)
Time the advance reservation was created (in seconds since the epoch); for example, "
1038938898
"Reservation Type (%d)
Type of advance reservation request:
- User reservation (RSV_OPTION_USER, defined as 0x001)
- User group reservation (RSV_OPTION_GROUP, defined as 0x002)
- System reservation (RSV_OPTION_SYSTEM, defined as 0x004)
- Recurring reservation (RSV_OPTION_RECUR, defined as 0x008)
For example, "
9
" is a recurring reservation created for a user.Creator ID (%d)
UNIX user ID of the reservation creator; for example, "
30408
"Reservation ID (rsvId %s)
For example, "
user2#0
"User Name (%s)
User name of the reservation user; for example, "
user2
"Time Window (%s)
Time window of the reservation:
- One-time reservation in seconds since the epoch; for example, "
1033761000-1033761600
"- Recurring reservation; for example, "
17:50-18:00
"Creator Name (%s)
User name of the reservation creator; for example, "
user1
"Duration (%d)
Duration of the reservation, in hours, minutes, seconds; for example, "
600
" is 6 hours, 0 minutes, 0 secondsNumber of Resources (%d)
Number of reserved resource pairs in the resource list; for example "
2
" indicates 2 resource pairs ("hostA 1 hostB 1
")Host Name (%s)
Reservation host name; for example, "
hostA
"Number of CPUs (%d)
Number of reserved CPUs; for example "
1
"SEE ALSO
Related topics
lsb.events
(5),lsb.params
(5),lsf.conf
(5),mbatchd
(8),bacct
(1),brsvadd
(8),brsvs
(1),bsub
(1),lsid
(1)Files
$LSB_SHAREDIR/
<cluster_name>
/logdir/lsb.acct
[ Top ]
[ Platform Documentation ] [ Title ] [ Contents ] [ Previous ] [ Next ] [ Index ]
Date Modified: February 24, 2004
Platform Computing: www.platform.com
Platform Support: support@platform.com
Platform Information Development: doc@platform.com
Copyright © 1994-2004 Platform Computing Corporation. All rights reserved.