Page History

Turn Off History

This is a collection of various magic numbers in Condor- and Grid-related software. These are mostly extracted from the source files for the software in question.

Originally written by Alan De Smet. Expanded by Jaime Frey.

Condor

Universe

JobUniverse in job ClassAds

0 Min A placeholder, not a universe
1 Standard Single process relinked jobs
2 Pipe A placeholder, no longer used
3 Linda A placeholder, no longer used
4 PVM Parallel Virtual Machine apps
5 Vanilla Single process non-relinked jobs
6 PVMD PVM daemon process
7 Scheduler A job run under the schedd
8 MPI Message Passing Interface jobs
9 Grid / Globus Jobs managed by condor_gridmanager (V6.6: always Globus, V6.7: grid_type=gt2, gt3, gt4, condor, oracle, nordugrid...)
10 Java Jobs for the Java Virtual Machine
11 Parallel Generalized parallel jobs
12 Local A job run under the schedd using a starter (advanced form of Scheduler)
13 Max A placeholder, not a universe

Job Status

JobStatus in job ClassAds

0 Unexpanded U
1 Idle I
2 Running R
3 Removed X
4 Completed C
5 Held H
6 Submission_err E

Notification

JobNotification in job ClassAds

0 Never
1 Always
2 Complete
3 Error

Shadow exit status

(Source: h/exit.h)

Value Name Description
4 JOB_EXCEPTION The job exited with an exception
44 DPRINTF_ERROR There is a fatal error with dprintf()
100 JOB_EXITED The job exited (not killed)
101 JOB_CKPTED The job was checkpointed
102 JOB_KILLED The job was killed
103 JOB_COREDUMPED The job was killed and a core file produced
105 JOB_NO_MEM Not enough memory to start the shadow
106 JOB_SHADOW_USAGE incorrect arguments to condor_shadow
107 JOB_NOT_CKPTED The job was kicked off without a checkpoint
107 JOB_SHOULD_REQUEUE (!) We define this to the same number, since we want the same behavior. However, "JOB_NOT_CKPTED" doesn't mean much if we're not a standard universe job. The effect of this exit code is that we want the job to be put back in the job queue and run again.
108 JOB_NOT_STARTED Can't connect to startd or request refused
109 JOB_BAD_STATUS Job status != RUNNING on startup
110 JOB_EXEC_FAILED Exec failed for some reason other than ENOMEM
111 JOB_NO_CKPT_FILE There is no checkpoint file (lost)
112 JOB_SHOULD_HOLD The job should be put on hold
113 JOB_SHOULD_REMOVE The job should be removed

Job Hold Reason Codes

(Source: src/condor_c++_util/condor_holdcodes.h)

Value Reason SubCode Contents
0 Unspecified N/A
1 UserRequest N/A
2 GlobusGramError GRAM error code
3 JobPolicy N/A
4 CorruptedCredential N/A
5 JobPolicyUndefined N/A
6 FailedToCreateProcess Unix errno
7 UnableToOpenOutput Unix errno
8 UnableToOpenInput Unix errno
9 UnableToOpenOutputStream Unix errno
10 UnableToOpenInputStream Unix errno
11 InvalidTransferAck N/A
12 DownloadFileError Unix errno
13 UploadFileError Unix errno
14 IwdError Unix errno
15 SubmittedOnHold N/A
16 SpoolingInput N/A

Log event codes

Submit 0
Execute 1
Executable error 2
Checkpointed 3
Job evicted 4
Job terminated 5
Image size 6
Shadow exception 7
Generic 8
Job aborted 9
Job suspended 10
Job unsuspended 11
Job held 12
Job released 13
Node execute 14
Node terminated 15
Post script terminated 16
Globus submit 17
Globus submit failed 18
Globus resource up 19
Globus resource down 20
Remote error 21
Job disconnected 22
Job reconnected 23
Job reconnect failed 24
Grid resource up 25
Grid resource down 26
Grid submit 27

{subseciton: Starters and Shadows}

Shadow Starter Universe
jim jim PVM
V6 V5 Standard
V6.1 V6.1 Everything else

Queue-Management Commands

Number Action
10001 InitializeConnection
10002 NewCluster
10003 NewProc
10004 DestroyCluster
10005 DestroyProc
10006 SetAttribute
10007 CloseConnection
10008 GetAttributeFloat
10009 GetAttributeInt
10010 GetAttributeString
10011 GetAttributeExpr
10012 DeleteAttribute
10013 GetNextJob
10014 FirstAttribute
10015 NextAttribute
10016 DestroyClusterByConstraint
10017 SendSpoolFile
10018 GetJobAd
10019 GetJobByConstraint
10020 GetNextJobByConstraint
10021 SetAttributeByConstraint
10022 InitializeReadOnlyConnection
10023 BeginTransaction
10024 AbortTransaction
10025 SetTimerAttribute

ClassAd Log Commands

Number Action
101 NewClassAd
102 DestroyClassAd
103 SetAttribute
104 DeleteAttribute
105 BeginTransaction
106 EndTransaction

DaemonCore Commands and Signals

From condor_includes/condor_commands.h
Name Number
DC_RAISESIGNAL 60000
DC_PROCESSEXIT 60001
DC_CONFIG_PERSIST 60002
DC_CONFIG_RUNTIME 60003
DC_RECONFIG 60004
DC_OFF_GRACEFUL 60005
DC_OFF_FAST 60006
DC_CONFIG_VAL 60007
DC_CHILDALIVE 60008
DC_SERVICEWAITPIDS 60009
DC_AUTHENTICATE 60010
DC_NOP 60011
DC_RECONFIG_FULL 60012
DC_FETCH_LOG 60013
DC_INVALIDATE_KEY 60014
DC_OFF_PEACEFUL 60015
DC_SET_PEACEFUL_SHUTDOWN 60016
DC_TIME_OFFSET 60017

CEDAR Authentication Methods

From condor_includes/condor_auth.h and condor_io/condor_secman.C
Name Number
CLAIMTOBE 2
FS 4
FS_REMOTE 8
NTSSPI 16
GSI 32
KERBEROS 64
ANONYMOUS 128
SSL 256
PASSWORD 512

Globus

TODO

Misc

TODO