HTCondor
Universe
JobUniverse in job ClassAds
0 | Min | A placeholder, not a universe |
1 | Standard | Single process relinked jobs |
2 | Pipe | A placeholder, no longer used |
3 | Linda | A placeholder, no longer used |
4 | PVM | Parallel Virtual Machine apps, no longer used |
5 | Vanilla | Single process non-relinked jobs |
6 | PVMD | PVM daemon process, no longer used |
7 | Scheduler | A job run under the schedd |
8 | MPI | Message Passing Interface jobs |
9 | Grid / Globus | Jobs managed by condor_gridmanager (V6.6: always Globus, V6.7: grid_type=gt2, gt3, gt4, condor, oracle, nordugrid...) |
10 | Java | Jobs for the Java Virtual Machine |
11 | Parallel | Generalized parallel jobs |
12 | Local | A job run under the schedd using a starter (advanced form of Scheduler) |
13 | VM | A job that is a virtual machine image |
14 | Max | A placeholder, not a universe |
Job Status
JobStatus in job ClassAds
0 | Unexpanded | U |
1 | Idle | I |
2 | Running | R |
3 | Removed | X |
4 | Completed | C |
5 | Held | H |
6 | Transferring_Output | > |
7 | Suspended | S |
Slot Activity
0 | None |
1 | Idle |
2 | Busy |
3 | Retiring |
4 | Vacating |
5 | Suspended |
6 | Benchmarking |
7 | Killing |
DAGMan Node Return Values
0 through 255 | job exited with corresponding return value |
-1 through -32 | job killed by corresponding (non-negative) signal number |
-1001 | job whose batch system submission fails |
-1002 | job externally removed from the batch system queue (by something other than condor_dagman) |
Notification
JobNotification in job ClassAds
0 | Never |
1 | Always |
2 | Complete |
3 | Error |
Shadow exit status
(Source: h/exit.h)
Value | Name | Description |
4 | JOB_EXCEPTION | The job exited with an exception |
44 | DPRINTF_ERROR | There is a fatal error with dprintf() |
100 | JOB_EXITED | The job exited (not killed) |
101 | JOB_CKPTED | The job was checkpointed |
102 | JOB_KILLED | The job was killed |
103 | JOB_COREDUMPED | The job was killed and a core file produced |
105 | JOB_NO_MEM | Not enough memory to start the shadow |
106 | JOB_SHADOW_USAGE | incorrect arguments to condor_shadow |
107 | JOB_NOT_CKPTED | The job was kicked off without a checkpoint |
107 | JOB_SHOULD_REQUEUE | (!) We define this to the same number, since we want the same behavior. However, "JOB_NOT_CKPTED" doesn't mean much if we're not a standard universe job. The effect of this exit code is that we want the job to be put back in the job queue and run again. |
108 | JOB_NOT_STARTED | Can't connect to startd or request refused |
109 | JOB_BAD_STATUS | Job status != RUNNING on startup |
110 | JOB_EXEC_FAILED | Exec failed for some reason other than ENOMEM |
111 | JOB_NO_CKPT_FILE | There is no checkpoint file (lost) |
112 | JOB_SHOULD_HOLD | The job should be put on hold |
113 | JOB_SHOULD_REMOVE | The job should be removed |
114 | JOB_MISSED_DEFERRAL_TIME | The job missed its deferred execution time |
115 | JOB_EXITED_AND_CLAIM_CLOSING | Same as JOB_EXITED, but also inform schedd that the claim will not accept any more jobs |
116 | JOB_RECONNECT_FAILED | Shadow was started in reconnect mode, and yet failed to reconnect to the starter |
Other exit status
Value | Name | Description |
-1073740777 | ACCESS_VIOLATION | Windows, AKA as C0000005 |
44 | DPRINTF_ERROR | There is a fatal error with dprintf() |
Job Hold Reason Codes
(Source: src/condor_c++_util/condor_holdcodes.h)
Value | Reason | SubCode Contents |
0 | Unspecified | N/A |
1 | UserRequest | N/A |
2 | GlobusGramError | GRAM error code |
3 | JobPolicy | N/A |
4 | CorruptedCredential | N/A |
5 | JobPolicyUndefined | N/A |
6 | FailedToCreateProcess | Unix errno |
7 | UnableToOpenOutput | Unix errno |
8 | UnableToOpenInput | Unix errno |
9 | UnableToOpenOutputStream | Unix errno |
10 | UnableToOpenInputStream | Unix errno |
11 | InvalidTransferAck | N/A |
12 | DownloadFileError | Unix errno |
13 | UploadFileError | Unix errno |
14 | IwdError | Unix errno |
15 | SubmittedOnHold | N/A |
16 | SpoolingInput | N/A |
17 | JobShadowMismatch | N/A |
18 | InvalidTransferGoAhead | N/A |
19 | HookPrepareJobFailure | Hook process exit code (or exit signal, if negative) |
20 | MissedDeferredExecutionTime | N/A |
21 | StartdHeldJob | WANT_HOLD_SUBCODE parameter |
22 | UnableToInitUserLog | N/A |
23 | FailedToAccessUserAccount | N/A |
24 | NoCompatibleShadow | N/A |
25 | InvalidCronSettings | N/A |
26 | SystemPolicy | N/A |
27 | SystemPolicyUndefined | N/A |
28 | GlexecChownSandboxToUser | N/A |
29 | PrivsepChownSandboxToUser | N/A |
30 | GlexecChownSandboxToCondor | N/A |
31 | PrivsepChownSandboxToCondor | N/A |
32 | MaxTransferInputSizeExceeded | N/A |
33 | MaxTransferOutputSizeExceeded | N/A |
34 | JobOutOfResources | N/A |
35 | InvalidDockerImage | N/A |
36 | FailedToCheckpoint | N/A |
37 | EC2UserError | N/A |
38 | EC2InternalError | N/A |
39 | EC2AdminError | N/A |
40 | EC2ConnectionProblem | N/A |
41 | EC2ServerError | N/A |
42 | EC2InstancePotentiallyLostError | N/A |
43 | PreScriptFailed | N/A |
44 | PostScriptFailed | N/A |
Log event codes
Submit | 0 |
Execute | 1 |
Executable error | 2 |
Checkpointed | 3 |
Job evicted | 4 |
Job terminated | 5 |
Image size | 6 |
Shadow exception | 7 |
Generic | 8 |
Job aborted | 9 |
Job suspended | 10 |
Job unsuspended | 11 |
Job held | 12 |
Job released | 13 |
Node execute | 14 |
Node terminated | 15 |
Post script terminated | 16 |
Globus submit | 17 |
Globus submit failed | 18 |
Globus resource up | 19 |
Globus resource down | 20 |
Remote error | 21 |
Job disconnected | 22 |
Job reconnected | 23 |
Job reconnect failed | 24 |
Grid resource up | 25 |
Grid resource down | 26 |
Grid submit | 27 |
Job Ad Information | 28 |
Job Status Unknown | 29 |
Job Status Known | 30 |
Job Stage In | 31 |
Job Stage Out | 32 |
Queue-Management Commands
Number | Action |
10001 | InitializeConnection |
10002 | NewCluster |
10003 | NewProc |
10004 | DestroyCluster |
10005 | DestroyProc |
10006 | SetAttribute |
10007 | CloseConnection |
10008 | GetAttributeFloat |
10009 | GetAttributeInt |
10010 | GetAttributeString |
10011 | GetAttributeExpr |
10012 | DeleteAttribute |
10013 | GetNextJob |
10014 | FirstAttribute |
10015 | NextAttribute |
10016 | DestroyClusterByConstraint |
10017 | SendSpoolFile |
10018 | GetJobAd |
10019 | GetJobByConstraint |
10020 | GetNextJobByConstraint |
10021 | SetAttributeByConstraint |
10022 | InitializeReadOnlyConnection |
10023 | BeginTransaction |
10024 | AbortTransaction |
10025 | SetTimerAttribute |
10026 | GetAllJobsByConstraint |
10027 | SetAttribute2 |
10028 | CloseSocket |
10029 | SendSpoolFileIfNeeded |
ClassAd Log Commands
Number | Action |
101 | NewClassAd |
102 | DestroyClassAd |
103 | SetAttribute |
104 | DeleteAttribute |
105 | BeginTransaction |
106 | EndTransaction |
107 | LogHistoricalSequenceNumber |
DaemonCore Commands and Signals
From condor_includes/condor_commands.h
Name | Number |
DC_RAISESIGNAL | 60000 |
DC_PROCESSEXIT | 60001 |
DC_CONFIG_PERSIST | 60002 |
DC_CONFIG_RUNTIME | 60003 |
DC_RECONFIG | 60004 |
DC_OFF_GRACEFUL | 60005 |
DC_OFF_FAST | 60006 |
DC_CONFIG_VAL | 60007 |
DC_CHILDALIVE | 60008 |
DC_SERVICEWAITPIDS | 60009 |
DC_AUTHENTICATE | 60010 |
DC_NOP | 60011 |
DC_RECONFIG_FULL | 60012 |
DC_FETCH_LOG | 60013 |
DC_INVALIDATE_KEY | 60014 |
DC_OFF_PEACEFUL | 60015 |
DC_SET_PEACEFUL_SHUTDOWN | 60016 |
DC_TIME_OFFSET | 60017 |
DC_PURGE_LOG | 60018 |
CEDAR Authentication Methods
From condor_includes/condor_auth.h and condor_io/condor_secman.C
Name | Number |
CLAIMTOBE | 2 |
FS | 4 |
FS_REMOTE | 8 |
NTSSPI | 16 |
GSI | 32 |
KERBEROS | 64 |
ANONYMOUS | 128 |
SSL | 256 |
PASSWORD | 512 |
Globus
GRAM Error Codes
From Globus 2.2.4, globus_gram_protocol-5.0/globus_gram_protocol_constants.h and globus_gram_protocol_error.c
Value | GLOBUS_GRAM_PROTOCOL_ERROR_... | Error String |
0 |
|
Success |
1 | PARAMETER_NOT_SUPPORTED | one of the RSL parameters is not supported |
2 | INVALID_REQUEST | the RSL length is greater than the maximum allowed |
3 | NO_RESOURCES | an I/O operation failed |
4 | BAD_DIRECTORY | jobmanager unable to set default to the directory requested |
5 | EXECUTABLE_NOT_FOUND | the executable does not exist |
6 | INSUFFICIENT_FUNDS | of an unused INSUFFICIENT_FUNDS |
7 | AUTHORIZATION | authentication with the remote server failed |
8 | USER_CANCELLED | the user cancelled the job |
9 | SYSTEM_CANCELLED | the system cancelled the job |
10 | PROTOCOL_FAILED | data transfer to the server failed |
11 | STDIN_NOT_FOUND | the stdin file does not exist |
12 | CONNECTION_FAILED | the connection to the server failed (check host and port) |
13 | INVALID_MAXTIME | the provided RSL 'maxtime' value is not an integer |
14 | INVALID_COUNT | the provided RSL 'count' value is not an integer |
15 | NULL_SPECIFICATION_TREE | the job manager received an invalid RSL |
16 | JM_FAILED_ALLOW_ATTACH | the job manager failed in allowing others to make contact |
17 | JOB_EXECUTION_FAILED | the job failed when the job manager attempted to run it |
18 | INVALID_PARADYN | an invalid paradyn was specified |
19 | INVALID_JOBTYPE | the provided RSL 'jobtype' value is invalid |
20 | INVALID_GRAM_MYJOB | the provided RSL 'myjob' value is invalid |
21 | BAD_SCRIPT_ARG_FILE | the job manager failed to locate an internal script argument file |
22 | ARG_FILE_CREATION_FAILED | the job manager failed to create an internal script argument file |
23 | INVALID_JOBSTATE | the job manager detected an invalid job state |
24 | INVALID_SCRIPT_REPLY | the job manager detected an invalid script response |
25 | INVALID_SCRIPT_STATUS | the job manager detected an invalid script status |
26 | JOBTYPE_NOT_SUPPORTED | the provided RSL 'jobtype' value is not supported by this job manager |
27 | UNIMPLEMENTED | unused ERROR_UNIMPLEMENTED |
28 | TEMP_SCRIPT_FILE_FAILED | the job manager failed to create an internal script submission file |
29 | USER_PROXY_NOT_FOUND | the job manager cannot find the user proxy |
30 | OPENING_USER_PROXY | the job manager failed to open the user proxy |
31 | JOB_CANCEL_FAILED | the job manager failed to cancel the job as requested |
32 | MALLOC_FAILED | system memory allocation failed |
33 | DUCT_INIT_FAILED | the interprocess job communication initialization failed |
34 | DUCT_LSP_FAILED | the interprocess job communication setup failed |
35 | INVALID_HOST_COUNT | the provided RSL 'host count' value is invalid |
36 | UNSUPPORTED_PARAMETER | one of the provided RSL parameters is unsupported |
37 | INVALID_QUEUE | the provided RSL 'queue' parameter is invalid |
38 | INVALID_PROJECT | the provided RSL 'project' parameter is invalid |
39 | RSL_EVALUATION_FAILED | the provided RSL string includes variables that could not be identified |
40 | BAD_RSL_ENVIRONMENT | the provided RSL 'environment' parameter is invalid |
41 | DRYRUN | the provided RSL 'dryrun' parameter is invalid |
42 | ZERO_LENGTH_RSL | the provided RSL is invalid (an empty string) |
43 | STAGING_EXECUTABLE | the job manager failed to stage the executable |
44 | STAGING_STDIN | the job manager failed to stage the stdin file |
45 | INVALID_JOB_MANAGER_TYPE | the requested job manager type is invalid |
46 | BAD_ARGUMENTS | the provided RSL 'arguments' parameter is invalid |
47 | GATEKEEPER_MISCONFIGURED | the gatekeeper failed to run the job manager |
48 | BAD_RSL | the provided RSL could not be properly parsed |
49 | VERSION_MISMATCH | there is a version mismatch between GRAM components |
50 | RSL_ARGUMENTS | the provided RSL 'arguments' parameter is invalid |
51 | RSL_COUNT | the provided RSL 'count' parameter is invalid |
52 | RSL_DIRECTORY | the provided RSL 'directory' parameter is invalid |
53 | RSL_DRYRUN | the provided RSL 'dryrun' parameter is invalid |
54 | RSL_ENVIRONMENT | the provided RSL 'environment' parameter is invalid |
55 | RSL_EXECUTABLE | the provided RSL 'executable' parameter is invalid |
56 | RSL_HOST_COUNT | the provided RSL 'host_count' parameter is invalid |
57 | RSL_JOBTYPE | the provided RSL 'jobtype' parameter is invalid |
58 | RSL_MAXTIME | the provided RSL 'maxtime' parameter is invalid |
59 | RSL_MYJOB | the provided RSL 'myjob' parameter is invalid |
60 | RSL_PARADYN | the provided RSL 'paradyn' parameter is invalid |
61 | RSL_PROJECT | the provided RSL 'project' parameter is invalid |
62 | RSL_QUEUE | the provided RSL 'queue' parameter is invalid |
63 | RSL_STDERR | the provided RSL 'stderr' parameter is invalid |
64 | RSL_STDIN | the provided RSL 'stdin' parameter is invalid |
65 | RSL_STDOUT | the provided RSL 'stdout' parameter is invalid |
66 | OPENING_JOBMANAGER_SCRIPT | the job manager failed to locate an internal script |
67 | CREATING_PIPE | the job manager failed on the system call pipe() |
68 | FCNTL_FAILED | the job manager failed on the system call fcntl() |
69 | STDOUT_FILENAME_FAILED | the job manager failed to create the temporary stdout filename |
70 | STDERR_FILENAME_FAILED | the job manager failed to create the temporary stderr filename |
71 | FORKING_EXECUTABLE | the job manager failed on the system call fork() |
72 | EXECUTABLE_PERMISSIONS | the executable file permissions do not allow execution |
73 | OPENING_STDOUT | the job manager failed to open stdout |
74 | OPENING_STDERR | the job manager failed to open stderr |
75 | OPENING_CACHE_USER_PROXY | the cache file could not be opened in order to relocate the user proxy |
76 | OPENING_CACHE | cannot access cache files in ~/.globus/.gass_cache, check permissions, quota, and disk space |
77 | INSERTING_CLIENT_CONTACT | the job manager failed to insert the contact in the client contact list |
78 | CLIENT_CONTACT_NOT_FOUND | the contact was not found in the job manager's client contact list |
79 | CONTACTING_JOB_MANAGER | connecting to the job manager failed. Possible reasons: job terminated, invalid job contact, network problems, ... |
80 | INVALID_JOB_CONTACT | the syntax of the job contact is invalid |
81 | UNDEFINED_EXE | the executable parameter in the RSL is undefined |
82 | CONDOR_ARCH | the job manager service is misconfigured. HTCondor arch undefined |
83 | CONDOR_OS | the job manager service is misconfigured. HTCondor os undefined |
84 | RSL_MIN_MEMORY | the provided RSL 'min_memory' parameter is invalid |
85 | RSL_MAX_MEMORY | the provided RSL 'max_memory' parameter is invalid |
86 | INVALID_MIN_MEMORY | the RSL 'min_memory' value is not zero or greater |
87 | INVALID_MAX_MEMORY | the RSL 'max_memory' value is not zero or greater |
88 | HTTP_FRAME_FAILED | the creation of a HTTP message failed |
89 | HTTP_UNFRAME_FAILED | parsing incoming HTTP message failed |
90 | HTTP_PACK_FAILED | the packing of information into a HTTP message failed |
91 | HTTP_UNPACK_FAILED | an incoming HTTP message did not contain the expected information |
92 | INVALID_JOB_QUERY | the job manager does not support the service that the client requested |
93 | SERVICE_NOT_FOUND | the gatekeeper failed to find the requested service |
94 | JOB_QUERY_DENIAL | the jobmanager does not accept any new requests (shutting down) |
95 | CALLBACK_NOT_FOUND | the client failed to close the listener associated with the callback URL |
96 | BAD_GATEKEEPER_CONTACT | the gatekeeper contact cannot be parsed |
97 | POE_NOT_FOUND | the job manager could not find the 'poe' command |
98 | MPIRUN_NOT_FOUND | the job manager could not find the 'mpirun' command |
99 | RSL_START_TIME | the provided RSL 'start_time' parameter is invalid |
100 | RSL_RESERVATION_HANDLE | the provided RSL 'reservation_handle' parameter is invalid |
101 | RSL_MAX_WALL_TIME | the provided RSL 'max_wall_time' parameter is invalid |
102 | INVALID_MAX_WALL_TIME | the RSL 'max_wall_time' value is not zero or greater |
103 | RSL_MAX_CPU_TIME | the provided RSL 'max_cpu_time' parameter is invalid |
104 | INVALID_MAX_CPU_TIME | the RSL 'max_cpu_time' value is not zero or greater |
105 | JM_SCRIPT_NOT_FOUND | the job manager is misconfigured, a scheduler script is missing |
106 | JM_SCRIPT_PERMISSIONS | the job manager is misconfigured, a scheduler script has invalid permissions |
107 | SIGNALING_JOB | the job manager failed to signal the job |
108 | UNKNOWN_SIGNAL_TYPE | the job manager did not recognize/support the signal type |
109 | GETTING_JOBID | the job manager failed to get the job id from the local scheduler |
110 | WAITING_FOR_COMMIT | the job manager is waiting for a commit signal |
111 | COMMIT_TIMED_OUT | the job manager timed out while waiting for a commit signal |
112 | RSL_SAVE_STATE | the provided RSL 'save_state' parameter is invalid |
113 | RSL_RESTART | the provided RSL 'restart' parameter is invalid |
114 | RSL_TWO_PHASE_COMMIT | the provided RSL 'two_phase' parameter is invalid |
115 | INVALID_TWO_PHASE_COMMIT | the RSL 'two_phase' value is not zero or greater |
116 | RSL_STDOUT_POSITION | the provided RSL 'stdout_position' parameter is invalid |
117 | INVALID_STDOUT_POSITION | the RSL 'stdout_position' value is not zero or greater |
118 | RSL_STDERR_POSITION | the provided RSL 'stderr_position' parameter is invalid |
119 | INVALID_STDERR_POSITION | the RSL 'stderr_position' value is not zero or greater |
120 | RESTART_FAILED | the job manager restart attempt failed |
121 | NO_STATE_FILE | the job state file doesn't exist |
122 | READING_STATE_FILE | could not read the job state file |
123 | WRITING_STATE_FILE | could not write the job state file |
124 | OLD_JM_ALIVE | old job manager is still alive |
125 | TTL_EXPIRED | job manager state file TTL expired |
126 | SUBMIT_UNKNOWN | it is unknown if the job was submitted |
127 | RSL_REMOTE_IO_URL | the provided RSL 'remote_io_url' parameter is invalid |
128 | WRITING_REMOTE_IO_URL | could not write the remote io url file |
129 | STDIO_SIZE | the standard output/error size is different |
130 | JM_STOPPED | the job manager was sent a stop signal (job is still running) |
131 | USER_PROXY_EXPIRED | the user proxy expired (job is still running) |
132 | JOB_UNSUBMITTED | the job was not submitted by original jobmanager |
133 | INVALID_COMMIT | the job manager is not waiting for that commit signal |
134 | RSL_SCHEDULER_SPECIFIC | the provided RSL scheduler specific parameter is invalid |
135 | STAGE_IN_FAILED | the job manager could not stage in a file |
136 | INVALID_SCRATCH | the scratch directory could not be created |
137 | RSL_CACHE | the provided 'gass_cache' parameter is invalid |
138 | INVALID_SUBMIT_ATTRIBUTE | the RSL contains attributes which are not valid for job submission |
139 | INVALID_STDIO_UPDATE_ATTRIBUTE | the RSL contains attributes which are not valid for stdio update |
140 | INVALID_RESTART_ATTRIBUTE | the RSL contains attributes which are not valid for job restart |
141 | RSL_FILE_STAGE_IN | the provided RSL 'file_stage_in' parameter is invalid |
142 | RSL_FILE_STAGE_IN_SHARED | the provided RSL 'file_stage_in_shared' parameter is invalid |
143 | RSL_FILE_STAGE_OUT | the provided RSL 'file_stage_out' parameter is invalid |
144 | RSL_GASS_CACHE | the provided RSL 'gass_cache' parameter is invalid |
145 | RSL_FILE_CLEANUP | the provided RSL 'file_cleanup' parameter is invalid |
146 | RSL_SCRATCH | the provided RSL 'scratch_dir' parameter is invalid |
147 | INVALID_SCHEDULER_SPECIFIC | the provided scheduler-specific RSL parameter is invalid |
148 | UNDEFINED_ATTRIBUTE | a required RSL attribute was not defined in the RSL spec |
149 | INVALID_CACHE | the gass_cache attribute points to an invalid cache directory |
150 | INVALID_SAVE_STATE | the provided RSL 'save_state' parameter has an invalid value |
151 | OPENING_VALIDATION_FILE | the job manager could not open the RSL attribute validation file |
152 | READING_VALIDATION_FILE | the job manager could not read the RSL attribute validation file |
153 | RSL_PROXY_TIMEOUT | the provided RSL 'proxy_timeout' is invalid |
154 | INVALID_PROXY_TIMEOUT | the RSL 'proxy_timeout' value is not greater than zero |
155 | STAGE_OUT_FAILED | the job manager could not stage out a file |
156 | JOB_CONTACT_NOT_FOUND | the job contact string does not match any which the job manager is handling |
157 | DELEGATION_FAILED | proxy delegation failed |
158 | LOCKING_STATE_LOCK_FILE | the job manager could not lock the state lock file |
159 | INVALID_ATTR | an invalid globus_io_clientattr_t was used. |
160 | NULL_PARAMETER | an null parameter was passed to the gram library |
161 | STILL_STREAMING | the job manager is still streaming output |
162 | AUTHORIZATION_DENIED | the authorization system denied the request |
163 | AUTHORIZATION_SYSTEM_FAILURE | the authorization system reported a failure |
164 | AUTHORIZATION_DENIED_JOB_ID | the authorization system denied the request - invalid job id |
165 | AUTHORIZATION_DENIED_EXECUTABLE | the authorization system denied the request - not authorized to run the specified executable |
166 | RSL_USER_NAME | the provided RSL 'user_name' parameter is invalid. |
167 | INVALID_USER_NAME | the job is not running in the account named by the 'user_name' parameter. |
168 | LAST |
GRAM Job States
Value | GLOBUS_GRAM_PROTOCOL_ JOB_STATE_... | Description |
1 | PENDING | The job is waiting for resources to become available to run. |
2 | ACTIVE | The job has received resources and the application is executing. |
4 | FAILED | The job terminated before completion because an error, user-triggered cancel, or system-triggered cancel. |
8 | DONE | The job completed successfully |
16 | SUSPENDED | The job has been suspended. Resources which were allocated for this job may have been released due to some scheduler-specific reason. |
32 | UNSUBMITTED | The job has not been submitted to the scheduler yet, pending the reception of the GLOBUS_GRAM_PROTOCOL_JOB_SIGNAL_COMMIT_REQUEST signal from a client. |
64 | STAGE_IN | The job manager is staging in files to run the job. |
128 | STAGE_OUT | The job manager is staging out files generated by the job. |
0xFFFFF | ALL | A mask of all job states. |
GRAM Signals
globus_gram_protocol_job_signal_t GRAM Signals
Value | GLOBUS_GRAM_PROTOCOL_ JOB_SIGNAL_... | Description |
1 | CANCEL | Cancel a job |
2 | SUSPEND | Suspend a job |
3 | RESUME | Resume a previously suspended job |
4 | PRIORITY | Change the priority of a job |
5 | COMMIT_REQUEST | Signal the job manager to commence with a job submission if the job request was accompanied by the (two_state=yes) RSL attribute. |
6 | COMMIT_EXTEND | Signal the job manager to wait an additional number of seconds (specified by an integer value string as the signal's argument) before timing out a two-phase job commit. |
7 | STDIO_UPDATE | Signal the job manager to change the way it is currently handling standard output and/or standard error. The argument for this signal is an RSL containing new stdout, stderr, stdout_position, stderr_position, or remote_io_url relations. |
8 | STDIO_SIZE | Signal the job manager to verify that streamed I/O has been completely received. The argument to this signal contains the number of bytes of stdout and stderr received, seperated by a space. The reply to this signal will be a SUCCESS message if these matched the amount sent by the job manager. Otherwise, an error reply indicating GLOBUS_GRAM_PROTOCOL_ERROR_STDIO_SIZE is returned. If standard output and standard error are merged, only one number should be sent as an argument to this signal. An argument of -1 for either stream size indicates that the client is not interested in the size of that stream. |
9 | STOP_MANAGER | Signal the job manager to stop managing the current job and terminate. The job continues to run as normal. The job manager will send a state change callback with the job status being FAILED and the error GLOBUS_GRAM_PROTOCOL_ERROR_JM_STOPPED. |
10 | COMMIT_END | Signal the job manager to clean up after the completion of the job if the job RSL contained the (two-phase = yes) relation. |
Well Known Ports used by HTCondor and/or Globus
HTCondor negotiator | 9614 | obsolete, dynamic in 6.7.x |
HTCondor collector | 9618 | officially registered with IANA! |
globus GT2 gatekeeper | 2119 | |
globus gridftp | 2811 | also may be used by HTCondor-G |
HTCondor checkpoint server store | 5651 | |
HTCondor checkpoint server restore | 5652 | |
HTCondor checkpoint server service | 5653 | |
HTCondor checkpoint server replicate | 5654 | |
globus GT4 web services | 8443 | |
GCB local | 65430 | mostly obsolete, superseded by CCB |
GCB public | 65432 | mostly obsolete, superseded by CCB |
Misc Exit codes
0xc0000139 -1073741511 | Possibly missing libraries or missing functions in libraries on Windows. Try running from the command line to see if you get any errors. |