Things to mention: -*: condor_hold actually kills the DAGMan process; condor_release starts a new process (but the same Condor ID) +*: condor_hold actually kills the DAGMan process; condor_release starts a new process (but the same HTCondor ID) *: lock file (including Joe's Unique PID thing) *: re-reading node job userlogs *: FD problems on wide DAGs (throttles don't help us in recovery mode)