Page History

Turn Off History

How to ban a machine from executing jobs

Known to work with Condor version: 7.0

Suppose jobs are mysteriously failing on a particular machine and you suspect some kind of hardware problem (e.g. memory corruption). You can turn Condor off until the problem is solved, but just so no mistakes are made, it may also a good idea to block the machine from joining the Condor pool, in case Condor gets restarted prematurely.

Here is what you should put in the condor configuration visible to the collector:

# 2008-04-27: badmachine has a hardware problem, so I am temporarily blacklisting it
HOSTDENY_WRITE = $(HOSTDENY_WRITE) badmachine.domain.name

Then issue

condor_reconfig -full
The existing machine ad will take some time to expire (~15 minutes), so restart the collector if you are impatient.