{section: Installing and Maintaining HTCondor-CE} *NOTE:* If you are installing an HTCondor-CE for the Open Science Grid, consult the {link: https://opensciencegrid.org/docs/compute-element/install-htcondor-ce/ OSG-specific installation guide} The HTCondor-CE software is a job gateway for a grid Compute Element (CE). As such, HTCondor-CE is the entry point for jobs coming from the grid - it handles authorization and delegation of jobs to your local batch system. See the {link: https://opensciencegrid.org/docs/compute-element/htcondor-ce-overview/ OSG HTCondor-CE Overview} for a much more detailed introduction. Use this page to learn how to install, configure, run, test, and troubleshoot HTCondor-CE from the {link: http://research.cs.wisc.edu/htcondor/instructions/ HTCondor Yum repositories}. {section: Before Starting} Before starting the installation process, consider the following points (consulting the Reference section below as needed): *: *User IDs:* If they do not exist already, the installation will create the Linux user =condor= (UID 4716) *: *SSL certificate:* The HTCondor-CE service uses a host certificate at =/etc/grid-security/hostcert.pem= and an accompanying key at =/etc/grid-security/hostkey.pem= *: *DNS entries:* Forward and reverse DNS must resolve for the HTCondor-CE host *: *Network ports:* The pilot factories must be able to contact your HTCondor-CE service on port 9619 (TCP) *: *Submit host:* HTCondor-CE should be installed on a host that already has the ability to submit jobs into your local cluster running supported batch system software (HTCondor, LSF, PBS/Torque, SGE, Slurm) *: *File Systems:* Non-HTCondor batch systems require a shared file system between the HTCondor-CE host and the batch system worker nodes. There are some one-time (per host) steps to prepare in advance: *: Ensure the host has a supported operating system (Red Hat Enterprise Linux variant version 6 or 7) *: Obtain root access to the host *: Prepare the EPEL and {link: http://research.cs.wisc.edu/htcondor/instructions/ HTCondor} Yum repositories *: Install CA certificates and VO data into =/etc/grid-security= {section: Installing HTCondor-CE} 1: Install the appropriate HTCondor-CE package for your batch system: | *If your batch system is...* | *Then run the following command...* | | HTCondor | =yum install htcondor-ce-condor= | | LSF | =yum install htcondor-ce-lsf= | | PBS/Torque | =yum install htcondor-ce-pbs= | | SGE | =yum install htcondor-ce-sge= | | Slurm | =yum install htcondor-ce-slurm= | 2: Install the certificate revocation list updater available from EPEL: {code} # yum install fetch-crl {endcode} {section: Configuring HTCondor-CE} {subsection: Configuring Authorization} To authorize the local HTCondor-CE daemons and job submission from external users and VOs, edit the contents of =/etc/condor-ce/condor_mapfile= by following the comments. *NOTE:* =condor_mapfile.rpmnew= files may be generated upon HTCondor-CE version updates that should be merged into =condor_mapfile=. {subsection: Configuring Non-HTCondor Batch Systems} HTCondor-CE requires a shared filesystem between the CE and the worker nodes to support file transfer for non-HTCondor batch systems. See the {link: https://opensciencegrid.org/docs/compute-element/install-htcondor-ce/#batch-systems-other-than-htcondor documentation here} for details. HTCondor-CE uses the Batch Language ASCII Helper Protocol (BLAHP) to submit and track jobs to non-HTCondor batch systems. To work with the HTCondor-CE, modify =/usr/libexec/condor/glite/etc/batch_gahp.config= using the following steps: 1: Disable BLAHP handling of certificate proxies: {code} blah_disable_wn_proxy_renewal=yes blah_delegate_renewed_proxies=no blah_disable_limited_proxy=yes {endcode} 2: *(Optional)* If your batch system tools are installed in a non-standard location (i.e., outside of =/usr/bin/=), set the corresponding =*_binpath= variable to the directory containing your batch system tools: | *If your batch system is...* | *Then change the following configuration variable...* | | LSF | =lsf_binpath= | | PBS/Torque | =pbs_binpath= | | SGE | =sge_binpath= | | Slurm | =slurm_binpath= | {subsection: Optional Configuration} If your site needs to report to BDII, consult the {link: https://github.com/opensciencegrid/htcondor-ce/tree/master/contrib/bdii#htcondor-ce-bdii-provider HTCondor-CE BDII Provider} documentation. See {link: https://opensciencegrid.org/docs/compute-element/install-htcondor-ce/#optional-configuration this section of the OSG installation guide} for additional optional HTCondor-CE configurations. Note that the "Accounting with multiple CEs or local user jobs" and "HTCondor-CE monitoring web interface" sections are both OSG specific and can be safely ignored. {section: Using HTCondor-CE} See {link: https://opensciencegrid.org/docs/compute-element/install-htcondor-ce/#using-htcondor-ce this section of the OSG installation guide} for how to use HTCondor-CE, including starting and stopping of services. Note that Gratia is accounting software specific to the OSG and can be safely ignored. {section: Validating HTCondor-CE} See {link: https://opensciencegrid.org/docs/compute-element/install-htcondor-ce/#validating-htcondor-ce this section of the OSG installation guide} for steps to validate your HTCondor-CE. {section: Troubleshooting HTCondor-CE} See {link: https://opensciencegrid.org/docs/compute-element/troubleshoot-htcondor-ce this OSG guide} for troubleshooting tips and strategies. {section: Getting Help} For assistance with HTCondor-CE, please see the {link: http://research.cs.wisc.edu/htcondor/htcondor-support/ HTCondor support page}. {section: References} *: {link: https://opensciencegrid.org/docs/compute-element/htcondor-ce-overview/ HTCondor-CE Overview} *: {link: https://opensciencegrid.org/docs/compute-element/install-htcondor-ce/ OSG HTCondor-CE Installation Guide} *: {link: https://opensciencegrid.org/docs/compute-element/job-router-recipes/ Job Router Configuration Guide} *: {link: https://github.com/opensciencegrid/htcondor-ce/tree/master/contrib/bdii#htcondor-ce-bdii-provider HTCondor-CE BDII Provider} *: {link: https://opensciencegrid.org/docs/compute-element/troubleshoot-htcondor-ce/ HTCondor-CE Troubleshooting Guide}