How To: Guides for users and Maryland T3 admins.

Help: Links and emails for further info.

Configuration: technical layout of the cluster, primarily for admins.

Log: Has been moved to a Google page, accessible only to admins.

Notes, To Do, & Sandbox

If you arrived at this page by Google search, odds are this page won't help you, it's mostly 'notes to self'. Try the admin guide. Also, this page is depreciated, as the To Do list was moved offsite to be available for sysadmins only.

Notes

The next shoot-node/cluster-kickstart will:

(last updated November 4, 2011)

To Do:

(last updated September 30, 2013 )

Sandbox

Holding condor jobs

To hold all the jobs running on the condor batch system, as root from the HN:

condor_status -schedd
For all nodes listed as the scheduler for running or idle jobs (e.g. compute-x-y):
ssh compute-x-y
condor_hold -name compute-x-y -all

To resume jobs:

condor_status -schedd
For all nodes listed as the scheduler for held jobs (e.g. compute-x-y):
ssh compute-x-y
condor_release -name compute-x-y -all

There must be an easier way to do this, but I don't know what it is! cluster-fork "condor_hold -all" will only hold jobs submitted by the root user.