Skip to content

IceCube dataset management system

License

Notifications You must be signed in to change notification settings

WIPACrepo/iceprod

Repository files navigation

IceProd

IceProd is a Python framework for distributed management of batch jobs. It runs as a layer on top of other batch middleware, such as HTCondor, and can pool together resources from different batch systems. The primary purpose is to coordinate and administer many large sets of jobs at once, keeping a history of the entire job lifecycle.

See also: Aartsen, Mark G., et al. "The IceProd framework: Distributed data processing for the IceCube neutrino observatory." Journal of parallel and distributed computing 75 (2015): 198-211.

Note:

For IceCube users with CVMFS access, IceProd is already installed. To load the environment execute:

/cvmfs/icecube.wisc.edu/iceprod/latest/env-shell.sh

or:

eval `/cvmfs/icecube.wisc.edu/iceprod/latest/setup.sh`

depending on whether you want to get a new shell or load the variables into the current shell.

Installation

Platforms:

IceProd should run on any Unix-like platform, although only Linux has been extensively tested and can be recommented for production deployment.

Prerequisites:

Listed here are any packages outside pip:

  • Python 3.7+
  • MongoDB 3.6+ (for the REST API)
  • nginx (for ssl offloading and better security)
  • globus (for data transfer)

Installation:

From the latest release:

Get the tarball link from https://github.com/WIPACrepo/iceprod/releases/latest

Then install like:

pip install https://github.com/WIPACrepo/iceprod/archive/v2.0.0.tar.gz

Installing from master:

If you must install the dev version from master, do:

pip install --upgrade git+git://github.com/WIPACrepo/iceprod.git#egg=iceprod