Embarrasingly distributed parallel loop on clusters of computers¶

The goal of this library is to ease the creation, launch and management of embarassingly parallel jobs on supercomputer such as SLURM. Some basic primitives (pure python NO-SQL database) to work in distributed memory architecture are provided.

Aims: simple, pure python

If you want to parallelize your python jobs in shared memory architecture, I advise you to use joblib.

Getting the latest code¶

To get the latest code using git, simply type:

git clone git://github.com/clusterlib/clusterlib.git

If you don’t have git installed, you can download a zip or tarball of the latest code: https://github.com/clusterlib/clusterlib/archive/master.zip

Installing¶

As any Python packages, to install clusterlib, simply do:

python setup.py install

in the source code directory.

How to contribute?¶

To contribute to clusterlib, first create a github account. Then you can fork the clusterlib repository. Once this is done, you can make clone of your fork, make your changes and whenever you are happy, send us a pull request to the main repository.

Running the test suite¶

To run the test suite, you need nosetests and the coverage modules. Run the test suite using:

nosetests

from the root of the project.

Documentation¶

For making the documentation, Sphinx==1.2.2 and sphinx-bootstrap-theme==0.4.0 are needed. Then, you can do

make doc

To update the documentation on http://clusterlib.github.io/clusterlib/, simply do

make gh-pages