This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
|
wiki:user_guide [2022/05/06 14:41] phegde |
wiki:user_guide [2022/05/28 18:18] (current) cnr-guest [Job preparation ans submission] |
||
|---|---|---|---|
| Line 101: | Line 101: | ||
| Complete documentation is avalailable at '' | Complete documentation is avalailable at '' | ||
| - | SLUR is an open source software sytstem for cluster management; it is highly scalable and integrates fault-tolerance and job scheduling mechanisms. | + | SLURM is an open source software sytstem for cluster management; it is highly scalable and integrates fault-tolerance and job scheduling mechanisms. |
| ==== SLURM basic concepts ==== | ==== SLURM basic concepts ==== | ||
| Line 327: | Line 327: | ||
| #SBATCH --nodes=[nnodes] | #SBATCH --nodes=[nnodes] | ||
| #SBATCH --ntasks-per-node=[ntasks per node] #number of cores per node | #SBATCH --ntasks-per-node=[ntasks per node] #number of cores per node | ||
| - | #SBATCH --gres=gpu: | + | #SBATCH --gres=gpu: |
| === Example of parallel jobs submission === | === Example of parallel jobs submission === | ||
| Line 339: | Line 339: | ||
| #SBATCH --ntasks-per-node=[ntasks per node] #number of cores per node | #SBATCH --ntasks-per-node=[ntasks per node] #number of cores per node | ||
| #SBATCH --gres=gpu: | #SBATCH --gres=gpu: | ||
| - | NPROC= | + | NPROC=[nprocesses] |
| tmpstring=tmp | tmpstring=tmp | ||
| Line 358: | Line 358: | ||
| * Parallelization can be implemented within the python code itself. For example, the evaluation of a function for different variable values can be done in parallel. Python offers many packages to parallelize the given process. The basic one among them is [[https:// | * Parallelization can be implemented within the python code itself. For example, the evaluation of a function for different variable values can be done in parallel. Python offers many packages to parallelize the given process. The basic one among them is [[https:// | ||
| - | * The keras module | + | * The keras and Pytorch modules |