This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
wiki:uso_sist_en [2022/09/01 10:37] phegde |
wiki:uso_sist_en [2022/10/07 19:14] (current) cnr-guest [Available file systems] |
||
---|---|---|---|
Line 73: | Line 73: | ||
In-depth documentation on Lustre is available online, at the link: '' | In-depth documentation on Lustre is available online, at the link: '' | ||
+ | |||
+ | ''/ | ||
+ | new scratch area shared among UI and computation nodes (available from 07/ | ||
==== Job preparation ans submission ==== | ==== Job preparation ans submission ==== | ||
+ | |||
+ | === Premise: new job management rules active from 9/10/2022 === | ||
+ | |||
+ | To improve the use of resources, the job management rules have been changed. | ||
+ | |||
+ | * New usage policies based on // fairshare // mechanisms have been implemented \\ | ||
+ | * New queues for job submissions have been defined | ||
+ | - ** sequential ** queue: | ||
+ | * accepts only sequential jobs with a number of tasks not exceeding 1, | ||
+ | * who do not use GP-GPUs, | ||
+ | * for a total number of jobs running on it not exceeding 128 | ||
+ | * and maximum execution time limit of 1 week | ||
+ | - ** parallel ** queue: | ||
+ | * accepts only parallel jobs with task number greater than 1 and less than 1580, | ||
+ | * that use no more 64 GP-GPUs | ||
+ | * and maximum execution time limit of 1 week | ||
+ | - ** gpus ** queue: | ||
+ | * only accepts jobs that use no more than 64 GP-GPUs, | ||
+ | * with task number less than 1580 | ||
+ | * and maximum execution time limit of 1 week | ||
+ | - ** hparallel ** queue: | ||
+ | * accepts only parallel jobs with task number greater than 1580 and less than 3160, | ||
+ | * that make use of at least 64 GP-GPUs | ||
+ | * and maximum execution time limit of 1 day | ||
+ | |||
+ | From 9 October the current queue will be disabled and only those defined here will be active, to be explicitly selected. For example, to subdue a job in the ** parallel ** queue, execute \\ | ||
+ | |||
+ | $ srun -p parallel <MORE OPTIONS> <COMMAND NAME> | ||
+ | |||
+ | If the job does not comply with the rules of the queue used, it will be terminated. | ||
+ | |||
+ | === Use of resources === | ||
In the system is installed the resource manager SLURM to manage the cluster resources. | In the system is installed the resource manager SLURM to manage the cluster resources. | ||
Line 366: | Line 401: | ||
* Setup the matlab environment by using the command '' | * Setup the matlab environment by using the command '' | ||
* Matlab version R2022a can be accessed using the command '' | * Matlab version R2022a can be accessed using the command '' | ||
+ | |||
== Configuration and execution == | == Configuration and execution == | ||