Difference between revisions of "Abel use old"

From mn/bio/cees-bioinf
Jump to: navigation, search
 
(17 intermediate revisions by 2 users not shown)
Line 1: Line 1:
= Introduction =
+
This wiki has moved! this page is here merely for archival reasons.
  
 +
[https://github.com/uio-cees/hpc/wiki '''Visit the new wiki here''']
  
 +
= Introduction =
  
 
We have been given a large allocation on Abel for computational work. This page explains how to get access and start using the resources. All use of Abel needs to draw CPU hours from an allocation.<br/><br/>'''Mailing list'''
 
We have been given a large allocation on Abel for computational work. This page explains how to get access and start using the resources. All use of Abel needs to draw CPU hours from an allocation.<br/><br/>'''Mailing list'''
Line 11: Line 13:
 
= Getting access to CPU hours =
 
= Getting access to CPU hours =
  
Fill out this form:
+
See [[Main Page#Getting access|getting access]].
 
 
[https://www.notur.no/notur/sites/drupal.uninett.no.notur/files/User-account-application-0613.pdf https://www.notur.no/notur/sites/drupal.uninett.no.notur/files/User-account-application-0613.pdf]
 
 
 
NOTES<br/>11. I would like an account on the following resources: '''abel'''
 
 
 
12. Start date (yyyy-mm-dd): '''use&nbsp;today's date''' End date (yyyy-mm-dd):'''&nbsp;when your project/contract ends''' (don't worry, we can extend access beyond that if needed)
 
 
 
13. Existing project (format nn****k for Notur):* '''NN9244K'''
 
 
 
14. Notur/NorStore user account (if you already have one): '''N/A'''<br/>Otherwise provide preferred / local user name: _____________________ (max. 8 chars) -->'''Please fill out your UiO user name'''
 
 
 
15. If you want to use a grid certificate (GSI), provide the distinguished name (DN): '''N/A'''
 
 
 
Name of the project manager:* '''Kjetill Jakobsen'''
 
 
 
Give the form to Kjetill Jakobsen for submission. Ask Lex for help if needed.
 
 
 
 
 
  
 
= Using Abel =
 
= Using Abel =
Line 73: Line 57:
 
<pre>logout (or ctrl-d)
 
<pre>logout (or ctrl-d)
 
</pre>
 
</pre>
== SLURM scripts ==
+
== Temporary, fast access disk space on Abel ==
  
Information coming, until then see [http://www.uio.no/english/services/it/research/hpc/abel/help/user-guide/queue-system.html here].
+
From the [http://www.uio.no/english/services/it/research/hpc/abel/newsletters/abel-newsletter-3-2013.html#toc3 Abel newsletter #3]:
 +
<blockquote>'''Update on Abel scratch file-system usage'''<br/>While a job runs, it has access to a temporary scratch directory on the shared file system /work. The directory is individual for each job, is automatically created, and is deleted when the job finishes (or gets requeued). There is no backup of this directory. The name of the directory is stored in the environment variable $SCRATCH, which is set within the job script. If your job is I/O intensive, we strongly recommend copying its work files to $SCRATCH and running the program there.<br/>Sometimes, one needs to use a file for several jobs, or have it available some time after the job finishes. To accommodate this need, we have now created a directory /work/users/$USER for each user, where $USER is the user's user name. The purpose of the directory is to stage files that are needed by more than one job. Files in this directory are automatically deleted after a certain time (currently 45 days). There is no backup of files in /work/users/.<br/></blockquote>
 +
== SLURM ==
  
 +
=== SLURM scripts ===
  
 +
Information coming, until then see [http://www.uio.no/english/services/it/research/hpc/abel/help/user-guide/job-scripts.html here].
  
== Temporary, fast access disk space on Abel ==
+
=== SLURM tips ===
 +
 
 +
==== Jobs in our queue ====
 +
<pre>squeue -A nn9244k</pre>
 +
==== Listing your jobs ====
 +
<pre>squeue -u username</pre>
 +
==== Information on your job ====
 +
<pre>scontrol show job JOBID</pre>
 +
==== Cancel your job ====
 +
<pre>scancel JOBID</pre>
 +
==== Cancel all your jobs ====
 +
<pre>scancel -u USERNAME</pre>
 +
==== Running SLURM script as a shell script when not submitted through SLURM ====
  
From the [http://www.uio.no/english/services/it/research/hpc/abel/newsletters/abel-newsletter-3-2013.html#toc3 Abel newsletter #3]:
+
Add these lines at the beginning of your slurm script, but after the "#SBATCH" instructions
<blockquote>'''Update on Abel scratch file-system usage'''<br/>While a job runs, it has access to a temporary scratch directory on the shared file system /work. The directory is individual for each job, is automatically created, and is deleted when the job finishes (or gets requeued). There is no backup of this directory. The name of the directory is stored in the environment variable $SCRATCH, which is set within the job script. If your job is I/O intensive, we strongly recommend copying its work files to $SCRATCH and running the program there.<br/>Sometimes, one needs to use a file for several jobs, or have it available some time after the job finishes. To accommodate this need, we have now created a directory /work/users/$USER for each user, where $USER is the user's user name. The purpose of the directory is to stage files that are needed by more than one job. Files in this directory are automatically deleted after a certain time (currently 45 days). There is no backup of files in /work/users/.<br/></blockquote>
+
<pre>if [ -n "$SLURM_JOB_ID" ]; then
 +
    # running in a slurm job
 +
    source /cluster/bin/jobsetup
 +
fi</pre>
 +
Now run the script as
 +
<pre>source script.slurm</pre>

Latest revision as of 16:07, 22 November 2017

This wiki has moved! this page is here merely for archival reasons.

Visit the new wiki here

Introduction

We have been given a large allocation on Abel for computational work. This page explains how to get access and start using the resources. All use of Abel needs to draw CPU hours from an allocation.

Mailing list

If you're not already on it, get subscribed to the appropriate mailing lists. We use this list to distribute information on the use of the CEES HPC resources - both our own nodes and the CPU allocation on Abel. See the main wiki page, then come back here.


Getting access to CPU hours

See getting access.

Using Abel

Interactive login

See also here.

ssh abel.uio.no

Getting a single cpu for 11 hrs

qlogin --account nn9244k --nodes 1 --ntasks-per-node 1

Same, for 24 hrs

qlogin --account nn9244k --nodes 1 --ntasks-per-node 1 --time 24:00:00


NOTE you aresharing the node with others, do no use more than the number of cpus you asked for

NOTE a pipeline of unix commands may use one cpu per command:

grep something somefile | sort | uniq -c

This may use three cpus!


Getting a whole node with 16 CPUs and 64 GB RAM:

qlogin --account nn9244k --nodes 1 --ntasks-per-node 16 --time 24:00:00

Even though each node has 16 cpus, due to hyperthreading, you can run up to 32 processes simultaneously


You have a large work area available as well:

echo $SCRATCH
cd $SCRATCH

Using

squeue -u your username

will tell you the job ID, the work area is

/work/jobID.d

NOTE all data on this area is deleted once you log out


Quitting:

logout (or ctrl-d)

Temporary, fast access disk space on Abel

From the Abel newsletter #3:

Update on Abel scratch file-system usage
While a job runs, it has access to a temporary scratch directory on the shared file system /work. The directory is individual for each job, is automatically created, and is deleted when the job finishes (or gets requeued). There is no backup of this directory. The name of the directory is stored in the environment variable $SCRATCH, which is set within the job script. If your job is I/O intensive, we strongly recommend copying its work files to $SCRATCH and running the program there.
Sometimes, one needs to use a file for several jobs, or have it available some time after the job finishes. To accommodate this need, we have now created a directory /work/users/$USER for each user, where $USER is the user's user name. The purpose of the directory is to stage files that are needed by more than one job. Files in this directory are automatically deleted after a certain time (currently 45 days). There is no backup of files in /work/users/.

SLURM

SLURM scripts

Information coming, until then see here.

SLURM tips

Jobs in our queue

squeue -A nn9244k

Listing your jobs

squeue -u username

Information on your job

scontrol show job JOBID

Cancel your job

scancel JOBID

Cancel all your jobs

scancel -u USERNAME

Running SLURM script as a shell script when not submitted through SLURM

Add these lines at the beginning of your slurm script, but after the "#SBATCH" instructions

if [ -n "$SLURM_JOB_ID" ]; then
    # running in a slurm job
    source /cluster/bin/jobsetup
fi

Now run the script as

source script.slurm