Fats : New Users : Basic Information for new users
NOTE: This is a template of an email sent to new users when
their accounts on fats is setup.
Please, make the proper adjustments when you read the text, e.g.,
userid
should be read as your actual login name to fats.
Dear New_User_Name
Your account on fats has been setup.
Your login name is userid
.
Your temporary password was sent to you on a separate email.
Please, keep this message for future reference.
All users of fats should abide by Columbia University Computer and Network Use Policy.
I urge you to read it at:
http://www.columbia.edu/cu/policy/
Please, login to fats ASAP and change your password
(using the passwd
command).
Please, let me know when you change your password.
(But don't email the password to me!)
I need to propagate it to the cluster nodes and to fats' file server,
so that you can access them.
To login to fats please use ssh
:
ssh fats
or
ssh -l userid fats.ldeo.columbia.edu
Your home directory at /home/fats/userid has a disk quota of 4GB (soft limit), with a hard limit of 4.5GB and a grace period of 7 days. In addition, you have a directory at /data[1,2]/fats/userid, with a disk quota of 20GB (soft limit), and a hard limit of 25GB and a grace period of 7 days. Moreover, you have a directory at /scratch[1,2]/fats/userid which is not limited by quota. Please, use disk space frugally, and remove big old files often.
Besides the regular Linux gcc, we have the PGI compilers, currently version 5.2-4. Hence, for compilation of serial code please use the the commands: pgf77, pgf90, pgcc, and pgCC, and for compilation of parallel code with MPI, please use the commands: mpif77, mpif90, mpicc, and mpiCC, for Fortran77, Fortran90, C, and C++, respectively.
PLEASE NOTE:
When you login to fats, you are actually using the
cluster's control workstation (a.k.a. front end
).
The purpose of this workstation is to provide to users a platform
for code development. In addition, it is used as a file server
and for system administration. In order to allow these
three main tasks to be done properly, we ask users please to
send executables to the cluster nodes, and not to run them on
the control workstation.
The cluster nodes, on the other hand, are reserved for
running your jobs only.
Jobs are sent to the nodes only through the PBS queue system.
The command used to submit these jobs is qsub
.
If your PBS script is called my_script.pbs
, you can submit it by using:
qsub my_script.pbs
More information about PBS can be found in the manual pages
of qsub
(man qsub
),
and on the references therein.
You can monitor jobs on the PBS queue system using the
utilities xpbs
and xpbsmon
.
To make it easy, the script xp
invokes both utilities.
Here is an example of a PBS script for a **serial** job,
where the executable is called my_serial_program
:
#!/bin/csh
#PBS -o my_serial_job.o
#PBS -e my_serial_job.e
#PBS -l nodes=1:athlon:ppn=1
#PBS -q production@fats.ldeo.columbia.edu
#PBS -N my_serial_job
cd $PBS_O_WORKDIR
./my_serial_program
And here is an example of a PBS script for a **parallel** job using MPI
on 4 nodes (8 processors), where the executable is called
my_mpi_program
:
#!/bin/csh
#PBS -o my_mpi_job.o
#PBS -e my_mpi_job.e
#PBS -l nodes=4:athlon:ppn=2
#PBS -q production@fats.ldeo.columbia.edu
#PBS -N my_mpi_job
cd $PBS_O_WORKDIR
mpirun -v -np 8 ./my_mpi_program
The examples above assume that the executables and the PBS
scripts are located on the
same directory.
The examples also assume that you launch the
qsub
command from this directory.
On the examples above, both jobs are sent to the production
queue.
Code that is already debugged should be sent to the production
queue.
You can also use the checkout
queue,
if you just want a short run to check things out.
Those jobs should use our nodes with dual Athlon (1.2GHz) processors,
as specified on the PBS scripts above.
For code testing and debugging we have the development
queue.
The development
queue uses our eight older/slower dual Pentium-III nodes.
If you wish to use the development
queue,
please make the following changes
on the scripts above:
athlon -> p3
production -> development
Please, note:
- Each node has 1GB of memory.
- The three file systems mentioned above, where you have directories, are exported/automounted and accessible from all nodes.
-
Details about the queues (maximum resources, maximum time, etc)
can be obtained from
the utilities
xpbs
andxpbsmon
, by highlighting the queue name and clicking on the buttondetails
.
If you are already familiar to PGI compilers, MPI compilation, PBS, etc, please forgive me for being so repetitive.
Don't hesitate to contact me if you need further assistance.
Welcome to fats!
Gus Correa
for fats' Sys. Admin.
Last updated on by Gus Correa.