Useful Links and Info for Oxford ATLAS users logo

Hello TWikiGuest and welcome to the Oxford ATLAS "useful stuff" page.

The concept of this page is that many PhD students and ATLAS Staff have spend many days finding solutions to what seem like minor issues. These problems detract from the actual time spent doing physics and making the world a better place so we have decided to gather the solutions to common problems together in one place so you can search for an existing solution. This means that if your problem is not listed below and you find a solution PLEASE add it to this page so that when someone else comes across the same issue, your work can benefit them.

If you can't find the solution to a problem here, just ask one of the current students or staff as someone has probably seen the problem before and most of us are nice people.

For most problems the ATLAS Twiki Portal is a good starting point, and the ATLAS software tutorial Twiki has up-to-date 'getting started' information.

We have created this page without any expandable areas to enable you to search for any problem or solution your are concerned with. It may also be useful to check out the pdf version of this page (button at top-right of page) and search there.

Coding

Many new PhD students are well versed in particle physics theory (and there are courses in first year to help with this) but have not had much experience with programming before starting the PhD. If this is the case, don't worry. You are in the same boat as most students I know. I strongly recommend taking some time as early as possible to learn a bit about bash, python and C++ as these are the main ATLAS languages. Remember that whatever language you use, good code practise has been designed for a reason - from my own experience you should design every piece of code thoughtfully as many programs that start as "just a quick script" can quickly exploded into very complex code frameworks and you will wish you had followed the rules from the beginning.

Suggested Tutorials

Language Link
Linux http://www.ee.surrey.ac.uk/Teaching/Unix/
Bash http://tldp.org/HOWTO/Bash-Prog-Intro-HOWTO.html
http://wiki.bash-hackers.org
Python https://docs.python.org/2/tutorial/
C++ http://www.cplusplus.com/doc/tutorial/
ROOT http://root.cern.ch/root/html/tutorials/

Bash

Bash is a shell language which is what you are using when you open a terminal and type "ls" or "echo ...". It is great for navigation and file organisation but is fairly low level so most complex tasks are best avoided in bash. There are many tutorials available online and as always with computing the best idea is to dive in and try things - you have to really know what you are doing before you can break the Oxford system so don't be afraid to try things but be sensible and especially be careful with removing files ("rm" command).

(Regarding the 'rm' command: One suggestion is to put an alias in your .bashrc file. This file is run every single time you login, straight away. If you add this line at the bottom:

alias rm="rm -i"
Then from now on, when you type "rm", you will, instead be using "rm -i" and this means Linux will ask you if you really want to delete the given file. any entry other than 'y' or 'yes' will leave the file untouched. My own graduate career was set back by a week due to misapplication of "rm *.*" prior to my adding this alias. It would have been a lot longer than a week but thankfully, there were automatic weekly backups.... )

Python

Python is still a scripting language like bash and is not compiled so is great for performing complex file management or string manipulation. It is much higher level than bash or even C++ so commands are much more intuitive. There are many modules that can be loaded into python but the most important for us is probably PyRoot which allows use of the C++ ROOT classes in python. Be warned that whilst python is easy to use and learn, it has limits such as lack of type-saftey and loops are significantly slower than in C++.

That said, it is my recommended way to learn to use ROOT and just a few lines can make histograms. Set up the relevant environment:

source /cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase/user/atlasLocalSetup.sh
localSetupROOT

A nice intro to using TTrees can be found here. The example script below requires an input ROOT file with a TTree called "physics". If you can download your own, go for it but, if you have not learned about "dq2" yet, you can borrow my example ROOT file at "/home/henderson/scripts/OxfordTwiki/MyRootfile.root".

And make a pyroot script called MyScript.py:

#! /usr/bin/env python
import ROOT
ROOT.gROOT.SetBatch(1)
myRootFile = ROOT.TFile("/home/henderson/scripts/OxfordTwiki/MyRootFile.root", 'r')
myRootTree = myRootFile.Get("physics")
myHistogram = ROOT.TH1D("MyHisto", "Histogram of Jet pT above with lepton trigger", 100, 0, 500 )
for event in myRootTree:
   if not event.EF_e24vhi_medium1: continue
   for jet in xrange(len(event.jet_AntiKt4LCTopo_pt)):
      myHistogram.Fill( event.jet_AntiKt4LCTopo_pt[ jet ] * 0.001 )
      pass
   pass
c1 = ROOT.TCanvas()
c1.SetLogy()
myHistogram.Draw("hist")
c1.Print("MyHist.pdf")

Modify the permissions and make executable:

chmod +x MyScript.py
./MyScript.py

Or simply run in an interactive python shell by typing "python" and copying/pasting the above into the terminal (one line at a time is required for the loop). You can exit python by pressing 'control' and 'd' together.

C++

C++ is the real work-horse language of ATLAS with the vast majority of code written in it. It is fast, efficient and well documented, although not always as intuitive as python. If you have never worked with object-orientated code before then do some reading on the subject and try to design your code around classes as it dramatically increases readability for others.

ROOT

ROOT is the standard ATLAS/CERN tool for histogramming and storing data. It is a collection of C++ classes such as TTrees and TFiles that allow the user to manipulate the data and make plots. There are many posts on the ROOT forum where you can ask the developers questions or check where others may have had similar issues. Beware of memory leaks especially in PyROOT - I have found that the memory management systems of python and ROOT fight and sometimes objects become deleted when you don't expect it. Try the following line if this happens:

ROOT.SetOwnership( <OBJECT NAME>, False )

xAOD Analysis

Jon: ATLAS will be using a new data storage method in Run 2 called xAODs. This comes with its own EDM which is integrated into RootCore's EventLoop package. I've been having a look at how to set up an analysis to run over an xAOD and I'll put any things I've discovered during this here - any questions or comments please let me know. I don't propose to summarise the xAOD tutorial here as it is fairly clear but if there is demand for that I can.

Useful xAOD links

Here are the main links that you'll need for reference when setting up an xAOD analysis: xAOD Software Tutorial, EventLoop twiki page, Offline software tutorial indico page

If you come across issues with EDM code doing something strange or if you need to check what you can do with a particular xAOD object then you can find their code in svn.

Common source of memory leaks

Root is notoriously bad for memory leaks and EventLoop is no exception. This can produce some very annoying problems as the effects aren't necessarily noticeable on smaller test samples but can break your job on a larger data set. The worst of these that I've come across is connected to making shallow copies of xAOD containers. As described in the tutorials when you create a shallow copy of a container you retain ownership of this object so it must be deleted. The best way to do this is to record the created containers into a TStore object. Most of the time you will want to create, process and delete these objects in each execute() loop - in this case there is an appropriate object that EventLoop creates for you accessible by

xAOD::TStore* store = wk()->xaodStore();

You can then record a container to the store using

if(!store->record(container, "containerName")) return EL::StatusCode::FAILURE;

This store is cleared automatically for you after each iteration of the execute() loop. Not deleting a shallow copy you've made is quite a simple mistake to spot - the danger is that a lot of xAOD tools (e.g. SUSYTools, JetCalibTools, etc) will create shallow copies without telling you. It is still your responsibility to delete these. A good policy to follow seems to be that any pointer you declare should be recorded to either an output stream or this TStore (unless you have good reason not to...). A further comment is that unless you have a very good reason xAOD objects should be collected into the relevant containers (e.g. xAOD::Jet objects should be stored in an xAOD::JetContainer object). This helps ensure that they're correctly removed from memory.

NB: Using the TStore will probably require you to include

#include "xAODRootAccess/TStore.h";
somewhere in your code.

Missing ET

As the MET term has to be rebuilt it isn't as simple to retrieve as many of the other objects in the xAOD. Typically it will require the use of a specialised tool. There is a set of slides in the offline tutorial. Many analysis tools will contain a MET rebuilder tool (for example the SUSYTools package contains such a tool).

Setting up your work environment

Oxford has a great IT website of helpful advice and links (printing, email etc.) that can be found here.

There are two real options for computing where you can work:

1 - The Oxford system has interactive machines (called pplxint8 and pplxint9) which run scientific linux 6 (SLC6). There are two older machines (pplxint5/6) which run SLC5 but you should not need these. These machines have many processors and a large amount of RAM to test all your memory leaks on. You can log in from any linux terminal using

 ssh -X OXFORD_USERNAME@pplxint8.physics.ox.ac.uk 
The "-X" sets up X11 which means you can view graphics if required rather than simply the test on the terminal. There is also a batch system which is explained below.

2 - LXPLUS is the CERN central SLC6 computing service. All CERN users are able to access this service so it can be a great way to run code that someone else has written in exactly the same environment as them or exchange files with people. Again, you can log in from any terminal with

ssh -X CERN_USERNAME@lxplus.cern.ch
I find it to be quite slow so I would recommend the Oxford system over lxplus.

The centrally maintained "ATLAS Software Tutorial" is a fantastic place to learn about the tools needed for analysis Link and I strongly recommend attending the tutorial week as soon as possible as it becomes hard to change your code after you have started to develop a framework.

NEW June 2014 The standard ATLAS tool for Run 2 analysis is planned to be the RootCore package AnalysisBase. The plan is that this will apply calibrations to all the physics objects from the combined performance (CP) tools so that the user does not have to worry about vastly over complicated tools (looking at you JetETMiss). Check out the tutorial here.

If you are working on lxplus, you can increase your storage space by going to here and clicking "increase quota".

Registering with CERN and ATLAS

If you arrived from a different institution and/or a different research group, you'll need to register with either CERN or ATLAS. The latter is especially useful, since many of the reading materials and software packages simply won't be available unless you have registered. The link below provides useful information for various registration scenarios: http://atlassec.web.cern.ch/atlassec/Registration.htm. The general procedure for two scenarios follows two steps in 'I plan to come to CERN and work for ATLAS!':

For most 1st year PhD students (who aren't physically working at CERN yet), you only need to email (or fax) ATLAS secretariat with your ATLAS registration form filled and signed by either Tony Weidberg or Ian Shipsey. If you are registering for the first time, also attach a copy of your id/passport in the email. (Step 2)

If you are starting your work on site, talk to Tony Weidberg to start your PRT. Shortly after you'll receive an email with link to your registration form. Fill it in, and ask Sue Geddes for a signed copy of home institute declaration form. (Step 1)

Useful CVMFS addresses

It can be useful to install code locally but if you are in a rush or if you like to know everything was compiled by an expert, you can use the cvmfs installations of most programs. This allows you to use lots of code versions which are pre-compiled on your architecture so please choose wisely. The addresses below will probably be out of date but should give you an idea of where to look.

Code Address
HepMC /cvmfs/atlas.cern.ch/repo/sw/software/x86_64-slc6-gcc48-opt/19.1.1/sw/lcg/external/HepMC/2.06.08/x86_64-slc6-gcc48-opt/
LhaPdf /cvmfs/atlas.cern.ch/repo/sw/software/x86_64-slc6-gcc48-opt/19.1.1/sw/lcg/external/MCGenerators_lcgcmt67b/lhapdf6/6.1.3/x86_64-slc6-gcc48-opt/
LhaPdf Sets /cvmfs/sft.cern.ch/lcg/external/lhapdfsets/current/

If using LHAPDF, it may be useful to setup the PDF sets to the environment variable:

export LHAPDF_DATA_PATH=/cvmfs/sft.cern.ch/lcg/external/lhapdfsets/current/

ATLAS Tools

Oxford does have versions of most software installed locally but working with on the same code analysers from many institutions means it is beneficial to have everyone working from the same setup and using the same versions. There are some easy commands to setup the necessary tools.

export ATLAS_LOCAL_ROOT_BASE=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase
source ${ATLAS_LOCAL_ROOT_BASE}/user/atlasLocalSetup.sh

This loads the commands that are needed to setup the tools you may wish to use from a central CERN-based script. Cvmfs (Cern Virtual Machine File System) Link is available at Oxford but links to servers at CERN just as the "/afs/cern.ch/" file system does. For example, if you wanted to setup ROOT, just type

localSetupROOT
after the 'source' line above. You can specify a specific version of ROOT or gcc (c compiler) if you wish (check out all the options by typing "localSetupROOT --help") but the command will try to automatically pick one that works for your current environment. ROOT will now be setup for your terminal.

There are many more tools you can load in the same manner, type "localSetup" and hit "tab" twice to see all the options.

ATHENA

Athena is the official ATLAS environment - although I have always found overly complicated and avoid it as much as possible. You need to use it to generate Monte-Carlo events in the official ATLAS manner or to use certain ATLAS tools. Check out this link for some top tips (more for developing athena than running). It can be setup using:

export ATLAS_LOCAL_ROOT_BASE=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase
source ${ATLAS_LOCAL_ROOT_BASE}/user/atlasLocalSetup.sh
asetup 17.2.10.3,slc5 --64
The long number "17.2..." defines the Athena release you wish to setup. There is a list of all the available releases here and change-log documentation. For pplxint8 or 9 we have to add the ",slc5" as Athena does not like the SLC6 environment just yet and the "--64" forces the release to be in 64 bit mode.

You can now run commands like "Generate_try.py" or "Reco_trf.py" interactively. This is the procedure for testing generation of events before sending jobs to the grid.

Grid Certificate/ATLAS VO Membership

This can be a nightmare if you don't have a guide, please read through all the below before starting and if you have any problems just email/ask someone as we have all had to go through this right of passage at some point.

The grid allows massively parallel jobs to be run on many sites around the world and can be an extremely powerful tool if you learn to use it. To submit jobs or download datasets you must have a valid grid certificate.

Certificate

To apply/renew your certificate, use the UK e-Science CertWizard. This is a relatively simple java wizard to guide you through the application. You are requested to enter at least 2 different passwords throughout this process - personally (despite it being bad password practice) I recommend using the same password at all points here because it is often not clear which password is being requested.

Once your certificate has been approved (it usually takes a day or so), you can export it from the CertWizard (as a .p12 file) and import into you internet browser (Firefox: Edit -> Preferences -> Advanced -> Certificates -> Import) or move it to pplxint/lxplus. The pplxint procedure is out line below:

ssh USERNAME@pplxint8.physics.ox.ac.uk
mkdir ~/.globus
cd ~/.globus
< Copy the .p12 file into the globus directory >
openssl pkcs12 -in <YOUR CERT>.p12 -clcerts -nokeys -out $HOME/.globus/usercert.pem
openssl pkcs12 -in <YOUR CERT>.p12 -nocerts -out $HOME/.globus/userkey.pem
chmod 400 userkey.pem
chmod 444 usercert.pem

You should now be able to log into the grid setup and submit jobs / download datasets. Test with:

export ATLAS_LOCAL_ROOT_BASE=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase
source ${ATLAS_LOCAL_ROOT_BASE}/user/atlasLocalSetup.sh
localSetupDQ2Client --skipConfirm
localSetupPandaClient --noAthenaCheck
voms-proxy-init -voms atlas -valid 96:00:00
export DQ2_LOCAL_SITE_ID=UKI-SOUTHGRID-OX-HEP_SCRATCHDISK

VO Membership

You will also need to be a member of the ATLAS Virtual Organisation (VO), click here for details. During the application you must selected a representative - nobody know why or who these people are - just try a few until one is accepted.

I have not found a centralised walkthrough for the UK for this step but use the below links to understand the steps in ATLAS VO registration. There are 3 registration phases

  • Phase 1 -> Becoming a candidate - simple, just enter name and email
  • Phase 2 -> Becoming an applicant - choose which Roles you wish to have, just choose:
    • /atlas
    • /atlas/uk
    • /astlas/lcg1
    • The other options are for software managers only
  • Phase 3 -> Becoming a member - Just wait for an email and tick a box

Helpful VO Website Link
ATLAS France walkthrough (don't sign up for /atlas/fr) https://atlas-france.in2p3.fr/cgi-bin/twiki/bin/view/Atlas/HowToAtlasVO
RHIC guide to ATLAS VO (My Favourite) https://www.racf.bnl.gov/docs/howto/grid/voatlas
Santa Maria (Chile) Guide http://atlas.fis.utfsm.cl/atlas/howto.grid.VO.html

The whole procedure can take a week to setup as you first have to apply for the certificate and then apply for the VO membership with the certificate before you can do anything.

Checking out software

Many software packages in ATLAS are available for download using either 'svn' or 'cmt'. Svn is a general purpose version control coda that allows users to store code remotely and download specific versions or tags. Cmt Link (Configuration Management Tool) is a similar utility but is more focused on package based codes. On the interactive pplxint Oxford systems your user name may not match your CERN user name and this can cause issues when trying to download using both these tools. This can be overcome using the below commands. Your user name on lxplus should always match your CERN user name so you should not need any changes if working on lxplus.

You can check your current user name with

echo $USER

If this matches your CERN user name then you should not have a problem, otherwise follow the steps below.

SVN user name change

Say you wish to check out (download) the awesome ATLAS VBF W framework, simply use:

svn co svn+ssh://CERN_USERNAME@svn.cern.ch/reps/atlasphys/Physics/StandardModel/ElectroWeak/Development/VBFW/Software/Framework/trunk

And replace CERN_USERNAME with your CERN user name. If your current user name matches your CERN user name then the "CERN_USERNAME@" can be omitted.

USEFUL To avoid having to repeat entering your password a great many times when using svn, check out the advice at this link. The main point being you can simply enter your password once per terminal session using:

 kinit CERN_USERNAME@CERN.CH 
where the capitalisation of "CERN.CH" matters.

CMT user name change

Thanks to Todd Huffman for battling through this procedure and finding out this information. In this case we must alter a log file but this only has to be done once. Open (or create) a text file called "~/.ssh/config" and make sure it looks like this:
host *.cern.ch
user <cern user name>  (in my case this would be thuffman)
ForwardX11 yes
GSSAPITrustDns yes
CheckHostIP no
GSSAPIDelegateCredentials yes

I think that it is the first two lines that are key, the rest is just to make Kerberos sharing and ssh work more transparently between Oxford and CERN.

Kinit <cernusername>@CERN.CH
aklog
Now go to your working directory on pplxint and setup Athena as above. We must now change the environment variable "SVNUSR", on a bash shell (echo $SHELL to test) type
export SVNUSR=‘svn+ssh://svn.cern.ch/reps/atlas-<cernusername>’
Or on a csh shell:
setenv SVNUSR ‘svn+ssh://svn.cern.ch/reps/atlas-<cernusername>’
Now you can download your ATHENA packages to pplxint, this next will get the ATHENA helloworld basic user analysis package

cmt co –r UserAnalysis-00-15-12 PhysicsAnalysis/AnalysisCommon/UserAnalysis
At this point you can drill down into the "PhysicsAnalysis/AnalysisCommon/UserAnalysis/cmt" directory and type "gmake" and it should actually compile! Once compiled you do
../run
get_files –jo HelloWorldOptions.py
athena.py HelloWorldOptions.py

Passwordless cern svn

You can setup the system to not require your password to be entered every time you wish to check out/in a package. See this link for details Link.

Data Disk Organisation

UPDATE In July/August 2014 all data disk files were transferred to a new faster server but I believe the user interface was maintained so the below is still correct.

For historical reasons there are 3 Oxford ATLAS data disks located at:

/data/atlas/atlasdata/
/data/atlas/atlasdata2/
/data/atlas/atlasdata3/
But they are all identical to the user and you can simply pick your favorite number and make a directory for yourself on there (I chose /data/atlas/atlasdata/henderson/). We have a central directory where we try to store all the large datasets that are downloaded in order to try to avoid duplication of datasets across different analysis groups. Please download your datasets to
/data/atlas/atlasdata3/D3PD/
And make them readable by everyone ("chmod -R a+r MyNewD3PDFolder/").

ATLAS Formats

Due to the large number of formats ATLAS data or MC goes through it can be confusing exactly what sort of data you are looking at. An incredibly useful but little known about twiki is the Explanation of SMWZ branches twiki - it is by no way complete but the best reference I have found at documenting what the branch names actually mean. An overview of the event data model for ATLAS can be found here.

Monte Carlo

ATLAS Monte Carlo is generated from event generators like Sherpa or Pythia as single events and then merged with minimum bias events to simulate pile-up. Minimum bias refers to the boring, low-pT events that are the most common events to see in ATLAS so there is no bias put on the selection (no trigger). The table below shows the progression of the MC files before we can access them. The main transformation (ATLAS term for moving one data format to another) is known as "Reco_trf" or "RecoTf", details can be found here.

MC Format Name Notes
Evgen or EVNT Output of "Generate_try.py", named as "X.pool.root", pure Monte Carlo - detector independent, can access all of unshowered, parton, hadron (/particle) level
Hits or simul GEANT4 used to simulate ATLAS detector, voltages output
Digi Digitisation of the Hits data
RECO Reconstruction of the voltages into physics loose objects
ESD Event Summary Data - not usually permanently stored
AOD Analysis Object Data - Hits reconstructed into objects and collections (jets, electrons etc), simplified calibrations applied
D3PD/NTUP Derived Physics Data/N-Tuple - Slimmed and re-organised AODs - used for most physics analyses

The various steps to making a D3PD are done in as parallel a way as possible so you will find many datasets with "merge" in the name where multiple files have been added together. You can use a few website to check on the status of MC production:

Website Explanation Link
AMI Holds information on all complete (some nearly complete) datasets - cross-sections, production PDFs, tags etc. https://atlas-ami.cern.ch/AMI/
Sometimes the above is down for maintainence but this seems to work, click "AMI dataset browser" then "Simple search" https://atlas-france.in2p3.fr/cgi-bin/twiki/bin/view/Atlas/DataAccess
MC Production Status Check on the status of current production samples http://www-f9.ijs.si/atlpy/atlprod/prodtask/
Panda D3PD Status You can request a D3PD to be produced from an AOD via the relevant email lists, this page shows the status of D3PD production, change the p1328 tag as needed http://panda.cern.ch/server/pandamon/query/?mode=taskquery&qPriority=p1328&qProjectMode=atlas&qTASEARCH=TASK&qMore=&qsubmit=QuerySubmit]

You may wish to identify exactly which version of a MC generator created a sample you are using (eg Sherpa 1.4.1 or 1.4.3??). NEW Check out this great page by Jose Garcia to find which generator versions are available in which athena versions.

This is usually done by searching AMI (see above) for the EVNT sample that your D3PD/AOD derives from. E.g. if I wanted to find out which Sherpa version was used for mc12_8TeV.129915.Sherpa_CT10_Wenu2JetsEW1JetQCD15GeV_min_n_tchannels.NTUP_SMWZ... I would search AMI for "mc12_8TeV 129915 EVNT" and click on "details".

This being ATLAS, sometimes the above method does not work. In this case, follow the above and find the Athena tag that the EVNT file was created with and then follow the below advice from MC guru Claire Gwenlan:

Set up the release:

asetup <ATHENA VERSION FOUND IN EVNT AMI PAGE>
and do:
cmt show versions External/<MC GENERATOR>
which will give you the generator tag. E.g. for the 129915 sample above:
$ asetup 17.2.6.2
$ cmt show versions External/Sherpa 
External/Sherpa Sherpa-01-04-01-00 /cvmfs/atlas.cern.ch/repo/sw/software/i686-slc5-gcc43-opt/17.2.6/AtlasSimulation/17.2.6
in this case, the version is 1.4.1

We can tell this just from the name since Sherpa tags are nicely numbered according to the version. Some of our External packages are not numbered so nicely and in this case you'd have to look in svn, and actually look at the requirements file associated with that tag, to be sure which version:

https://svnweb.cern.ch/trac/atlasoff/browser/External/Sherpa/tags/Sherpa-01-04-01-00/cmt/requirements

Data formats

Data goes through similar transformation steps as the above.

Data Format Name Notes
RDO Raw Data Objects
SFO Sub-farm Output
ESD as for MC
AOD
D3PD

Batch System

The Oxford batch system allows users to send high intensity jobs to dedicated cores where they can run without interruption from interactive issues and consume resources without annoying other users. There are 3 main quesues which differ depending on the allowed time for a job to complete, these are called "normal", "short" (12 hours) and "veryshort" (2 hours), although names differ on SLC5 (pplxint5 and 6) and SLC6 (pplxint8 and 9) machines. Submitting to a shorter time limit queue will increase the priority of your job. To submit to a particular queue, simply add the time limit you wish to the qsub command with the "-l cput=

qsub -l cput=1:59:59 MyScript.sh

This will submit to the "veryshort" queue. There is a dedicated testing queue which can only hold 2 jobs from any one user at a time but is great for debugging batch jobs and is usually very empty so jobs tend to run immediately. To use the testing queue:

qsub -q testing MyScript.sh

The queues can only hold so many jobs, so if you submit more than ~ 350 jobs at once, they will overflow into the "input" queue but will be moved back into the correct queue when space becomes available. There is a special queue for dataset downloads explained below.

It is often handy to see how many jobs you have running and the status of the queues, I have written a script akin to "top" which can be used to update the status of your jobs to your terminal. You can run it or copy it and make any changes you fancy from "/home/henderson/scripts/qTop". Please feel free to take any other code that looks interesting from this folder.

Downloading Datasets

In order to download a local copy of a dataset to the Oxford system we can use the dq2-get command. This can be used interactively (directly on pplxint) but dq2-get tends to use multiple threads and can cause the interactive machines to slow down for other users. Therefore, there is a dedicated "datatransfer" queue on the batch system which is specifically setup to do "dq2-get"s.

RUCIO NOTE ( The current (July 2014) recommendation is to use "dq2" to move datasets around but soon we will have to move to "rucio" which can be used now if you prefer not to have to re-learn all this in early 2015. Check out the rucio webpage. )

To setup the grid environment (this is done interactively) we have to "source" a script containing the following lines. NOTE when you "source" a script ("source script.sh" or equivalently ". script.sh") the changes that are made to the environment (variable changes etc.) are maintained back into the shell, this is NOT the case if you simple run a script ("./script.sh").

#!/bin/bash
export ATLAS_LOCAL_ROOT_BASE=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase
source ${ATLAS_LOCAL_ROOT_BASE}/user/atlasLocalSetup.sh
localSetupDQ2Client --skipConfirm
localSetupPandaClient --noAthenaCheck
voms-proxy-init -voms atlas -valid 96:00:00
export DQ2_LOCAL_SITE_ID=UKI-SOUTHGRID-OX-HEP_SCRATCHDISK

With the above lines "sourced" we have a working grid certificate and can now interactively download a dataset using:

dq2-get -V -D -f *.root* data12_8TeV.periodA.physics_Egamma.PhysCont.NTUP_SMWZ.grp14_v01_p1328_p1329/
Look at all the options with "dq2-get --help". "-V" skips the validation part of the download which I have only ever found to double the download time and be of no benefit. "-D" puts the downloads in the current directory and "-f ∗.root∗" will only download "root" files. NOTE the two "∗"s are important as the grid can place integers after the ".root" if retried jobs are involved.

As we don't want to download interactively, we run a script to do this download on the batch system. To do this, we need to first create a proxy and put it somewhere the batch system can see it, by copying to the home directory (don't worry, it is only readable by you):

#!/bin/bash
export ATLAS_LOCAL_ROOT_BASE=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase
source ${ATLAS_LOCAL_ROOT_BASE}/user/atlasLocalSetup.sh
localSetupEmi
voms-proxy-init -voms atlas -valid 96:00:00
cp $X509_USER_PROXY /home/${USER}/myProxy
NB: you must source atlasLocalSetup.sh and do localSetupSomething ('Something' can be DQ2Client, RucioClients, etc; Emi is a 'bare minumum') to set up grid middleware before generating a proxy, otherwise you can run into seemingly mysterious problems later.

Now this is done, we can reuse the above setup script, except we need to tell the job where to look for our proxy. Create the following in setup_dq2_cluster.sh:

#!/bin/bash
export ATLAS_LOCAL_ROOT_BASE=/cvmfs/atlas.cern.ch/repo/ATLASLocalRootBase
source ${ATLAS_LOCAL_ROOT_BASE}/user/atlasLocalSetup.sh
localSetupDQ2Client --skipConfirm
localSetupPandaClient --noAthenaCheck
export X509_USER_PROXY=/home/${USER}/myProxy
export DQ2_LOCAL_SITE_ID=UKI-SOUTHGRID-OX-HEP_SCRATCHDISK

Now create a script called "BatchDq2.sh" containing:

#PBS -q datatransfer
source <path_to>/setup_dq2_cluster.sh
cd <LOCATION YOU WISH TO DOWNLOAD FILES TO>
dq2-get -V -f *.root* data12_8TeV.periodA.physics_Egamma.PhysCont.NTUP_SMWZ.grp14_v01_p1328_p1329/
dq2-get -V -f *.root* data12_8TeV.periodB.physics_Egamma.PhysCont.NTUP_SMWZ.grp14_v01_p1328_p1329/

PLEASE NOTE The "PBS" line tells qsub to send this job to the "datatransfer" queue, this is equivalent to calling "qsub -q datatransfer BatchDq2.sh". In the above we are downloading two datasets but we can submit two jobs instead that download a single dataset each to speed up the procedure. To submit and run the job, simply call:

qsub BatchDq2.sh

Note: If we ran this script interactively, we would have to change the run permissions ("chmod +x BatchDq2.sh") but this is not necessary if we are submitting it to the batch system - although there is no harm in it.

Old setup, now not necessary (although both afs and cvmfs are temperamental so if the above fails, try the below): The below is a bit of a work around as I have not found a cleaner way to do this but it does work. The batch system will not setup the grid environment as we did interactively above so we have to use the "/afs/" grid setup and copy our grid certificate to somewhere that is accessible by both the interactive and batch systems. First, copy the certificate to your home directory as above. Then create a script called "BatchDq2.sh" containing:

#PBS -q datatransfer
export X509_USER_PROXY=/home/${USER}/myProxy
source /afs/cern.ch/atlas/offline/external/GRID/ddm/DQ2Clients/setup.sh; 
export DQ2_LOCAL_SITE_ID=UKI-SOUTHGRID-OX-HEP_SCRATCHDISK;
cd <LOCATION YOU WISH TO DOWNLOAD FILES TO>
dq2-get -V -f *.root* data12_8TeV.periodA.physics_Egamma.PhysCont.NTUP_SMWZ.grp14_v01_p1328_p1329/
dq2-get -V -f *.root* data12_8TeV.periodB.physics_Egamma.PhysCont.NTUP_SMWZ.grp14_v01_p1328_p1329/

Oxford Disk Usage

A web page has been setup to show the disk usage of different folders on the Oxford data disk. It can be quite slow as there is a lot of data to assemble but it is a great tool for seeing how many terrabytes the SUSY group is currently taking up.

http://dataatlas.physics.ox.ac.uk/

Please contact someone in the group for the log in details.

Personal Webspace

The pplxint machines are setup to organise a web space for each user automatically. Create a folder named "public_html" in your home area and anything you put inside there will be viewable from your web space at

http://www-pnp.physics.ox.ac.uk/~<USER NAME>/
Where is your Oxford user name (eg henderson for me). This can be especially useful for sharing plots or pdfs with others or simply viewing graphics on a web browser instead of having to open up GUIs over a ssh connection.

Please remember that much of what we work on should not be publically viewable until approved so look into ".htaccess" and ".htpasswd" for protection of these folders Link

Vidyo (Internet meetings)

CERN meetings are officially setup to use the Vidyo Desktop software. This should be easy to download and install when you first try to join a meeting but can also be downloaded on a linux machine with:

sudo apt-get install vidyodesktop

However, there are a number of reported cases of Vidyo failing on linux after 5 minutes and the audio dying (I believe this is a PulseAudio issue). This can be resolved by re-opening Vidyo but this is a real pain. In this case, try the alternative software of linphone. I have found this to work very well but, as you are using a phone-bridge to connect you cannot see who else is in the meeting or broadcast / view slides from Vidyo. More details can be found here.

PLEASE NOTE: For linphone - you must use your cern username as your linphone username, your display name can be anything you wish. An example meeting address (For EWK weekly) is "9281020*123546@vidyogw.cern.ch".

Misc. / Computing / at CERN

Computing

Oxford has a good IT website of helpful advice and links (printing, email etc.) that can be found here.

For obvious reasons CERN are fairly sensitive about access to their networks. For this reason when you arrive at CERN the first time you may find it tricky to get online. Click here to access the CERN webpage to register a new device (laptop, phone, whatever) on the CERN network.

To connect to a printer at CERN, use the webpage: LINK and click to "Display and install printers" to get details of the closest printers to you.

The lxplus system sometimes has trouble with permissions - if you have a file that you would like someone else to be able to access and you have properly setup the permissions they may need to log into the same lxplus node as you have changed the permissions on.

If you are using lxplus or the "work"-space (eg /afs/cern.ch/work/j/jhenders/") you can increase your storage space quota and default shell (bash v zsh) using LINK and clicking through "Services", "AFS Workspaces" and "Settings". This is also a very useful way to see your current usage of you quota both for your lxplus "home" and "work" directories.

Craig: If you are using lxplus and for some reason you need to run an interactive session for more than 25 hours and need to run things from afs you need to update your afs/kerberos token. See this website for details Link.

Evgen

If you are generating some MC and need to know the latest MCXXJobOpts.tar.gz options, check out this link, or simply type:

more /cvmfs/atlas.cern.ch/repo/sw/Generators/MC12JobOptions/tag

Check out this great link for a more detailed walk-through: link

LaTeX

If you haven't already made the jump to using LaTeX, now is the time to do so! All ATLAS documentation is written collaboratively in this way - necessitating the sharing only of text files, not bloated word documents or similar. On pplxint, your favourite latex compiler (latex, pdflatex, xelatex, etc) will work 'out of the box'. If you have issues with missing packages, try:

module load texlive
or add it to your .bashrc.

For a handy table-making package, which takes an input tex file and returns a pdf and png cropped to size, try:

alias make_tab='/data/atlas/atlasdata/kalderon/shared/latex/make_table.sh'
make_tab filename.tex

Passwordless log-in for lxplus on mac OS X

It's not necessary to type in your password every time you wish to login to the lxplus machines from your local machine. Note this has been tested on El Capitan 10.11.6. To enable password-less login, simply add the lines:
Host lxplus.cern.ch lxplus
  User <CERN USERNAME>
  GSSAPITrustDns yes
  GSSAPIAuthentication yes
  GSSAPIDelegateCredentials yes
  ForwardX11Timeout 168h
UserKnownHostsFile ~/.ssh/known_hosts
to the file
~/.ssh/config 
You may have to create this file if it does not already exist.

It's not truly password-less, but this will allow you to login by only typing your password once per day using the command:

kinit <cern username>@CERN.CH
This can save you a lot of strain typing out your password many times every day.

For more information, see the "checking out software" section above.

ToDo Add Sections on/Requests

  • Slimming/Skimming
  • grid jobs - prun and pathena commands
  • rivet
  • ilumicalc and GRL generation
  • General twiki links
  • Lhef 2 hepmc, links to explain formats

My Personal Preferences

  • Show tool-tip topic info on mouse-over of WikiWord links, on or off:
    • Set LINKTOOLTIPINFO = off

  • More preferences TWiki has system wide preferences settings defined in TWikiPreferences. You can customize preferences settings to your needs: To overload a system setting, (1) do a "raw view" on TWikiPreferences, (2) copy a Set VARIABLE = value bullet, (3) do a "raw edit" of your user profile page, (4) add the bullet to the bullet list above, and (5) customize the value as needed. Make sure the settings render as real bullets (in "raw edit", a bullet requires 3 or 6 spaces before the asterisk).

-- JamesHenderson - 08 May 2014

Edit | Attach | Watch | Print version | History: r45 < r44 < r43 < r42 < r41 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r45 - 2019-10-23 - SiyuanYan
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    Main All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright &© 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback