Profiling Code on the Cluster
The cluster has a special dedicated partition for benchmarking, called bench. It currently contains four machines, 533,533,534 and 535, which are configured for profiling.
Before using it, you need to ask to be added to the special benchmarking account. Once you have done this, you can get access to the partition via:
- ssh -X maestro
Note -X , to get access to graphical user interface. Once you are on maestro-submit
- srun -p bench --qos bench -A bench --pty --x11 bash
We have a licence to a highly advanced profiler, Intel Vtune
- module load icc/xe_2019
- amplxe-gui
If for some reason you do not want to use the GUI version, there are other tools available:
Code Block (text)
amplxe-androidreg.sh amplxe-perf amplxe-python-execamplxe-cl amplxe-perf-sync amplxe-runssamplxe-feedback amplxe-pgo-report.sh amplxe-runss.pyamplxe-gui amplxe-python amplxe-self-checker.sh
here is the overview of Intel Vtune:
https://www.youtube.com/watch?v=Vl19kyt-cVI
or read the full documentation here:
https://software.intel.com/en-us/vtune-amplifier-help
Ask us if you need any help (ask-hpc@pasteur.fr)
for admins:
echo 1 > /proc/sys/kernel/perf_event_paranoid