hpcleuven / vscdocumentation Goto Github PK
View Code? Open in Web Editor NEWVSC (Vlaams Supercomputing Center) user documentation.
Home Page: https://docs.vscentrum.be/
License: GNU General Public License v3.0
VSC (Vlaams Supercomputing Center) user documentation.
Home Page: https://docs.vscentrum.be/
License: GNU General Public License v3.0
Do we need to add documentation for archive storage?
Hi @backelj @RVerschoren (and others from UA):
I would like to clean up / improve the Slurm-related pages. One thing that catches the eye is that (probably for historic reasons) there is quite a bit of UAntwerpen-specific info on the following pages:
https://docs.vscentrum.be/jobs/job_submission.html
https://docs.vscentrum.be/jobs/job_types.html
https://docs.vscentrum.be/jobs/job_advanced.html
I think it would be better to move such info to somewhere in UAntwerp Tier-2 Infrastructure, possibly as part of 2.1.5. UAntwerp-specific software instructions :: Slurm workload manager. What do you think? If you're in favor but in lack of time, I can open a PR for this.
Add documentation about how to use beeond on Tier-1 and KU Leuven Tier-2 systems
When installing miniconda3, the installer offer to modify the user's .bashrc
. In the past, it simply added a modification of PATH
, now it defines a conda
function.
Somehow, this causes issues when users try to log in on NX nodes, so a cautionary note should be added to the documentation.
Job dependencies can be useful for certain workflows.
A mini how-to and a reference to the official documentation could be useful.
Dear GJB
The link pointing to the request form for introduction credits for KU Leuven user is either broken, or is not properly set up, because it leads to a blank page: https://vlaams-supercomputing-centrum-vscdocumentation.readthedocs-hosted.com/en/latest/leuven/credits.html#id1
Br
Ehsan
A link to a PDF document that served as a quickstart to Tier-1 was sent to new Tier-1 users.
This document is out of date, too extensive, and should be available on the documentation site, rather than as a static document.
At KU Leuven, TurboVNC is no longer used. I've removed all references to KU Leuven from pages on which it was mentioned.
I note that UAntwerp has its own page on its visualization nodes.
Do we still need to keep the following page?
./access/turbovnc_start_guide.rst
Some documentation on submitting and managing job arrays would be useful.
Currently, a generic, non-existant host name is used in both the screenshot and the text of the FileZilla documentation page.
Since this confuses users, it should be reworded similarly to what was done for PuTTY.
Additionally, the WinSCP documentation should be checked to see whether it has similar issues.
In superdome if the user wants to use a ratio of mem/core higher than the default it is needed to specify the memory feature in the -L command. If this is not done when the job tries to use more than the default memory it will start swapping. In the manual of pbs in the numa section it is explicitily mention this behaviour:
So, for example to submit of a job using 7 cores and 100GB per core the jobs should be submitted as follows:
qsub -l partition=superdome -q qsuperdome - L tasks=1:lprocs=7:place=numanode=1:memory=700gb
The documentation on the credits system mentions gquote, the utility to estimate the credits required to run a job.
gquote has not been updated in quite a while, and can not deal with jobs that use parts of nodes (nor regular, nor GPU).
Either gquote has to be updated, or we remove it from the documentation.
Opinions?
Although suggested by the documentation, it is nowhere stated explicitly that the order in a job script matters.
There seems to be no link to the cloud documentation from the general documentation.
Can that be added? @boegel ? @alvarosimon ? https://hpcugent.github.io/vsc_user_docs/cloud/
It would be useful to have a pointer to screen/tmux with an emphasis on session persistence to mitigate poor-quality network issues.
Since there are many good tutorials, there is no need for full-blown documentation, simply to point out these tools exist.
Since Cerebro is decommissioned, there is no information available on MPT, SGI's (now HPE) MPI implementation.
It would be useful to add some notes on MPT to the page on superdome.
The worker documentation page has a remark on job arrays that is no longer accurate. This needs reformulation.
The page should also prominently refer to the official worker documentation.
The content should be reviewed with respect to current worker features.
current sitemap.xml only has a reference to the main page
We have quite a number of pages on SVN, do we keep them or retire them? I suspect that those who use know how to, and that we hardly encourage new users to use subversion.
Since we mention this page in the VSC user-day presentation, the first paragraph of https://docs.vscentrum.be/jobs/basic_linux_usage.html should be updated
Some of the information on https://docs.vscentrum.be/software/intel_toolchain.html only applies to the classic compilers like icc
. It should be updated for the modern compilers like icx
.
Do we need the following page?
https://vlaams-supercomputing-centrum-vscdocumentation.readthedocs-hosted.com/en/latest/access/setting_up_a_ssh_proxy_with_putty.html#ssh-proxy-with-putty
It was written back in the old days when muk was behind a firewall. Do we still have systems for which you need to set up a proxy using a login node?
The problem is that this page leads to no end of confusion from users who think they need to do this (and obviously don't), so I would be very happy if we could delete it.
The bar at the top of the page contains a number of useful (or not so much) icons:
The procedure for installing Anaconda on the NX nodes explicitly refers to thinking. Is it still up-to-date?
Hey all
I would like to bring the fact into your attention that the development
branch has fallen quite behind the master
branch, which to my knowledge opposes the original idea. A simple git diff master..development
shows quite sensitive differences between the two which calls for some work to bring them back in sync.
So, one possibility is to retire the existing development branch, and then
master
and merging to the master
.master
branch can be protected from direct merges, by enforcing reviews.What do you guys think?
Ehsan
Users tend to send screenshot of terminals, rather than simply copy/paste the text.
This makes the support staff's life harder, since, e.g., path names must be retyped, rather than simply copy/pasted from the helpdesk ticket.
There are some things that look off in the thinking_centos7.png figure of the hardware on ThinKing (shown at the bottom of this VsCdocs page):
I think the legend on the right is mostly fine, except that those two Quadro K5200 nodes are not there anymore?
The map colors don't seem to agree with the legend. According to pbsnodes
info,
Probably also good to double-check the FDR/QDR infiniband network rectangle.
At the moment, there is an unofficial FAQ page on the documentation site. Whenever I come across a page that answers a FAQ, I add a link on that page.
It is "unofficial" in the sense that there is no link to it from other pages.
Since I know that opinions on FAQ pages are divided, opinions as to its fate are invited.
The MFA page explains how to use NoMachine with MFA, but on the NoMachine page itself (https://docs.vscentrum.be/en/latest/access/nx_start_guide.html#nx-start-guide) there is no mention of MFA. Could you revise that page (and probably all other pages explaining how to connect to the cluster) so that there are references to the main MFA page?
The first and second example may confuse the user, some clarification is required.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.