Coder Social home page Coder Social logo

hpcleuven / vscdocumentation Goto Github PK

View Code? Open in Web Editor NEW
1.0 21.0 29.0 47.1 MB

VSC (Vlaams Supercomputing Center) user documentation.

Home Page: https://docs.vscentrum.be/

License: GNU General Public License v3.0

HTML 97.40% Shell 0.07% Python 2.06% Makefile 0.08% Batchfile 0.08% CSS 0.31%

vscdocumentation's Introduction

VscDocumentation

Repository of the VSC documentation website.

How to contribute?

Improving the documentation is quite simple, all documents can be edited with a simple text editor. You standard workflow to contribute changes is:

  1. Clone this repository
  2. Make corrections/improvements to any document
  3. Push you changed to a new branch
  4. Open a pull request to the master branch of this repo

โš ๏ธ Follow our style guide to edit the VSC documentation.

What is the status of improvements/fixes?

Feel free to open issues to get fixes or improvements on the agenda. To get an overview of work that is planned or in progress, check out the project overview.

Workflow

Prerequisites

You will need to clone the repository, i.e.,

$ git clone [email protected]:hpcleuven/VscDocumentation.git

Your life will be substantially easier if you can preview your changes locally. For this you'll need to install the required packages in a self-contained environment such as a Python virtual environment (venv) or Conda. The packages are listed in the requirements.txt file in this repo.

Such environments should preferably use the same Python version as the one specified in the .readthedocs.yaml file. Note that older Python versions (<= v3.7 at the time of writing) may not be able to install the requirements.

Python venv

This is an example setup for a Python virtual environment:

$ python -m venv /path/to/new/venv
$ source /path/to/new/venv/bin/activate
$ cd VscDocumentation
$ python -m pip install -r requirements.txt

Conda

This is an example setup for a Conda environment:

$ conda create -n vscdocs python=3.11
$ source activate vscdocs
$ pip install -r requirements.txt

Downloads and installation instructions for Miniconda can be found on conda's website.

Creating a feature branch

Do not make changes in the master or development branches directly. Create your own feature/bugfix branch as needed.

$ cd VscDocumentation
$ git checkout master
$ git checkout -b feature/new_stuff

Running a local sphinx server

The repository contains a make file that has a target to run the sphinx server. The latter will monitor the source directory for changes, and serve the documentation to a web browser that is opened automatically.

$ make web

Edit content

You can now edit the content to your heart's content, making commits to your feature branch as you go. You can push your feature branch to the Github repository whenever you like.

$ git add source/some_documentation_file.rst
$ git commit -m "some new stuff added to VSC docs"
$ git push origin feature/new_stuff

Pull request

When you are done, create a pull request to the master branch of this repository on GitHub.

Changes to the documentation require the positive review from the documentation maintainers of your pull request. Pay attention to the review in case there would be remarks to address.

Once the pull request has been merged, the branch will be deleted from GitHub. At this point, all that is left to do is delete your local feature branch.

$ git fetch origin
$ git checkout master
$ git pull origin master
$ git branch -d feature/new_stuff

vscdocumentation's People

Contributors

alvarosimon avatar avapirev avatar backelj avatar boegel avatar dependabot[bot] avatar epauwels avatar gjbex avatar hajgato avatar itkovian avatar jefscheepers avatar jooghe avatar klust avatar kwaegema avatar lexming avatar maximevdb avatar molden avatar moravveji avatar mselwa avatar mstfdkmn avatar rverschoren avatar sbecuwe avatar smoors avatar stdweird avatar stefaanverwimp avatar stevenvdb avatar tvandenberghe avatar w-lampaert avatar wdpypere avatar willebell avatar woutervanassche avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

vscdocumentation's Issues

Retiring the Development branch and revisiting the workflow

Hey all

I would like to bring the fact into your attention that the development branch has fallen quite behind the master branch, which to my knowledge opposes the original idea. A simple git diff master..development shows quite sensitive differences between the two which calls for some work to bring them back in sync.

So, one possibility is to retire the existing development branch, and then

  • either create one from the existing master, and keep our current workflow (and really practice it)
  • or we follow the normal practice of branching off from master and merging to the master.
    However, the master branch can be protected from direct merges, by enforcing reviews.

What do you guys think?
Ehsan

Title bar/header

The bar at the top of the page contains a number of useful (or not so much) icons:

  • Does the one to Github add value? We don't have much material there that is public/interesting
  • Can we replace the Twitter icon with the X icon?
  • Can we add a LinkedIn icon since that is our main social media channel?
  • Can we add an icon for the VSC website?

SVN documentation

We have quite a number of pages on SVN, do we keep them or retire them? I suspect that those who use know how to, and that we hardly encourage new users to use subversion.

ThinKing hardware nodemap errors

There are some things that look off in the thinking_centos7.png figure of the hardware on ThinKing (shown at the bottom of this VsCdocs page):

  • I think the legend on the right is mostly fine, except that those two Quadro K5200 nodes are not there anymore?

  • The map colors don't seem to agree with the legend. According to pbsnodes info,

    • the r10 racks should be red
    • the r11 and r12 racks should be salmon (that intermediate color I mean)
    • the r04i00, r04i01, r05i00 and r05i01 don't exist (anymore?)
    • the r04i02, r05i02 and r08i00 racks should be yellow (except for those debug nodes in brown)
  • Probably also good to double-check the FDR/QDR infiniband network rectangle.

Suggestion to use `cpus-per-gpu` instead of `ntasks` in slurm docs

In the documentation all examples use ntasks or n to specify the number of CPUs needed per GPU. This generally works fine, but external tools (such as submitit ) have a specific interpretation of ntasks, which can lead to issues. It might be better to explicitly use the cpus-per-gpu slurm option in the examples to avoid such issues. The options both work identically in my tests requesting GPUs on the debug node and on wice.

Separation of site-specific Slurm info (especially UAntwerpen)

Hi @backelj @RVerschoren (and others from UA):

I would like to clean up / improve the Slurm-related pages. One thing that catches the eye is that (probably for historic reasons) there is quite a bit of UAntwerpen-specific info on the following pages:

https://docs.vscentrum.be/jobs/job_submission.html
https://docs.vscentrum.be/jobs/job_types.html
https://docs.vscentrum.be/jobs/job_advanced.html

I think it would be better to move such info to somewhere in UAntwerp Tier-2 Infrastructure, possibly as part of 2.1.5. UAntwerp-specific software instructions :: Slurm workload manager. What do you think? If you're in favor but in lack of time, I can open a PR for this.

gquote

The documentation on the credits system mentions gquote, the utility to estimate the credits required to run a job.

gquote has not been updated in quite a while, and can not deal with jobs that use parts of nodes (nor regular, nor GPU).

Either gquote has to be updated, or we remove it from the documentation.

Opinions?

TurboVNC

At KU Leuven, TurboVNC is no longer used. I've removed all references to KU Leuven from pages on which it was mentioned.

I note that UAntwerp has its own page on its visualization nodes.

Do we still need to keep the following page?
./access/turbovnc_start_guide.rst

Revise worker documentation, vis a vis job arrays

The worker documentation page has a remark on job arrays that is no longer accurate. This needs reformulation.

The page should also prominently refer to the official worker documentation.

The content should be reviewed with respect to current worker features.

Proxy configuration page

Do we need the following page?
https://vlaams-supercomputing-centrum-vscdocumentation.readthedocs-hosted.com/en/latest/access/setting_up_a_ssh_proxy_with_putty.html#ssh-proxy-with-putty

It was written back in the old days when muk was behind a firewall. Do we still have systems for which you need to set up a proxy using a login node?

The problem is that this page leads to no end of confusion from users who think they need to do this (and obviously don't), so I would be very happy if we could delete it.

Add a note on using screen/tmux

It would be useful to have a pointer to screen/tmux with an emphasis on session persistence to mitigate poor-quality network issues.

Since there are many good tutorials, there is no need for full-blown documentation, simply to point out these tools exist.

Tier-1 quickstart

A link to a PDF document that served as a quickstart to Tier-1 was sent to new Tier-1 users.

This document is out of date, too extensive, and should be available on the documentation site, rather than as a static document.

Add some info on MPT for superdome

Since Cerebro is decommissioned, there is no information available on MPT, SGI's (now HPE) MPI implementation.

It would be useful to add some notes on MPT to the page on superdome.

add need to request memory for superdome with the -L flag

In superdome if the user wants to use a ratio of mem/core higher than the default it is needed to specify the memory feature in the -L command. If this is not done when the job tries to use more than the default memory it will start swapping. In the manual of pbs in the numa section it is explicitily mention this behaviour:

http://docs.adaptivecomputing.com/torque/6-1-3/adminGuide/torque.htm#topics/NUMA/-Lresource.htm#options

So, for example to submit of a job using 7 cores and 100GB per core the jobs should be submitted as follows:

qsub -l partition=superdome -q qsuperdome - L tasks=1:lprocs=7:place=numanode=1:memory=700gb

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.