How do I get help with the RIS Compute Service?Anchor id1 id1
Where can I find RIS Compute Documentation?Anchor id2 id2
Where are the RIS computing services physically located?Anchor id3 id3
How do I Obtain an account?Anchor id4 id4
How do I change my password?Anchor id5 id5
How do I log into the Compute environment?Anchor id6 id6
How do I make it so I can log in without having to use my password?Anchor id7 id7
How do I launch jobs in the HPC environment?Anchor id8 id8
How should I name my files and directories?Anchor id9 id9
Is there a way to summarize usage statistics?Anchor id10 id10
What does it mean to have a “Compute Condo(minium)”Anchor id11 id11
Are there general access computing resources?Anchor id12 id12
What is the difference between the general and the general-interactive queues?Anchor id13 id13
What does the Compute Service price include?Anchor id14 id14
How much space is in myAnchor id15 id15 $HOME
directory?
Why is this limited to 10G? Can I have more?Anchor id16 id16
How do I see what is using up all of myAnchor id17 id17 $HOME
space?
Why am I getting a Disk I/O error?Anchor id18 id18
How much space is in my Storage Allocation?Anchor id18id19id18 id19
How do I share files in my storage with colleagues?Anchor id19id20id19 id20
What’s the best way for me to transfer data?Anchor id20id21id20 id21
How do I request more resources for my job?Anchor id21id22id21 id22
Does RIS offer Docker containers or a repository for them?Anchor id22id23id22 id23
Why can’t I connect to my noVNC image?Anchor id23id24id23 id24
Software Debugging PolicyAnchor id24id25id24 id25
...
User $HOME directories are intended to allow space for users to make use of the compute cluster, with the knowledge that the Storage Cluster is where data and software will be stored. This is accessed via /storage1, and the Scratch Cluster is accessed via /scratch1.
The $HOME directory is required for the Compute Platform(s) to function for users and software often rely on it.
Policy dictates that you be limited to 10G of $HOME space.
The $HOME directory is NOT backed up and important data should NOT be stored here. Anything you wish to be backed up should be placed in /storage1, this includes scripts.
How do I see what is using up all of my $HOME
space?
You can use the following command to list out the top 10 (or any number if you replace the 10) files or directories using the most space in your $HOME directory.
Make sure the following command is run from your $HOME directory.
Code Block | ||||
---|---|---|---|---|
| ||||
du -hsx .[^.]* * 2>/dev/null | sort -rh | head -10 |
Expected example output.
Code Block | ||||
---|---|---|---|---|
| ||||
800M .vscode-server
140M .local
95M work
68M .cache
41M .lsbatch
24M .nv
21M .matlab
20M .npm
20M .config
15M ondemand |
Why am I getting a Disk I/O error?
This error typically refers to the ability of the job to write a file to a directory.
The most common source of the error is a user’s home directory being full.
-
- If you encounter this error, please follow the steps below.
Use the methods described in the home directory space section section to determine if the home directory is at cap.
Remove or move files from the home directory to reduce usage.
Attempt to run the job again.
If the problem persists, submit a ticket to the service desk: https://ris.wustl.edu/support/service-desk/
How much space is in my Storage Allocation?
-
- The Compute Service is connected to the Storage Service via POSIX filesystem mounts.
The batch (execution) nodes and condos are connected via cache.
The client and interactive nodes are connected directly.
The Storage Service provides the SMB interface at smb://storage1.ris.wustl.edu/${STORAGE_ALLOCATION}.
The Compute Service provides a POSIX interface via the filesystem path /storage1/fs1/${STORAGE_ALLOCATION}.
You can observe available space via SMB mounts with a df command on the mounting workstation, or by the mmlsquota command while logged into the compute system.
...
Again, under the Block Limits section, the ‘blocks’ portion is how much you have consumed. The Compute Service uses a caching interface to access the data. Read more about how this affects usage and quota here: cache interfaces
How do I share files in my storage with colleagues?
You can request access be granted to your colleagues through our ticketing system.
You can also use collections within Globus to share specific folders or files with colleagues. This method is the suggested method when it comes to colleagues outside of WashU. You can find more information about using this feature here:
What’s the best way for me to transfer data?
The first method we recommend is to use SMB mounts. You can find more information about connecting at the following link.
Our suggested method of transferring data if SMB is not an option is to make use of Globus. You can use Globus in multiple ways. There are links to our Globus documentation below.
How do I request more resources for my job?
Requesting more resources for your job means using options that are part of the bsub command. You can find out more information about the bsub options at the following link.
Be aware that if the software you use requires special options in order to use these resources, you will need to include those options in your software command as well.
Does RIS offer Docker containers or a repository for them?
RIS offers RIS hosted and controlled Docker images. You can find them here.
RIS also offers a list of vetted applications where we do not control the Docker image nor host it. You can find that list here.
You can request help building a Docker image if you are having trouble via our ticketing system.
Software that is used frequently is taken into consideration when creating RIS hosted and controlled Docker images.
We currently do not have a public repository for users to host their own images in.
Why can’t I connect to my noVNC image?
-
- The first reason this could be happening, is port conflicts.
If your job lands on a node that has a job already using the port you are attempting to, you will not be able to connect.
You can attempt to launch your job on a new node, or you can change the port you’re using and launch the job again.
-
- The second reason this could be happening, is that some department based VPNs are not part of the trusted network that will allow this.
Please see our VPN information for which VPNs we recommend.
-
- If you wish to avoid dealing with ports for GUI based software, you can check out what software we have available through Open on Demand.
-
- You can also use port fowarding to get around the second reason for being unable to connect.
Software Debugging Policy
We strive to provide help with software debugging and support to the best of our abilities and time. With that being said, there may be times when we cannot solve an issue related to a specific piece of software or script that is not supported by RIS. In those cases, we will attempt to provide a solution to the problem, but we cannot guarantee that the solution will be successful. We recommend reading this section for more help debugging your software as well as for guidance on software development best practices.