Associations & Quality of Service (QOS)
Associations
Oscar uses associations to control job submissions from users. An association refers to a combination of four factors: Cluster, Account, User, and Partition. For a user to submit jobs to a partition, an association for the user and partition is required in Oscar.
To view a table of association data for a specific user (thegrouch
in the example), enter the following command in Oscar:
If thegrouch
has an exploratory account, you should see an output similar to this:
Note that the first four columns correspond to the four factors that form an association. Each row of the table corresponds to a unique association (i.e., a unique combination of Cluster, Account, User, and Partition values). Each association is assigned a Quality of Service (see QOS section below for more details).
Some associations have a value for GrpTRESRunMins
. This value indicates a limit on the total number of Trackable RESource (TRES) minutes that can be used by jobs running with this association at any given time. The cpu=110000
for the association with the batch
partition indicates that all of the jobs running with this association can have at most an accumulated 110,000 core-minute cost. If this limit is reached, new jobs will be delayed until other jobs have completed and freed up resources.
Example of GrpTRESRunMins
Limit
GrpTRESRunMins
LimitHere is an example file that incurs a significant core-minute cost:
If this file is named too_many_cpu_minutes.sh
, a user withthegrouch
's QOS might experience something like this:
Note that the REASON
the job is pending and not yet running is AssocGrpCPURunMinutesLimit
. This is because the program requests 30 cores for 90 hours, which is more than the oscar/default/thegrouch/batch association allows (30 cores * 90 hours * 60 minutes/hour = 162,000 core-minutes > 110,000 core-minutes). In fact, this job could be pending indefinitely, so it would be a good idea for thegrouch
to run scancel 12345678
and make a less demanding job request (or use an association that allows for that amount of resources).
Account Quality of Service (QoS) and Resources
myaccount
- To list the QoS & Resources
myaccount
- To list the QoS & ResourcesThe myaccount
command serves as a comprehensive tool for users to assess the resources associated with their accounts. By utilizing this command, individuals can gain insights into critical parameters such as Max Resources Per User
and Max Jobs Submit Per User
.
Last updated
Was this helpful?