跳到主要内容

LBG Utility Frequently Asked Questions

When using Lebesgue Utility for the first time, you need to specify a project. Please make sure that the current project ID is correct.

View all projects:

lbg project ls

Modify the default project:

lbg project switch <project>

2. When submitting jobs using Utility, how to set the maximum running time for jobs?

Add max_run_time in the --file -i json file, the unit is minutes. The job will be automatically terminated and the files will be recycled after exceeding the maximum running time.

3. Why do jobs abnormally interrupt? How to auto-retry when submitting jobs via LBG Utility?

During the computing node operation, there is a low probability of encountering abnormal issues, such as machine reclaiming or physical machine failure.

In this case, your job will be interrupted. The job results will still be retrieved rieved and can be downloaded from the job details page.

If you want the system to auto-retry after a job is abnormally interrupted, you can make the following settings in job.json:

  • If your computing software supports resuming calculations, you can:

    • Set the command to support resuming calculations, For example, using if else statements to determine whether there is a checkpoint file, and then executing different commands for calculations.

    • Set the maxRescheduleTimes value to the number of retries you can accept. After a job interruption, the system will automatically retry resuming the calculations. An example of setting 2 retries is shown below:

      maxRescheduleTimes: 2
  • If your computing software does not support resuming calculations.

    • you only need to set the maxRescheduleTimes value to the number of retries you can accept. After a job interruption, the system will perform a complete job recalculation. An example of setting 2 retries is shown below:
      maxRescheduleTimes: 2