Job Frequently Asked Questions
1. Job Problem
1.1. Missing or incorrectly filled in image in job.json_ Address field。
Image in job.json
image_Address
is a required field, so you only need to fill in image when submitting the task_ Address is sufficient。【note】:
image_address needs to be filled in with the complete image address rather than the image name. In Bohrium, images are divided into public and private images. You can check the image address that needs to be filled in the image center。
Example:ubuntu:22.04-py3.10-intel2022
The mirror address is:registry.dp.tech/dptech/ubuntu:22.04-py3.10-intel2022
1.2. Task result file returned to Bohrium management node with exception, causing node to be unable to boot up
Please first check if the default location of the program return result file executed by the current task is the system disk. It is the disk size you selected when booting up, with a maximum of 100g used to store system software packages. If it is full, it will cause the task to fail, and even the management node cannot boot up. You need to adjust the program script yourself and resubmit it。
If the issue still cannot be resolved, Please contact WeChat Work at 17710231129 for technical support.
1.3. How to automatically download the result file to the specified directory using the Lebesgue Utility。
Please use the - r parameter in the
bohr job submit
, which can automatically download the result file to the specified directory[Note]: Data can only be automatically downloaded to the data drive. Please specify the path after the parameter - r as/data
The specific beginning can be referred to https://bohrium-doc.dp.tech/docs/bohrctl/job/#%E6%8F%90%E4%BA%A4submit
2. Product Problem
2.1. There are differences in task submission efficiency between Bohrium models with the same specifications
- Due to the algorithm itself having a fluctuation of about 30%, and the Bohrium backend's computing power scheduling center dynamically allocates resource vendors based on scheduling rules, different vendors may have slight differences in resources, resulting in inconsistent task efficiency。
2.2. Task resource scheduling takes a long time
- During the task submission process, there may be issues with the vendor caching custom images to the server cluster or insufficient computing resources. Please be patient and wait. If you need to know more about the task details, please contact WeChat Work at 17710231129 for help with your query.
2.3. The selected model in the script does not match the actual computer model displayed in the task center。
- The cloud factory will adjust the available computing resources based on the current inventory, which can lead to a shortage of computing power and result in task submission failure. To solve this situation, Bohrium adopts an intelligent scheduling strategy to search and match computing resources that meet user needs across the entire network. We will try our best to ensure that factors such as model, number of cores, and memory are consistent or similar, and provide you with the most favorable price to obtain the required computing resources. If you have any other questions, please contact our technical support via WeChat Work at 17710231129 to obtain further assistance and solutions.
2.4. The task progress always hovers between the two stages of resource scheduling and environmental preparation
- This situation is usually caused by platform resources being rescheduled, which may be due to vendors caching custom images or insufficient resources. You can contact technical support via WeChat Work at 17710231129 to obtain further assistance and solutions.
2.5. There is no output prompt after submitting the bohr job submit task
- After executing the 'bohr job submit' command, bohr starts compressing all files in the specified folder after the - p parameter to submit the task. Please check the - p/ Is there a large hidden file in the folder that causes the file compression time to be too long。
2.6. According to the Bohrium price calculator, the required model is available, however, bohr submitted a task report stating that the model is not available
- When encountering such a situation using the Bohrium price calculator, it indicates that the model resources are scarce. It is recommended to try selecting other available models。