This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| servst [2025/07/09 13:51] – zhangyk | servst [2025/07/17 04:34] (current) – zhangyk | ||
|---|---|---|---|
| Line 2: | Line 2: | ||
| # servst | # servst | ||
| - | [Gitlab-servst](https:// | + | [Gitlab](https:// |
| - | ## Description | + | ## 1. Description |
| - | `servst` is a collection of tools designed to inspect the utilization status of | + | `servst` is primarily |
| - | each server. | + | status of each server. Then I add support for MD tasks, to show them or kill |
| + | them conveniently and thoroughly. | ||
| - `gpust` and `cpust` provide status for servers | - `gpust` and `cpust` provide status for servers | ||
| - | - `gpurf` and `gpurf` refresh status information for all or specific servers | + | - `gpurf` and `cpurf` refresh status information for all or specific servers |
| + | - `lsmd` | ||
| + | - `killmd` kill target MD tasks | ||
| - | ## Installation | + | ## 2. Installation |
| There is no need for installation. The commands are available as global aliases. | There is no need for installation. The commands are available as global aliases. | ||
| - | HOWEVER, password-free configuration is recommended | + | HOWEVER, password-free configuration is recommended |
| + | and `lsmds` | ||
| - | ## Usage | + | ## 3. Usage |
| - | ### For gpust and cpust: | + | ### 3.1 For gpust and cpust: |
| These commands can be used on any server to get GPU or CPU status of any server. | These commands can be used on any server to get GPU or CPU status of any server. | ||
| Line 41: | Line 45: | ||
| purple | purple | ||
| red | red | ||
| - | white | ||
| yellow | yellow | ||
| ``` | ``` | ||
| Line 63: | Line 66: | ||
| purple | purple | ||
| violet | violet | ||
| - | white | ||
| yellow | yellow | ||
| ``` | ``` | ||
| - | ### For gpurf and cpurf: | + | ### 3.2 For gpurf and cpurf: |
| 1. `gpurf` updates the status of all servers. Without password-free configuration, | 1. `gpurf` updates the status of all servers. Without password-free configuration, | ||
| Line 74: | Line 76: | ||
| 2. `gpurf yellow` updates the status of merely the wanted server `yellow`. You | 2. `gpurf yellow` updates the status of merely the wanted server `yellow`. You | ||
| - | must either have password-free access or provide the password when prompted. The | + | must either have password-free access or provide the password |
| - | statuses of the other servers will remain as previously displayed. The column | + | `rainbow` |
| - | `Last Updated` indicates updating time of each server. | + | previously displayed. The column `Last Updated` indicates updating time of each |
| + | server. | ||
| ``` | ``` | ||
| Line 99: | Line 102: | ||
| ``` | ``` | ||
| + | ### 3.3 For lsmd and lsmds | ||
| + | |||
| + | 1. `lsmd` will get MD tasks' status in the current server. `lsmds` will get the | ||
| + | statuses for several servers. | ||
| + | |||
| + | 2. `lsmds purple` will fetch the information of the wanted server `purple`. | ||
| + | |||
| + | ``` | ||
| + | purple | ||
| + | / | ||
| + | / | ||
| + | / | ||
| + | yellow | ||
| + | |||
| + | red | ||
| + | / | ||
| + | blue | ||
| + | / | ||
| + | / | ||
| + | orange | ||
| + | / | ||
| + | / | ||
| + | indigo | ||
| + | / | ||
| + | / | ||
| + | gold | ||
| + | / | ||
| + | green | ||
| + | / | ||
| + | ``` | ||
| + | |||
| + | ### 3.4 For killmd | ||
| + | |||
| + | Guide can be called by typing `killmd -h`. | ||
| + | ``` | ||
| + | usage: killmd.py [-h] [-a] [-p PID] [-g GPU] | ||
| + | |||
| + | Kill series of md tasks instantly. | ||
| + | |||
| + | options: | ||
| + | -h, --help | ||
| + | -a, --all Kill all md tasks | ||
| + | -p PID, --pid PID Process ID to kill | ||
| + | -g GPU, --gpu GPU GPU id | ||
| + | ``` | ||
| + | |||
| + | - `killmd -a` will kill all md tasks in the current server. | ||
| + | - `killmd -p 12345` will kill the task with pid `12345`, as well as related tasks | ||
| + | - `killmd -g 0` will kill tasks runing on `GPU 0` | ||
| + | |||
| + | ## 4. Acknowledgement | ||
| + | |||
| + | Thanks to Prof. Xue and Zhewei Qiu. I optimized their script | ||
| + | `checkamber.py` to get `lsmd.py`. | ||
| </ | </ | ||