OS Monitoring Metrics Supported by ECSs with the Agent Installed
Description
OS monitoring provides system-level, proactive, and fine-grained monitoring. It requires the Agent to be installed on the ECSs to be monitored. This section describes OS monitoring metrics reported to Cloud Eye.
OS monitoring supports metrics about the CPU, CPU load, memory, disk, disk I/O, file system, GPU, NIC, NTP, and TCP.
After the Agent is installed, you can view monitoring metrics of ECSs running different OSs. Monitoring data is collected every 1 minute.
Namespace
AGT.ECS
OS Metrics: CPU
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
cpu_usage |
(Agent) CPU Usage |
CPU usage of the monitored object
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_idle |
(Agent) Idle CPU Usage |
Percentage of time that CPU is idle
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_user |
(Agent) User Space CPU Usage |
Percentage of time that the CPU is used by user space
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_system |
(Agent) Kernel Space CPU Usage |
Percentage of time that the CPU is used by kernel space
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_other |
(Agent) Other Process CPU Usage |
Percentage of time that the CPU is used by other processes
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_nice |
(Agent) Nice Process CPU Usage |
Percentage of time that the CPU is in user mode with low-priority processes which can easily be interrupted by higher-priority processes
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_iowait |
(Agent) iowait Process CPU Usage |
Percentage of time that the CPU is waiting for I/O operations to complete
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_irq |
(Agent) CPU Interrupt Time |
Percentage of time that the CPU is servicing interrupts
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_softirq |
(Agent) CPU Software Interrupt Time |
Percentage of time that the CPU is servicing software interrupts
|
0-100 |
% |
N/A |
ECS |
1 minute |
OS Metric: CPU Load
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
load_average1 |
(Agent) 1-Minute Load Average |
CPU load averaged from the last 1 minute Linux: Obtain the metric value from the number of logic CPUs in load1/ in file /proc/loadavg. Run the top command to check the load1 value. |
≥ 0 |
N/A |
N/A |
ECS |
1 minute |
load_average5 |
(Agent) 5-Minute Load Average |
CPU load averaged from the last 5 minutes Linux: Obtain the metric value from the number of logic CPUs in load5/ in file /proc/loadavg. Run the top command to check the load5 value. |
≥ 0 |
N/A |
N/A |
ECS |
1 minute |
load_average15 |
(Agent) 15-Minute Load Average |
CPU load averaged from the last 15 minutes Linux: Obtain the metric value from the number of logic CPUs in load15/ in file /proc/loadavg. Run the top command to check the load15 value. |
≥ 0 |
N/A |
N/A |
ECS |
1 minute |

The Windows OS does not support the CPU load metrics.
OS Metric: Memory
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
mem_available |
(Agent) Available Memory |
Amount of memory that is available and can be given instantly to processes
|
≥0 |
GB |
N/A |
ECS |
1 minute |
mem_usedPercent |
(Agent) Memory Usage |
Memory usage of the monitored object
|
0-100 |
% |
N/A |
ECS |
1 minute |
mem_free |
(Agent) Idle Memory |
Amount of memory that is not being used
|
≥0 |
GB |
N/A |
ECS |
1 minute |
mem_buffers |
(Agent) Buffer |
Amount of memory that is being used for buffers
|
≥0 |
GB |
N/A |
ECS |
1 minute |
mem_cached |
(Agent) Cache |
Amount of memory that is being used for file caches
|
≥0 |
GB |
N/A |
ECS |
1 minute |
total_open_files |
(Agent) Total File Handles |
Total handles used by all processes
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
OS Metric: Disk

- Currently, only physical disks are monitored. The NFS-attached disks cannot be monitored.
- By default, Docker-related mount points are shielded. The prefix of the mount point is as follows:
/var/lib/docker;/mnt/paas/kubernetes;/var/lib/mesos
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
disk_free |
(Agent) Available Disk Space |
Free space on the disks
|
≥0 |
GB |
N/A |
ECS - Mount point |
1 minute |
disk_total |
(Agent) Disk Storage Capacity |
Total space on the disks, including used and free
|
≥0 |
GB |
N/A |
ECS - Mount point |
1 minute |
disk_used |
(Agent) Used Disk Space |
Used space on the disks
|
≥0 |
GB |
N/A |
ECS - Mount point |
1 minute |
disk_usedPercent |
(Agent) Disk Usage |
Percentage of total disk space that is used, which is calculated as follows: Disk Usage = Used Disk Space/Disk Storage Capacity
|
0-100 |
% |
N/A |
ECS - Mount point |
1 minute |
OS Metric: Disk I/O
OS Metric: File System
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
disk_fs_rwstate |
(Agent) File System Read/Write Status |
Read and write status of the mounted file system of the monitored object. Value: 0 (read and write) or 1 (read only) Linux: Check file system information in the fourth column in file /proc/mounts. |
|
N/A |
N/A |
ECS - Mount point |
1 |
disk_inodesTotal |
(Agent) Disk inode Total |
Total number of index nodes on the disk Linux: Run the df -i command to check the value in the Inodes column. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), periods (.), and swung dashes (~). |
≥ 0 |
Count |
N/A |
ECS - Mount point |
1 minute |
disk_inodesUsed |
(Agent) Total inode Used |
Number of used index nodes on the disk Linux: Run the df -i command to check the value in the IUsed column. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), periods (.), and swung dashes (~). |
≥ 0 |
Count |
N/A |
ECS - Mount point |
1 minute |
disk_inodesUsedPercent |
(Agent) Percentage of Total inode Used |
Number of used index nodes on the disk Linux: Run the df -i command to check the value in the IUse% column. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), periods (.), and swung dashes (~). |
0-100 |
% |
N/A |
ECS - Mount point |
1 minute |

The Windows OS does not support the file system metrics.
OS Metric: NIC
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
net_bitRecv |
(Agent) Outbound Bandwidth |
Number of bits sent by this NIC per second
|
≥ 0 |
bit/s |
1024(IEC) |
ECS |
1 minute |
net_bitSent |
(Agent) Inbound Bandwidth |
Number of bits received by this NIC per second
|
≥ 0 |
bit/s |
1024(IEC) |
ECS |
1 minute |
net_packetRecv |
(Agent) NIC Packet Receive Rate |
Number of packets received by this NIC per second
|
≥ 0 |
Counts/s |
N/A |
ECS |
1 minute |
net_packetSent |
(Agent) NIC Packet Send Rate |
Number of packets sent by this NIC per second
|
≥ 0 |
Counts/s |
N/A |
ECS |
1 minute |
net_errin |
(Agent) Receive Error Rate |
Percentage of receive errors detected by this NIC per second
|
0-100 |
% |
N/A |
ECS |
1 minute |
net_errout |
(Agent) Transmit Error Rate |
Percentage of transmit errors detected by this NIC per second
|
0-100 |
% |
N/A |
ECS |
1 minute |
net_dropin |
(Agent) Received Packet Drop Rate |
Percentage of packets received by this NIC which were dropped per second
|
0-100 |
% |
N/A |
ECS |
1 minute |
net_dropout |
(Agent) Transmitted Packet Drop Rate |
Percentage of packets transmitted by this NIC which were dropped per second
|
0-100 |
% |
N/A |
ECS |
1 minute |
OS Metric: NTP
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
ntp_offset |
(Agent) NTP Offset |
NTP offset of the monitored object
|
≥ 0 |
ms |
N/A |
ECS |
1 minute |
OS Metric: TCP
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
net_tcp_total |
(Agent) Total TCP Connections |
Total number of TCP connections in all states
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_established |
(Agent) TCP ESTABLISHED Connection |
Number of TCP connections in ESTABLISHED state
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_sys_sent |
(Agent) TCP SYS_SENT Connections |
Number of TCP connections that are being requested by the client
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_sys_recv |
(Agent) TCP SYS_RECV Connections |
Number of pending TCP connections received by the server
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_fin_wait1 |
(Agent) TCP FIN_WAIT1 Connections |
Number of TCP connections waiting for ACK packets when the connections are being actively closed by the client
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_fin_wait2 |
(Agent) TCP FIN_WAIT2 Connections |
Number of TCP connections in the FIN_WAIT2 state
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_time_wait |
(Agent) TCP TIME_WAIT Connections |
Number of TCP connections in TIME_WAIT state
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_close |
(Agent) TCP CLOSE Connections |
Number of closed TCP connections
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_close_wait |
(Agent) TCP CLOSE_WAIT Connections |
Number of TCP connections in CLOSE_WAIT TCP state
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_last_ack |
(Agent) TCP LAST_ACK Connections |
Number of TCP connections waiting for ACK packets when the connections are being passively closed by the client
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_listen |
(Agent) TCP LISTEN Connections |
Number of TCP connections in the LISTEN state
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_closing |
(Agent) TCP CLOSING Connections |
Number of TCP connections to be automatically closed by the server and the client at the same time
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_retrans |
(Agent) TCP Retransmission Rate |
Percentage of packets that are resent
|
0-100 |
% |
N/A |
ECS |
1 minute |
OS Metric: GPU
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
gpu_status |
GPU Health Status |
Overall measurement of the GPU health
|
|
N/A |
N/A |
|
1 minute |
gpu_usage_encoder |
Encoding Usage |
Encoding capability usage of the GPU
|
0-100 |
% |
N/A |
|
1 minute |
gpu_usage_decoder |
Decoding Usage |
Decoding capability usage of the GPU
|
0-100 |
% |
N/A |
|
1 minute |
gpu_volatile_correctable |
Volatile Correctable ECC Errors |
Number of correctable ECC errors since the GPU is reset. The value is reset to 0 each time the GPU is reset.
|
≥ 0 |
Count |
N/A |
|
1 minute |
gpu_volatile_uncorrectable |
Volatile Uncorrectable ECC Errors |
Number of uncorrectable ECC errors since the GPU is reset. The value is reset to 0 each time the GPU is reset.
|
≥ 0 |
Count |
N/A |
|
1 minute |
gpu_aggregate_correctable |
Aggregate Correctable ECC Errors |
Aggregate correctable ECC errors on the GPU
|
≥ 0 |
Count |
N/A |
|
1 minute |
gpu_aggregate_uncorrectable |
Aggregate Uncorrectable ECC Errors |
Aggregate uncorrectable ECC Errors on the GPU
|
≥ 0 |
Count |
N/A |
|
1 minute |
gpu_retired_page_single_bit |
Retired Page Single Bit Errors |
Number of retired page single bit errors, which indicates the number of single-bit pages blocked by the graphics card
|
≥ 0 |
Count |
N/A |
|
1 minute |
gpu_retired_page_double_bit |
Retired Page Double Bit Errors |
Number of retired page double bit errors, which indicates the number of double-bit pages blocked by the graphics card
|
≥ 0 |
Count |
N/A |
|
1 minute |
gpu_performance_state |
(Agent) Performance Status |
GPU performance of the monitored object
|
P0-P15, P32
|
N/A |
N/A |
|
1 minute |
gpu_usage_mem |
(Agent) GPU Memory Usage |
GPU memory usage of the monitored object
|
0-100 |
% |
N/A |
|
1 minute |
gpu_usage_gpu |
(Agent) GPU Usage |
GPU usage of the monitored object
|
0-100 |
% |
N/A |
|
1 minute |
gpu_free_mem |
GPU Free Memory |
Free Memory on the GPU
|
≥ 0 |
MB |
N/A |
|
1 minute |
gpu_graphics_clocks |
GPU Graphics Clocks |
Current Graphics Clocks on the GPU
|
≥ 0 |
MHz |
N/A |
|
1 minute |
gpu_mem_clocks |
GPU Memory Clocks |
Current Memory Clocks on the GPU
|
≥ 0 |
MHz |
N/A |
|
1 minute |
gpu_power_draw |
GPU Draw Power |
Draw Power on the GPU
|
NA |
W |
N/A |
|
1 minute |
gpu_rx_throughput_pci |
GPU PCI Inbound Bandwidth |
Current PCI Rx Throughput on the GPU
|
≥ 0 |
MByte/s |
N/A |
|
1 minute |
gpu_sm_clocks |
GPU SM Clocks |
Current SM Clocks on the GPU
|
≥ 0 |
MHz |
N/A |
|
1 minute |
gpu_temperature |
GPU Temperature |
Current Temperature on the GPU
|
≥ 0 |
°C |
N/A |
|
1 minute |
gpu_tx_throughput_pci |
GPU PCI Tx Throughput |
Current PCI Tx Throughput on the GPU
|
≥ 0 |
MByte/s |
N/A |
|
1 minute |
gpu_used_mem |
GPU Used Memory |
Memory Used on the GPU
|
≥ 0 |
MB |
N/A |
|
1 minute |
gpu_video_clocks |
GPU Video Clocks |
Current Video Clocks on the GPU
|
≥ 0 |
MHz |
N/A |
|
1 minute |
OS Metrics: NPU
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
npu_device_health |
NPU Device Health |
Overall measurement of the NPU health Linux: Obtain the metric value from the libdcmi.so library file of the NPU card. |
|
N/A |
N/A |
|
1 minute |
npu_util_rate_mem |
NPU Memory Usage |
The memory usage of the NPU Linux: Obtain the metric value from the libdcmi.so library file of the NPU card. |
0-100 |
% |
N/A |
|
1 minute |
npu_util_rate_ai_core |
NPU AI Core Usage |
The AI core usage of the NPU Linux: Obtain the metric value from the libdcmi.so library file of the NPU card. |
0-100 |
% |
N/A |
|
1 minute |
npu_util_rate_ai_cpu |
NPU AI CPU Usage |
The AI CPU usage of the NPU Linux: Obtain the metric value from the libdcmi.so library file of the NPU card. |
0-100 |
% |
N/A |
|
1 minute |
npu_util_rate_ctrl_cpu |
NPU Control CPU Usage |
The CPU control usage of the NPU Linux: Obtain the metric value from the libdcmi.so library file of the NPU card. |
0-100 |
% |
N/A |
|
1 minute |
npu_util_rate_mem_bandwidth |
NPU Memory Bandwidth Usage |
The memory bandwidth usage of the NPU Linux: Obtain the metric value from the libdcmi.so library file of the NPU card. |
0-100 |
% |
N/A |
|
1 minute |
npu_freq_mem |
NPU Memory Frequency |
Clock frequency of the NPU memory Linux: Obtain the metric value from the libdcmi.so library file of the NPU card. |
≥ 0 |
MHz |
N/A |
|
1 minute |
npu_freq_ai_core |
NPU AI Core Frequency |
Clock frequency of the NPU's AI core Linux: Obtain the metric value from the libdcmi.so library file of the NPU card. |
≥ 0 |
MHz |
N/A |
|
1 minute |
npu_usage_mem |
Used NPU Memory |
Used size of the NPU memory Linux: Obtain the metric value from the libdcmi.so library file of the NPU card. |
≥ 0 |
MB |
N/A |
|
1 minute |
npu_sbe |
NPU Single-Bit Errors |
Numbers of single-bit errors of the NPU Linux: Obtain the metric value from the libdcmi.so library file of the NPU card. |
≥ 0 |
count |
N/A |
|
1 minute |
npu_dbe |
NPU Double-Bit Errors |
Numbers of double-bit errors of the NPU Linux: Obtain the metric value from the libdcmi.so library file of the NPU card. |
≥ 0 |
count |
N/A |
|
1 minute |
npu_power |
NPU Power |
The power of the NPU (Rated power displayed for 310 only, actual power displayed for other cards) Linux: Obtain the metric value from the libdcmi.so library file of the NPU card. |
≥ 0 |
W |
N/A |
|
1 minute |
npu_temperature |
NPU Temperature |
Current temperature of the NPU Linux: Obtain the metric value from the libdcmi.so library file of the NPU card. |
≥ 0 |
°C |
N/A |
|
1 minute |

The Windows OS does not support NPU metrics.
OS Metrics: DAVP
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
davp_device_health |
DAVP Device Health |
An overall measurement of the DAVP health Linux: Obtain the metric value from the libvaml.so library file in the VAtools tool of the DAVP card. |
|
N/A |
N/A |
|
1 minute |
davp_util_rate_mem |
DAVP Memory Usage |
The memory usage of the DAVP Linux: Obtain the metric value from the libvaml.so library file in the VAtools tool of the DAVP card. |
0-100 |
% |
N/A |
|
1 minute |
davp_usage_mem |
Used DAVP Memory |
Used size of the DAVP memory Linux: Obtain the metric value from the libvaml.so library file in the VAtools tool of the DAVP card. |
≥ 0 |
MB |
N/A |
|
1 minute |
davp_util_rate_ai_core |
DAVP AI Core Usage |
The AI core usage of the DAVP Linux: Obtain the metric value from the libvaml.so library file in the VAtools tool of the DAVP card. |
0-100 |
% |
N/A |
|
1 minute |
davp_util_rate_vdsp_core |
DAVP VDSP Core Usage |
The VDSP core usage of the DAVP Linux: Obtain the metric value from the libvaml.so library file in the VAtools tool of the DAVP card. |
0-100 |
% |
N/A |
|
1 minute |
davp_util_rate_enc_core |
DAVP Encoding Core Usage |
The encoding core usage of the DAVP Linux: Obtain the metric value from the libvaml.so library file in the VAtools tool of the DAVP card. |
0-100 |
% |
N/A |
|
1 minute |
davp_util_rate_dec_core |
DAVP Decoding Core Usage |
The decoding core usage of the DAVP Linux: Obtain the metric value from the libvaml.so library file in the VAtools tool of the DAVP card. |
0-100 |
% |
N/A |
|
1 minute |
davp_sysc_temperature |
DAVP System Module Temperature |
The temperature of DAVP system module Linux: Obtain the metric value from the libvaml.so library file in the VAtools tool of the DAVP card. |
≥ 0 |
°C |
N/A |
|
1 minute |

The Windows OS does not support DAVP metrics.
Dimensions
Dimension |
Key |
Value |
---|---|---|
ECS |
instance_id |
Specifies the ECS ID. |
ECS - Disk |
disk |
Specifies the disks attached to an ECS. |
ECS - Mount point |
mount_point |
Specifies the mount point of a disk. |
ECS - GPU |
gpu |
Specifies the graphics card of an ECS. |
ECS - NPU |
npu |
Specifies the NPU graphics card of an NPU-based ECS. |
ECS - DAVP |
davp |
Specifies the DaoCloud DAVP1 video acceleration card of a DAVP-based ECS. |
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot