domino_admin_toolkit.checks.test_node_ephemeral_storage module
- pydantic model domino_admin_toolkit.checks.test_node_ephemeral_storage.NodeEphemeralStorageAnalyzer
Bases:
AnalyzerBaseValidates node ephemeral storage usage is within acceptable thresholds.
- Fields:
- analyze(data)
Evaluate a single node’s ephemeral storage usage against the configured threshold.
- Return type:
- name: ClassVar[str] = 'NodeEphemeralStorageAnalyzer'
- domino_admin_toolkit.checks.test_node_ephemeral_storage.node_ephemeral_storage_data(prometheus_client_v2)
Collect node ephemeral storage usage from Prometheus.
- Return type:
- domino_admin_toolkit.checks.test_node_ephemeral_storage.test_node_ephemeral_storage(node_ephemeral_storage_data, runner)
Description: Checks actual ephemeral storage usage on cluster nodes. Failure Conditions: Any node root filesystem usage exceeds 80%. Troubleshooting Steps:
Identify affected nodes from the table (sorted worst-first)
Check what’s filling the disk: ssh to node, run du -sh /* 2>/dev/null | sort -rh | head
Common causes: container log accumulation, image layer cache, coredumps
- Resolution Steps:
Prune unused container images: crictl rmi –prune
Rotate or truncate large logs in /var/log
If node pool is ‘compute’, nodes are ephemeral — cordon and replace
Required Permissions: Node SSH access, cluster admin