domino_admin_toolkit.checks.test_pods_mem_cpu_sizing module

domino_admin_toolkit.checks.test_pods_mem_cpu_sizing.color_RED(val)
domino_admin_toolkit.checks.test_pods_mem_cpu_sizing.color_YELLOW(val)
domino_admin_toolkit.checks.test_pods_mem_cpu_sizing.color_df(df, color_RED, color_YELLOW, col_name)
domino_admin_toolkit.checks.test_pods_mem_cpu_sizing.convert_to_float(df)
domino_admin_toolkit.checks.test_pods_mem_cpu_sizing.get_cpu_queries()
domino_admin_toolkit.checks.test_pods_mem_cpu_sizing.mem_queries()
domino_admin_toolkit.checks.test_pods_mem_cpu_sizing.prom_data(prometheus_client, queries)
domino_admin_toolkit.checks.test_pods_mem_cpu_sizing.test_pods_cpu_sizing()
Description:

Retrieve the pod cpu usage and compare against it’s limits and requests.

  • CPU REQUESTS(cores) - Average CPU resource requests by container.

  • CPU LIMITS(cores) - Average CPU resource limits by container.

  • CPU ACTUAL(cores) - Average actual CPU usage by container over the last hour.

  • CPU THROTTLING % - Percentage of CPU throttling by container over the last hour.

  • CPU OVERCOMMIT RATIO - Ratio of CPU limits to requests by container.

  • ACTUAL CPU vs REQUESTS % - Percentage of actual CPU usage vs requests by container over the last hour.

  • ACTUAL CPU vs LIMITS % - Percentage of actual CPU usage vs limits by container over the last hour.

Result:

Tag those containers that are higher than a PASS_THRESHOLD% for CPU Provide CPU table, also columns for actual requests, limits, their overcommit ratios (limits/requests), the percentage of time that a container’s CPU usage was throttled due to cgroup CPU throttling.

domino_admin_toolkit.checks.test_pods_mem_cpu_sizing.test_pods_memory_sizing(prometheus_client, mem_queries)
Description:

Retrieve the pod memory current usage and compare against it’s limits and requests. This will help identify pods that are close to their limit or get OOM killed. This will also help identify pods that have requests and limits that are far apart, and the memory usage is high over requests.

  • MEM REQUESTS (MB) - Average memory resource requests by container, in MB.

  • MEM LIMITS (MB) - Average memory resource limits by container, in MB.

  • MEM RSS (MB) - Average memory RSS (Resident Set Size) by container, in MB.

  • MEM WORKING SET (MB) - Average actual working set memory usage by container, in MB.

  • MEM OVERCOMMIT RATIO - Ratio of memory limits to requests by container.

  • MEM WORKING SET vs REQUESTS % - Percentage of actual memory usage vs requests over the last hour.

  • MEM WORKING SET vs LIMITS % - Percentage of actual memory usage vs limits over the last hour.

  • RESTARTS - Total number of container restarts over the last hour.

  • OOMs - Displays the number of OOM (Out Of Memory) events over the last hour if found.

Result:

Tag those containers that are higher than a PASS_THRESHOLD% for memory. Provide memory table, also columns for actual requests and limits and their overcommit ratios (limits/requests)