domino_admin_toolkit.checks.info.test_dmm_spark module
DMM Spark Executor Inspection (informational)
Data source: Spark worker HTTP UI (spark3-worker-0.spark3-worker-headless.<compute>,
port 8081) — both the JSON endpoint (/json/) and the HTML root page,
which is parsed with lxml to extract executor state.
Question answered: “What Spark executors are currently running or finished on the DMM Spark worker, and what app/user owns each?”
- What this check does NOT cover:
Whether the spark3-worker pod is present and healthy — see test_dmm_pods_list.
Whether DMM jobs are stuck — see test_dmm_ingestion_jobs_status.
Whether the Redis queues are backing up — see test_dmm_redis_queues.
This check remains info-only: it queries an HTTP UI rather than a structured data source, and its output is best read alongside the asserting DMM checks to understand whether a stuck-in-processing ingestion job is due to Spark executor starvation.
- domino_admin_toolkit.checks.info.test_dmm_spark.job_states(is_running, html_scrape)
- Description:
Returns the spark job states for either running or finished jobs
- domino_admin_toolkit.checks.info.test_dmm_spark.print_worker_executor_jobs(worker_data, html_scrape, is_running)
- Description:
Logs executor data for running and finished jobs
- domino_admin_toolkit.checks.info.test_dmm_spark.test_dmm_spark_finished_executors(k8s_client, spark3_worker_exists)
- Description:
Print executor jobs that have stopped running (successfully or not)
- Result:
Print data about finished spark jobs
- domino_admin_toolkit.checks.info.test_dmm_spark.test_dmm_spark_running_executors(k8s_client, spark3_worker_exists)
- Description:
Print executor jobs that are either currently or pending running
- Result:
Print spark running jobs