test_dataset_health module

test_dataset_health.test_dataset_latest_snapshot_health()
Description:

Identify legacy dataset entity if latest snapshot in it doesn’t exist in legacy snapshots collection

Result:

Identified legacy dataset entities with invalid snapshot entity

test_dataset_health.test_dataset_project_health()
Description:

Identify legacy dataset entity if it points to a nonexistent project

Result:

Identified legacy dataset entities with invalid project information

test_dataset_health.test_read_write_dataset_health()
Description:

Identify read-write datasets where the ID doesn’t exist in the legacy dataset_collection

Result:

Identified read-write dataset entities with invalid ID information

test_dataset_health.test_scratch_space_health()
Description:

Identify scratch spaces that point to nonexistent projects and users

Result:

Identified scratch space entities with invalid project and user information

test_dataset_health.test_snapshot_dataset_health()
Description:

Identify legacy snapshot entries that have collectionId that don’t exist in legacy dataset collection

Result:

Identified legacy snapshot entities with invalid parent dataset