Move autoremove builds logic to a scheduled job #2468

zackgalbreath · 2024-09-30T17:16:15Z

Decouple the cleanup of old builds from the submission parsing step

We've noticed some large production sites struggle to delete large batches of builds in a performant manner. To help address this issue, this commit introduces a new configuration option to control the number of builds that CDash attempts to remove in a single pass. This commit also reduces the default batch size from 100 to 10.

zackgalbreath · 2024-09-30T17:21:18Z

it would be good to verify that this doesn't cause any performance degradations on a "production sized" database

williamjallen

I'll have more code comments later once we agree on the big-picture design.

williamjallen · 2024-09-30T17:20:58Z

.env.example

+# How many builds should CDash try to remove per iteration?
+# Consider tweaking this value if you notice the autoremove queries
+# are slow to execute.
+#AUTOREMOVE_BUILDS_BATCH_SIZE=10


Instead of adding another configuration option, what do you think about adding some logic to automatically adjust the batch size to keep the running time for a given batch under a set time limit, while maximizing the deletion rate under that time limit.

I like the idea of figuring out the batch size automatically rather than making the CDash admin figure it out via trial & error.

After thinking about this further, I wonder if we should even bother with batching in the first place. If we simply delete one at a time, deletion rate will probably be suboptimal, but we are practically guaranteed that the deletion process won't hang.

If each delete is guaranteed to be fast, we can delete for a maximum period of 5 minutes, and schedule the job to run every 10 minutes. This 5-minutes-on-5-minutes-off approach gets rid of the large monolithic job and keeps cleanup tasks out of the way of submission processing.

One-at-a-time is fine with me! Practically speaking, that's the patch I've resorted to applying on large production instances that hang when attempting to delete 100 builds in a single pass.

app/Console/Commands/CleanupDatabase.php

williamjallen · 2024-09-30T17:26:02Z

app/Console/Commands/CleanupDatabase.php

+    /** Delete unused rows */
+    private static function delete_unused_rows(string $table, string $field, string $targettable, string $selectfield = 'id'): void
+    {
+        DB::delete("DELETE FROM $table WHERE $field NOT IN (SELECT $selectfield AS $field FROM $targettable)");


Should this delete in batches too? You can add a LIMIT clause to the end, and then iterate until no more results are deleted (DB::delete returns the number of rows deleted).

It turns out postgres does not support DELETE FROM ... LIMIT, so we would need to do something more clever here to allow for deleting in batches.

You could try DELETE FROM ... WHERE ... IN (SELECT ... LIMIT ...) instead. I'm not sure whether the optimizer would be smart enough to do it optimally, but it's probably worth a shot.

Wouldn't that result in data loss? The inner SELECT is telling us what rows to keep. If we put a LIMIT 1 (eg.) there, the outer DELETE would then remove all but one record from the table, right?

Oops I mistyped the pseudo-query. See edit.

app/Console/Kernel.php

app/Http/Controllers/AdminController.php

williamjallen · 2024-09-30T17:37:45Z

.env.example

+
+# Skip the autoremove builds step if more than this number of submissions
+# are waiting to be parsed.
+#AUTOREMOVE_BUILDS_SKIP_THRESHOLD=100


Instead of a new configuration option, what do you think about using queue priorities to automatically prioritize submissions if such jobs exist? That way, the queuing system will automatically handle cleanup tasks whenever submission activity is low.

Sounds like a good idea to me!

This commit extracts the automatic removal of old builds from CDash's submission parsing logic. Instead, old builds will now be periodically cleaned up as a scheduled job.

The autoremove functionality has been decoupled from the submission parsing workflow, so this test is now obsolete.

Delete custom queries from removeBuilds to detect when shared records are no longer needed. Instead, run the unconditional db:cleanup command at the end of the scheduled autoremove task. While writing this commit, the following tables were already handled by db:cleanup: - buildfailuredetails - configure - configureerror - coveragefile - test2image The following tables represent potentially shared data that wasn't already handled by db:cleanup: - note - buildupdate - testoutput - updatefile - image - uploadfile The following tables were found to already have cascade-on-delete foreign keys, and thus their explicit DELETE logic was deemed safe to remove: - build2uploadfile - dynamicanalysisdefect - label2dynamicanalysis - label2buildfailure

Use the artisan command or scheduled task instead.

Attempting to clean these CDash tables is redundant because they are already using cascading deletions.

williamjallen requested changes Sep 30, 2024

View reviewed changes

zackgalbreath added 6 commits September 30, 2024 14:04

Move autoremove builds logic to a scheduled job

5ed8a83

This commit extracts the automatic removal of old builds from CDash's submission parsing logic. Instead, old builds will now be periodically cleaned up as a scheduled job.

Remove autoremovebuilds_on_submit test

deea53c

The autoremove functionality has been decoupled from the submission parsing workflow, so this test is now obsolete.

Move remove_builds (and helper functions) to DatabaseCleanupUtils

19cd37f

Move "Cleanup database" logic to Artisan command

77630eb

Remove "Database cleanup" button from web UI

376c467

Use the artisan command or scheduled task instead.

zackgalbreath force-pushed the remove_builds_revamp branch from 9197858 to be7f3ef Compare September 30, 2024 19:25

zackgalbreath added 2 commits September 30, 2024 17:32

Remove unnecessary CleanupDatabase commands

542dad0

Attempting to clean these CDash tables is redundant because they are already using cascading deletions.

Regenerate phpstan baseline

173e373

zackgalbreath force-pushed the remove_builds_revamp branch from be7f3ef to 173e373 Compare September 30, 2024 21:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move autoremove builds logic to a scheduled job #2468

Move autoremove builds logic to a scheduled job #2468

zackgalbreath commented Sep 30, 2024

zackgalbreath commented Sep 30, 2024

williamjallen left a comment

williamjallen Sep 30, 2024

zackgalbreath Sep 30, 2024

williamjallen Oct 1, 2024

zackgalbreath Oct 1, 2024

williamjallen Sep 30, 2024

zackgalbreath Sep 30, 2024

williamjallen Sep 30, 2024 •

edited

Loading

zackgalbreath Oct 1, 2024

williamjallen Oct 1, 2024

williamjallen Sep 30, 2024

zackgalbreath Sep 30, 2024

Move autoremove builds logic to a scheduled job #2468

Are you sure you want to change the base?

Move autoremove builds logic to a scheduled job #2468

Conversation

zackgalbreath commented Sep 30, 2024

zackgalbreath commented Sep 30, 2024

williamjallen left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

williamjallen Sep 30, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

williamjallen Sep 30, 2024 •

edited

Loading