Unless I'm mistaken (or misconfigured) -- which is entirely possible -- collector will accumulate uncompressed data in out/ indefinitely, long after it's been archived in archive/ and will no longer be modified.
This takes up a lot of disk space and it'd be nice to
a) get confirmation I can remove data from out/ than is older than N months (2? 3?)
b) have it deleted automagically (or at least with a config setting)
Designs
Child items ...
Show closed items
Linked items 0
Link issues together to show that they're related.
Learn more.
You're right. It's safe to delete descriptors that are at least N = 2 months old. In fact, on the main CollecTor instance I usually delete descriptors that are at least N = 1.5 months old. But I do this manually, because of https://xkcd.com/1205/ and https://xkcd.com/1319/, and more importantly because I'm afraid of messing up with scripts and accidentally deleting all descriptors.
I admit that these reasons don't fully apply anymore with three CollecTor instances running. Would you submit a patch to delete descriptors older than N = 2 months, possibly even with a configurable N?
You're right. It's safe to delete descriptors that are at least N = 2 months old. In fact, on the main CollecTor instance I usually delete descriptors that are at least N = 1.5 months old. But I do this manually, because of https://xkcd.com/1205/ and https://xkcd.com/1319/, and more importantly because I'm afraid of messing up with scripts and accidentally deleting all descriptors.
I think you accidentally pasted the wrong link :-)
The description is here.
I admit that these reasons don't fully apply anymore with three CollecTor instances running. Would you submit a patch to delete descriptors older than N = 2 months, possibly even with a configurable N?
And if you have ideas for making that script even less copy-and-pasty, please feel free to tweak it!
One of the planned improvements is to integrate all scripted maintenance into java and get rid of scripting.
We're planning and working on these steps already, see legacy/trac#20518 (moved), legacy/trac#20546 (moved). So, please check the back first to avoid duplicate work. Setting parent to legacy/trac#20546 (moved).
I have a patch that I'm currently running on corsicum a.k.a. collector2.tp.o. (Even if it breaks files in the out/ directory there, that's not a big deal, because we only care about recent descriptors on that instance anyway.)