The last deployment of bridgestrap was March 20th, so it doesn't match with that date. But the uptime says 8 days, so it does matches April 16. Looks like bridgestrap has stopped providing collector metrics since the last reboot of polyanthum. Weird that we didn't see this problem after deploying it, could it be that I copied the binary into the server and didn't restart the service?
I believe the problem is coming from me forgetting to add copytruncate into the [logrotate.conf]( bridgestrap-admin@dc4eec33), and logrotate is now emptying the collector file. We didn't notice this change before the reboot of the server as the logrotate configuration was not applied until then.
I'm checking with TPA how to apply this configuration, not sure I have rights to do so.
So, bridgestrap is doing its job, and it is sending the data to collector properly.
You might reasonably reply "ok fine but my bridge is still marked as down, something's wrong" -- and you are right. The next step is to wonder why the relay-search page chooses to label you as Down.
Well, I apologise, I mean I don't have deep knowledge on how the backend works and how each system is connected with the others, I just wanted to notify you on the (related ?) issue.
Relay search doesn't show me always down though, randomly it shows me up, with the correct uptime.
Then, suddenly, it shows me down with a random downtime.