something about backlog for s390x builders ( you can see current builds in koji list-tasks --arch s390x | grep buildArch | grep -v FREE but not sure how to get that into zabbix
the scraper load one might need to also include kojipkgs01/02 load and koji01/02 load? or could be another one
Perhaps some ‘all proxies’ ones? load on all proxies, interface on all proxies… in order to see where a heavy load / request storm is coming from.
Perhaps a dashboard showing all the openshift apps requests/bw used. That would allow seeing easily when there’s a bunch of requests for one service.
Any other ones people can think of? @gwmngilfen thoughts?
So most of these seem sensible, I’d love input from folks on what plots would work for them in their areas of expertise.
On the s390 one, where does that run? on koji I assume…? We could add an item to those hosts to report queue data back to Zabbix, and then add it to a dashboard…
Ah if its on the builder, thats tricker as we don’t run the agent on the builders. Maybe we should revisit that policy? It could give us some useful data, I guess… but we probably want fewer alerts for them
If we stick to getting it from koji, then we have a bunch of options in front of us around getting the relevant data into Zabbix
well, the data we want is about the builders, but doesn’t need anything running on the builders.
The data we need is on the hubs (koji01/02). about how many builds are waiting for s390x builders to become available. Even there, we don’t actually need to run anything on those hubs, we need to make a api call to koji to get that data from the hubs.
As to monitoring builders we stopped doing it a long time ago because so much of the normal monitoring is not something we care about on them. ie, we don’t want to know about ‘high cpu’ or ‘large disk i/o’ or the like… because they are building things and thats normal for building things. We could look at doing a subset of things, but also I worry about the overhead of the agent there, memory/cpu taken up by the agent means less for builds, but perhaps that is too small to really matter.