-
Notifications
You must be signed in to change notification settings - Fork 180
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Include PodSpec.nodeName as a label in goldpinger_peers_response_time_s_* metrics #40
Comments
Glad we could help ! I don't see why not - it would put some extra strain on Prometheus with the extra time series, but it would indeed make for better graphs and alerts. 👍 |
👍 seeing at the source, it seems the code only care about HostIP of ping targets (at least in the prometheus metrics). I believe it'd be more useful when debugging issues if we can somehow give more context of the target nodes. In the monitoring dashboard/alerting, instead of this
this would be better
|
Hey @seeker89 😄 Do you think this is something you would consider implementing in the near future? we particularly feel the this is missing for alerting, where we know some node(s) are unhealthy but can't pinpoint from the metrics. |
Great news! I'll try and summarize our current experience:
/edit: I think it would also be great if the host name can be used on Goldpinger's UI and log instead of the host IP, it would make things much easier when trying to troubleshoot things. |
wdyt @seeker89? I tried going through the code yesterday to see if I could work out how to implement this, but to be honest getting a bit lost... it seems like the podIPs (and hostIPs) are used as primary identifiers almost everywhere and passed around in ad-hoc maps, I suppose we'll need to define a struct for that and use it throughout the code? Generally, I think it would be great if we could substitute pod IPs with pod names and host IPs with host names (probably keeping the IP data as well though, but make it secondary) because having just the IP addresses displayed is rather confusing. It would be (at least in my pov) so much more human friendly if the metrics and the UI (graph, heatmap, etc..) referenced the pod names + host(node) names, instead of plain IPs. Sorry if I'm going out of scope here. |
Some good points here. I'll have a look when I get some free bandwidth. 👍 |
@seeker89 just curious if you had a chance to look into this? 🙏 |
Sorry, not yet. I did set some time for goldpinger next week, might be able to get into this. |
An example latency check that works well!
|
Any update? |
Hey guys!
Happy New Year!!! 🎉
Super cool project, we've recently deployed this in our clusters and it's proving to be super useful.
I was wondering if it would be possible to add a
nodeName
label to these (goldpinger_peers_response_time_s_*
) metrics so it would be easier to identify the target nodes being probed.the node name should be available in the
PodSpec.nodeName
The text was updated successfully, but these errors were encountered: