Big Bang Nightly CI Failure (2/22/23)
https://repo1.dso.mil/big-bang/bigbang/-/jobs/19043134
Failure while waiting on pod/deployment health:
error: timed out waiting for the condition on deployments/neuvector-prometheus-exporter-po
Designs
- Show closed items
Activity
-
Newest first Oldest first
-
Show all activity Show comments only Show history only
- Micah Nagel assigned to @micah.nagel
assigned to @micah.nagel
- Author Guest
neuvector pod/neuvector-prometheus-exporter-pod-5cc64b9b-5fb5l 0/1 CrashLoopBackOff 10 (27s ago) 27m
Exporter pod is hitting a crashloopbackoff repeatedly.
Collapse replies - Author Guest
Events do not provide any additional insight into the failure. I think we've run into similar failures in some previous runs:
- Micah Nagel added statusdoing label
added statusdoing label
- Micah Nagel added teambigbang label
added teambigbang label
- Micah Nagel added 1 deleted label
added 1 deleted label
- Author Guest
https://repo1.dso.mil/big-bang/bigbang/-/jobs/19084141
Rob ran a debug pipeline to try and reproduce the error. The logs are not super helpful although it seems like the consul agent is resulting in a crashloop on the controllers, which may explain why the exporter can't connect.
- Rob Ferguson mentioned in issue dsop/neuvector/neuvector/controller#21 (closed)
mentioned in issue dsop/neuvector/neuvector/controller#21 (closed)
- Author Guest
Nightly is getting more and more mad about this :cry-hard:
- Rob Ferguson mentioned in merge request !2541 (merged)
mentioned in merge request !2541 (merged)
- Author Guest
Temp solve to prevent nightly issues, thanks @rob.ferguson
- Micah Nagel assigned to @rob.ferguson
assigned to @rob.ferguson
- Author Guest
https://repo1.dso.mil/big-bang/bigbang/-/jobs/19279241
This job has logs from the failure. Discovered in a bit of testing that disabling networkpolicies oddly seems to fix the issue in rke2?
- Developer
Addressing the missing egress rules to allow controller component access to the API server: big-bang/product/packages/neuvector!23 (merged)
1 - Micah Nagel mentioned in merge request !2548 (merged)
mentioned in merge request !2548 (merged)
- Author Guest
Kicking off an RKE2 pipeline or 2 here - !2545 (merged)
- Author Guest
k3d pipeline with 3 replicas: https://repo1.dso.mil/big-bang/bigbang/-/pipelines/1412695
rke2 pipeline with 3 replicas: https://repo1.dso.mil/big-bang/bigbang/-/pipelines/1412696
Collapse replies - Author Guest
Testing these to see if the NP fix resolved our multi-replica issues too.
- Micah Nagel mentioned in merge request !2545 (merged)
mentioned in merge request !2545 (merged)
- Micah Nagel closed with merge request !2545 (merged)
closed with merge request !2545 (merged)
- Micah Nagel mentioned in commit 4f62da08
mentioned in commit 4f62da08