[Product Update] "Infrastructure Failure" Badge in CircleCI UI

sebastian-lerner · February 28, 2023, 5:55pm

The “job status” badges in the CircleCI UI now include a badge for “Infrastructure Failure”. An infrastructure failure is the result of CircleCI running into issues with the underlying infrastructure that the job runs on.

Previously, it was difficult to discern whether a job failed due to an infrastructure failure or due to legitimate failures such as a test failure as both cases would have resulted in a “Failure” badge.

infra_fail_badge

xakraz · April 11, 2023, 3:02pm

Hi there !

Many thanks for this new addition, it is very timely

We have just switched our workflows on CircleCI self-hosted runners running on GCP preemptible VMs.
Due to the nature of these VMs, the CircleCI agent gets killed and the CircleCI “control plane” does not receive updates from it anymore. As a consequence, some of our workflows jobs get into this new infrastructure fail state.

Are there some metrics available we can monitor to measure the number and rate of infrastructure fail jobs?

This would be very handy for us to measure and prioritize troubleshooting for these kinds of issues.

sebastian-lerner · April 12, 2023, 2:17pm

Unfortunately there are no metrics of that kind. I’ll pass along this feedback to the relevant team at CircleCI to see if this might be something they would consider enabling.

xakraz · August 9, 2023, 12:41pm

Hi @sebastian-lerner ,

Any news about the topic ?

We would like to monitor such things and provide some SLA to our dev teams.

Thank you

sebastian-lerner · August 9, 2023, 3:45pm

Unfortunately there are no updates on our end. It is still something the relevant team is considering, but there are no timelines to share.

vijaytholpadi · June 1, 2024, 4:05am

Curious,

Do the credits consumed in infrastructure failure builds/workflows get counted against billing for commercial plans?

We have been seeing these infrastructure failure quite often after switching to M1 machines as part of CircleCI’s Intel deprecation plan.

Reference build

sebastian-lerner · June 5, 2024, 11:21pm

@vijaytholpadi I don’t believe infrastructure failure builds consume credits. Let me know if you are seeing different behavior.

pguinard-public-com · October 11, 2024, 8:48pm

We’d also like to see metrics on builds that have the infrastructure failure tag or perhaps an a more detailed response instead of the following:

This job appears to have stopped responding, try re-running it.

We’re having a really hard time when we run into these issues as they’re not reproducible on our end unless we disable our build cache.

borfig · February 23, 2025, 12:36pm

I have just noticed that our builds are failing due to this new error.
The same commit but different branch works for some reason…

amogh-orolabs · May 5, 2025, 12:34am

Running into similar issue, the AWS EC2 instance has the same resource specifications as the previous CircleCI managed resource but the job is failing due to “This job appears to have stopped responding, try re-running it.” with “Infrastructure Fail” badge; how to trouble shoot this?

wyardley · June 26, 2025, 9:40pm

A few weeks ago, we’d been running into a lot of timeouts on jobs that had previously been working; eventually, these frequent timeouts stopped again, even after we reverted some tweaks / changes we’d made to try to resolve them.

Today, got the “infrastructure failure” badge specifically, despite all tests seeming to have succeeded across 12 containers.

Are there any infra level issues that have been ongoing recently that might not have been well communicated about?

Topic		Replies	Views
Trigger an "Infrastructure Fail" via build step? Build Environment	1	50	April 15, 2025
Build failure due to infrastructure bug Build Environment	1	2386	June 18, 2018
There was an issue while running this container and it was rerun. The most recent run is shown Build Environment	6	1759	February 26, 2019
Status Badge is not updated when job fails Feedback & Bug Reports	0	1014	September 25, 2020
Badge shows incorrect status Feedback & Bug Reports	11	1143	May 22, 2019

[Product Update] "Infrastructure Failure" Badge in CircleCI UI

Related topics