We’d also like to see metrics on builds that have the infrastructure failure tag or perhaps an a more detailed response instead of the following:
This job appears to have stopped responding, try re-running it.
We’re having a really hard time when we run into these issues as they’re not reproducible on our end unless we disable our build cache.