At 22:52 UTC on 21 October (15:52 PDT), GitHub experienced a network partition and subsequent database failure. This has caused intermittent issues with webhook delivery and other events that CircleCI depends on to manage your CircleCI workflows and jobs. The downtime has also prevented us from making API calls to GitHub to check on authorization and project/organization status.
Until GitHub has ended their outage, we will be unable to know fully what changes or issues this has caused with your projects or jobs within our system. Furthermore, when GitHub does start delivering webhooks again, we will see a surge of jobs starting, and we will immediately scale in response and remain overprovisioned until the surge is complete.
A quick script I put together for our team to trigger builds for now.
Aside, I’m surprised the API doesn’t return a reference to the newly created workflow.
#!/bin/bash
PROVIDER=github # or bitbucket
ORG=my-project-org-or-user
PROJECT=my-project-name
if [ -z $CIRCLECI_TOKEN ]
then
echo "Create a token at https://circleci.com/account/api and export it as CIRCLECI_TOKEN."
exit 1
fi
if [ -z $1 ]
then
echo "Usage: $0 <branch>"
exit 2
fi
echo "Triggering build for branch $1..."
curl -X POST -H "Content-Type: application/json" \
-d '{"branch": "'$1'"}' \
https://circleci.com/api/v1.1/project/$PROVIDER/$ORG/$PROJECT/build\?circle-token\=$CIRCLECI_TOKEN
is there any way to see the depth of the queue? the work day is winding down where i’m at and it might be better to call it early if we’re looking at several hours of backlog still to go.
i was asking about the depth of circleci’s queue. github was reporting that they’d finished their backlog while you were reporting that you still had a backlog. it was a noticeably missing feature where, for example, travis ci’s response to the outage was to show the depth of their queue generated by github reopening the floodgates.
I misunderstood. I dont know that we had a queue, but if we did i dont remember seeing it shared in our incident channel. I know we ramped up capacity to prepare for it, but im not in a place to say if we did or not.
Edit: there were updates i missed after i signed off yesterday. It looks like we had queue spikes, and they were quickly cleared. The notes on status.circleci.com are the most acurate since they came from our SRE team. I apparently mispoke, sorry.