GitHub was down

  • I would love to see a chart of traffic to other sites when GitHub goes down. My bet is that HackerNews and Twitter both get significant spikes from all those bored developers.

  • When I break our GitHub webhooks, I joke it's time for people to practice our Disaster Recovery (DR) procedures. In all seriousness, this is a good opportunity to practice work without GitHub. Any service can go down; can you deploy a critical bug fix without it? If not, why not and what can you do to fix it?

  • I had to change a username from capitalized to uncapitalized and use my updated remote afterwards, apologies if I broke it for everyone.

  • If anyone is interested, I've been working with a git host that is actually distributed across a p2p network using SSB.

    see:

    https://github.com/clehner/git-ssb

    https://github.com/noffle/git-ssb-intro

    It's been working fairly well so far. We are using git-ssb to manage a few projects instead of putting them into Github.

  • Status now shows Major Service Outage:

    12:32 EDTMajor service outage.

    https://status.github.com/

  • Pages Builds Failure Rate spiked to over 2000%. I don't know how that's possible, but it seems pretty bad.

  • Insert remark on why we use a centralized service for a distributed source control system, etc. No one seems to care, unfortunately

  • Looking at the status graphs, it seems like there was some clearly anomalous data starting around midnight, about 9 hours before the actual outage "began". Maybe a gradual botnet ramp-up, and 9:27 AM is when it got bad enough to overload some critical service? (Or really any other threshold-based failure scenario.)

  • What was happening to Github for a week or so in late June - early July? I see "The status is still red at the beginning of the day" for a whole week.

    https://status.github.com/messages/2017-07-03

  • Do these general Github outages affect GH Pages as well, or is that service portion segmented to some degree?

  • I think it started as minor as I was receiving a unicorn once per 10 pages. It's currently happening on almost all.

    Of course, I'm trying to dig into a WebKit issue and need the issues to load!

  • Where is github hosted?

    Do they use AWS or another commercial cloud provider, or do they have their own servers in data centers (hopefully scattered around the globe)?

    If AWS, are their services spread among multiple availability groups? I'm just wondering how this could happen.

  • Github is back online.

  • It has leveled up to a major outage!

  • Dang. It's too bad their customers' source control files aren't distributed and decentralized, or they could keep working and ignore this.

  • Anyone have any knowledge of what specifically happened?

  • I saw a comment earlier mentioning that GitHub allegedly doesn't release post mortems publicly? If this is true, that's upsetting.

  • My apologies. I knew my Perl 6 wrapper for GLFW was bad, but never realized it'd be so bad that GitHub would choke to death on it.

  • Are there any other major sites that are down?

  • It just became a major service outage.

  • This is happening too frequently now.

  • It's starting to work again for me. I was able to approve a PR and merge it.

  • Looks like I am still able to push to/pull from my repos without issue.

  • undefined

  • Whatever happened to gittorrent?

  • Everyone calm down. GitHub is working just fine.

  • Well you all know to commit your vendor libs so you can still deploy when Github is down, right? RIGHT?!

  • Pretty shocking how accustomed we've all become to Github outages. Mission critical developer tools should have gratuitously high uptime. Github is a 100M company whose most important operational objective is just keeping the site running. I call shenanigans.

  • undefined

  • Thoughts on the cause?

  • How does this affects all your dependencies?

  • >GitHub is having a minor service outage

    It's definitely not minor.

  • I knew I shouldn't have released the new version of my project yesterday. :p

    Sorry everyone

  • Githubs uptime is pretty bad. Isn't it under 95% for the year now?

  • In the face of a lack of information, HN comments begin to throw around unfounded speculation & tongue-in-cheek jokes run rampant. I suppose that in the absence of information, many stay silent, & the remaining see a thread lacking comments

    & now we've got this meta one in the mix

  • undefined

  • How many more can we expect before they develop appreciation for testing _before_ they push to prod?