Incident with Actions and Pages

a10c · 2026-05-26T12:11:34 1779797494

My action failed with "Unexpected error fetching GitHub release for tag refs/heads/master: HttpError: Sorry. Your account was suspended"

Which certainly made me shit myself, briefly.

neya · 2026-05-26T15:02:08 1779807728

It's an eye opener. Think about it - today, it was a mistake. But, what if it really happened? What if you really lost access to all your years of hard work? It's a wake up call. A blessing in disguise to store what matters to you the most locally, backed up offline. Never trust any single provider. Be it MS or Google or Apple. RAID is the way.

onion2k · 2026-05-26T15:29:35 1779809375

People should use something that keeps a local copy of their code and just copies it to Github and to other contributors with a sync process to push and pull changes. Some sort of 'distributed source control system' maybe. Then people would only need a 'hub' to connect to people, and it'd be easier to move somewhere else.

gopalv · 2026-05-26T16:18:54 1779812334

> Some sort of 'distributed source control system' maybe

The day it broke away and became centralized was when we had a PR + mandatory "Required actions" to merge to main.

ruszki · 2026-05-26T18:43:13 1779820993

That’s only mandatory on the “hub”. I can do that locally anytime.

bergie · 2026-05-26T23:09:49 1779836989

I'm looking at setting up rngit mirrors of all my repos on our boat NAS. Conceivably it also allows issue tracking and collaboration without centralized infra

https://reticulum.network/manual/git.html#mirroring-reposito...

marricks · 2026-05-26T16:37:46 1779813466

I like how tech seems to be all about stacking more and more turtles on top of each other:

Gosh, it's hard figuring out what changes Lorne made if only we had a system to merge those changes. Enter git

Gosh it's hard figuring out what packages Rachel had to make this work. Enter rubygems/pip/npm

Gosh it's hard figuring out sync these changes across a network. Enter github

Gosh it's hard figuring out how to get those packages working on my operating system. Enter docker

Gosh centralizing our distributed version control software system onto one website is getting really unreliable. Enter fossil(?????)

If we go any further having one computer per business with a sign up sheep is starting to sound pretty fucking attractive.

fusishch · 2026-05-26T15:57:43 1779811063

What you just described is Fossil. It has an auto-sync feature that makes everything feel distributed.

Just set up a Kubernetes deployment and you’re set.

But as others mention, GitHub’s primary strength is collaboration. If you want decentralized, solve this by creating a decentralized collaboration tool on top of fossil and/or git.

For example, how to do pull requests and code reviews?

40four · 2026-05-26T16:37:36 1779813456

Why they just described is Git :) pretty sure it was a joke

coldpie · 2026-05-26T15:42:39 1779810159

This gets tiresome. Github is a lot more than a host for Git repositories. If you want to suggest that people use something else, you need to suggest a replacement that has the features people use Github for.

ornornor · 2026-05-26T15:46:33 1779810393

Increasingly less and less so as they “upgrade” their offering and have more and more downtime.

danudey · 2026-05-26T16:31:31 1779813091

I think you missed the joke, which is that the parent poster you're replying to is suggesting a 'solution' to the problem which evolved in complexity until he was just describing Github again.

doctorpangloss · 2026-05-26T16:07:28 1779811648

yeah, #1, it is free private file storage, and #2, it's a download portal for free as in beer software replacing paid offerings. that's what it is for 99.99% of people.

being a host for git repositories has never been its core competency. neither has its groupware offering.

does it even serve OSS well? a very interesting criteria is, "Have mature or adopted end-user-facing OSS recently merged a large PR from an unallied contributor?" The answer is overwhelming no. This is why there is so much innovation in this space.

mpaco · 2026-05-26T15:51:55 1779810715

I recently got my GitHub account suspended for 4 months. When it was finally reinstated, their support just said it was a "mistake".

Proudly self-hosting Forgejo since then.

MatthiasPortzel · 2026-05-26T16:23:23 1779812603

This happened to me as well—thankfully not my personal account that I use for work, but the organization associated with an open source project I worked on was suspended. It similarly took 2 months for GitHub to restore the organization.

> Our team is currently experiencing an unexpectedly high volume of tickets which has resulted in longer response times than we prefer. We acknowledge the long wait and apologize for the experience.

> Sometimes our abuse detecting systems highlight accounts that need to be manually reviewed. We've cleared the restrictions from your account…

Fully self-hosted IMO can be an overcorrection. The issue isn’t “relying on other people”—it’s relying on GitHub, when they’ve made it clear they don’t care about uptime and they don’t care about support turn-around-time.

corvad · 2026-05-26T15:33:56 1779809636

RAID is not a backup.

PokemonNoGo · 2026-05-26T15:51:25 1779810685

They... Didn't describe RAID? More 3-2-1.

filleduchaos · 2026-05-26T15:56:28 1779810988

The last sentence in the comment is literally "RAID is the way".

jrockway · 2026-05-26T16:20:38 1779812438

I think they were intending to evoke the image of RAID rather than literally referring to a redundant array of inexpensive disks. You host your code on Github, Gitlab, and at home, then you survive a Github outage. It's a redundant array. Not sure it's inexpensive, though.

iso1631 · 2026-05-26T16:14:50 1779812090

Well yes, my git repositories sit on my laptop, that's the entire point. If github banned my country because its president has a tis, I can push my entire commit history to another company. Same with anyone else who's working on it.

It would be a pain as I'd have to set up a few integrations again, but github is far lower down the risk scale than the vast majority of SAAS providers

grim_io · 2026-05-26T12:18:53 1779797933

A brownout redefined.

DonHopkins · 2026-05-26T14:51:51 1779807111

ShitHub

https://www.youtube.com/watch?v=LGeOee7x5lY

lachieh · 2026-05-26T14:33:51 1779806031

Good thing I'm wearing my brown pants today.

drcongo · 2026-05-26T12:26:38 1779798398

Same. It's weird how I always find out that GitHub is down before GitHub does. Took 15 minutes before it appeared on githubstatus.com

jaapz · 2026-05-26T12:28:55 1779798535

All these monitoring rules are of the format "when 500 errors > baseline for x minutes". Otherwise you'd have monitoring alerts every second. So it is normal for users to already see errors before github officially counts it as an outage.

logifail · 2026-05-26T14:28:31 1779805711

> All these monitoring rules are of the format "when 500 errors > baseline for x minutes". Otherwise you'd have monitoring alerts every second. So it is normal for users to already see errors before github officially counts it as an outage.

Is it true that official service status pages are updated automatically?

baby_souffle · 2026-05-26T15:00:10 1779807610

> it true that official service status pages are updated automatically?

Depends. Typically no because there’s an art to crafting the actual message around impact… but sometimes yes it is automated

hnlmorg · 2026-05-26T13:12:14 1779801134

You'd expect them to be monitoring more than just the HTTP response codes from user requests for precisely this reason.

If the first they hear of an outage is when user requests start to fail, then that's a failure in their monitoring as well.

But effective monitoring is harder than people assume.

dncornholio · 2026-05-26T14:16:58 1779805018

> If the first they hear of an outage is when user requests start to fail, then that's a failure in their monitoring as well.

Isn't that what monitoring actually is? The issue seems to be in their testing, not monitoring.

hnlmorg · 2026-05-26T14:33:29 1779806009

No, monitoring for HTTP response code is a subset of observability and not one that generally gives you the best insights into which subsystems are misbehaving nor why.

There are synthetic tests, where you can generate API request calls or even simulate an entire user journey. These allow you to control the user agent, the payloads, and thus you know anything errors back are actual errors. These are triggered by the observability platform (think like running a cron-job) and thus you're not tied to user activity to see when problems arise.

There are other metrics outside of HTTP response codes too. Think like free RAM, CPU usage, disk space, etc. This is just naming some obvious ones because these types of metrics are generally bespoke to the type of application your monitoring. And with these types of monitors, you'd not just have an alert when things have failed, but ideally have alerts when an irregular trend is showing that things are likely to fail too. This latter type of monitors helps you get ahead of the problem before it become customer facing.

Then you have more traditional stuff like logs. This will also be bespoke to the application. But you'd expect errors in logs to get surfaced quickly. Assuming Github have good hygiene in what's being logged.

Tie that up with APMs, RUM, and other goodies like that and you'll have diagnostics to investigate issues when they appear.

(this is just a super high level view of observability too)

lokar · 2026-05-26T14:48:33 1779806913

Even a synthetic probe needs a few failures to trigger an alert.

You should not alert on cpu, ram, etc

hnlmorg · 2026-05-26T15:09:51 1779808191

> Even a synthetic probe needs a few failures to trigger an alert.

It doesn't "need" that. That just how most people set it up because it’s an easy sane default that allows for network jitter without inexperienced engineers thinking about different conditions triggering different types of responses.

If you’re measuring internal APIs from an observablity solution that’s has nodes already inside you’re network enclave, then there is a strong argument for alerting early.

> You should not alert on cpu, ram, etc

That’s not true to say as an absolute statement. And a generalisation it heavily depends on the system your monitoring and how it behaves under pressure.

But in any case, I wasn’t suggesting CPU alerts were the end goal. I said:

> these types of metrics are generally bespoke to the type of application your monitoring.

Ie you’ll use metrics but those metrics will be highly specific.

The CPU examples were an illustration as to what a “metric” is (it might seem obvious but not everyone is an expert) but the point was HTTP response codes aren't the only types of metrics one should be capturing and watching.

lokar · 2026-05-26T15:48:26 1779810506

Ah, yes, I misunderstood. And I have seen cases where a direct CPU alert makes sense, but 99 times out of 100 times I see it, it's nothing but trouble. Worse, I tend to see the cpu alert when there are no end to end synthetic alerts, 500 alerts, queue depth alerts, etc.

If your requests are fast and cheap, you can probe frequently relative to your goals, but often that's not really possible (think, long SQL queries, or scheduling a container/pod). There you need several datapoints, or possible fewer augmented with other signals.

hnlmorg · 2026-05-26T16:02:00 1779811320

Yeah very true.

Talking about long SQL queries, I quite like throwing CPU alerts on database servers. They'll be a low priority alert (ie no out of hours "pagers") so just something that goes into a slack channel. But they're a good indicator of when developers have poorly optimized SQL, or the DB schema is poorly defined (eg missing indexes), or the DB server itself is poorly sized.

This wouldn't be something you'd expect to need in production and definitely not something you'd rely on as a notice of a production outage. But it is an example of one of those 1% occasions where a CPU alert does add value to the overall observability of the application.

But this also ties into your excellent point about how you'd use CPU and other data points to build a picture of what's happening in your application.

lokar · 2026-05-26T16:18:06 1779812286

Oh, I was thinking about it as the person running SQL as a service. People run queries that go on for days....

idle CPU is often wasted CPU

re-thc · 2026-05-26T17:23:49 1779816229

> But effective monitoring is harder than people assume.

Who says public status page equals internal monitoring.

They likely know faster than you. Whether they post it publicly is a different issue (hint: SLA penalties, news impacting stock etc)

hnlmorg · 2026-05-26T17:40:28 1779817228

I never mentioned anything about status pages.

Are you sure you’re replying to the right comment?

re-thc · 2026-05-26T18:39:04 1779820744

> I never mentioned anything about status pages.

For context, the parent comment you replied to started with status page.

Then are you talking about internal leaks or just guessing? Otherwise besides what's public how do you know they don't know?

hnlmorg · 2026-05-26T19:45:38 1779824738

It was two comments prior to mine that mentioned status pages.

Someone then replied about how it takes a bunch of HTTP response errors for problems to be alerted and thus I commented that application observability would consist of more than just waiting for users to hit errors.

echelon · 2026-05-26T12:41:18 1779799278

In a high performance service with good maintenance and upkeep, you page for all 500s. A noisy pager forces the team to fix the 500s.

Maybe the Github Actions infrastructure isn't run like that.

edit: my oncall rotation notified on all 500s, 24/7, not just rates - https://news.ycombinator.com/item?id=48279262

Doohickey-d · 2026-05-26T12:53:11 1779799991

Im curious about this: because in my experience (working on smaller services though), a small number of errors is always there, as a "baseline".

Recently there was this: https://news.ycombinator.com/item?id=47252971 "10% of Firefox crashes are caused by bitflips"

Which makes me think a small amount of random issues which happen even though nothing is broken, is normal everywhere. Especially once move things around on a network, there's potential for a lot more random errors.

bobthepanda · 2026-05-26T14:38:59 1779806339

It’s where monitoring for 9s is more important at that scale than absolute errors. So long as degradation is graceful or retried it should not be a massive problem.

It does require constant tuning and adjustment though.

KPGv2 · 2026-05-26T13:24:23 1779801863

Bitflips are something that can happen in consumer-grade RAM, so that tracks (and it's comforting that wayward cosmic rays are a substantial reason for an application's crashes!), but on enterprise servers, they will run ECC RAM that is very resistant to bit flips.

This is why data hoarders who have NASes with lots of space insist on running their servers with ECC RAM despite it being significantly more expensive. Because bit flips, for all intents and purposes, cannot happen. The RAM itself detects and corrects for them.

I wouldn't expect bit flips to be a significant contributor to enterprise problems.

Anon1096 · 2026-05-26T13:41:05 1779802865

Bitflips specifically may not be; things like network issues, noisy neighbors, row/rack/host maintenance (leading to a downed and migrated host) absolutely are things that happen at high frequency at scale and cause your background level of errors to be more than 0.

maccard · 2026-05-26T13:41:52 1779802912

You've completely missed the point - It's not about bitflips it's about errors that are outside the scope of what's fixable.

KPGv2 · 2026-05-26T19:28:50 1779823730

I suppose I misunderstood what the "random error" was supposed to mean. I wouldn't call a network error a "random error" because it's caused by things that are internal to the system (entities using a network). A bit flip is caused by an external factor: cosmic radiation. To me, that's what a "random" error is.

If your network goes down because of a DDOS, or part of your system overheating, that's an internal issue you had control over.

If a bit flips because of cosmic radiation, you can't really do anything about that, and it's utterly unpredictable. That's "random" to me.

TheDong · 2026-05-26T12:49:40 1779799780

Do you know of a single service at a single company that actually does that?

I know all of Gmail, every GCE service I can think of, every AWS service I can think of, Amazon.com, Netflix, and Github all do not page on just a single 500.

I know none of those are particularly "high performance" though. Curious where your experience is coming from.

CBLT · 2026-05-26T12:59:50 1779800390

I've been oncall for a different G service that nearly paged on every error. It used the standard error budget tooling, but on hundreds of user buckets because the engineering around locality-specific configuration was... suspect. Many of these buckets had single-digits user. If a user was on their phone and lost signal, I was paged. Very poor oncall experience.

theta_d · 2026-05-26T14:02:55 1779804175

The sub-service at IBM cloud I worked on had an insanely small error budget such that pages were nearly constant. On call was hell week until a few of us insisted on fixing the issues. The "few" of us were contractors. The employees seemed more than willing to just let the pages continue.

alexfoo · 2026-05-26T16:46:28 1779813988

Some companies pay more if people are paged. It can create a perverse incentive not to fix problems or, in extreme cases, to watch things going wrong, waiting for the page, and then being ready to fix it straight away.

echelon · 2026-05-26T13:04:08 1779800648

I worked at a large fintech moving billions of dollars in volume a day.

I had a fairly long tenure, where I maintained multiple key services in critical online payments flow. Authentication, authorization, core business and risk data, as well as some cross-cutting control plane stuff, etc. You needed one or more of our services to take a payment, serve any request from the employee dashboard - pretty much everything hit our services. The entire company ground to a halt without my team.

We paged for every single 500. In instances where a particular class of 500 was spurious or not worth fixing, we would leave it acked or mark it as noise. But typically we'd just put in a fix as soon as possible so we didn't page.

Our graceful shutdown and traffic shaping stack was great, but occasionally we'd get a few pages during deploys or failovers.

Oncall was typically not bad, but when it did get bad it was terrible. I've been involved in huge outages that cost hundreds of millions of dollars. Usually it was the fault of multiple teams having compounding runaway failures rather than one service or bug in particular.

It's inexcusable to have a customer's payments not go through. We engineered around resilience. We had strict five nines SLAs and p99 targets and evaluated our adherence with even the smallest partial outage. Hundreds of other services depended on ours, and downstream impacts were huge, so we had to keep a tight ship.

We didn't have "business hours"-only paging either as our platform was available globally, including a heavy install base in Asia.

sunrunner · 2026-05-26T13:33:17 1779802397

> We paged for every single 500.

Assuming the existence of some kind of network (with zero guarantee of 100% reliability), how does this work in practice? Is each 500 treated as an event that needs investigation, even if the result of that would end up as 'a router dropped something from an internal buffer but the transaction as a whole was re-tried by a parent so the service itself recovered'?

LPisGood · 2026-05-26T14:03:10 1779804190

A reliability engineer from Jane Street gave a great talk about this, five nine’s of correctness in reporting, etc isn’t enough for the SEC.

https://youtu.be/zR9PpXWsKFQ

eithed · 2026-05-26T14:36:05 1779806165

Client network timeout shouldn't result in 500. With 408 and retry you should, dependent on the business criteria, get either an upsert (transaction is retried) or 422 (validation that given entry already exists).

Even if it's "DB in datacenter I tried to save to was hit by meteor" event, you can cater for this not to result in 500 (ie - DB unreachable, retry in a couple of minutes); the question is if you want to.

compumike · 2026-05-26T14:00:16 1779804016

Re: "page for all 500s": there's a world of difference between "page me with a critical alert at 3am" and "notify me on Monday morning when my normal workday starts". At the extremes:

If my DB health check endpoint is returning 500s for N consecutive checks over M minutes, yeah, please wake me up at 3am!

If one user hit a weird edge case in form validation and got a one-off 500, please don't! We can fix that on Monday.

Not always easy to distinguish those clearly or configure those business hours rules, but for my team at https://heyoncall.com/ that is the goal -- otherwise your team burns out fast. Waking up someone at 3am has a real cost, so you better be sure it's worth it.

wasmitnetzen · 2026-05-26T14:25:16 1779805516

Shouldn't Github be large enough to not have anyone on-call, but just rotate the responsible team around the world?

alexfoo · 2026-05-26T16:43:27 1779813807

One team can't troubleshoot AND FIX every possible subsystem, so you just end up with lots (growing to hundreds) of people "on-call" anyway.

As others have said, follow-the-sun type models do exist, usually staffed by people in their normal working hours (EMEA, Americas, APAC) but this means you've still got to cover the weekend and public holidays (which there are a lot of when you factor in plenty of different countries).

Where you need a quick response you can have a core ops/noc team that looks at things with lower thresholds and shorter windows, and their job is to do the initial triage and then page the appropriate team earlier than they would have been alerted by their own alert thresholds/monitoring.

Actually clicking the button to change the status on a public status page is a whole different topic that becomes very political in certain companies.

bobthepanda · 2026-05-26T14:37:29 1779806249

At least when I worked at a Bigcorp a lot of that was being cut to save costs.

lokar · 2026-05-26T15:56:35 1779810995

I've worked in large orgs where we could (at at some times did) have around the world rotations. They don't work well. It've very hard to maintain real team cohesion, and you end up with really superficial operations. People tend not to dig in really deep, find good fixes, etc. Lots of superficial bandages.

awithrow · 2026-05-26T12:53:31 1779800011

that is absolutely not the case for any system of size and scale. that would just burn out the on-call team and not result in improvements. Error rates/budgets are used instead.

hnlmorg · 2026-05-26T13:36:35 1779802595

It depends what you're monitoring. If it's response codes from user generated queries, then I'd agree with you.

But if it is synthetic queries sent from the monitoring platform, then you control the user agent, payload, and endpoints. So any failed requests are a symptom of a misconfiguration and/or failure that should be investigated. Albeit not necessarily as a P1 priority.

hvb2 · 2026-05-26T14:02:26 1779804146

> A noisy pager forces the team to fix the 500s.

I'm sure you're not in ops. Or in a dev org of a service with decent request rates.

What you're asking for is a service to fail silently. There's no way a service with a decent request rate to have 0 500s. Not when it still sees development.

A 50 year old bank API? Maybe...

rhyperior · 2026-05-26T14:06:33 1779804393

You only do this when you’re trying to use incident management as a hammer to make a point to somebody whom you have otherwise failed to convince to fix something through persuasive argument. Ie, it’s punitive.

swiftcoder · 2026-05-26T13:54:51 1779803691

Yeah, no, nobody runs cloud services like that. At AWS most alarms required failures in 3 consecutive 5 minute periods. Critical things could be on 3 consecutive 1 minute windows - but that alarm starts a 15 minute escalation for the oncall engineer to check in, and they have to validate the issue isn't a false alarm before updating the status page would even be considered

jordemort · 2026-05-26T13:10:38 1779801038

forget it, Jake; it’s Azure

registeredcorn · 2026-05-26T15:29:07 1779809347

I'm not arguing with what you're saying, but it does make me wonder: What exactly is the point of the status page, if "it is normal for users to already see errors before GitHub officially counts it as an outage"?

Is it more so to have something to link to for managers who aren't using the service have a pretty bar to look at and feel like they are "doing something"? Or is it more of a kind of a way to prevent confirming what you already suspect to be true. E.g. "Huh. Me and Jim are seeing problems. How about you Tom? Oh wait, crud. The service page is confirming it's down now. Never mind! Who wants coffee?!"

filleduchaos · 2026-05-26T16:03:05 1779811385

There is oddly enough a middle ground between "zero errors whatsoever" and "outage".

simonjgreen · 2026-05-26T12:30:04 1779798604

More likely that 'update the Status site' lives a long way down their incident response plan, and they have alarms going off well before that

jordemort · 2026-05-26T13:09:05 1779800945

yeah I mean a company the size of GitHub certainly can’t be expected to have enough staff to walk and chew gum at the same time

swiftcoder · 2026-05-26T13:51:45 1779803505

If it's like other BigTechs I have worked at, you need director-level signoff and comms team approval to post an outage notice

PunchyHamster · 2026-05-26T13:54:22 1779803662

it should be automatic tho. Probably isn't so they can at least get the one nine on availability

simonjgreen · 2026-05-26T14:35:34 1779806134

Marketing definitely takes interest in status sites

re-thc · 2026-05-26T12:40:33 1779799233

> It's weird how I always find out that GitHub is down before GitHub does

No, it's not. Official updates = potential SLA penalties. Always requires approval.

drcongo · 2026-05-26T15:56:16 1779810976

This is the most plausible reply.

chrisjj · 2026-05-26T16:30:46 1779813046

> githubstatus.com

There's a threshold. It shows only once 1000 users complain.

/i

ridiculous_leke · 2026-05-26T19:15:13 1779822913

> Which certainly made me shit myself, briefly.

Can you sue companies for inducing such anxiety?

Imustaskforhelp · 2026-05-26T19:21:26 1779823286

IANAL, but I can probably imagine a case being made if a person really got so stressed that for example any health condition got invoked from the stress. It might be up to the lawyer to explain how exactly the service caused the stress and its direct relation to health condition though and up to the judge.

but I suppose that there might be some terms of conditions within using github (ahem Microsoft) that you can probably not sue them for something like this.

It really depends upon the severity of situation (imo)

For example, if a person had any heart condition and they got so stressed because of an error at github (which to be fair, I can understand the stress part, imagine losing some part of your software because it was on github and the amount of direct damage to livelihood if your income depended on it)

and I think that the judge might have to be in just the right technical know-spot as well and someone who can understand the situation from programmer's perspective hopefully.

Then I can see a case being made.

once again not a lawyer but an interesting question, would love reading other replies to your comment.

also for what its worth, you can sue any company for X,Y or Z. The question worth asking is if you can win such lawsuit.

Personally I believe it might be hard but not impossible but for all practical use cases it might as well be but the only answer can probably be found in court. I am just guessing at this point.

dvduval · 2026-05-26T13:15:50 1779801350

Yes, Thais can be be really frustrating when you’re trying to get work done. There needs to be more competition and better alternatives and the LLMs need to offer easier connection to these alternatives.

weird-eye-issue · 2026-05-26T13:23:46 1779801826

What do the Thai people have to do with this? :(

denisw · 2026-05-26T13:39:58 1779802798

Pretty sure that they wanted to write "this", typed something different by accident, and auto-correct struck.

weird-eye-issue · 2026-05-26T13:47:05 1779803225

Oh gee thanks

superxpro12 · 2026-05-26T13:34:29 1779802469

Reminded me of the "Thai Fighter" joke from family guy's star wars spoof lol

cpfohl · 2026-05-26T12:08:38 1779797318

Wasn’t my fault this time! I haven’t started work yet.

https://news.ycombinator.com/item?id=47237377

folkrav · 2026-05-26T12:31:26 1779798686

Hah, I know the feeling. I installed Ubuntu on a PC recently, it obviously happened to be one of the days they got DDOSed and apt repos were unreachable. I had other things to take care of, so I put it aside for the next week or so. It didn't help very much, cause after picking it back up, halfway through, Snapcraft went down.

Waterluvian · 2026-05-26T12:09:41 1779797381

Yeah but you thought about it, didn’t you?

cpfohl · 2026-05-26T12:58:44 1779800324

I did....maybe my powers are growing.

thesdev · 2026-05-26T12:39:58 1779799198

Next thing you're gonna tell us you're SRE at GitHub.

JsonDemWitOster · 2026-05-26T13:05:56 1779800756

Sorry guys it might be me.

I vibe coded a script that interacts with both Gitlab and Github via their APIs and I've been using it pretty heavily since this morning. I crossed the streams! Goodness, I didn't know it would be _this_ bad!

zombot · 2026-05-26T14:29:57 1779805797

It's only natural that this kind of promiscuity provoked an allergic reaction from Microslop.

swyx · 2026-05-26T15:22:41 1779808961

> I haven’t started work yet.

spooky action at a distance

Andrex · 2026-05-26T12:25:34 1779798334

Uh oh. That means there's at least one more like you out there that we don't know about.

cpfohl · 2026-05-26T12:59:59 1779800399

I always wanted superpowers, but I never dreamed it'd be like this.

- So many super-heroes/super-villains

ramon156 · 2026-05-26T12:21:59 1779798119

Was about to send my bill to you.

... You're off the hook this time./s

bouk · 2026-05-26T12:09:54 1779797394

Insane, we have to come up with contingency plans now for long-duration GitHub outages because we can't safely do deployments. For a service we're paying thousands of $ per year for even though we host runners ourselves...

Salgat · 2026-05-26T14:52:53 1779807173

It's funny, when we were acquired they started moving us to Github actions but it seems that maybe we should stay on our old crusty self-hosted Jenkins setup...

decodebytes · 2026-05-26T12:15:02 1779797702

Same thoughts - we use an action to ship to production, its builds an image, pushes it to ECS which triggers a deployment.

We can't be blocked here. Seems silly what we settled on this, but for a long time GitHub had been reliable enough for many years, but things are sliding down the pan as of late.

mystifyingpoi · 2026-05-26T12:41:25 1779799285

Sounds like a very easy process to rewrite in bash/python and have it on hand if needed.

cryo32 · 2026-05-26T14:11:23 1779804683

You should never entirely depend on a third party service for deployments.

Been burned too many times on that one.

999900000999 · 2026-05-26T14:37:48 1779806268

Ok.

Move to EC2.

Darn AWS is down.

Alright, run it on a Mac Mini in your basement. Ahh dawn, your ISP is having issues. Good thing you have a backup 5G hotspot.

Ohh no, the power is out.

Eventually you have to trust someone else.

GitHub is a tragedy of the Commons. Too many people are using it, and Microsoft isn't willing to handle it correctly.

Feels like a very good business opportunity. Minimum 50k yearly contracts, GitHub with actual uptime. GitPro ?

cryo32 · 2026-05-26T15:16:24 1779808584

We’re actually moving back to redundant data centres due to all of those problems.

Aggregate risk is too high.

sleight42 · 2026-05-26T15:15:14 1779808514

It's almost as though GitHub should never have let itself be sold to Microsoft...

999900000999 · 2026-05-26T15:32:40 1779809560

I'm sure the VCs who invested in GitHub disagree.

This is supposed to be Hacker News! Who is coming up with a startup to fill the gap !

bee_rider · 2026-05-26T15:08:44 1779808124

Maybe we need a split between source management and distribution? The former looks like git[hub] to me, the latter maybe more like a Linux distro repo?

bouk · 2026-05-26T15:18:18 1779808698

We could still deploy manually but it's suboptimal! And we're 'flying blind' without CI runs

matt_kantor · 2026-05-26T15:33:09 1779809589

> And we're 'flying blind' without CI runs

You should never entirely depend on a third party service to run your tests, either.

cryo32 · 2026-05-26T21:56:24 1779832584

   make test

Should work without CI

dnnddidiej · 2026-05-26T12:18:59 1779797939

It is a control pain

the8472 · 2026-05-26T12:42:13 1779799333

./deploy.sh

yoyohello13 · 2026-05-26T14:43:20 1779806600

Self host gitlab. If you already host runners it’s not a big lift.

xaerise · 2026-05-26T21:04:17 1779829457

Even if there is features that are similar, most of gitlabs features are for paying customers only.

yoyohello13 · 2026-05-27T00:13:29 1779840809

OP said they already pay for GitHub. We pay for the premium tier of Gitlab at my work and it’s definitely worth it.

Cthulhu_ · 2026-05-26T14:31:33 1779805893

It's always best to be portable - always be able to do builds and releases locally (at least, once you get the keys - it shouldn't be possible by default), then add things like github actions on top as convenience.

sebmellen · 2026-05-26T12:11:04 1779797464

Same here. You’d think they could at least separate out the GitHub-hosted and self-hosted runners, so you’re still able to dispatch jobs if the self-hosted runners are down.

ketzu · 2026-05-26T12:17:03 1779797823

If the job queue is down, that wouldn't help, would it?

On my repo the jobs do not get scheduled on the PRs at all, so I assume that separation wouldn't help for todays issue.

voxic11 · 2026-05-26T13:42:00 1779802920

They have the github enterprise domain separated out and its working fine right now https://us.githubstatus.com/posts/dashboard

anon7000 · 2026-05-26T14:28:11 1779805691

I’m not convinced they actually do, because GHE on the cloud tends to have the same problems as the main outages. Probably costs extra to be “single tenant” or whatever

sofixa · 2026-05-26T12:14:36 1779797676

Depending on how many thousands of $ per year, it would probably be cheaper and more reliable to self-host GitLab. It's better in terms of organisational structure (you can have one, including access and secret inheritance), and (personal view) Gitlab-CI is better than GitHub Actions because it doesn't push you towards a JavaScript/NPM style dependency hell. And it's actually fairly easy to self-hosted, with options from a single machine with an omnibus package that handles everything to a full blown autoscaling Kubernetes deployment.

hsbauauvhabzb · 2026-05-26T12:34:51 1779798891

Sounds good until you see their cvedetails page

lazystone · 2026-05-26T13:13:13 1779801193

Hide it behind VPN, so it's not accessible from outside.

hsbauauvhabzb · 2026-05-26T23:25:33 1779837933

Now patching becomes a responsibility, unless your organisation is willing to run knowingly vulnerable software.

PunchyHamster · 2026-05-26T13:56:42 1779803802

When you own it you can just limit it into vpn-ed company users, that significantly cuts down on the area that can be hit

sofixa · 2026-05-26T12:37:36 1779799056

I mean, the GitHub Actions supply chain risks and attacks definitely compensate for any GitLab security vulnerabilities you can think of.

re-thc · 2026-05-26T12:41:34 1779799294

> For a service we're paying thousands of $ per year for even though we host runners ourselves...

Wait until you charge you for self-hosting runners.

Oh wait. They already tried.

pluc · 2026-05-26T13:54:25 1779803665

Sure. Don't use GitHub.

You can now hire me as an overpriced consultant instead of paying Microsoft.

bob1029 · 2026-05-26T14:17:19 1779805039

The last two projects I built I did the CI/CD manually with a small win32 service that polls git and builds+deploys the main service locally. It's barely 200 lines of code. Not much to go wrong. "dotnet publish" is not difficult to wrap.

The latest language models have enabled this sort of thing for me. I can integrate a mini Jenkins into every project within a 5-10 minute prompting session. This sort of code isn't hard. It's just tedious, and the LLMs absolutely rock at boring repetitive stuff. Having a win32 service start up successfully on the very first try is something I haven't experienced until 2026.

starik36 · 2026-05-26T14:55:06 1779807306

That works for relatively simple scenarios. When you have to add deploying sql changes or something having to update something in the cloud, you'd have to include a lot more plumbing.

Yokohiii · 2026-05-26T15:20:12 1779808812

In my world CI/CD and db migrations are 2 different things working together. CI/CD at heart is rather simple for many setups. Migrations need quite a lot scrutiny, you really want to mess up there. But if you run on gihub actions with 50/50 uptime, does it matter?

bob1029 · 2026-05-26T17:32:14 1779816734

Deploying SQL changes is actually trivial if you are using SQLite.

I agree in a hosted+shared SQL scenario you have to be a little bit more careful with all of this. Arguably, you should have a separate schema management phase in these cases.

But if you are just SQLite embedded in the service, you can use the user_version pragma to track schema version and perform deterministic migrations (assuming a user didn't manually jack with the file in-between).

peheje · 2026-05-26T15:07:03 1779808023

Deploying SQL changes? Why not just let the application do that on startup. Ofcourse be backward and forward compatible. SQL change only deploy.

"Update something in the cloud" <- What do you mean?

Yokohiii · 2026-05-26T15:24:46 1779809086

> Why not just let the application do that on startup.

That only works on extremely simple setups and has risks. If you have only a single server, you can stall it. Now, how to roll back?

peheje · 2026-05-26T19:10:53 1779822653

We try to keep things simple. Everything has risks. No stall, run async, backward compatible. DB handles rollback via transactions. Happy to expand if interested.

ValentineC · 2026-05-26T15:46:20 1779810380

Not reflected on GitHub Status: most of the frontier models disappearing from most people's subscriptions:

https://www.reddit.com/r/GithubCopilot/comments/1toa9tf/mode...

mcrittenden · 2026-05-26T16:34:49 1779813289

I believe that's here now: https://www.githubstatus.com/incidents/xflkh26pm7vv

sigbottle · 2026-05-26T16:11:00 1779811860

Sad times ahead.

bezier-curve · 2026-05-26T16:02:04 1779811324

Meanwhile people using a copilot proxy [1] with a third party harness have zero issues. Very clumsy enforcement if not a bug.

[1] https://github.com/ericc-ch/copilot-api

nomilk · 2026-05-26T16:06:08 1779811568

As an Indy hacker I want to see GitHub succeed, but I ditched actions years ago - (shocking) false economy. Spend entire nights pushing to actions over and over only for complaints about weird/niche dependency issues and other oddities - the cycle time's just too slow and the DX is no fun (my pain doesn't even factor in outages; just the feature itself as it's intended to be experienced). I want to spend time talking to users and building features, not debugging weird syntax or dependency issues on a remote machine non-interactively.

So why are Actions so unreliable anyway? Occam's Razor would probably suggest the domain is inherently complex/difficult; but other providers show that reliability is possible. What would Occam's Razor suggest next? Poor management..?

frisbee6152 · 2026-05-26T16:08:07 1779811687

What did you switch to, and what do you like about it?

nomilk · 2026-05-26T16:12:26 1779811946

Running tests locally. It's primitive, but incredibly reliable, and a breeze to debug if (big if) there is any dependency issue.

gchamonlive · 2026-05-26T18:37:39 1779820659

I have Gitlab with a runner on a notebook I have running as a server. Pretty solid and if you need to bail on Gitlab SaaS you can BYOI and selfhost. Plus the CI is many streets ahead of GitHub in terms of pretty much everything.

xixixao · 2026-05-26T16:46:22 1779813982

How do you ensure you or your contributors didn’t forget to run the tests?

You’d need at least some hash of sources + test results, and check that it matches that (in CI).

And you’d still deal with environment differences.

nomilk · 2026-05-26T16:59:18 1779814758

> How do you ensure you didn’t forget to run the tests?

Reasonable concern. In ~10 years of indy development, I haven't forgotten to run tests before pushing to main, ever. So setting up and maintaining complicated machinery to solve a problem that could (but never has) happened doesn't justify taking focus off other more important things, namely building.

The benefit probably increases with team size (I'm a team of 1, so I appreciate the luxury of being able to dodge CI/CD entirely).

csomar · 2026-05-26T17:39:10 1779817150

So you switched to nothing? That’s not the purpose of github actions or remote ci/cd. Anyone can run tests/builds locally.

nomilk · 2026-05-26T17:56:08 1779818168

I think it comes down to risk tolerance. For an established company that wants to avoid upsetting users at all costs, CI/CD makes sense. But for a nimble 'move fast and break things' startup, it can steal dev time for very little upside.

Say a disaster happens and someone pushes to main without running tests, 9 times out of 10 it will be of ~zero consequence (either the code works first time, it was a cosmetic change that hardly affected users etc).

I know there are horror stories and CI/CD would have prevented some of those, but IME they're just not that common nor severe for small operations, and even when they happen, only a small subset are irreversible/unfixable.

csomar · 2026-05-26T19:21:14 1779823274

That's not the purpose of a remote CI/CD. Your pipeline can be as strict or as loose as you wish. It's there to show you a log of the execution as it happened in a neutral environment (remote server).

Basically, what you are suggesting is that everyone advertises their tests/builds run on slack? Also when two devs merge their changes, who compile/tests the master branch?

nomilk · 2026-05-26T19:37:39 1779824259

I see the benefit (it avoids the “works on my machine” problem), but my rails app isn’t too fancy and works on heroku ~100% of the time when it works on the dev machine. Making an intermediate build redundant (technically not entirely but it’s just not worth the effort).

For small teams it could be as simple as everyone agreeing to ensure tests pass on main before pushing to prod.

juanre · 2026-05-26T17:02:03 1779814923

A good Makefile goes a long way.

efromvt · 2026-05-26T12:16:01 1779797761

Incredible how reliable the heuristic of "something seems off - probably github being down" has gotten these days

comboy · 2026-05-26T12:22:19 1779798139

It's big enough that every time it goes down, it surely stops somebody from pushing fix for what they currently have broken, so I wonder if status page services see some kind of ripple from github outages.

JsonDemWitOster · 2026-05-26T13:19:40 1779801580

About an hour ago I was having trouble browsing repo files in the browser and I thought "A disturbance in the force, is Github down?" Refreshed HN and loaded up their status site. Nada.

(Ofc, in a sensible universe, we just brush that off to a JS/Firefox glitch or my ISP.)

And yet, here I am. My code is not compiling, my AI isn't vibing, nonetheless I can't work! Two more hours before I can get off!

peterspath · 2026-05-26T12:46:15 1779799575

I moved a while back to Forgejo -> https://forgejo.org couldn't be happier. Highly recommended.

cryptos · 2026-05-26T13:19:31 1779801571

Looks good, but I'm not sure about security: https://bearyangry.com/2026/04/29/carrot-disclosure-forgejo-...

gib444 · 2026-05-26T13:35:11 1779802511

Looks lik a terrible source. Like someone ran Claude on the codebase, didn't analyse the results, then vibe coded a blog post. And the dustri.org link doesn't work for me

Anyway. Forgejo's response to it: https://floss.social/@forgejo/116494295922963052

jonathanbull · 2026-05-26T12:53:21 1779800001

https://www.dayswithoutgithubincident.com

jillesvangurp · 2026-05-26T13:11:44 1779801104

I've been against self hosting internal tools for a long time mainly because of the devops and other overhead. But AI based devops makes it so easy now to spin up whatever you want now that I'm reconsidering that. I use a lot of ansible for several of our deployments. At this point, most of that is managed via codex.

For Git, all you technically need is ssh access and some backup strategy for your server. It would be bare bones but workable. And there are of course plenty of OSS things that are a lot nicer than that.

I'm still using gh and gh actions and we are mostly below the freemium layer with that. But it is kind of slow and honestly a dedicated vm plus some high CPU/memory workers we can spin up on a need to have basis might be a lot faster. With GH outages becoming more common, my hand might be forced a bit.

In recent weeks, I've spun up listmonk (mailing list solution), matrix (as a slack alternative), and a few other things specific to our software stack. A github alternative would be more of the same. We don't need a lot.

The main objection is that with more moving parts to worry about, the workload for me also increases. Things need updating, monitoring, backups, alerting (and responding to alerts), etc. That sucks up my time and that is scarce.

Another reason for self hosting these days is that with agentic AI tools, self hosted things are a lot easier to integrate into agentic systems. If it is self hosted, you don't have to worry about API limitations, rate limitations, walled gardens, etc. All the traditional SAAS silos are becoming a problem from that point of view. The more locked down it is, the bigger the motive for moving away from it. That's why we ditched Slack for Matrix. Slack is hopelessly locked down and tedious to deal with. Matrix is super easy for this.

Barbing · 2026-05-26T15:27:49 1779809269

Did HN forgive Slack for their business with the kids at Hack Club? https://news.ycombinator.com/item?id=45283887

halapro · 2026-05-26T13:58:15 1779803895

> For Git, all you technically need is

Technically Dropbox is just rsync.

Also https://xkcd.com/1319/ but for maintenance.

altern8 · 2026-05-26T12:10:34 1779797434

Why do they go down so often? Is it true that the reason is that they've incorporated too much AI without human review?

insanitybit · 2026-05-26T12:13:43 1779797623

It's (a) they're under massively increased load because everyone's vibing up new projects these days, (b) they've been in a weird frankenstein "on azure but also we have our own control plane" state for years and they're pushing to no longer have that be the case.

I don't think vibecoding at Github has much to do with it.

altern8 · 2026-05-26T12:14:36 1779797676

Ah, yes. A lot more repos, commits, and most importantly huge PRs.

That makes sense. Thank you!

gilrain · 2026-05-26T12:20:12 1779798012

No, it doesn’t. Their competition is not similarly unstable, despite existing in the same world of LLMs. Think critically.

datsci_est_2015 · 2026-05-26T12:28:18 1779798498

Devil’s advocate, Pareto heuristic would let us speculate that 80% of LLM traffic would be aimed directly at the largest provider, i.e. GitHub.

abejfehr · 2026-05-26T12:53:37 1779800017

I think it’s much more than 80%, it’s probably the default recommendation and folks who aren’t technical would just accept it. Probably closer to 95% or more

necovek · 2026-05-26T16:02:01 1779811321

Isn't the relative increase more of interest? If someone was only owning 10% of the market, and they've only gotten 8% (percentage points) of the 20%-not-GH LLM-related increase, they'd still be seeing a very similar spike compared to their baseline as GitHub.

gilrain · 2026-05-26T12:48:40 1779799720

Your speculation is that their competitors would naturally not see a commensurate increase in instability while “only” handling 20% of the same crisis?

I don’t buy the excuse. I want to hitch my wagon to those “mysteriously lucky” competitors. (And have. And haven’t had similar issues to Github, since.)

datsci_est_2015 · 2026-05-26T15:00:13 1779807613

Competitors would be long tail, so a different mode of traffic entirely. Maybe they get spikes that are more easily whack-a-moled than the constant hammering that GitHub receives.

Tough to say as this is all speculative, though.

porridgeraisin · 2026-05-26T15:00:50 1779807650

It's probably a threshold thing isn't it? You wouldn't get 20% of the effect at 20% of the traffic. There's a step function in there somewhere.

vitally3643 · 2026-05-26T14:53:59 1779807239

Their competition doesn't have nearly the same scale of traffic because they don't have nearly the same scale of users or network effects.

Think critically.

ModernMech · 2026-05-26T13:45:13 1779803113

I started using an agent (Codex) on my repo and it went from a a few dozen clones to thousands (3383 this week). I dunno what the agents are doing to clone the repo so many times -- I'm not running 3000 agents or prompts, maybe 10 or so this week. But if this is typical, a 1000x increase in usage across the board can't be good on the system.

12_throw_away · 2026-05-26T15:30:50 1779809450

> I dunno what the agents are doing to clone the repo so many times

agentic "ai" is going great

cautiouscat · 2026-05-26T12:17:49 1779797869

Microsoft has boasted 30% of their code written by AI.[1] However we could only guess if AI generated code is the issue or something else, or a combination of things.

That being said there was a noticeable trend starting around 2022.[2] That being said they’ve also been doing a big migration to Azure. It’s likely a combination of things.

1: https://www.cnbc.com/2025/04/29/satya-nadella-says-as-much-a...

2: https://www.reddit.com/r/sysadmin/s/LOMPaSv3wY

jampekka · 2026-05-26T12:21:19 1779798079

The instability started well before vibecoding, in around 2018-2019, shortly after the Microsoft acquisition.

https://damrnelson.github.io/github-historical-uptime/

https://news.ycombinator.com/item?id=47591928

chilmers · 2026-05-26T12:25:46 1779798346

This gets posted every time GitHub is down. This chart is not accurate. It is based on data scraped from GitHub's status page and that data is missing historical incidents from the pre-Microsoft era.

sarchertech · 2026-05-26T12:36:50 1779799010

Yeah, it’s not even consistent with their own incident history. I spot checked it and consistently found incidents with downtime/elevated error rates in months listed as 100.00000% uptime on that chart.

Gigachad · 2026-05-26T13:25:53 1779801953

The unofficial and offical charts are both lying. The GitHub one ignores actual outages and the unofficial ones count minor display bugs in minor features as a “github outage”.

sarchertech · 2026-05-26T14:23:06 1779805386

The unofficial one has done that for years though so it’s useful for comparison. If you go back a few years it was regularly at 99.9% uptime.

Gigachad · 2026-05-27T01:37:46 1779845866

Just vibes wise, before Microsoft acquired GitHub, they added almost no features on a regular basis. These days they are adding tons of stuff every month.

When I dug in to the latest outages, they were almost all in small newer, features like all the AI stuff. The actual core GitHub platform seems much more stable than the unofficial uptime trackers propose.

cebert · 2026-05-26T12:15:31 1779797731

GitHub had a blog post about this recently. They reported a significant uptick in volume (repos created, PRs, etc.), which they attribute to AI usage and tooling.

rossant · 2026-05-26T15:00:56 1779807656

https://github.blog/news-insights/company-news/an-update-on-...

gilrain · 2026-05-26T12:18:22 1779797902

Do you really believe their competition hasn’t seen the same increase? Because their competition certainly hasn’t seen the same instability issues.

abejfehr · 2026-05-26T12:56:27 1779800187

Yes, I truly believe that GitHub is recommended by an LLM orders of magnitude more frequently than any other forge

dawnerd · 2026-05-26T14:47:44 1779806864

I’ve interviewed a lot of people and when asking about their git experience they’ve said they use GitHub. To a lot of devs they are the same thing.

rwmj · 2026-05-26T12:55:53 1779800153

This plus in a well-designed system an increase in load might cause new jobs to stop running but shouldn't take down the whole system.

llbbdd · 2026-05-26T14:39:21 1779806361

What competition?

coreyh14444 · 2026-05-26T12:27:46 1779798466

I personally trigger github actions approximately 50x more than I did prior to AI-driven developer coding and I'm not alone.

martinald · 2026-05-26T13:01:51 1779800511

Totally agree. There's days (or even afternoons) where I trigger more actions than I would have done in a month.

r0b05 · 2026-05-26T12:54:13 1779800053

Okay so the recent outages are also likely due to increased load due to AI assisted development speeding up workflows.

AlienRobot · 2026-05-26T13:01:59 1779800519

It could be many things. Microsoft mismanaging stuff. Azure. Vibe-coded Github. So much AI slop being committed it adds an extra burden on the servers, etc.

thepaulmcbride · 2026-05-26T15:02:13 1779807733

We’ve had GitHub actions for long enough, it’s time for GitHub consequences.

cyanydeez · 2026-05-26T15:09:05 1779808145

>Copilot: Do you want me to implement consequences for you or babble on and on about what might entirely be a figment of your imagination (Github is up and you're on a 48 hour bender without sleep)

pnvdr · 2026-05-26T15:09:39 1779808179

i would like to see consequences for "secure sleep" XD.

shevy-java · 2026-05-26T16:05:52 1779811552

I think this can only happen if there are viable alternatives.

For instance, the UI at setups such as https://git.devuan.org/Daemonratte/gtk2-ng is quite ok-ish, in my opinion. Granted, it is mostly copy/paste from github but that still is about 1000000x better than sourceforge's interface - and gitlab's UI too (I just hate gitlab's UI, they seem to love complexity and a billion features only 0.000001% ever need; GitHub, with all its faults, is for the most part really simple - not everywhere, e. g. GitHub wiki setup sucks, but by and large I think it is simple overall).

danudey · 2026-05-26T16:09:46 1779811786

Github definitely has the better UI but if it weren't for network effects I'd be pushing to migrate to Gitlab pretty hard.

Bnjoroge · 2026-05-26T16:14:58 1779812098

Gitlab’s UI is extremely terrible. It’s hard to even explain how bad it is.

dilawar · 2026-05-26T16:41:01 1779813661

Once you get used to it, it's not so bad which is probably true for all functional UIs. I switch between gitlab and GitHub quite a lot and I can't say which one is objectively better. I do like that cross-linking is easier in GitHub but I prefer gitlab ci over GitHub actions. Too bad that gitlab ci runner has removed the command to run ci locally but third party foss solutions are there.

wongarsu · 2026-05-26T16:32:35 1779813155

If replacing github wholesale isn't viable, how does the story for replacing GitHub Actions look like currently? I don't remember the pre-Github-Actions days of everyone using CircleCI with a github integration in a negative light. I've noticed that since then a couple of CI providers have sprung up that differentiate themselves with faster build speeds, but I haven't really kept up with that market

joshuanapoli · 2026-05-26T16:33:27 1779813207

I've been thinking of reverting back to Circle CI.

thesurlydev · 2026-05-26T16:38:30 1779813510

"A little less conversation, a little more action please"

pistoriusp · 2026-05-26T12:19:51 1779797991

Whilst you're waiting for it to come back, try out AGENT-CI (which is a project I built.), which runs GitHub Actions on your machine: https://agent-ci.dev. (Open source, etc.)

No, it's not like "act," because it uses the standard Github runner, the difference is that the control plane is an emulation of api.github.com, because of this we can do all kinds of nice things:

Caching in ~0 ms. Pause on failure, so you can let your AI agent fix it and retry without pushing.

skinfaxi · 2026-05-26T13:13:13 1779801193

You're affiliated with the project. You should definitely be upfront about that when shilling.

pistoriusp · 2026-05-26T13:35:18 1779802518

You're right, figured it was implied, but now fixed.