More

tlb · 2026-06-13T09:27:56 1781342876

As a kid I found saltpeter at an old-fashioned pharmacy and made gunpowder, and it also barely fizzled. I think you have to grind the ingredients much finer than a kid has patience for.

tlb · 2026-06-13T09:08:38 1781341718

Generator excitation is not the hard part of a black start. You have to run coal feeders, blowers, and water pumps for an hour before you can spin the generator. Then you get power instantly upon applying power to the field windings.

msandford · 2026-06-13T11:53:16 1781351596

And even that's not the hard part of a black start. The plant control is relatively easy. What's hard is grid coordination.

All generation and consumption have to be almost perfectly balanced every second of every day. And the power company doesn't have good addressability of load. Worse when you restore power to an area all their stuff turns on in parallel giving an inrush that could be 3x or more the steady state.

A black start is a very drawn out process of bringing generation and load online in a balanced way and with wait times between load increases for stabilization.

cesarb · 2026-06-13T16:08:45 1781366925

> You have to run coal feeders, blowers, and water pumps for an hour before you can spin the generator.

That's probably the reason most grid black start facilities in my country (Brazil) are hydroelectric dams, which need none of these.

tlb · 2026-06-11T17:44:17 1781199857

Yes, that is basically the plan. It's based on the belief that unfettered AI would let anyone be a supervillain and destroy the world. There are enough would-be supervillains out there, but they rarely get far because they can't get teams of smart people to build doomsday machines for them. So the AI has to not let anyone do evil with it.

Unfortunately, that won't feel very much like freedom.

lebovic · 2026-06-11T18:37:40 1781203060

It sounds like you might not agree with that belief.

While I don't agree with their actions here, I do think there's sufficient reason to hold that belief.

On some fronts (e.g. security, on which you've experienced more than me), I think there are surmountable challenges. But on other fronts (e.g. bio), a single errant actor could reasonably kill millions or billions of people with sufficiently powerful AI. We don't have good defenses here, and those actors do exist.

I still don't agree with these actions, but I do think I agree with their assumptions.

zozbot234 · 2026-06-11T19:06:02 1781204762

The model release cards for Opus have repeatedly and consistently stressed that the model doesn't have the fiddly know-how that's required to provide meaningful assistance in possibly dangerous subfields of biology. Mythos (Fable without the overly strict guardrails) has shown improvements in things like drug design, but even then the situation isn't really that different. This risk is ridiculously overblown, and the way to manage it sensibly is to introduce meaningful oversight for actors that seek to order the actual specialized materials involved (especially any synthetically generated genes/proteins/whatever).

lebovic · 2026-06-11T19:30:32 1781206232

No, Anthropic's model cards have claimed that the models don't show considerably more uplift than previous ASL-3 models, which already showed material uplift.

I participated in the internal bioweapons uplift test for Sonnet 3.7, and even then, one non-expert got huge uplift from the model [1]. I'd consider evals a lower bound of capabilities that can be elicited from a model.

The team behind Biomni, a biomedical agent that's widely used by researchers, has continued to find consistent gains between models [2]. I trust them, because I visited them to build their HPC tool [3], which the model is quite capable of using – moreso than most grad students. The Biomni team cares a lot about about real usability for real researchers, so they have a great pulse on capabilties.

SecureBio also has some public evals [4], which have continued to show increasing uplift.

And while synthesis monitoring is a part of the solution, I think you might underestimate how much goes under the radar. See the Reedley lab incident for an example [5].

Is Anthropic still effectively throttling beneficial biomedical research? Yes! And so is OpenAI. But the underlying capability is still actually dual use.

[1]: See page 25 in https://www-cdn.anthropic.com/9ff93dfa8f445c932415d335c88852...

[2]: Their benchmark has a preprint at https://www.biorxiv.org/content/10.64898/2026.05.12.724604v1...

[3]: https://x.com/phylo_bio/article/2029233694775624096

[4]: https://securebio.org/

[5]: Search for "ebola" in the public report for the Reedley lab incident at https://chinaselectcommittee.house.gov/sites/evo-subsites/se...

zozbot234 · 2026-06-11T19:47:20 1781207240

> No, Anthropic's model cards have claimed that the models don't show considerably more uplift than previous ASL-3 models, which already showed material uplift.

Doesn't this simply amount to disagreeing about what counts as "meaningful" from a bio-safety POV? Also, even the ASL-3 deployment safeguards for Opus 4 and higher were always adopted as a mere matter of caution; it's not clear that even Anthropic believed at any point that this reflected any genuine "threshold crossing" event. So it's just not obvious how much weight we're supposed to place on that particular stance.

lebovic · 2026-06-11T20:26:43 1781209603

In normal bio, there are standardized biosafety levels, because without it there would be no standard agreement on what "meaningful" safety is. So yes, I do think there's ambiguity here.

But I don't think I've found any domain expert who thinks granting everyone raw access to the most capable models wouldn't meaningfully increase risk. OpenAI recently staffed a biological threat modeler to help quantify this risk.

(Edit: just saw your edit, this includes at Anthropic. ASL tiers were "rule-out" to exclude rather than "rule-in", so exact thresholds were murkier, but I think it's clear that models have passed that threshold by now.)

That said, there are clear steps and requirements to set up a BSL-2 or BSL-3 lab, and I think there should be similarly clear rules around model capabilties and access. The process for Anthropic and OpenAI is murky and still implictly gated on spend, which I think is holding back research.

For example, anyone who has access to a BSL-3 lab should have a clear and low-cost path to a model with corresponding capabilities, as long as they set up corresponding precautions for model access.

I think it would be a bad outcome for only frontier labs and a select few groups they choose to have access to the most capable models – which is sadly the precedent that's currently being set.

zozbot234 · 2026-06-11T20:31:41 1781209901

> But I don't think I've found anyone who is a domain expert who thinks granting everyone access to raw modes wouldn't meaningfully increase risk.

It depends how capable these raw models are. Biology as a field depends most on real-world knowledge, which is an expensive capability for open models targeting widespread deployment. It's quite plausible that even Opus 4 would be a lot more capable in these domains than the best universally accessible "raw models" today, quite unlike other domains such as coding or pure math. The securebio.org benchmark has spotty representation of openly available models, but it does show Kimi 2.5 being no more capable than GPT 5 mini, and clearly below o4-mini and Opus 4.0; which may be a plausible summary of where things stand today.

lebovic · 2026-06-11T20:43:11 1781210591

That's a good clarification. I've updated my comment to the "most capable models" to refer to the most recent releases.

And sure, and I love open models – I spent much of the past couple months doing additional RL on Qwen 3.6 35B A3B, Gemma 4, Kimi K2.6, and GLM 5.1. Without these open models, I'd be forced to do my research inside a frontier lab.

There's a balance to strike here, but I don't think the biological risk is overplayed. It would be very easy to accidentally cross the threshold of "meaningful" without adequate safeguards, and then be unable to undo what you've released to the world.

charcircuit · 2026-06-12T05:17:14 1781241434

>and those actors do exist

Do they? We don't even have single errant actors who go and kill 1000 people. I don't believe human motivations support the idea of killing so many people unrelated to you.

tlb · 2026-06-11T16:19:19 1781194759

It's ridiculous to consider MITM attacks out of scope for taking over your computer. Also, there are probably ways to exploit this without a true MITM like DNS cache poisoning. But it's best to just assume the whole internet is MITMed.

tptacek · 2026-06-11T20:24:44 1781209484

It's not out of scope "for taking over your computer". It's out of scope for the specific goals of the bug bounty program. Bug bounties are (usually) about prioritizing internal engineering effort; they are to vulnerability remediation what market feedback is to feature/function decisions in the rest of the product.

Everyone's judging this by the standard of "how good a bug" this is. But that's not necessarily how a bug bounty should function. Important prior to frame this with: neither any individual bug bounty submission nor the sum of all valid submissions materially alters the security of a serious product, at least not on their own. The system they feed into (for instance: security engineers taking a validated bounty submission and then quickly auditing the entire tree for variants of the same bug) can move the dials. The bounty bugs themselves though are mostly a sideshow.

What's especially weird (you didn't say this, but the sentiment has popped up on all 3 threads about this story) is the idea that AMD would be trying to cover this up. Why would they care? They run a bug bounty program. They've accepted the premise that they have vulnerabilities.

(From earlier today, in add'n: https://news.ycombinator.com/item?id=48492908).

inlined · 2026-06-11T23:22:32 1781220152

But it should be their job to protect against MitM in their threat model. There is no rational reason to exclude them from the bug bounty. Doing so only leaves MitM attacks like this undisclosed.

tptacek · 2026-06-11T23:33:09 1781220789

I just gave a rational reason to exclude them from the bug bounty, which I can summarize as "the bug bounty is not their entire security program and does not have the goal you've axiomatically derived for it".

Cards on the table I am not a fan of bug bounty programs, and the fact that they're an engineering process that turns out to be impossible to have public engineering discussions about is definitely one of many reasons why. Most companies should not run bug bounty programs.

tiberious726 · 2026-06-12T23:11:54 1781305914

Who would exploits be sold to then?

tptacek · 2026-06-13T00:55:18 1781312118

Nobody is selling this dumb mitm bug.

amiga386 · 2026-06-11T16:37:19 1781195839

MITM where attacker needs to install their own CA certs on the victim's device -- sure, out of scope.

MITM because you used http instead of https and you don't have any other verified cryptographic signature on your data -- get tae fuck, fix it pronto.

pietervdvn · 2026-06-11T17:23:10 1781198590

I'd even count this as "having local access to the device", as that is what is needed to install such a cert

arcfour · 2026-06-11T18:31:06 1781202666

I think it's fair to say that requiring local administrative access to the device is out of scope, since you have already completely pwned the device in that case, which is what what you need to install a CA cert on any OSes.

inlined · 2026-06-11T23:25:08 1781220308

In honor of The Old New Thing I call these “Vogon vulnerabilities”: I have a marvelous exploit in mind that pwns anyone I have root access to

josephg · 2026-06-12T03:18:15 1781234295

The list of preinstalled CAs is long. I think its a safe bet that many nation-states have covert control over at least one CA on that list. (Or they have one of the root signing certs). HTTPS is way better than HTTP. But I'd personally rather if these random organisations didn't have RCE on my computers.

I've never heard of most of them. AAA Certificate Services? AC RAIZ FNMT-RCM? ACCVRAIZ1? Actalis? AffirmTrust? Even Godaddy is in there. I know I don't trust those guys.

Trust has gotta start somewhere. But its much better to TOFU, then pin signing keys in the updater.

joxdosba · 2026-06-11T17:53:03 1781200383

Why would anyone ever exclude true mitm?

Various domain registrars have been compromised over and over again (often by children!), resulting in companies like Tesla and Cloudflare getting owned.

The reality is that any vaguely competent attacker can compromise a court clerk and just compel e.g. the .com registry to hand over whatever domain they want.

Although I suppose the aforementioned problem has significant implications beyond dns…

gruez · 2026-06-11T18:26:03 1781202363

>Why would anyone ever exclude true mitm?

Same reason security programs exclude social engineering, even though that's a pretty common way for companies to get pwned.

zulln · 2026-06-11T19:08:22 1781204902

Excluding SE is to make sure people do not spam customer support and launch annoying phishing campaigns. None of that is applicable for local software running on your own computer.

tptacek · 2026-06-11T21:16:15 1781212575

No, excluding SE is to make sure the bounty program is incentivizing things that inform the product security team. Social engineering is a corpsec function; they're not even the same teams.

joxdosba · 2026-06-12T05:36:59 1781242619

Sure, but this is more akin to dismissing a 1click RCE as “social engineering” because an employee has to be convinced to click a link.

sigmoid10 · 2026-06-11T16:25:18 1781195118

Out of scope does not necessarily mean out of impact. It is merely a question of how far a company wants to be responsible for the environment their software is run in. Most of the time that answer is "not much."

tuckerpo · 2026-06-11T16:45:28 1781196328

Out of scope in this case means "we don't wanna pay you"

cogman10 · 2026-06-11T18:52:39 1781203959

Apparently it also means "We don't want to pay our engineers to fix this".

dlcarrier · 2026-06-11T16:35:37 1781195737

But I use a Wi-Fi password, so my phone says it's secure!

tlb · 2026-06-11T16:13:14 1781194394

They should compile it to WASM and host it.

slow_typist · 2026-06-11T19:42:01 1781206921

I did not know COBOL compiles to WASM until I read your comment and looked it up, thanks!

tlb · 2026-06-03T14:35:03 1780497303

Building a swap device at user level used to be one of those classic unsolvable problems, because what if your daemon needs to swap in a page in order to swap in a page? Or at least it was discussed at a reason why microkernels will never work. I’m not sure what the solution is here.

kccqzy · 2026-06-03T15:03:41 1780499021

Your daemon can be smart enough to know which are its own pages and prevent them from being swapped out. The Linux kernel also prevents its own text pages from being swapped out, so the solution exists and I don’t see why it doesn’t apply to microkernel designs.

netbsdusers · 2026-06-03T14:56:45 1780498605

The general principle is that what is involved in paging should not be paged itself. Wiring the memory of that whole daemon is then a trivial solution to the problem.

tlb · 2026-06-03T12:36:14 1780490174

There's a package to do this in Julia: https://juliaarrays.github.io/StructArrays.jl/stable/

tlb · 2026-05-22T17:29:11 1779470951

There are still things you can't do with an open-weight model without the training data, like modifying the architecture and training from scratch. That's different from true open-source code, where you can do anything the authors could do.

tlb · 2026-05-16T12:28:41 1778934521

Linux on Arm works great. I barely notice the difference except everything is a bit faster. Most SaaS companies can and should switch.

tlb · 2026-05-13T21:31:54 1778707914

Yes, for example taking off or landing a rocket on the surface blasts particles of sand out sideways at 1000s of m/s. The particles can fly in the thin atmosphere for kilometers and sandblast everything. Our intuitions about how far and fast tiny things can fly are only true in an atmosphere of similar density.

Recurecur · 2026-05-15T22:29:55 1778884195

Your intuition about how far sand particles can fly at high velocity in the Martian atmosphere is way off base…

ActorNightly · 2026-05-16T04:48:32 1778906912

While he is exaggerating a bit, the problem still remains - dust can be deadly to equipment because the grains will move way faster. You also have the problem of dust particles colliding and becoming charged with nothing to dump the charge to. A human habitat has to hold positive air pressure, which means that it has to generate its oxygen or get it from the atmosphere.

If we don't have the experience of buildings stuff on earth where we can test things, we sure as shit not gonna be able to do it on Mars.