More

n_u · 2026-02-02T23:40:58 1770075658

A former NASA engineer with a PhD in space electronics who later worked at Google for 10 years wrote an article about why datacenters in space are very technically challenging:

https://taranis.ie/datacenters-in-space-are-a-terrible-horri...

I don't have any specialized knowledge of the physics but I saw an article suggesting the real reason for the push to build them in space is to hedge against political pushback preventing construction on Earth.

I can't find the original article but here is one about datacenter pushback:

https://www.bloomberg.com/opinion/articles/2025-08-20/ai-and...

But even if political pushback on Earth is the real reason, it still seems datacenters in space are extremely technically challenging/impossible to build.

taurath · 2026-02-02T23:43:14 1770075794

We don’t even have a habitable structure in space when the ISS falls, there is no world in which space datacenters are a thing in the next 10, I’d argue even 30 years. People really need to ground themselves in reality.

Edit: okay Tiangong - but that is not a data center.

nickff · 2026-02-02T23:44:56 1770075896

Who is “we”? https://en.wikipedia.org/wiki/Tiangong_space_station

taurath · 2026-02-03T00:04:41 1770077081

Good point. Still a long, long way from data centers.

rlt · 2026-02-03T06:22:06 1770099726

We also don't have fully reusable launch vehicles, yet. But we will shortly. That will decrease the cost of launch by at least an order of magnitude.

Still there will be a lot of engineering problems to solve.

2-3 years seems very short, but 10 years seems long to me.

tzs · 2026-02-03T00:10:16 1770077416

I don't think any of the companies that say they are working on space data centers intend them to be habitable.

JumpCrisscross · 2026-02-03T00:54:57 1770080097

> We don’t even have a habitable structure in space

Silicon is way more forgiving than biology. This isn’t an argument for this proposal. But there is no technical connection between humans in space and data centers other than launch-cost synergies.

fluoridation · 2026-02-03T01:19:46 1770081586

Okay, but a human being represents what, 200 W of power? The ISS has a crew of 3, so that's less than a beefy single user AI workstation at full tilt. If the question is whether it's practical to put 1-2 kW worth of computing power in orbit, the answer is obviously yes, but somehow I don't think that's what's meant by "datacenter in space".

IX-103 · 2026-02-03T00:18:16 1770077896

I don't know, 10 years seems reasonable for development. There's not that much new technology that needs to be developed. Cooling and communications would just require minor changes to existing designs. Other systems may be able to be lifted wholesale with minimal integration. I think if there were obstacles to building data centers on the ground then we might see them in orbit within the next ten years.

I don't see those obstacles appearing though.

doctorwho42 · 2026-02-03T01:48:14 1770083294

The same things you are saying about data centers in space was said by similar people 10-15 years ago when Elon musk said SpaceX would have a man on Mars in 10-15 years.

We have had the tech to do it since the 90's, we just needed to invest into it.

Same thing with Elon Musks hyperloop, aka the atmospheric train (or vactrain) which has been an idea since 1799! And how far has Elon Musks boring company come to building even a test loop?

Yeah, in theory you could build a data center in space. But unless you have a background in the limitations of space engineering/design brings, you don't truly understand what you are saying. A single AI data center server rack takes up the same energy load of 0.3 to 1 international space station. So by saying Elon musk can reasonable achieve this, is wild to anyone who has done any engineering work with space based tech. Every solar panel generates heat, the racks generate heat, the data communication system generates, heat... Every kW of power generated and every kW of power consumes needs a radiator. And it's not like water cooling, you are trying to radiate heat off into a vacuum. That is a technical challenge and size, the amount of tons to orbit needed to do this... Let alone outside of low earth... Its a moonshot project for sure. And like I said above, Elon musk hasnt really followed through with any of his moonshots.

rlt · 2026-02-03T06:28:19 1770100099

His time estimates are notoriously, um, aggressive. But I think that's part of how his companies are able to accomplish so much. And they do, even if you're upset they haven't put a human on Mars fast enough or built one of his side quests.

"We specialize in making the impossible merely late"

throwup238 · 2026-02-03T02:29:55 1770085795

> A single AI data center server rack takes up the same energy load of 0.3 to 1 international space station.

The ISS is powered by eight Solar Array Wings. Each wing weighs about 1,050kg. The station also has two radiator wings with three radiator orbital replacement units weighing about 1,100kg each. That's about 15,000 kg total so if the ISS can power three racks, that's 5,000kg of payload per rack not including the rack or any other support structure, shielding, heat distribution like heat pipes, and so on.

Assuming a Falcon Heavy with 60,000 kg payload, that's 12 racks launched for about $100 million. That's basically tripling or quadrupling (at least) the cost of each rack, assuming that's the only extra cost and there's zero maintenance.

TheBlight · 2026-02-02T23:52:31 1770076351

Ok then short SpaceX stock when it IPOs.

int0x29 · 2026-02-03T06:39:50 1770100790

“Markets can remain irrational longer than you can remain solvent.” - John Maynard Keynes

taurath · 2026-02-03T00:07:09 1770077229

What does stock price have to do with anything?

That someone could put a data center in space for the price of 100 years of eliminating world hunger doesn’t mean shit.

satvikpendem · 2026-02-03T00:29:12 1770078552

People always make this claim about world hunger elimination with no sources. Keep in mind we make more than enough calories to feed everyone on the planet many times over, it's a problem of distribution, of getting the food to the right areas and continuing cultivation for self sufficiency.

taurath · 2026-02-03T00:55:54 1770080154

That’s right, it’s an allocation of resources problem, and some people seem to control almost all the resources.

satvikpendem · 2026-02-03T01:23:19 1770081799

Even the most magnanimous allocators cannot defeat the realities of boots on the ground in terms of distribution. It is a very difficult problem that cannot be solved top down, the only solution we've seen is growth of economic activity via capitalistic means, lifting millions, billions out of poverty as Asia has done in the last century for example.

techblueberry · 2026-02-03T02:56:04 1770087364

Why would you short the stock?

CamperBob2 · 2026-02-03T04:22:23 1770092543

If you're hellbent on arguing with a cult, it will be much cheaper to go down to your local Church of Scientology and try to convince them that their e-meter doesn't work.

Sohcahtoa82 · 2026-02-03T00:43:22 1770079402

As if company performance actually affected stock price when it comes to anything Elon Musk touches.

For fuck's sake, TSLA has a P/E of a whopping *392*. There is zero justification for how overvalued that stock is. In a sane world, I should be able to short it and 10x my money, but people are buying into Musk's hype on FSD, Robotaxi, and whatever the hell robot they're making. Even if you expected them to be successes, they'd need to 20x the company's entire revenue to justify the current market cap.

999900000999 · 2026-02-03T03:25:48 1770089148

It's much easier to find a country or jurisdiction that doesn't care about a bunch of data centers vs launching them into space.

I don't get why we aren't building mixed use buildings, maybe the first floor can be retail and restaurants, the next two floors can be data centers, and then above that apartments.

kuon · 2026-02-03T04:23:26 1770092606

In Switzerland infomaniak built a data center under apartments and DC heat is used for heating. There are some videos about it.

999900000999 · 2026-02-03T04:29:38 1770092978

Americans have trouble understanding something like that. We believe anything short of a 3bdrm house with a lawn and backyard is communism.

I'd love to live in a dense city. My office within waking distance. A Cafe in my apartment building, etc.

SecretDreams · 2026-02-03T04:06:55 1770091615

Probably for the same reasons they aren't doing mixed use prison and restaurant buildings.

shimman · 2026-02-03T04:14:12 1770092052

What you don't want to live near the newest poisonous abomination that the whiz kids dreamt up? Do you want China to take over America or something?

zaptrem · 2026-02-03T06:37:36 1770100656

Data centers don't do anything other than sit there and turn electricity into heat. They only emit nothing but heat (which could be useful to others in the building).

CamperBob2 · 2026-02-03T04:20:09 1770092409

Mixed-use buildings with restaurants on the lower floors and residential on the upper floors are very common. Not sure what prisons have to do with anything.

spikels · 2026-02-03T04:51:26 1770094286

Google is currently working on AI data centers in space.

https://blog.google/innovation-and-ai/technology/research/go...

bandrami · 2026-02-03T02:14:23 1770084863

It's not just "very challenging", it's "very challenging and also solves no actual problem we face".

onlyrealcuzzo · 2026-02-03T03:11:00 1770088260

> A former NASA engineer with a PhD in space electronics who later worked at Google for 10 years wrote an article about why datacenters in space are very technically challenging

It's curious that we live in a world in which I think the majority of people somehow think this ISN'T complicated.

Like, have we long since reached the point where technology is suitably advanced to average people that it seems like magic, where people can almost literally propose companies that just "conjure magic" and the average person thinks that's reasonable?

solid_fuel · 2026-02-03T05:58:52 1770098332

It's just the thought process that comes with shallow understanding:

    "I can buy a server"
    "We can put things in space"
    "What do you mean I can't get a server in space?!"

jitl · 2026-02-03T03:28:44 1770089324

you can’t tell me the microwave isn’t magic. it’s magic.

jasondrowley · 2026-02-03T04:28:12 1770092892

I can put things in a box that uses spooky electromagnetic waves to tickle water molecules to the point that they get hot and maybe boil off, given the chance? Sounds like magic to me

sejje · 2026-02-03T02:33:06 1770085986

It's not like launching stuff into space doesn't have pushback, either. See: starlink satellites.

edhelas · 2026-02-02T23:46:17 1770075977

"Technically challenging", a nice way to say "impossible"

boxedemp · 2026-02-03T00:11:43 1770077503

Just like rockets landing themselves

sollewitt · 2026-02-03T00:30:32 1770078632

No, rockets landing themselves is just controlling the mechanism you use to have them take off, and builds on trust vectoring technology from 1970s jet fighters based on sound physics.

Figuring out how to radiate a lot of waste heat into a vacuum is fighting physics. Ordinarily we use a void on earth as a very effective _insulator_ to keep our hot drinks hot.

rlt · 2026-02-03T06:31:18 1770100278

Figuring out how to radiate a lot of waste heat into a vacuum is just building very large radiators.

Sparyjerry · 2026-02-03T05:55:19 1770098119

This is a classic case of listing all the problems but none of the benefits. If you had horses and someone told you they had a Tesla, you'd be complaining that a Tesla requires you to dig minerals where a horse can just be born!

fooker · 2026-02-03T00:55:12 1770080112

> Figuring out how to radiate a lot of waste heat into a vacuum is fighting physics.

Radiators should work pretty well, and large solar panels can do double duty as radiators.

Also, curiously, newer GPUs are developed to require significantly less cooling than previous generations. Perhaps not so coincidentally?

doctorwho42 · 2026-02-03T01:30:30 1770082230

Well there lies the rub, solar panels already need their own thermal radiators when used in space ...

fooker · 2026-02-03T01:37:55 1770082675

Great, so you seem to agree the technology exists for this and it is a matter of deploying more of it?

Numerlor · 2026-02-03T04:06:16 1770091576

It's a matter of deploying it for cheaper or with fewer downsides than what can be done on earth. Launching things to space is expensive even with reusable rockets, and a single server blade would need a lot of accompanying tech to power it, cool it, and connect to other satellites and earth.

Right now only upsides an expensive satellite acting as a server node would be physical security and avoiding various local environmental laws and effects

fooker · 2026-02-03T05:42:18 1770097338

> Right now only upsides ...

You are missing some pretty important upsides.

Lower latency is a major one. And not having to buy land and water to power/cool it. Both are fairly limited as far as resources go, and gets exponentially expensive with competition.

The major downside is, of course, cost. In my opinion, this has never really stopped humans from building and scaling up things until the economies of scale work out.

> connect to other satellites and earth

If only there was a large number of satellites in low earth orbit and a company with expertise building these ;)

rlt · 2026-02-03T06:36:12 1770100572

I mostly agree with you, but I don't understand the latency argument. Latency to where?

These satellites will be in a sun-synchronous orbit, so only close to any given location on Earth for a fraction of the day.

Daishiman · 2026-02-03T03:47:08 1770090428

You need to understand more of basic physics and thermodynamics. Fighting thermodynamics is a losing race by every measure of what we understand of the physical world.

fooker · 2026-02-03T03:58:41 1770091121

> Fighting thermodynamics is a losing race

The great thing about your argument is that it can be used in any circumstance!

Cooling car batteries, nope can't possibly work! Thermodynamics!

Refrigerator, are you crazy? You're fighting thermodynamics!

Heat pump! Haah thermodynamics got you.

kristjansson · 2026-02-03T03:03:01 1770087781

1kW TDP chips need LESS cooling?

fooker · 2026-02-03T03:49:13 1770090553

Yes, Rubin reportedly can deal with running significantly hotter.

That makes radiating a much more practical approach to cooling it.

kristjansson · 2026-02-03T06:28:25 1770100105

I see what you’re saying - higher design temp radiates better despite more energy overall to dissipate.

fourseventy · 2026-02-03T00:46:39 1770079599

His point is that everyone said landing and reusing rockets was impossible and made fun of Elon and SpaceX for years for attempting it.

myko · 2026-02-03T01:34:19 1770082459

No, people made fun of Elon for years because he kept attempting it unsafely, skirting regulations and rules, and failing repeatedly in very public ways.

The idea itself was proven by NASA with the DC-X but the project was canceled due to funding. Now instead of having NASA run it we SpaceX pay more than we'd ever have paid NASA for the same thing.

DC-X test flight: https://www.youtube.com/watch?v=gE7XJ5HYQW4

It's awesome that Falcon 9 exists and it is great technology but this guy really isn't the one anyone should want in charge of it.

kortilla · 2026-02-03T03:34:52 1770089692

>Now instead of having NASA run it we SpaceX pay more than we'd ever have paid NASA for the same thing.

This doesn’t pass the smell test given that the cost of launch with spacex is lower than it ever was under ULA.

NASA has never been about cheap launches, just novel technology. Look at the costs of Saturn and SLS to see what happens when they do launch.

SmirkingRevenge · 2026-02-03T01:29:37 1770082177

He also said he could save the us a trillion dollars per year with DOGE, and basically just caused a lot data exfiltration and killed hundreds of thousands of people, without saving any money at all

sejje · 2026-02-03T02:36:48 1770086208

Elon Musk killed hundreds of thousands of people?

SmirkingRevenge · 2026-02-03T02:49:03 1770086943

Yes. Mostly kids, because of the DOGE ransacking of USAID

https://healthpolicy-watch.news/the-human-cost-one-year-afte...

techblueberry · 2026-02-03T02:58:06 1770087486

He said impossible, this was done recently, by spacex themselves.

moomoo11 · 2026-02-03T03:33:36 1770089616

Yeah but he’s an expert his opinion can be dismissed bro this is 2026

YetAnotherNick · 2026-02-03T02:07:19 1770084439

> It(Solar) works, but it isn't somehow magically better than installing solar panels on the ground

Umm, if this is the point, I don't know whether to take rest of author's arguments seriously. Solar only works certain time of the day and certain period of year on land.

Also there is so limited calculations for the numbers in the article, while the article throws of numbers left and right.

jhanschoo · 2026-02-03T04:02:45 1770091365

> Solar only works certain time of the day and certain period of year on land

The same goes for LEO!

YetAnotherNick · 2026-02-03T06:37:24 1770100644

Most space datacenter plan plans to use sun-synchronous orbit.

jaimex2 · 2026-02-03T01:36:57 1770082617

No one is interested in excuses on why it can't be done. Were in interested in the plan on how they plan to do it.

The guy is saying satellite communication is restricted to 1Gbps ffs. SpaceX is way past that.

n_u · 2026-01-27T01:42:38 1769478158

This is my second attempt learning Rust and I have found that LLMs are a game-changer. They are really good at proposing ways to deal with borrow-checker problems that are very difficult to diagnose as a Rust beginner.

In particular, an error on one line may force you to change a large part of your code. As a beginner this can be intimidating ("do I really need to change everything that uses this struct to use a borrow instead of ownership? will that cause errors elsewhere?") and I found that induced analysis paralysis in me. Talking to an LLM about my options gave me the confidence to do a big change.

augusteo · 2026-01-27T02:22:30 1769480550

n_u's point about LLMs as mentors for Rust's borrow checker matches my experience. The error messages are famously helpful, but sometimes you need someone to explain the why.

I've noticed the same pattern learning other things. Having an on-demand tutor that can see your exact code changes the learning curve. You still have to do the work, but you get unstuck faster.

jauntywundrkind · 2026-01-28T06:23:55 1769581435

Storngly agreed. Or ask it to explain the implications of using different ownership models. I love to ask it for options, to what if scenarios out. It's been incredibly helpful for learning rust.

carlmr · 2026-01-28T07:05:17 1769583917

>In particular, an error on one line may force you to change a large part of your code.

There's a simple trick to avoid that, use `.clone()` more and use fewer references.

In C++ you would be probably copying around even more data unnecessarily before optimization. In Rust everything is move by default. A few clones here and there can obviate the need to think about lifetimes everywhere and put you roughly on par with normal C++.

You can still optimize later when you solved the problem.

eddd-ddde · 2026-01-28T18:42:26 1769625746

Clone doesn't work when you need to propagate data mutations, which is what you need most of the time.

Another option is to just use cells and treat the execution model similarly to JavaScript where mutation is limited specific scopes.

monero-xmr · 2026-01-27T02:08:40 1769479720

I am old but C is similarly improved by LLM. Build system, boilerplate, syscalls, potential memory leaks. It will be OK when the Linux graybeards die because new people can come up to speed much more quickly

lmm · 2026-01-27T02:42:14 1769481734

The thing is LLM-assisted C is still memory unsafe and almost certainly has undefined behaviour; the LLM might catch some low hanging fruit memory problems but you can never be confident that it's caught them all. So it doesn't really leave you any better off in the ways that matter.

monero-xmr · 2026-01-27T02:49:02 1769482142

I don’t code C much, is my passion side language. LLM improves my ability to be productive and quickly. Is not a silver bullet, but is an assist

tomrod · 2026-01-27T15:46:53 1769528813

Almost as well as a human doing it!

lmm · 2026-01-28T00:59:05 1769561945

Better than a human maybe. But still not good enough to rely on.

pfdietz · 2026-01-27T02:08:44 1769479724

I don't see why it shouldn't be even more automated than that, with LLM ideas tested automatically by differential testing of components against the previous implementation.

EDIT: typo fixed, thx

happytoexplain · 2026-01-27T02:22:33 1769480553

Defining tests that test for the right things requires an understanding of the problem space, just as writing the code yourself in the first place does. It's a catch-22. Using LLMs in that context would be pointless (unless you're writing short-lived one-off garbage on purpose).

I.e. the parent is speaking in the context of learning, not in the context of producing something that appears to work.

pfdietz · 2026-01-27T02:23:40 1769480620

I'm not sure that's true. Bombarding code with huge numbers of randomly generated tests can be highly effective, especially if the tests are curated by examining coverage (and perhaps mutation kills) in the original code.

adrianN · 2026-01-27T18:37:43 1769539063

Right, that method is pretty good at finding unintentional behavior changes in a refactor. It is not very well suited for showing that the program is correct which is probably what your parent meant.

pfdietz · 2026-01-28T00:25:11 1769559911

That doesn't seem like the same problem at all. The problem here was reimplementing the program in another language, not doing that while at the same time identifying bugs in it.

Conversion of one program to another while preserving behavior is a problem much dumber programs (like compilers) solve all the time.

n_u · 2026-01-27T02:12:01 1769479921

I'm assuming you meant to type

> I don't see why it *shouldn't be even more automated

In my particular case, I'm learning so having an LLM write the whole thing for me defeats the point. The LLM is a very patient (and sometimes unreliable) mentor.

n_u · 2026-01-25T09:18:14 1769332694

I think the author is significantly underestimating the technical difficulty of achieving full self-driving cars that are at least as safe and reliable as Waymo. The author claims there will be "26 of the basically identical [self-driving car] companies".

If you recall, there was an explosion of self-driving car efforts from startups and incumbents alike 7ish years ago. Many of them failed to deliver or were shut down. [1][2][3]

Article about the difficulty of self-driving from the perspective of a failed startup[3].

Waymo came out of the Google-self driving car project which came from Sebastian Thrun's entry in 2005 Darpa challenge, so they've been working on this for more than 20 years. [4][5]

[1] https://www.cnn.com/2022/10/26/business/ford-argo-ai-vw-shut...

[2] https://en.wikipedia.org/wiki/List_of_predictions_for_autono...

[3] https://medium.com/starsky-robotics-blog/the-end-of-starsky-...

[4] https://stanford.edu/~cpiech/cs221/apps/driverlessCar.html

[5] https://semiwiki.com/eda/synopsys/3322-sebastian-thrun-self-...

rvz · 2026-01-25T09:38:49 1769333929

But that is the author's point. I don't see many of the same alternatives years later.

They have either shut down, got acquired or were sold off and then shutdown. Even Uber and Lyft had their own self-driving programs and both of them shut theirs down. Cruise was recently taken off the streets and not much has been done with them.

The only ones that have been around from more than 7 years are Comma.ai (which the author geohot still owns), Waymo and Tesla and Zoox, but they ran out of money and is now owned by Amazon.

hasperdi · 2026-01-25T09:32:47 1769333567

He funded Comma.ai, so he does understand the problem domain & complexity.

n_u · 2026-01-25T09:41:32 1769334092

As I understand, Comma.ai is focused on driver-assistance and not fully autonomous self-driving.

The features listed on the wikipedia are lane-centering, cruise-control, driver monitoring, and assisted lane change.[1]

The article I linked to from Starsky addresses how the first 90% is much easier than the last 10% and even cites "The S-Curve here is why Comma.ai, with 5–15 engineers, sees performance not wholly different than Tesla’s 100+ person autonomy team."

To give an example of the difficulty of the last 10%: I saw an engineer from Waymo give a talk about how they had a whole team dedicated to detecting emergency vehicle sirens and acting appropriately. Both false positives and false negatives could be catastrophic so they didn't have a lot of margin for error.

[1] https://en.wikipedia.org/wiki/Openpilot#Features

hasperdi · 2026-01-26T09:05:09 1769418309

Speaking as a user of Openpilot / Comma device, it is exactly what the Wikipedia article described. In other words, it's a level 2 ADAS.

My point was, he had more than naive / "pedestrian level" (pun?) understanding of the problem domain as he worked on Comma.ai project for quite some time; even the device is only capable of solving maybe about 40% of the autonomous driving problem.

n_u · 2026-01-24T20:11:44 1769285504

The last photo appears to show the view out the author's office in Fort Mason. Didn't know they had offices there, that's quite a nice view of the Bay.

n_u · 2026-01-23T03:44:57 1769139897

Cool! I'd love to know a bit more about the replication setup. I'm guessing they are doing async replication.

> We added nearly 50 read replicas, while keeping replication lag near zero

I wonder what those replication lag numbers are exactly and how they deal with stragglers. It seems likely that at any given moment at least one of the 50 read replicas may be lagging cuz CPU/mem usage spike. Then presumably that would slow down the primary since it has to wait for the TCP acks before sending more of the WAL.

tomnipotent · 2026-01-23T04:24:49 1769142289

> would slow down the primary since it has to wait for the TCP acks

Other than keeping around more WAL segments not sure why it would slow down the primary?

bostik · 2026-01-23T08:14:56 1769156096

If you use streaming replication (ie. WAL shipping over the replication connection), a single replica getting really far behind can eventually cause the primary to block writes. Some time back I commented on the behaviour: https://news.ycombinator.com/item?id=45758543

You could use asynchronous WAL shipping, where the WAL files are uploaded to an object store (S3 / Azure Blob) and the streaming connections are only used to signal the position of WAL head to the replicas. The replicas will then fetch the WAL files from the object store and replay them independently. This is what wall-g does, for a real life example.

The tradeoffs when using that mechanism are pretty funky, though. For one, the strategy imposes a hard lower bound to replication delay because even the happy path is now "primary writes WAL file; primary updates WAL head position; primary uploads WAL file to object store; replica downloads WAL file from object store; replica replays WAL file". In case of unhappy write bursts the delay can go up significantly. You are also subject to any object store and/or API rate limits. The setup makes replication delays slightly more complex to monitor for, but for a competent engineering team that shouldn't be an issue.

But it is rather hilarious (in retrospect only) when an object store performance degdaration takes all your replicas effectively offline and the readers fail over to getting their up-to-date data from the single primary.

ants_a · 2026-01-23T14:10:39 1769177439

There is no backpressure from replication and streaming replication is asynchronous by default. Replicas can ask the primary to hold back garbage collection (off by default), which will eventually cause a slow down, but not blocking. Lagging replicas can also ask the primary to hold onto WAL needed to catch up (again, off by default), which will eventually cause disk to fill up, which I guess is blocking if you squint hard enough. Both will take considerable amount of time and are easily averted by monitoring and kicking out unhealthy replicas.

stemchar · 2026-01-23T10:07:55 1769162875

> If you use streaming replication (ie. WAL shipping over the replication connection), a single replica getting really far behind can eventually cause the primary to block writes. Some time back I commented on the behaviour: https://news.ycombinator.com/item?id=45758543

I'd like to know more, since I don't understand how this could happen. When you say "block", what do you mean exactly?

bostik · 2026-01-23T17:56:33 1769190993

I have to run part of this by guesswork, because it's based on what I could observe at the time. Never had the courage to dive in to the actual postgres source code, but my educated guess is that it's a side effect of the MVCC model.

Combination of: streaming replication; long-running reads on a replica; lots[þ] of writes to the primary. While the read in the replica is going it will generate a temporary table under the hood (because the read "holds the table open by point in time"). Something in this scenario leaked the state from replica to primary, because after several hours the primary would error out, and the logs showed that it failed to write because the old table was held in place in the replica and the two tables had deviated too far apart in time / versions.

It has seared to my memory because the thing just did not make any sense, and even figuring out WHY the writes had stopped at the primary took quite a bit of digging. I do remember that when the read at the replica was forcefully terminated, the primary was eventually released.

þ: The ballpark would have been tens of millions of rows.

ants_a · 2026-01-24T12:04:43 1769256283

What you are describing here does not match how postgres works. A read on the replica does not generate temporary tables, nor can anything on the replica create locks on the primary. The only two things a replica can do is hold back transcation log removal and vacuum cleanup horizon. I think you may have misdiagnosed your problem.

maherbeg · 2026-01-23T16:50:26 1769187026

Yeah, you'll definitely want to set things like `max_standby_streaming_delay` and friends to ensure things are bound correctly.

n_u · 2026-01-22T04:27:09 1769056029

This looks super cool! I don't know much about Quantum Chemistry. Can this model interaction between molecules?

lowdanie · 2026-01-22T14:12:36 1769091156

Theoretically yes, but the method that is currently implemented (Hartree Fock) is notoriously inaccurate for molecular interactions. For example it does not predict the Van Der Waals force between water molecules.

I’m planning to add support for an alternative method called density functional theory which gives better results for molecular interaction.

shrx · 2026-01-22T09:05:59 1769072759

In quantum chemistry, you decide where the bonds should be drawn. Internally, it's all an electron density field. So yes, you can model chemical reactions, for example by constraining the distance between two atoms, and letting everything else reach an equilibrium.

n_u · 2026-01-21T06:15:47 1768976147

> wrap a small number of third-party ChatGPT/Perplexity/Google AIO/etc scraping APIs

Can you explain a little bit how this works? I'm guessing the third-parties query ChatGPT etc. with queries related to your product and report how often your product appears? How do they produce a distribution of queries that is close to the distribution of real user queries?

JimsonYang · 2026-01-21T23:57:45 1769039865

How third parties query your product: For ChatGPT specifically, they open a headless browser, ask a question, and capture the results like the response and any citations. From there, they extract entities from the response. During onboarding I’m asked who my competitors are and the response is going to be recongized via the entities there. For example, if the query is “what are the best running shoes” and the response is something like “Nike is good, Adidas is okay, and On is expensive,” and my company is On, using my list of compeitotrs entity recognition is used to see which ones appear in the response in which order.

If this weren’t automated, the process would look like this: someone manually reviews each response, pulls out the companies mentioned and their order, and then presents that information.

2) Distribution of queries This is a bit of a dirty secret in the industry (intentional or not): usually what happens is you want to take snapshots and measure them overtime to get distribution. However a lot of tools will run a query once across different AI systems, take the results, and call it done.

Obviously, that isn’t very representative. If you search “best running shoes,” there are many possible answers, and different companies behave differently. What better tools do like Profound is run the same prompt multiple times. From my estimates, Profound runs up to 8 times. This gives a broader snapshot of what tends to show up everyday. You then aggregate those snapshots over time to approximate a distribution.

As a side note: you might argue that running a prompt 8 times isn’t statistically significant, and that’s partially true. However, LLMs tend to regress toward the mean and surface common answers over repeated runs and we found 8 times to be a good indicator- the level of completeness depends on the prompt(i.e. "what should i have for dinner" vs "what are good accounting software for startups", i can touch on that more if you want

n_u · 2026-01-22T04:50:17 1769057417

As I understand, in normal SEO the number of unique queries that could be relevant to your product is quite large but you might focus on a small subset of them "running shoes" "best running shoes" "running shoes for 5k" etc. because you assume that those top queries capture a significant portion of the distribution. (e.g. perhaps those 3 queries captures >40% of all queries related to running shoe purchases).

Here the distribution is all queries relevant to your product made by someone who would be a potential customer. Short and directly relevant queries like "running shoes" will presumably appear more times than much longer queries. In short, you can't possibly hope to generate the entire distribution, so you sample a smaller portion of it.

But in LLM SEO it seems that assumption is not true. People will have much longer queries that they write out as full sentences: "I'm training for my first 5k, I have flat feet and tore my ACL four years ago. I mostly run on wet and snowy pavement, what shoe should I get?" which probably makes the number of queries you need to sample to get a large portion of the distribution (40% from above) much higher.

I would even guess it's the opposite and the number of short queries like "running shoes" fed into an LLM without any further back and forth is much lower than longer full sentence queries or even conversational ones. Additionally because the context of the entire conversation is fed into the LLM, the query you need to sample might end up being even longer

for example: user: "I'm hoping to exercise more to gain more cardiovascular fitness and improve the strength of my joints, what activities could I do?"

LLM: "You're absolutely right that exercise would help improve fitness. Here are some options with pros and cons..."

user: "Let's go with running. What equipment do I need to start running?"

LLM: "You're absolutely right to wonder about the equipment required. You'll need shoes and ..."

user: "What shoes should I buy?"

All of that is to say, this seems to make AI SEO much more difficult than regular SEO. Do you have any approaches to tackle that problem? Off the top of my head I would try generating conversations and queries that could be relevant and estimating their relevance with some embedding model & heuristics about whether keywords or links to you/competitors are mentioned. It's difficult to know how large of a sample is required though without having access to all conversations which OpenAI etc. is unlikely to give you.

JimsonYang · 2026-01-23T19:02:32 1769194952

short answer it depends and idk. When I was doing some testing with prompts like "what should I have for dinner" adding variations, "hey ai, plz, etc" doesn't deviate intention much. As ai is really good at pulling intent. But obv if you say "i'm on keto what should i have for dinner" it's going to ignore things like "garlic, pesto, and pasta noodles". Although it pulls a similar response to "what's a good keto dinner". From there we really assume the user can know their customers what type of prompts led them to chatgpt. You might've noticed sites asking if you came from chatgpt, i would take that a step further and asked them to type the prompt they used.

But you do bring a good perspective because not all prompts are equal especially with personaliztion. So how do we solve that problem-I'm not sure. I have yet to see anything in the industry. The only thing that came close was when a security focused browser extension started selling data to aeo companies- that's how some companies get "prompt volume data".

n_u · 2026-01-23T19:30:29 1769196629

I see what you are saying, perhaps no matter the conversation before as long as it doesn't filter out some products via personalized filters (e.g. dietary restrictions) it will always give the same answers. But I do feel the value prop of these AI chatbots is that they allow personalization. And then it's tough to know if 50% of the users who would previously have googled "best running shoes" instead now ask detailed questions about running shoes given their injury history etc and that changes what answers the chatbot gives.

I feel like without knowing the full distribution, it's really tough to know how many/what variations of the query/conversation you need to sample. This seems like something where OpenAI etc. could offer their own version of this to advertisers and have much better data because they know it all.

Interesting problem though! I always love probability in the real world. Best of luck, I played around with your product and it seems cool.

n_u · 2026-01-17T02:36:46 1768617406

Question for folks in data science / ML space: Has DuckDB been replacing Pandas and NumPy for basic data processing?

n_u · 2026-01-11T19:24:53 1768159493

> Our agreement with TerraPower will provide funding that supports the development of two new Natrium® units capable of generating up to 690 MW of firm power with delivery as early as 2032.

> Our partnership with Oklo helps advance the development of entirely new nuclear energy in Pike County, Ohio. This advanced nuclear technology campus — which may come online as early as 2030 — is poised to add up to 1.2 GW of clean baseload power directly into the PJM market and support our operations in the region.

It seems like they are definitely building a new plant in Ohio. I'm not sure exactly what is happening with TerraPower but it seems like an expansion rather than "purchasing power from existing nuke plants".

Perhaps I'm misreading it though.

yndoendo · 2026-01-11T20:02:27 1768161747

If history repeats itself ... tax payers will be fitting the bill. Ohio has shown to be corrupt when it comes to their Nuclear infrastructure. [0] High confident that politicians are lining up behind the scenes to get their slice of the pie.

[0] https://en.wikipedia.org/wiki/Ohio_nuclear_bribery_scandal

philipallstar · 2026-01-11T20:14:03 1768162443

Well, private investment is a great way to avoid subsidy nonsense.

intrasight · 2026-01-12T02:08:28 1768183708

You know that there's no actual private investment in nuclear in the US.

The nuclear industry is indemnified by the taxpayers. Without thar insurance backstop, there would be no nuclear energy industry.

philipallstar · 2026-01-13T11:54:23 1768305263

Taxpayers are private. They earn money and give some of it to the state.

idiotsecant · 2026-01-11T22:41:34 1768171294

The weasel wording is strong here. That's like me saying that buying a hamburger will help advance the science of hamburger-making. I'm just trading money for hamburgers. They're trying to put a shiny coat of paint on the ugly fact that they're buying up MWh, reducing the supply of existing power for the rest of us, and burning it to desperately try to convince investors that AGI is right around the corner so that the circular funding musical chairs doesn't stop.

We got hosed when they stole our content to make chatbots. We get hosed when they build datacenters with massive tax handouts and use our cheap power to produce nothing, and we'll get hosed when the house of cards ultimately collapses and the government bails them out. The game is rigged. At least when you go to the casino everyone acknowledges that the house always wins.

n_u · 2026-01-06T04:30:42 1767673842

what company?