More

vitus · 2026-03-24T11:58:38 1774353518

Eh. It depends what your bottleneck is. If the bottleneck is now, say, CPU cache contention because you've doubled your thread count, it's entirely possible that FL1 running on the new server generation is operating in a different regime than on the previous generation. You can see some hints of that happening, since doubling thread count didn't result in a doubling of throughput.

In fact, I suspect based on the throughput doubling with FL2, we're back in the same regime as the baseline.

It would be useful to see what the latency is of FL2 on Gen12 compared to baseline (FL1 on Gen12), just to confirm.

c0reM · 2026-03-24T12:15:18 1774354518

Yes fair points. The think it’s also indicative of how important it is that code be optimized for the specific hardware it will run on. Systems need to be considered and optimized as a whole. Still an interesting post.

vitus · 2026-03-22T11:43:35 1774179815

It depends what dates you're looking at, but energy (gas prices and more) and food (including eggs) are generally recognized as way more volatile than the rest of the CPI.

Eggs were actually quite stable for the 20 years prior to 2001, so maybe don't put your life savings into egg futures...

Egg prices: https://fred.stlouisfed.org/series/APU0000708111

CPI: https://fred.stlouisfed.org/series/CPIAUCSL

Core CPI (without food + energy prices): https://fred.stlouisfed.org/series/CPILFESL

PowerElectronix · 2026-03-22T12:18:53 1774181933

That is very curious, yes. Eggs seem to just start to increase dramatically after 2000 and indeed outdo the CPI, disregarding the peaks and valleys of the different shocks to egg production like covid and the avian flu.

I read that the price includes free range, eco, etc varieties which are more expensive and in more demand nowadays, probably just that explains a good chunk of the price increase.

bix6 · 2026-03-22T13:32:18 1774186338

This is a good read if you haven’t seen it. Spoiler alert it’s private equity. Shocker I know.

https://www.thebignewsletter.com/p/hatching-a-conspiracy-a-b...

PowerElectronix · 2026-03-22T14:11:19 1774188679

That is indeed a good read, I wasn't aware that there is now a Big Egg fixing egg prices.

voakbasda · 2026-03-22T15:17:29 1774192649

I think it is now relatively safe to assume that there is Big X fixing the prices of X, for pretty much any X that could turn a profit.

derbOac · 2026-03-22T16:54:56 1774198496

I feel like those links are more useful than the target essay.

Reading through them, I wonder why CPIs aren't based on empirical correlational patterns between prices over time? Sort of like in these articles:

https://iopscience.iop.org/article/10.1088/1742-6596/1796/1/... https://www.ecb.europa.eu/pub/pdf/scpwps/ecbwp1011.pdf

Or maybe they are? I'm not an expert in this and reading through some of the government literature there's no mention of this.

Then at least you would know that a given price marker is a good empirical index of how other prices are changing also, at least for a given dimension/component.

vitus · 2026-03-20T21:37:31 1774042651

> Kaeshibashi

The preference is to use a separate pair of communal chopsticks that is not used directly for eating.

> Kosuribashi

I have heard that this one is because it's considered to be an insult implying that the chopsticks are low-quality. (That said, if your chopsticks are indeed low-quality, then avoiding splinters is probably preferable to then visibly plucking splinters out of your fingers.)

vitus · 2026-03-15T17:00:52 1773594052

Looks like the repo owner force-pushed a bad commit to replace an existing one. But then, why not forge it to maintain the existing timestamp + author, e.g. via `git commit --amend -C df8c18`?

Innocuous PR (but do note the line about "pedronauck pushed a commit that referenced this pull request last week"): https://github.com/pedronauck/reworm/pull/28

Original commit: https://github.com/pedronauck/reworm/commit/df8c18

Amended commit: https://github.com/pedronauck/reworm/commit/d50cd8

Either way, pretty clear sign that the owner's creds (and possibly an entire machine) are compromised.

chrismorgan · 2026-03-15T17:30:17 1773595817

The value of the technique, I suppose, is that it hides a large payload a bit better. The part you can see stinks (a bunch of magic numbers and eval), but I suppose it’s still easier to overlook than a 9000-character line of hexadecimal (if still encoded or even decoded but still encrypted) or stuff mentioning Solana and Russian timezones (I just decoded and decrypted the payload out of curiosity).

But really, it still has to be injected after the fact. Even the most superficial code review should catch it.

vitus · 2026-03-15T17:48:19 1773596899

Agreed on all those fronts. I'm just dismayed by all the comments suggesting that maintainers just merged PRs with this trojan, when the attack vector implies a more mundane form of credential compromise (and not, as the article implies, AI being used to sneak malicious changes past code review at scale).

jeltz · 2026-03-15T18:04:39 1773597879

Yeah, the attack vector seems to be stolen credentials. I would be much more interested in an attack which actually uses Invisible characters as the main vector.

vitus · 2026-03-08T19:51:27 1772999487

> if indeed he went all-in on AI in 2015, that seems to me like a damn near prophetic vision.

Also note that 7 years later, when ChatGPT came out, built on top of Google Brain research (transformers), Google was caught flat-footed.

Even supposing that Pichai really had the right vision a decade ago, he completely failed in leading its execution until a serious threat to the company's core business model materialized.

vitus · 2026-02-28T15:33:53 1772292833

Well, it shouldn't be slower than "Read 1,000,000 bytes sequentially from memory" (741ns) which in turn shouldn't be slower than "Read 1,000,000 bytes sequentially from disk" (359 us).

That said, all those numbers feel a bit off by 1.5-2 orders of magnitude -- that disk read speed translates to about 3 GB/s which is well outside the range of what HDDs can achieve.

https://brenocon.com/dean_perf.html indicates the original set of numbers were more like 10us, 250us, and 30ms.

And it links to https://github.com/colin-scott/interactive_latencies which seems like it extrapolates progress from 14 years ago:

        // NIC bandwidth doubles every 2 years
        // [source: http://ampcamp.berkeley.edu/wp-content/uploads/2012/06/Ion-stoica-amp-camp-21012-warehouse-scale-computing-intro-final.pdf]
        // TODO: should really be a step function
        // 1Gb/s = 125MB/s = 125*10^6 B/s in 2003

which means that in 2026 we'll have seen 11 doublings since gigabit speeds in 2003, so we'll all have > terabit speeds available to us.

amluto · 2026-02-28T16:26:08 1772295968

> that disk read speed translates to about 3 GB/s which is well outside the range of what HDDs can achieve.

That’s PCIe 3.0 x4 or PCIe 4.0 x2, which a decent commodity M.2 NVMe SSD can use and can possibly saturate, at least for reads.

> which means that in 2026 we'll have seen 11 doublings since gigabit speeds in 2003, so we'll all have > terabit speeds available to us.

We’re not that far off. 100GbE hardware is not especially expensive these days. Little “AI” boxes with 400-800 Gbps of connectivity are a thing.

That being said, all the connections over 100Gbps are currently multi-lane AFAIK, and the heroic efforts and multiplexing needed to exceed 100Gbps at any distance are a bit in excess of the very simple technology that got us to 100Mbps “fast Ethernet”.

vitus · 2026-02-28T18:36:35 1772303795

> That’s PCIe 3.0 x4 or PCIe 4.0 x2, which a decent commodity M.2 NVMe SSD can use and can possibly saturate, at least for reads.

Given that there's a separate item for sequential disk reads vs SSD reads, I think it's pretty clear that particular item meant hard drives specifically. Agreed that modern SSDs should be able to pull that off.

> That being said, all the connections over 100Gbps are currently multi-lane AFAIK, and the heroic efforts and multiplexing needed to exceed 100Gbps at any distance are a bit in excess of the very simple technology that got us to 100Mbps “fast Ethernet”.

Yeah. Terabit networking is not here yet, and it's certainly not "commodity network"-grade. We can LACP a bunch of 100G optics together, but we're probably 5-10 years out for 800G ethernet to become widely adopted and for 1600G to even be developed.

BenjiWiebe · 2026-03-01T00:00:12 1772323212

According to Wikipedia, the Power Mac G4 was the first mass produced computer with gigabit Ethernet - in 2000.

So now in 2025, 12.5 doublings later, we should have a mass produced personal computer available with a 1 gigabit times 2^12.5 = ~5 Tbps NIC.

We're not there yet. Not even close.

yomismoaqui · 2026-02-28T15:41:40 1772293300

You are right, but my comment was about a trivial observation: 1 green square is 10µs so half a green square should be 5µs (not 5ns)

So I guess it's a typo but it makes me doubt the other numbers.

vitus · 2026-01-31T15:41:12 1769874072

You probably meant to say oversubscribing, not overprovisioning.

Oversubscription is expected to a certain degree (this is fundamentally the same concept as "statistical multiplexing"). But even oversubscription in itself is not guaranteed to result in bufferbloat -- appropriate traffic shaping (especially to "encourage" congestion control algorithms to back off sooner) can mitigate a lot of those issues. And, it can be hard to differentiate between bufferbloat at the last mile vs within the ISP's backbone.

altairprime · 2026-01-31T20:26:08 1769891168

vitus · 2025-12-28T12:02:14 1766923334

We do; most people don't just write eBPF by hand.

https://github.com/llvm/llvm-project/tree/main/llvm/lib/Targ...

vitus · 2025-12-24T17:43:32 1766598212

std::ignore's behavior outside of use with std::tie is not specified in any finalized standard.

https://www.open-std.org/jtc1/sc22/wg21/docs/papers/2023/p29... aims to address that, but that won't be included until C++26 (which also includes _ as a sibling commenter mentions).

vitus · 2025-12-16T12:44:23 1765889063

So, for 10 pairs, 45 guesses (9 + 8 + 7 + 6 + 5 + 4 + 3 + 2 + 1) in the worst case, and roughly half that on average?

It's interesting how close 22.5 is to the 21.8 bits of entropy for 10!, and that has me wondering how often you would win if you followed this strategy with 18 truth booths followed by one match up (to maintain the same total number of queries).

Simulation suggests about 24% chance of winning with that strategy, with 100k samples. (I simplified each run to "shuffle [0..n), find index of 0".)