More

ngrislain · 2026-03-25T10:39:01 1774435141

Yes the user has to be cooperative somehow. You could emulate linear/affine types like features with indexed monads though.

ngrislain · 2026-03-25T08:46:10 1774428370

You are right, what I wrote is more of a PoC. It's valid for blocking sockets on the happy path.

ngrislain · 2026-03-25T08:23:32 1774427012

Fair point! Updated. I’m definitely coming at this more from a Lean 4/formal methods perspective than a POSIX one.

ngrislain · 2025-12-12T13:34:10 1765546450

Just finished

ngrislain · 2025-12-01T14:48:37 1764600517

Yes, this year I'm going for Lean 4: https://github.com/ngrislain/lean-adventofcode-2025

It's a great language. It's dependent-types / theorem-proving-oriented type-system combined with AI assistants makes it the language of the future IMO.

rootnod3 · 2025-12-01T15:20:09 1764602409

Isn't the whole point of AoC to NOT use AI? Even says so in the FAQ

ngrislain · 2025-12-02T15:56:53 1764691013

Yes, I'm doing it without AI to learn the language, nonetheless I do think that Lean 4 + AI is a super-powerful combination.

tgv · 2025-12-01T18:33:18 1764613998

Like with the leader board. People do it to score points, not to learn. Hence, cheating.

ngrislain · 2025-12-01T11:11:39 1764587499

A good opportunity to learn a new programming language: https://hackernews.hn/item?id=46105849

ngrislain · 2025-12-01T10:44:11 1764585851

Advent of Code 2025 in Lean...

ngrislain · on Jan 15, 2025

Yes it is true that the model has undergone SFT, and RLHF, and other alignment procedures, and hence the logprobs do not reflect the probability of the next token as in the pre-training corpus. Nevertheless, in concrete applications such as our main internal use-case: structured data extraction from pdf documents it revealed very valuable. When the value was obviously well extracted, the logprob was high and when the information was super hard to find or impossible the model would output - or hallucinate - some value with much lower logprob.

ngrislain · on Jan 15, 2025

We need to build a syntax tree and be able to map each value (number, boolean, string) to a range of character and then to a GPT token (for which OpenAi produces logprobs). This is the reason we use Lark.

ngrislain · on Jan 14, 2025

Same token usage. Actually OpenAI returns the logprob of each token conditional on the previous ones with the option logprobs=true. This lib simply parses the output json string with `lark` into an AST with value nodes. The value nodes are mapped back to a range of characters in the json string. Then the characters are mapped back to the GPT tokens overlapping the character ranges and the logprobs of the tokens are summed.

potatoman22 · on Jan 14, 2025

That's great to hear, thanks for the explanation! Super excited to try this out.