sudb

2026-03-10 12:56

Commented: "Xkcd: Change in Slope"

there definitely are some areas in software-land where graphing data and/or directly eyeballing it genuinely helps to spot patterns where statistical methods might be cumbersome/tricky/otherwise annoying, like log analysis[1]

[1] https://jvns.ca/blog/2022/12/07/tips-for-analyzing-logs/

2026-03-10 12:48

Commented: "Perhaps not Boring Technology after all"

This sharp uptick in LLM in-context "learning" capabilities means I'm more excited than ever to try to get to grips with "new" languages like Nim or Gleam (but worried that using LLMs to help me get to a working end state will rob me of some of the experience of learning).

2026-03-02 12:10

Commented: "When does MCP make sense vs CLI?"

Every MCP vs CLI argument I've seen really glosses over _where_ the agent is running, and how that makes a difference. For individual users where you're running agents locally, I totally agree that CLIs cover the vast majority of use cases, where available.

I think something I've not seen anyone mention is that MCPs make much more sense to equip agents on 3rd party platforms with the tools they need - often installing specific CLIs isn't possible and there's the question of whether you trust the platform with your CLI authentication key.

2025-07-17 4:45

Submitted: "How We Beat Claude Code on Terminal Bench"

3 points0 commentswww.enginelabs.ai

Engine Labs has posted a state of the art score on Terminal Bench - the top score with a Claude Sonnet 4 class model and #2 overall. This improves on Claude Code's equivalent score by ~25%, or around…

2025-07-07 6:59

Submitted: "Terminals for LLMs: A Halting Problem"

2 points0 commentscto.new

cto.new is the world's first completely free AI code agent. Use the latest frontier models from Anthropic, OpenAI and more. No credit card or API keys required.

Hacker News

sudb

26

2023-02-14

Recent Activity

Commented: "Xkcd: Change in Slope"

Commented: "Perhaps not Boring Technology after all"

Commented: "When does MCP make sense vs CLI?"

Submitted: "How We Beat Claude Code on Terminal Bench"

Submitted: "Terminals for LLMs: A Halting Problem"

HackerNews