Show HN: Why write code if the LLM can just do the thing? (web app experiment)

2025-11-0117:45436324github.com

Contribute to samrolken/nokode development by creating an account on GitHub.

Show article

A web server with no application logic. Just an LLM with three tools.

One day we won't need code. LLMs will output video at 120fps, sample inputs in realtime, and just... be our computers. No apps, no code, just intent and execution.

That's science fiction.

But I got curious: with a few hours this weekend and today's level of tech, how far can we get?

I expected this to fail spectacularly.

Everyone's focused on AI that writes code. You know the usual suspects, Claude Code, Cursor, Copilot, all that. But that felt like missing the bigger picture. So I built something to test a different question: what if you skip code generation entirely? A web server with zero application code. No routes, no controllers, no business logic. Just an HTTP server that asks an LLM "what should I do?" for every request.

The goal: prove how far away we really are from that future.

Contact manager. Basic CRUD: forms, database, list views, persistence.

Why? Because most software is just CRUD dressed up differently. If this works at all, it would be something.

// The entire backend
const result = await generateText({ model, tools: { database, // Run SQL queries webResponse, // Return HTML/JSON updateMemory // Save user feedback }, prompt: `Handle this HTTP request: ${method} ${path}`,
});

Three tools:

database - Execute SQL on SQLite. AI designs the schema.
webResponse - Return any HTTP response. AI generates the HTML, JavaScript, JSON or whatever fits.
updateMemory - Persist feedback to markdown. AI reads it on next request.

The AI infers what to return from the path alone. Hit /contacts and you get an HTML page. Hit /api/contacts and you get JSON:

// What the AI generates for /api/contacts
{ "contacts": [ { "id": 1, "name": "Alice", "email": "alice@example.com" }, { "id": 2, "name": "Bob", "email": "bob@example.com" } ]
}

Every page has a feedback widget. Users type "make buttons bigger" or "use dark theme" and the AI implements it.

It works. That's annoying.

Every click or form submission took 30-60 seconds. Traditional web apps respond in 10-100 milliseconds. That's 300-6000x slower. Each request cost $0.01-0.05 in API tokens—100-1000x more expensive than traditional compute. The AI spent 75-85% of its time reasoning, forgot what UI it generated 5 seconds ago, and when it hallucinated broken SQL that was an immediate 500 error. Colors drifted between requests. Layouts changed. I tried prompt engineering tricks like "⚡ THINK QUICKLY" and it made things slower because the model spent more time reasoning about how to be fast.

But despite all that, forms actually submitted correctly. Data persisted across restarts. The UI was usable. APIs returned valid JSON. User feedback got implemented. The AI invented, without any examples, sensible database schemas with proper types and indexes, parameterized SQL queries that were safe from injection, REST-ish API conventions, responsive Bootstrap layouts, form validation, and error handling for edge cases. All emergent behavior from giving it three tools and a prompt.

So yes, the capability exists. The AI can handle application logic. It's just catastrophically slow, absurdly expensive, and has the memory of a goldfish.

The capability exists. The AI can handle application logic.

The problems are all performance: speed (300-6000x slower), cost (100-1000x more expensive), consistency (no design memory), reliability (hallucinations → errors).

But these feel like problems of degree, not kind:

Inference: improving ~10x/year
Cost: heading toward zero
Context: growing (eventual design memory?)
Errors: dropping

But the fact that I built a working CRUD app with zero application code, despite it being slow and expensive, suggests we might be closer to "AI just does the thing" than "AI helps write code."

In this project, what's left is infrastructure: HTTP setup, tool definitions, database connections. The application logic is gone. But the real vision? 120 inferences per second rendering displays with constant realtime input sampling. That becomes the computer. No HTTP servers, no databases, no infrastructure layer at all. Just intent and execution.

I think we don't realize how much code, as a thing, is mostly transitional.

.env:

LLM_PROVIDER=anthropic
ANTHROPIC_API_KEY=sk-ant-...
ANTHROPIC_MODEL=claude-3-haiku-20240307

Visit http://localhost:3001. First request: 30-60s.

What to try:

Check out prompt.md and customize it. Change what app it builds, add features, modify the behavior. That's the whole interface.

Out of the box it builds a contact manager. But try:

/game - Maybe you get a game?
/dashboard - Could be anything
/api/stats - Might invent an API
Type feedback: "make this purple" or "add a search box"

⚠️ Cost warning: Each request costs $0.001-0.05 depending on model. Budget accordingly.

MIT License

Page 2

You can’t perform that action at this time.

Read the original article

Comments

By sunaurus 2025-11-0118:2511 reply

The question posed sounds like "why should we have deterministic behavior if we can have non-deterministic behavior instead?"

Am I wrong to think that the answer is obvious? I mean, who wants web apps to behave differently every time you interact with them?

By jstummbillig 2025-11-0118:5611 reply

Because nobody actually wants a "web app". People want food, love, sex or: solutions.

You or your coworker are not a web app. You can do some of the things that web apps can, and many things that a web app can't, but neither is because of the modality.

Coded determinism is hard for many problems and I find it entirely plausible that it could turn out to be the wrong approach in software, that is designed to solve some level of complex problems more generally. Average humans are pretty great at solving a certain class of complex problems that we tried to tackle unsuccessfully with many millions lines of deterministic code, or simply have not had a handle on at all, like (like build a great software CEO).

By latexr 2025-11-0213:124 reply

> Because nobody actually wants a "web app". People want food, love, sex or: solutions.

Talk about a nonsensical non-sequitur, but I’ll bite. People want those to be deterministic too, to a large extent.

When people cook a meal with the same ingredients and the same times and processes (like parameters to a function), they expect it to taste about the same, they never expect to cook a pizza and take a salad out of the oven.

When they have sex, people expect to ejaculate and feel good, not have their intercourse morph into a drag race with a clown half-way though.

And when they want a “solution”, they want it to be reliable and trustworthy, not have it shit the bed unpredictably.

By mavamaarten 2025-11-0217:05

Exactly this. The perfect example is Google Assistant for me. It's such a terrible service because it's so indeterministic. One day it happily answers your basic question with a smile, and when you need it most it doesn't even try and only comes up with "Sorry I don't understand".

When products have limitations, those are usually acceptable to me if I know what they are or if I can find out what the breaking point is.

If the breaking point was me speaking a bit unclearly, I'd speak more clearly. If the breaking point was complex questions, I'd ask simpler ones. If the breaking point is truly random, I simply stop using the service because it's unpredictable and frustrating.

By tomcam 2025-11-031:25

> When they have sex, people expect to ejaculate and feel good, not have their intercourse morph into a drag race with a clown half-way though.

speak for yourself

By pempem 2025-11-0217:08

Ways to start my morning...reading "When they have sex, people expect to ejaculate and feel good, not have their intercourse morph into a drag race with a clown half-way though."

Stellar description.

By davnicwil 2025-11-0217:271 reply

This thing of 'look, nobody cares about the details really, they just care about the solution' is a meme that I think will be here forever in software. It was here before LLMs, they're now just the current socially accepted legitimacy vehicle for the meme.

In the end, useful stuff is built by people caring about the details. This will always be true. I think in LLMs and broadly AI people see an escape valve from that where the thinking about the details can be taken off their hands, and that's appealing, but it won't work in exactly the same way that having a human take the details off your hands doesn't usually work that well unless you yourself understand the details to a large extent (not necessarily down to the atoms, but at the point of abstraction where it matters, which in software is mostly about deterministically how do the logic flows of the thing actually work and why).

I think a lot of people just don't intuit this. An illustrative analogy might be something else creative, like music. Imagine the conversation where you're writing a song and discussing some fine point of detail like the lyrics, should I have this or that line in there, and ask someone's opinion, and their answer is 'well listen, I don't really know about lyrics and all of that, but I know all that really matters in the end is the vibe of the song'. That contributes about the same level of usefulness as talking about how software users are ultimately looking for 'solutions' without talking about the details of said software.

By mojoe 2025-11-033:16

Exactly, in the long run it's the people who care the most who win, it's tautological

By 113 2025-11-0119:339 reply

> Because nobody actually wants a "web app". People want food, love, sex or: solutions.

Okay but when I start my car I want to drive it, not fuck it.

By jstummbillig 2025-11-0120:264 reply

Most of us actually drive a car to get somewhere. The car, and the driving, are just a modality. Which is the point.

By kennywinker 2025-11-030:352 reply

If this was a good answer to mobility, people would prefer the bus over their car. It’s non-deterministic - when will it come? How quick will i get there? Will i get to sit? And it’s operated by an intelligent agent (driver).

Every reason people prefer a car or bike over the bus is a reason non-deterministic agents are a bad interface.

And that analogy works as a glimpse into the future - we’re looking at a fast approaching world where LLMs are the interface to everything for most of us - except for the wealthy, who have access to more deterministic services or actual human agents. How long before the rich person car rental service is the only one with staff at the desk, and the cheaper options are all LLM based agents? Poor people ride the bus, rich people get to drive.

By aryehof 2025-11-036:21

Bus vs car hit home for me as a great example of non vs deterministic.

It has always seemed to me that workflow or processes need to be deterministic and not decided by an LLM.

By soco 2025-11-0322:00

Here in Switzerland the bus is the deterministic choice. Just saying.

By 63stack 2025-11-029:291 reply

Most of us actually want to get somewhere to do an activity. The getting there is just a modality.

By jermaustin1 2025-11-0212:211 reply

Most of us actually want to get some where to do an activity to enjoy ourselves. The getting there, and activity, are just modalities.

By tags2k 2025-11-0212:412 reply

Most of us actually want to get somewhere to do an activity to then have known we did it for the rest of our lives as if to extract some intangible pleasure from its memory. Why don't we just hallucinate that we did it?

By shswkna 2025-11-0213:083 reply

This leads to us asking the deepest question of all: What is the point of our existence. Or as someone suggests lower down, in our current form all needs could ultimately be satisfied if AI just provided us with the right chemicals. (Which drug addicts already understand)

This can be answered though, albeit imperfectly. On a more reductionist level, we are the cosmos experiencing itself. Now there are many ways to approach this. But just providing us with the right chemicals to feel pleasure/satisfaction is a step backwards. All the evolution of a human being, just to end up functionally like an amoeba or a bacteria.

So we need to retrace our steps backwards in this thought process.

I could write a long essay on this.

But, to exist in first place, and to keep existing against all the constraints of the universe, is already pretty fucking amazing.

Whether we do all the things we do, just in order to stay alive and keep existing, or if the point is to be the cosmos “experiencing itself”, is pretty much two sides of the same coin.

By narrator 2025-11-0215:15

>Or as someone suggests lower down, in our current form all needs could ultimately be satisfied if AI just provided us with the right chemicals. (Which drug addicts already understand)

When you suddenly realize walking down the street that the very high fentanyl zombie is having a better day than you are.

Yeah, you can push the button in your brain that says "You won the game." However, all those buttons were there so you would self-replicate energy efficient compute. Your brain runs on 10 watts after all. It's going to take a while for AI to get there, especially without the capability for efficient self-repair.

By tags2k 2025-11-0215:16

Indeed - stick me in my pod and inject those experience chemicals into me, what's the difference? But also, what would be the point? What's the point anyway?

In one scenario every atom's trajectory was destined from the creation of time and we're just sitting in the passenger seat watching. In another, if we do have free will then we control the "real world" underneath - the quantum and particle realms - as if through a UI. In the pod scenario, we are just blobs experiencing chemical reactions through some kind of translation device - but aren't we the same in the other scenarios too?

By awesomecomment 2025-11-0222:22

[dead]

By 63stack 2025-11-0310:52

This was actually my point as well. You can follow this thought process all the way up to "make those specific neuron pathways in my brain fire", everything else is just the getting there part.

By GTP 2025-11-0213:45

But I want that somewhere to be deterministic, i.e. I want to arrive to the place I choose. With this kind of non-determinism instead, I have a big chance of getting to the place I choose. But I will also every now and then end up in a different place.

By 113 2025-11-021:143 reply

Yeah but in this case your car is non-deterministic so

By mikodin 2025-11-021:512 reply

Well the need is to arrive where you are going.

If we were in an imagined world and you are headed to work

You either walk out your door and there is a self driving car, or you walk out of your door and there is a train waiting for you or you walk out of your door and there is a helicopter or you walk out of your door and there is a literal worm hole.

Let's say all take the same amount of time, are equally safe, same cost, have the same amenities inside, and "feel the same" - would you care if it were different every day?

I don't think I would.

Maybe the wormhole causes slight nausea ;)

By didericis 2025-11-025:261 reply

> Well the need is to arrive where you are going.

In order to get to your destination, you need to explain where you want to go. Whatever you call that “imperative language”, in order to actually get the thing you want, you have to explain it. That’s an unavoidable aspect of interacting with anything that responds to commands, computer or not.

If the AI misunderstands those instructions and takes you to a slightly different place than you want to go, that’s a huge problem. But it’s bound to happen if you’re writing machine instructions in a natural language like English and in an environment where the same instructions aren’t consistently or deterministically interpreted. It’s even more likely if the destination or task is particularly difficult/complex to explain at the desired level of detail.

There’s a certain irreducible level of complexity involved in directing and translating a user’s intent into machine output simply and reliably that people keep trying to “solve”, but the issue keeps reasserting itself generation after generation. COBOL was “plain english” and people assumed it would make interacting with computers like giving instructions to another employee over half a century ago.

The primary difficulty is not the language used to articulate intent, the primary difficulty is articulating intent.

By simianwords 2025-11-029:062 reply

this is a weak argument.. i use normal taxis and ask the driver to take me to a place in natural language - a process which is certainly non deterministic.

By chii 2025-11-0210:061 reply

and the taxi driver has an intelligence that enables them to interpret your destination, even if ambiguous. And even then, mistakes happen (all the time with taxis going to a different place than the passenger intended because the names may have been similar).

By simianwords 2025-11-0210:20

Yes so a bit of non determinism doesn’t hurt anyone. Current LLMs are pretty accurate when it comes to these sort of things.

By didericis 2025-11-0215:34

> a process which is certainly non deterministic

The specific events that follow when asking a taxi driver where to go may not be exactly repeatable, but reality enforces physical determinism that is not explicitly understood by probabilistic token predictors. If you drive into a wall you will obey deterministic laws of momentum. If you drive off a cliff you will obey deterministic laws of gravity. These are certainties, not high probabilities. A physical taxi cannot have a catastrophic instant change in implementation and have its wheels or engine disappear when it stops to pick you up. A human taxi driver cannot instantly swap their physical taxi for a submarine, they cannot swap new york with paris, they cannot pass through buildings… the real world has a physically determined option-space that symbolic token predictors don’t understand yet.

And the reason humans are good at interpreting human intent correctly is not just that we’ve had billions of years of training with direct access to physical reality, but because we all share the same basic structure of inbuilt assumptions and “training history”. When interacting with a machine, so many of those basic unstated shared assumptions are absent, which is why it takes more effort to explicitly articulate what it is exactly that you want.

We’re getting much better at getting machines to infer intent from plain english, but even if we created a machine which could perfectly interpret our intentions, that still doesn’t solve the issue of needing to explain what you want in enough detail to actually get it for most tasks. Moving from point A to point B is a pretty simple task to describe. Many tasks aren’t like that, and the complexity comes as much from explaining what it is you want as it does from the implementation.

By hyperadvanced 2025-11-024:01

I think it’s pretty obvious but most people would prefer a regular schedule not a random and potentially psychologically jarring transportation event to start the day.

By chii 2025-11-0210:012 reply

> your car is non-deterministic

it's not as far as your experience goes - you press pedal, it accelerates. You turn the steering, it goes the way it turns. What the car does is deterministic.

More importantly, it does this every time, and the amount of turning (or accelerating) is the same today as it was yesterday.

If an LLM interpreted those inputs, can you say with confidence, that you will accelerate in a way that you predicted? If that is the case, then i would be fine with an LLM interpreted input to drive. Otherwise, how do you know, for sure, that pressing the brakes will stop the car, before you hit somebody in front of you?

of course, you could argue that the input is no longer your moving the brake pads etc - just name a destination and you get there, and that is suppose to be deterministic, as long as you describe your destination correctly. But is that where LLM is at today? or is that the imagined future of LLMs?

By iliaxj 2025-11-031:282 reply

Sometimes it doesn't though. Sometimes the engine seizes because a piece of tubing broke and you left your coolant down the road two turns ago. Or you steer off a cliff because there was coolant on the road for some reason. Or the meat sack in front of the wheel just didn't get enough sleep and your response time is degraded and you just can't quite get the thing to feel how you usually do. Ultimately the failure rate is low enough to trust your life on it, but that's just a matter of degree.

By pepoluan 2025-11-032:13

The situations you described reflects a System that has changed. And if the System has changed, then a change in output is to be expected.

It's the same as having a function called "factorial" but you change the multiplication operation to addition instead.

By chii 2025-11-032:40

all of those situations are the "driver's own fault", because they could've had a check to ensure none of that happened before driving. Not true with an LLM (at least, not as of today).

By crote 2025-11-0213:08

Tesla's "self-driving" cars have been working very hard to change this. That piece of road it has been doing flawlessly for months? You're going straight into the barrier today, just because it feels like it.

By nurettin 2025-11-029:29

I mean, as long as it works and it is still technically "my car", I would welcome the change.

By stirfish 2025-11-0120:431 reply

But do you want to drive, or do you want to be wherever you need to be to fuck?

By codebje 2025-11-021:582 reply

For me personally, the latter, but there's definitely people out there that just love driving.

Either way, these silly reductionist games aren't addressing the point: if I just want to get from A to B then I definitely want the absolute minimum of unpredictability in how I do it.

By theendisney 2025-11-023:351 reply

That would ruin the brain placticity.

I wonder now, if everything is always different and suddenly every day would be the same. How many times as terrifying would that be compared to the opposite?

By whilenot-dev 2025-11-024:16

A form of Alexei Yurchak's hypernormalisation?

By mewpmewp2 2025-11-0211:16

Only because you think the driving is what you want. The point is that what you want is determined by our brain chemicals. Many steps could be skipped if we could just give you the chemicals in your brain that you craved.

By ozim 2025-11-0121:191 reply

I feel like this is the point where we start to make jokes about Honda owners.

By bfkwlfkjf 2025-11-024:201 reply

Go on, what about honda owners? I don't know the meme.

By hathawsh 2025-11-028:11

The "Wham Baam" YouTube channels have a running joke about Hondas bumping into other cars with concerning frequency.

By lambdaone 2025-11-0211:06

Sadly, this is not true of a (admittedly very small) number of individuals.

By hinkley 2025-11-0120:22

Christine didn’t end well for anyone.

By OJFord 2025-11-0121:431 reply

...so that you can get to the supermarket for food, to meet someone you love, meet someone you may or may not love, or to solve the problem of how to get to work; etc.

Your ancestors didn't want horses and carts, bicycles, shoes - they wanted the solutions of the day to the same scenarios above.

By sublinear 2025-11-0123:571 reply

As much as I love your point, this is where I must ask whether you even want a corporeal form to contain the level of ego you're describing. Would you prefer to be an eternal ghost?

To dismiss the entire universe and its hostilities towards our existence and the workarounds we invent in response as mere means to an end rather than our essence is truly wild.

By anonzzzies 2025-11-026:271 reply

Most people need to go somewhere (in a hurry) to make money or food etc which most people don't want to do if they didn't have to, so yeah it is mostly a means to an end.

By sublinear 2025-11-028:231 reply

And yet that money is ultimately spent on more means to ends that are just as inconvenient from another perspective?

My point was that there is no true end goal as long as whims continue. The need to craft yet more means is equally endless. The crafting is the primary human experience, not the using. The using of a means inevitably becomes transparent and boring.

By mewpmewp2 2025-11-0211:151 reply

It should finalize into introducing satisfaction to the whims directly, so the AI would be directly managing the chemicals in our brains that would trigger feelings of reward and satisfaction.

By sublinear 2025-11-0318:351 reply

I think you're just describing drugs

By mewpmewp2 2025-11-049:48

Yes, but current drugs have many issues such as tolerance build up and withdrawals. If AI could figure out how to directly manage chemicals in the brain in such a way that it keeps working, it would be able to attain its goals of making people happy.

By lazide 2025-11-0120:261 reply

Even if it purred real nice when it started up? (I’m sorry)

By ozim 2025-11-0211:05

Looks like we have a Civic owner xD

By zahrevsky 2025-11-0213:09

Weird kink

By mjevans 2025-11-0122:49

Food -> 'basic needs'... so yeah, Shelter, food, etc. That's why most of us drive. You are also correct to separate Philia and Eros ( https://en.wikipedia.org/wiki/Greek_words_for_love ).

A job is better if your coworkers are of a caliber that they become a secondary family.

By cheema33 2025-11-0120:433 reply

> Average humans are pretty great at solving a certain class of complex problems that we tried to tackle unsuccessfully with many millions lines of deterministic code..

Are you suggesting that an average user would want to precisely describe in detail what they want, every single time, instead of clicking on a link that gives them what they want?

By ethmarks 2025-11-020:111 reply

No, but the average user is capable of describing what they want to something trained in interpreting what users want. The average person is incapable of articulating the exact steps necessary to change a car's oil, but they have no issue with saying "change my car's oil" to a mechanic. The implicit assumption with LLM-based backends is that the LLM would be capable of correctly interpreting vague user requests. Otherwise it wouldn't be very useful.

By sarchertech 2025-11-020:532 reply

The average mechanic won’t do something completely different to your car because you added some extra filler words to your request though.

The average user may not care exactly what the mechanic does to fix your car, but they do expect things to be repeatable. If car repair LLMs function anything like coding LLMs, one request could result in an oil change, while a similar request could end up with an engine replacement.

By ethmarks 2025-11-021:391 reply

I think we're making similar points, but I kind of phrased it weirdly. I agree that current LLMs are sensitive to phrasing and are highly unpredictable and therefore aren't useful in AI-based backends. The point I'm making is that these issues are potentially solvable with better AI and don't philosophically invalidate the idea of a non-programmatic backend.

One could imagine a hypothetical AI model that can do a pretty good job of understanding vague requests, properly refusing irrelevant requests (if you ask a mechanic to bake you a cake he'll likely tell you to go away), and behaving more or less consistently. It is acceptable for an AI-based backend to have a non-zero failure rate. If a mechanic was distracted or misheard you or was just feeling really spiteful, it's not inconceivable that he would replace your engine instead of changing your oil. The critical point is that this happens very, very rarely and 99.99% of the time he will change your oil correctly. Current LLMs have far too high of a failure rate to be useful, but having a failure rate at all is not a non-starter for being useful.

By sarchertech 2025-11-022:13

All of that is theoretically possible. I’m doubtful that LLMs will be the thing that gets us to that though.

Even if it is possible, I’m not sure if we will ever have the compute power to run all or even a significant portion of the world’s computations through LLMs.

By array_key_first 2025-11-022:302 reply

Mechanics, and humans, are non-deterministic. Every mechanic works differently, because they have different bodies and minds.

LLMs are, of course, bad. Or not good enough, at least. But suppose they are. Suppose they're perfect.

Would I rather use an app or just directly interface with an LLM? The LLM might be quicker and easier. I know, for example, ordering takeout is much faster if I just call and speak to a person.

By theendisney 2025-11-023:50

Old people sometimes call rather than order on the website. They never fail to come up with a query that no amount of hardcoded logic could begin to attack.

By sarchertech 2025-11-035:02

> Every mechanic works differently, because they have different bodies and minds.

Yes but the same LLM works very differently on each request. Even ignoring non-determinism, extremely minor differences in wording that a human mechanic wouldn’t even notice will lead to wildly different answers.

> LLMs are, of course, bad. Or not good enough, at least. But suppose they are. Suppose they're perfect.

You’re just talking about magic at that point.

But suppose the do become “perfect”, I’m skeptical we’ll ever have the compute resources to replace a significant fraction of computation with LLMs.

By anonzzzies 2025-11-026:32

There would be bookmarks to prompts and the results of the moment would be cached : both of these are already happening and will get better. We probably will freeze and unfreeze parts of neural nets to just get to that point and even mix them up to quickly mix up different concept you described before and continue from there.

By samdoesnothing 2025-11-020:04

I think they're suggesting that some problems are trivially solvable by humans but extremely hard to do with code - in fact the outcome can seem non-deterministic despite it being deterministic because there are so many confounding variables at play. This is where an LLM or other for of AI could be a valid solution.

By Aerroon 2025-11-021:37

When I reach for a hammer I want it to behave like a hammer every time. I don't ever want the head to fly off the handle or for it to do other things. Sometimes I might wish the hammer were slightly different, but most of the time I would want it to be exactly like the hammer I have.

Websites are tools. Tools being non-deterministic can be a really big problem.

By majormajor 2025-11-020:451 reply

Companies want determinism. And for most things, people want predictability. We've spent a century turning people into robots for customer support, assembly lines, etc. Very few parts of everyday life that still boil down to "make a deal with the person you're talking to."

So even if it would be better to have more flexibility, most business won't want it.

By pigpop 2025-11-021:121 reply

Why sell to a company when you can replace it?

I can speculate about what LLM-first software and businesses might look like and I find some of those speculations more attractive than what's currently on offer from existing companies.

The first one, which is already happening to some degree on large platforms like X, is LLM powered social media. Instead of having a human designed algorithm handle suggestions you hand it over to an LLM to decide but it could go further. It could handle customizing the look of the client app for each user, it could provide goal based suggestions or search so you could tell it what type of posts or accounts you're looking for or a reason you're looking for them e.g. "I want to learn ML and find a job in that field" and it gives you a list of users that are in that field, post frequent and high quality educational material, have demonstrated willingness to mentor and are currently not too busy to do so as well as a list of posts that serve as a good starting point, etc.

The difference in functionality would be similar to the change from static websites to dynamic web apps. It adds even more interactivity to the page and broadens the scope of uses you can find for it.

By majormajor 2025-11-021:41

Sell to? I'm talking about buying from. How are you replacing your grocery store, power company, favorite restaurants, etc, with an LLM? Things like vertical integration and economies of scale are not going anywhere.

By pepoluan 2025-11-032:07

The issue with not having something deterministic is that when there's regression, you cannot surgically fix the regression. Because you can't know how "Plan A" got morphed into "Modules B, C, D, E, F, G," and so on.

And don't even try to claim there won't ever be any regression: Current LLM-based A.I. will 'happily' lie to you that they passed all tests -- because based on interactions in the past, it has.

By Ghos3t 2025-11-025:571 reply

So basically you say the future of web would be everyone gets their own Jarvis, and like Tony you just tell Jarvis what you want and it does it for you, theres no need for a preexisting software or to even write a new one, it just does what's needed to fulfill the given request and give you the results you want. This sounds nice but wouldn't it get repetitive and computationally expensive, life imagine instead of Google maps, everyone just asks the AI directly for the things people typically use Google maps for like directions and location reviews etc. A centralized application like maps can be more efficient as it's optimized for commonly needed work and it can be further improved from all the data gathered from users who interact with this app, on the other hand if AI was allowed to do it's own thing, it could keep reinventing the wheel solving the same tasks again and again without the benefit of building on top of prior work, while not getting the improvements that it would get from the network effect of a large number of users interacting with the same app.

By acomjean 2025-11-0213:44

You might end up with ai trying to get information from ai, which saves us the frustration..

knows where we’d end up?

On the other hand the logs might be a great read.

By rafaelmn 2025-11-0218:13

We're used to dealing with human failure modes, AI fails in so unfamiliar ways it's hard to deal with.

By anonzzzies 2025-11-026:34

But it is still very early days. And if you have the AI generate code for deterministic things and fast execution, but the ai always monitors the code and if the user requires things that don't fit code, it will jump in. It's not one or the other necessarily.

By hshdhdhehd 2025-11-029:25

Determinism is the edge these systems have. Granted in theory enough AI power could be just as good. Like 1,000,000 humans could give you the answer of a postgres query. But the postgres gonna be more efficient.

By samrolken 2025-11-0118:291 reply

No, I wouldn’t say that my hypothesis is that non-deterministic behavior is good. It’s an undesirable side effect and illustrates the gap we have between now and the coming post-code world.

By killingtime74 2025-11-0118:341 reply

AI wouldn't be intelligent though if it was deterministic. It would just be information retrieval

By finnborge 2025-11-0120:291 reply

It already is "just" information retrieval, just with stochastic threads refining the geometry of the information.

By nxor 2025-11-0123:52

Haha u mean it isn't AGI? /s

By admax88qqq 2025-11-0118:283 reply

Web apps kind of already do that with most companies shipping constant UX redesigns, A/B tests, new features, etc.

For a typical user today’s software isn’t particularly deterministic. Auto updates mean your software is constantly changing under you.

By Jaygles 2025-11-0118:351 reply

I don't think that is what the original commenter was getting at. In your case, the company is actively choosing to make changes. Whether its for a good reason, or leads to a good outcome, is beside the point.

LLMs being inherently non-deterministic means using this technology as the foundation of your UI will mean your UI is also non-deterministic. The changes that stem from that are NOT from any active participation of the authors/providers.

This opens a can of worms where there will always be a potential for the LLM to spit out extremely undesirable changes without anyone knowing. Maybe your bank app one day doesn't let you access your money. This is a danger inherent and fundamental to LLMs.

By admax88qqq 2025-11-0118:431 reply

Right I get tha. The point I’m making is that from a users perspective it’s functionally very similar. A non deterministic llm or a non deterministic company full of designers and engineers.

By lazide 2025-11-0120:271 reply

Regardless of what changes the bank makes, it’s not going to let you access someone else’s money. This llm very well might.

By array_key_first 2025-11-022:321 reply

Well, software has been known to have vulnerabilities...

Consider this: the bank teller is non-deterministic, too. They could give you 500 dollars of someone else's money. But they don't, generally.

By an_guy 2025-11-0216:141 reply

Bank tellers are deterministic though. They have a set protocol for each cases and escalate unknown cases to a more deterministic point of contact.

It will be difficult to incorporate relative access or restrictions to features with respect to users current/known state or actions. Might as well write the entire web app at that point.

By array_key_first 2025-11-0218:351 reply

I think the bank teller's systems and processes are deterministic, but the teller itself is not. They could even rob the bank, if they wanted to. They could shoot the customers. They don't, generally, but they can.

I think, if we can efficiently capture a way to "make" LLMs conform to a set of processes, you can cut out the app and just let the LLM do it. I don't think this makes any sense for maybe the next decade, but perhaps at some point it will. And, in such time, software engineering will no longer exist.

By yencabulator 2025-11-1122:55

The actual app is the set of processes.

By paulhebert 2025-11-0118:48

The rate of change is so different it seems absurd to compare the two in that way.

The LLM example gives you a completely different UI on _every_ page load.

That’s very different from companies moving around buttons occasionally and rarely doing full redesigns

By jeltz 2025-11-0118:35

And most end users hate it.

By reissbaker 2025-11-0122:131 reply

I think it's actually conceptually pretty different. LLMs today are usually constrained to:

1. Outputting text (or, sometimes, images).

2. No long term storage except, rarely, closed-source "memory" implementations that just paste stuff into context without much user or LLM control.

This is a really neat glimpse of a future where LLMs can have much richer output and storage. I don't think this is interesting because you can recreate existing apps without coding... But I think it's really interesting as a view of a future with much richer, app-like responses from LLMs, and richer interactions — e.g. rather than needing to format everything as a question, the LLM could generate links that you click on to drill into more information on a subject, which end up querying the LLM itself! And similarly it can ad-hoc manage databases for memory+storage, etc etc.

By pepoluan 2025-11-032:19

Or, maybe, just not use LLMs?

LLM is just one model used in A.I. It's not a panacea.

For generating deterministic output, probably a combination of Neural Networks and Genetic Programming will be better. And probably also much more efficient, energy-wise.

By visarga 2025-11-0215:38

Every time you need a rarely used functionality it might be better to wait 60s for an LLM with MCP tools to do its work than to update an app. It only makes sense to optimize and maintain app functionalities when they are reused.

By vidarh 2025-11-0214:31

For some things you absolutely want deterministic behaviour. For other things, behaviour that adapts to usage and the context provided by the data the user provides sounds like it could potentially be very exciting. I'm glade people are exploring this. The hard part will be figuring out where the line goes, and when and how to "freeze" certain behaviours that the user seems happy with vs. continuing to adapt to data.

By ddalex 2025-11-0120:353 reply

Like, for sure you can ask the AI to save it's "settings" or "context" to a local file in a format of its own choosing, and then bring that back in the next prompt ; couple this with temperature 0 and you should get to a fixed-point deterministic app immediately

By dehsge 2025-11-0120:50

There still maybe some variance at temperature 0. The outputted code could still have errors. LLMs are still bounded by the undecidable problems in computational theory like Rices theorem.

By guelo 2025-11-0122:08

Why wouldn't the llm codify that "context" into code so it doesn't have to rethink through it over and over? Just like humans would. Imagine if you were manually operating a website and every time a request came in you had come up with sql queries (without remembering how you did it last time) and manually type the responses. You wouldn't last long before you started automating.

By geraneum 2025-11-0120:51

> couple this with temperature 0

Not quite the case. Temperature 0 is not the same as random seed. Also there are downsides to lowering temperature (always choosing the most probable next token).

By anon291 2025-11-0221:11

Llms are easily made deterministic by choosing the selection strategy. More than being deterministic they are also fully analayzable and you don't run into issues like the halting problem if you constrain the output appropriately.

By SecretDreams 2025-11-031:47

Why do good thing consistently when we can do great thing that only works sometimes??? :(

By myhf 2025-11-0118:413 reply

Designing a system with deterministic behavior would require the developer to think. Human-Computer Interaction experts agree that a better policy is to "Don't Make Me Think" [1]

[1] https://en.wikipedia.org/wiki/Don%27t_Make_Me_Think

By krapp 2025-11-0118:511 reply

That book is talking about user interaction and application design, not development.

We absolutely should want developers to think.

By crabmusket 2025-11-0121:101 reply

As experiments like TFA become more common, the argument will shift to whether anybody should think about anything at all.

By krapp 2025-11-029:571 reply

What argument? I see a business model here, not an argument.

By crabmusket 2025-11-032:33

I meant "the discourse", "the conversation we are all having", interpreting the experiment in TFA as an entry in that discourse.

By _se 2025-11-0119:091 reply

This is such a massive misunderstanding of the book. Have you even read it? The developer needs to think so that the user doesn't have to...

By finnborge 2025-11-0120:101 reply

My most charitable interpretation of the perceived misunderstanding is that the intent was to frame developers as "the user."

This project would be the developer tool used to produce interactive tools for end users.

More practically, it just redefines the developer's position; the developer and end-user are both "users". So the developer doesn't need to think AND the user doesn't need to think.

By stirfish 2025-11-0120:41

I interpreted it like "why don't we simply eat the orphans"? It kind of works but it's absurd, so it's funny. I didn't think about it too hard though, because I'm on a computer.

By AstroBen 2025-11-0118:55

..is this an AI comment?

By thih9 2025-11-0118:354 reply

> who wants web apps to behave differently every time you interact with them?

Technically everyone, we stopped using static pages a while ago.

Imagine pages that can now show you e.g. infinitely customizable UI; or, more likely, extremely personalized ads.

By ozim 2025-11-0211:11

Small anecdote. We were releasing UI changes every 2 weeks making app better more user friendly etc.

Product owners were happy.

Until users came for us with pitchforks as they didn’t want stuff to change constantly.

We backed out to releasing on monthly cadence.

By ehutch79 2025-11-0118:40

No.

When I go to the dmv website to renew my license, I want it to renew my license every single time

By anthk 2025-11-0123:57

Ah, sure; that's why everyone got Adblock and UBo in first place. Even more under phones.

By hansmayer 2025-11-0123:46

> infinitely customizable UI; or, more likely, extremely personalized ads

Yeah, NO.

By finnborge 2025-11-0119:201 reply

This is amazing. It very creatively emphasizes how our definition of "boilerplate code" will shift over time. Another layer of abstraction would be running N of these, sandboxed, responding to each request, and then serving whichever instance is internally evaluated to have done the best. Then you're kind of performing meta reinforcement learning with each whole system as a head.

The hard part (coming from this direction) is enshrining the translation of specific user intentions into deterministic outputs, as others here have already mentioned. The hard part when coming from the other direction (traditional web apps) is responding fluidly/flexibly, or resolving the variance in each user's ability to express their intent.

Stability/consistency could be introduced through traditional mechanisms: Encoded instructions systematically evaluated, or, via the LLMs language interface, intent-focusing mechanisms: through increasing the prompt length / hydrating the user request with additional context/intent: "use this UI, don't drop the db."

From where I'm sitting, LLMs provide a now modality for evaluating intent. How we act on that intent can be totally fluid, totally rigid, or, perhaps obviously, somewhere in-between.

Very provocative to see this near-maximum example of non-deterministic fluid intent interpretation>execution. Thanks, I hate how much I love it!

By SkiFire13 2025-11-0120:49

> serving whichever instance is internally evaluated to have done the best. Then you're kind of performing meta reinforcement learning

I thought this didn't work? You basically end up fitting your AI models to whatever is the internal evaluation method, and creating a good evaluation method most often ends up having a similar complexity as creating the initial AI model you wanted to train.

By Finbarr 2025-11-0121:013 reply

If you added a few more tools that let the LLM modify code files that would directly serve requests, that would significantly speed up future responses and also ensure consistency. Code would act like memory. A direct HTTP request to the LLM is like a cache miss. You could still have the feedback mechanism allowing a bypass that causes an update to the code. Perhaps code just becomes a store of consistency for LLMs over time.

By samrolken 2025-11-020:052 reply

This was an unserious experiment meant to illustrate the gap and bottlenecks that are still there. I agree that there's a lot that could be done to optimize this kind of approach. But even if you did, I'm not sure the results would be viable and I'm pretty sure classic coding (with LLM assistance and all) would still outperform such a product.

By Finbarr 2025-11-020:21

I found it thought provoking and interesting. Thanks for sharing.

By theendisney 2025-11-023:21

You need to do more unserious experments. This one is perhaps the best stupid idea ive seen.

Maybe the browser should learn to talk back.

You could store the pages in the database and periodically generate a new version based on the current set of pages and the share of traffic they enjoy. You would get something that evolves and stabilizes in some niche. Have an innitial prompt like; "dinosaurs!" Then sit back and see the magic unfold.

By kinduff 2025-11-021:15

Creating instructions and adding boundaries on how to grow, and you end up with a seed.

By hartator 2025-11-0123:121 reply

You should try making this.

By Finbarr 2025-11-020:21

I'm tempted to!