I think you underestimate how impossible of a task it is to write sufficient tes...

brookst · 2026-01-23T20:29:05 1769200145

How do we check humans’ work for these unknown errors?

gpm · 2026-01-23T20:40:45 1769200845

An expectation of professionalism, training and written material on software design, providing incentives (like promotions) to not produce crap, etc.

It's not a world where everything produced is immediately verified.

If a human consistently only produced the quality of work Claude Opus 4.5 is capable of I would expect them to be fired from just about any job in short order. Yes, they'd get some stuff done, but they'd do too much damage to be worth it. Of course humans are much more expensive than LLMs to manage so this doesn't mean it can't be a useful tool... just it's not that useful a tool yet.

ASalazarMX · 2026-01-23T20:42:43 1769200963

Humans may be prone to err, but they don't confabulate like LLMs do. Also, the unit tests are done by people who know intimately the expected behavior of the code, which surprisingly, it's frequently the same programmer.

This can be abused because the programmer is both judge and jury, but people tend to handle this paradox much better than LLMs.

AstroBen · 2026-01-23T21:36:56 1769204216

We have many layers to prevent them:

1. Competent humans architecting and leading the system who understand the specs, business needs, have critical thinking skills and are good at their job

2. Automated tests

3. Competent human reviewers

4. QA

5. Angry users

Cutting out 1 and 3 in favor of more tests isn't gunna work

kaydub · 2026-01-23T22:52:58 1769208778

Ugh, I just think everyone in these threads are talking past each other.

I'm personally not advocating for not having humans in the loop. I don't know of anybody using llm tools or advocating for them that are saying there shouldn't be humans in the loop. "vibe coding" seems to mean different things to different people.