Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think you underestimate how impossible of a task it is to write sufficient test coverage to keep AI in line

You can assert that something you want to happen is actually happening

How do you assert all the things it shouldn't be doing? They're endless. And AI WILL mess up

It's enough if you're actively reviewing the code in depth.. but if you're vibe coding? Good luck



How do we check humans’ work for these unknown errors?


An expectation of professionalism, training and written material on software design, providing incentives (like promotions) to not produce crap, etc.

It's not a world where everything produced is immediately verified.

If a human consistently only produced the quality of work Claude Opus 4.5 is capable of I would expect them to be fired from just about any job in short order. Yes, they'd get some stuff done, but they'd do too much damage to be worth it. Of course humans are much more expensive than LLMs to manage so this doesn't mean it can't be a useful tool... just it's not that useful a tool yet.


Humans may be prone to err, but they don't confabulate like LLMs do. Also, the unit tests are done by people who know intimately the expected behavior of the code, which surprisingly, it's frequently the same programmer.

This can be abused because the programmer is both judge and jury, but people tend to handle this paradox much better than LLMs.


We have many layers to prevent them:

1. Competent humans architecting and leading the system who understand the specs, business needs, have critical thinking skills and are good at their job

2. Automated tests

3. Competent human reviewers

4. QA

5. Angry users

Cutting out 1 and 3 in favor of more tests isn't gunna work


Ugh, I just think everyone in these threads are talking past each other.

I'm personally not advocating for not having humans in the loop. I don't know of anybody using llm tools or advocating for them that are saying there shouldn't be humans in the loop. "vibe coding" seems to mean different things to different people.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: