Except humans learn from your PR comments and in other interactions with more ex...

org3 · 2025-09-21T19:04:27 1758481467

Some people say we're near the end of pre-training scaling, and RLHF etc is going to be more important in the future. I'm interested in trying out systems like https://github.com/OpenPipe/ART to be able to train agents to work on a particular codebase and learn from my development logs and previous interactions with agents.

krageon · 2025-09-22T13:54:03 1758549243

If they're unmotivated enough to not get there after four review rounds for a junior-appropriate feature, they're not going to get better. It's a little impolite to say, but if you spend any significant amount of time coaching juniors you'll encounter exactly what I'm talking about.

EdwardDiego · 2025-09-23T08:28:49 1758616129

I have spent plenty, rest assured.

shepherdjerred · 2025-09-21T18:19:11 1758478751

LLMs can learn if you provide it rules in your repo, and update those rules as you identify the common mistakes the LLM makes

300hoogen · 2025-09-22T07:12:56 1758525176

retarded take

EdwardDiego · 2025-09-23T08:28:22 1758616102

Thanks for the insightful reply that showed me where I went astray.

dayjaby · 2025-09-22T08:12:40 1758528760

Can you elaborate or you call it a day after insulting?

player1234 · 2025-09-25T10:23:54 1758795834

Aggreed