More

hamuraijack · 2026-02-25T16:18:53 1772036333

so how would you eval your own claude.md? Each context is unique to the project, team, and personal root claude.md. Do you just take given task and ask it to redo the same one over and over again against a known solution? Do you just keep using it and "feel" whether or not it's working? How is that different from what everyone is already doing?

sjmaplesec · 2026-02-26T14:09:37 1772114977

The review eval tests language, activation etc of skills. I guess you could move it all to a skill quick and then run an eval on that if using Tessl. This checks if the way you write the instructions etc are being well understood by the agent

hamuraijack · 2026-02-12T17:28:38 1770917318

this is also easily gameable. Any language can easily be converted into a single LOC

hamuraijack · 2025-08-06T14:29:02 1754490542

This is definitely deliberate

hamuraijack · 2025-07-22T19:42:08 1753213328

The sky is blue

dang · 2025-07-22T23:17:58 1753226278

Ok, but please don't post unsubstantive comments.

hamuraijack · 2025-07-10T13:57:23 1752155843

Did anyone actually believe she was CEO?

hamuraijack · 2025-06-30T12:22:46 1751286166

$25k? What about when Honda Civics were $15k???

hamuraijack · 2025-04-28T14:44:04 1745851444

Is it just me, or is vibe coding only useful for greenfield projects that have minimal complexity? Seems like they collapse once enough complexity has built up.

fhd2 · 2025-04-28T15:30:45 1745854245

I've tried to vibe code small stuff a few times, but there's not one success story. After about 2-4 hours, I'd hit a wall, and ultimately threw it away, because it wasn't worth the manual programming effort it would have required (which is why I tried vibe coding it in the first place).

I think vibe coding might be more successful for people doing things an experienced developer can do in their sleep with a few lines of code in Django or something. Something a non programmer might have previously done with some no code tool.

hamuraijack · on Feb 20, 2025

I'm someone who hated leetcode style interviews for the longest, but I'm starting to come around on them. I get that these style of questions are easy to game, but I still think they have _some_ value. The point of these style of questions was supposed to test your ability to problem solve and come up with a good solution given the tools you knew. That being said, I don't think every company should be using this type of question for their interviews. I think leetcode style questions should be reserved for companies that are pushing the boundary of the industry since they're exploring charted territory and need people who can come up with unique solutions to problems no one really knows. I think most companies would be fine with some kind of pairing problem since most people are probably solving engineering problems instead of computer science problems. But none of this matters, since, we all know that even if we went that direction as an industry, the business people would fuck it up some how anyways.

abhijeetpbodas · on Feb 20, 2025

> reserved for companies that are pushing the boundary of the industry

In a world where every company beleives (or wants to beleive) that they are doing some ground-breaking, bleeding edge work (see any tech company blog and you can only find hyped technologies in there), I do not think one can expect companies to do a fair assessment of if they really are doing such work.

hamuraijack · on March 1, 2024

Obligatory xkcd reference. https://xkcd.com/927/

hamuraijack · on Jan 6, 2024

Isn't this the exact reason they got into this mess in the first place?