More

gck1 · 2026-03-06T01:31:42 1772760702

Not sure how accurate this is, but found contextarena benchmarks today when I had the same question.

It appears only gemini has actual context == effective context from these. Although, I wasn't able to test this neither in gemini cli, nor antigravity with my pro subscription because, well, it appears nobody actually uses these tools at Google.

https://contextarena.ai/?showLabels=false

gck1 · 2026-03-05T08:58:11 1772701091

Claude is Anthropic's property which they rent to the government. Is there any other place where rental agreements don't come with clauses on how the property can and can't be used?

gck1 · 2026-03-05T08:46:34 1772700394

Enforce this with deterministic guardrails. Use strictest linting config you possibly can, and even have it write custom, domain specific linters of things that can't happen. Then you won't have to hand hold it that much

gck1 · 2026-03-05T06:52:27 1772693547

Yeah, people seem to forget one of the L's in LLM stands for Language, and human language is likely the largest chunk in training data.

A cli that is well designed for humans is well designed for agents too. The only difference is that you shouldn't dump pages of content that can pollute context needlessly. But then again, you probably shouldn't be dumping pages of content for humans either.

Smaug123 · 2026-03-05T07:34:37 1772696077

It's not obvious that human language is or should be the largest amount of training data. It's much easier to generate training data from computers than from humans, and having more training data is very valuable. In paticular, for example, one could imagine creating a vast number of debugging problems, with logs and associated command outputs, and training on them.

rkagerer · 2026-03-05T08:08:51 1772698131

I also feel like it's just a matter of time until someone cracks the nut of making agents better understand GUI's and more adept at using them.

Is there progress happening in that trajectory?

Imustaskforhelp · 2026-03-05T11:10:48 1772709048

> Is there progress happening in that trajectory?

There was a recent Hackernews post which had a novel approach about making agents interact with GUI/computer-use

https://news.ycombinator.com/item?id=47125014: The First Fully General Computer Action Model : https://si.inc/posts/fdm1/

Hope this helps

gck1 · 2026-03-05T06:43:08 1772692988

Yeah, that one can't even figure out how to write a formula, or sometimes read data when it's sitting WITHIN context of sheets.

I get better experience if I just copy-paste the sheet data into Gemini web. And IIRC copy-paste is just space "delimited" by default.

gck1 · 2026-03-05T06:37:57 1772692677

Not because it's fast, but because of its compiler. It acts as a very good guardrail and feedback mechanism for LLMs.

abustamam · 2026-03-05T14:01:38 1772719298

Typescript has surpassed Python and JS as most used on Github for a similar reason

https://xcancel.com/github/status/2029277638934839605?s=20

koakuma-chan · 2026-03-05T15:36:43 1772725003

> making strict typing an advantage, not a chore

It's crazy that people think strict typing is a chore. Says a lot about our society.

abustamam · 2026-03-05T17:23:12 1772731392

I learned TS after a few years with JS. I thought having strict types was cool. Many of my colleagues with much more (JS) experience than me thought it was a hassle. Not sure if they meant the setup or TS or what but I always thought it was weird.

gck1 · 2026-03-05T06:31:28 1772692288

Isn't that NVME also very expensive to replace because it's tied to hardware identifiers? If you keep swapping all the time, surely NVME would be the first part to fail

ua709 · 2026-03-05T07:11:31 1772694691

This was heavily debated in the 11.4 timeframe because there was risk that this version of the OS could excessively wear NVME.

https://appleinsider.com/articles/21/06/04/apple-resolves-m1...

The issue was subsequently resolved but the consensus was with modern wear leveling this isn't so much a thing.

I have a 2021 MacBook Pro with the original drive. I use it heavily for development practically every day and just dumped the SMART data.

Model Number: APPLE SSD AP1024R

=== START OF SMART DATA SECTION ===

SMART overall-health self-assessment test result: PASSED

Available Spare: 100%

Available Spare Threshold: 99%

As always, YMMV

gck1 · 2026-03-03T10:29:26 1772533766

> Meanwhile, my last interview already did test my AI skills.

Curious to hear more about this.

gck1 · 2026-03-03T10:28:15 1772533695

> But you are forgetting your skills (seen it first hand), and you're not learning anything new.

This is just false. I may forget how to write code by hand, but I'm playing with things I never imagined I would have time and ability to, and getting engineering experience that 15 years of hands on engineering couldn't give me.

> Your next interview won't be testing your AI skills.

Which will be a very good signal to me that it's not a good match. If my next interview is leetcode-style, I will fail catastrophically, but then again, I no longer have any desire to be a code writer - AI does it better than me. I want to be a problem solver.

adithyassekhar · 2026-03-03T10:47:16 1772534836

> getting engineering experience that 15 years of hands on engineering couldn't give me.

This is the equivalent of how watching someone climb mountain everest in a tv show or youtube makes you feel like you did it too. You never did, your brain got the feeling that you did and it'll never motivate you to do it yourself.

gck1 · 2026-03-03T11:25:40 1772537140

This is only true for fully unsupervised "vibe coding". But you'll find this will not work for anything beyond a basic todo list app.

You'll free up your time from actually writing code, but on the other hand, you'll have to do way more reading, planning, making architectural decisions etc. This is what engineering feels like should be.

gck1 · 2026-03-03T10:20:23 1772533223

If you want an answer to the OP question, just ask AI to analyze the session jsonl files in your user directory and give you statistics of what's in there.

You'll find that at least half of it is noise.

If you put that in commits, you lose the ability to add "study git commits to ground yourself" in your agents.md or prompts. Because now you'll have 50%+ noise in your active session's context window.

Context window is precious. Guard it however you can.