Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

For $50/month, it's a non-starter. I hope they can find a way to use all this excess bandwidth to put out a $10 equivalent to Claude Code instead of a 1000 tok/s party trick I can't use properly.


I feel the same and it's also why I can't understand all these people using small local models.

Every local model I've used and even most open source are just not good


the only good-enough model I still use it gpt-oss-120b-mxfp4 (not 20b) and glm-4.6 at q8 (not q4).

quantization ruins models and some models aren't that smart to begin with.


GLM-4.6 is on par with Sonnet 4.5. Sometimes it is better, sometimes it is worse. Give it a shot. It's the only model that made me (almost) ditch Claude. The only problem is, Claude Code is still the best agentic program in town and search doesn't function without a proper subscription.


Have you tried Claude Code Router with GLM 4.6?

https://github.com/musistudio/claude-code-router


z.ai hosted GLM 4.6 works great with claude code, drops right in


Have you tried opencode?


Cerebras offers pay-per-token. What are you asking for? Claude Code starts at $100, or $15/mtok. Cerebras is already much cheaper, but you want it to be even cheaper at $10?


$600 per year is a trivial cost for a professional tool


$600 per anything is Herman Miller territory, pal. I'm not paying that for a SaaS.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: