chiragrohit's comments

chiragrohit · 2026-01-26T04:41:10 1769402470

How many tokens are you burning daily?

storystarling · 2026-01-26T09:26:53 1769419613

The real cost driver with agents seems to be the repetitive context transmission since you re-send the history every step. I found I had to implement tiered model routing or prompt caching just to make the unit economics work.

gls2ro · 2026-01-26T05:01:05 1769403665

Not the OP but I think in case of scanning and tagging/summarization you can run a local LLM and it will work with a good enough accuracy for this case.