Any suggestions for a simple tool to set up your own local evals?

dimava · 2025-12-18T13:52:44 1766065964

Just ask LLM to write one on top of OpenRouter, AI SDK and Bun To take your .md input file and save outputs as md files (or whatever you need) Take https://github.com/T3-Content/auto-draftify as example

theshrike79 · 2025-12-18T11:00:22 1766055622

My "tool" is just prompts saved in a text file that I feed to new models by hand. I haven't built a bespoke framework on top of it.

...yet. Crap, do I need to now? =)

ggsp · 2025-12-18T12:34:57 1766061297

Yeah I’ve wondered about the same myself… My evals are also a pile of text snippets, as are some of my workflows. Thought I’d have a look to see what’s out there and found Promptfoo and Inspect AI. Haven’t tried either but will for my next round of evals

kedihacker · 2025-12-18T14:04:20 1766066660

Well you need to stop them from getting incorporated into its training data

lobsterthief · 2025-12-18T12:12:23 1766059943

_Brain backlog project #77 created_