thanks for sharing. are some local models better than others? can small models w...

vunderba · 2025-06-11T22:53:17 1749682397

So in my experience smaller models tend to produce worse results BUT I actually got really good transcription cleanup with CoT (Chain of Thought models) like Qwen even quantized down to 8b.

dragonwriter · 2025-06-12T14:24:32 1749738272

I think the 8B+ question was about parameter count (8 billion+ parameters), not quantization level (8 bits per weight).

vunderba · 2025-06-12T16:41:34 1749746494

Yeah I should have been more specific - Qwen 8b at a 5_K_M quant worked very well.