For those that have homebrewed a base model, does your output have the same AI-i... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		pwython 31 days ago \| parent \| context \| favorite \| on: LLM from scratch, part 28 – training a base model ... For those that have homebrewed a base model, does your output have the same AI-isms like overusing em dashes? If so/not, what dataset did you use?

itissid 31 days ago | [–]

Does yours also use the oxford comma and generally more commas?

miki123211 31 days ago | | [–]

AFAIK, those are mostly a consequence of posttraining.

whimsicalism 31 days ago | [–]

that is a post-training artifact

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact