This is really impressive! Can you please elaborate more on the way you labeled ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

malshe on Sept 20, 2021 | parent | context | favorite | on: Show HN: 40k HN comments mentioning books, extract...

This is really impressive! Can you please elaborate more on the way you labeled the data? I think usually there is a lot to learn from labeling methods.

tracyhenry on Sept 21, 2021 [–]

I generated training comments by matching book names. Roughly, there are one in five of those comments that actually have a book mention. Then I use the Doccano labeling tool to label the tokens in the comments.

malshe on Sept 22, 2021 | [–]

Thanks! I will check out Doccano now

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact