Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This is really impressive! Can you please elaborate more on the way you labeled the data? I think usually there is a lot to learn from labeling methods.


I generated training comments by matching book names. Roughly, there are one in five of those comments that actually have a book mention. Then I use the Doccano labeling tool to label the tokens in the comments.


Thanks! I will check out Doccano now




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: