Using external memory instead of encoding all of the knowledge in the model will...

nestorD · on Jan 3, 2022

Yes! I have long thought that GPT type of model are huge because they are forced to encode a lot of raw knowledge, giving them the ability to search for knowledge in a database solves that problem which should help making them smaller while scaling to larger datasets.

The cherry on top is that you could get not only information from the model but also the sources it used to make up its mind!

savant_penguin · on Jan 3, 2022

Do you apply the NN search on the raw images themselves or on the latent vector from the CNN?

m_ke · on Jan 3, 2022

Image embeddings, NN on raw images would scale horribly and not return anything relevant.