Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Is it possible to prompt this model with two or more texts for each image and get masks for each? Something like this inputs = processor(images=images, text=["cat", "dog"], return_tensors="pt").to(device)?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: