Hacker Newsnew | past | comments | ask | show | jobs | submit | guivr's commentslogin

In the field of personalized image generation, the ability to create images preserving concepts has significantly improved. Creating an image that naturally integrates multiple concepts in a cohesive and visually appealing composition can indeed be challenging. This paper introduces "InstantFamily," an approach that employs a novel masked cross-attention mechanism and a multimodal embedding stack to achieve zero-shot multi-ID image generation. Our method effectively preserves ID as it utilizes global and local features from a pre-trained face recognition model integrated with text conditions. Additionally, our masked cross-attention mechanism enables the precise control of multi-ID and composition in the generated images. We demonstrate the effectiveness of InstantFamily through experiments showing its dominance in generating images with multi-ID, while resolving well-known multi-ID generation problems. Additionally, our model achieves state-of-the-art performance in both single-ID and multi-ID preservation. Furthermore, our model exhibits remarkable scalability with a greater number of ID preservation than it was originally trained with.


Good point. Indeed looks like only a few have. I'll try to add focus styles to all of them in the future. Thanks!


Thanks, merek. Happy it was helpful to you :D


Thanks swyx <3

Nice, yes, I plan to add "copy to react" feature soon. Will let you know when it's ready


Thanks for the heads up


No JS, all made with pure CSS :D (you can copy the code to see how it works)


Thanks @rrishi!!


Awesome, thanks for buying it! I'm glad it's useful to you


Thanks a lot @dmje, really appreciate all your support!


Yess, it's on the roadmap! :D


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: