One shot image resynthesis experiments using the cover art from the WMF cd "Margaritaville" as the input image (no text prompting). Above is using the new Versatile Diffusion model, which is an integrated multi-modal model that is indeed quite versatile. Above is using it in image to image mode.
Below is using it in an alternate image to text embedding to image mode. I tried to negate the cars and trucks in the intermediate text embedding space, which was a big fail. However, it did act to break the image resynthesis out of the tight hold it has on the input in an interesting way. So a useful knob in some respects that is an interesting artistic tool.
No comments:
Post a Comment