Captioning photos with KI III

Having captioned lots of images with BLIB, CLIP I was not happy. But I stumbled onto a GEM

The GEM was

It discusses the different img2text models and gave me a pointer to Waifu Diffusion 1.4.

Waifu Diffusion 1.4. is trained on Mangas.

The captions from WD14 are quite detailed but expressed as an list of attributes in contrast to formulated sentences from BLIB/CLIP.

In the end I will use BLIB, CLIP and WD14 for the caption of my images.