Captioning photos with KI III
Having captioned lots of images with BLIB, CLIP I was not happy. But I stumbled onto a GEM
The GEM was
https://nednex.com/en/image-to-text-models-clip-blip-and-wd14/
It discusses the different img2text models and gave me a pointer to Waifu Diffusion 1.4.
Waifu Diffusion 1.4. is trained on Mangas.
The captions from WD14 are quite detailed but expressed as an list of attributes in contrast to formulated sentences from BLIB/CLIP.
In the end I will use BLIB, CLIP and WD14 for the caption of my images.