Various nodes interacting with or running LLM models are used by AI artists to enhance prompts during image and video generation. This page collections information on some of them.
btw, I fired Josie as my prompt editor, and hired Mistral 7b Q4 running on
ComfyUI_Searge_LLM
Apparently unlike LMStudio, VLLM can do parallel queries.
gemma-4-26b-a4b-it-heretic.q4_k_m.ggufsupergemma4-26b-uncensored-fast-v2-Q4_K_M.ggufgemma-4-26B-A4B-it-UD-IQ2_M.ggufgemma-4-...-mmproj.. to enable Gemma to “see” imagesStef:
I used both Qwen3.6 and Gemma4. Both are equally good at describing the reference images, I had the impression Gemma4 folllows specific instructions a bit better (as “mention the reference images always as ‘from image0’, ‘from image1’ etc. Qwen 3.6 sometimes forgot that)