6/5/2026
at
9:53:37 PM
I kind of disagree, because describing an image is more like writing an svg than rendering pixels.If I ask you to describe how would you draw a cat head, you could do it in text like: "a big circle as the head, 2 small circles as the eyes, 2 triangles for ears, 3 lines on each side of the mouth as moustaches, etc..."
by XCSme
6/5/2026
at
11:37:54 PM
Hmm, sure. I'm still surprised - it also has to say where they are in coordinate space. It feels like the way genAI works ("what comes next") is not amenable to this use case (demonstrably I'm wrong, of course).
by happytoexplain
6/5/2026
at
11:48:25 PM
I remember when I was learning WebGL/OpenGL and had to draw some test shapes to test my shader, I would manually think what the vertex positions should be to draw a triangle, a pyramid, etc. I think for AIs it's quite easy, because most are quite decent at math, and have been trained on many geometry problems, and probably also a lot of OpenGL code too, and 3D assets.
by XCSme