site banner

Friday Fun Thread for December 8, 2023

Be advised: this thread is not for serious in-depth discussion of weighty topics (we have a link for that), this thread is not for anything Culture War related. This thread is for Fun. You got jokes? Share 'em. You got silly questions? Ask 'em.

2
Jump in the discussion.

No email address required.

Can't beat the cost and convenience for a good-enough image!

I'm sure we'll get to a spot in 2 or 3 years where this gets a lot better. I do have Stable Diffusion but it's slow and hard to wrestle with. I do use Dall-E 3 for work but it's not a large part of what I do. Let's say I generate 2-3 images a day.

My whine here is specifically about the stylistical awfulness of Dall-E 3 images which I now see cropping up everywhere. Prompt-hacking doesn't work. I try stuff like this: "Simple, not complex, no extra characters, restrained, not saturated", but it doesn't seem to really give me what I want.

which I now see cropping up everywhere

I haven't noticed it -- do you have an example?

"Simple, not complex, no extra characters, restrained, not saturated"

Maybe it has trouble with negatives? I wonder if it would respond to directions about specific color palettes (yellow ochre, Paynes grey, cadmium red?), where to place the focal point, or name dropping Rembrandt?

I haven't noticed it -- do you have an example?

Sure. Here are a some examples from a blog that was posted to the slatestarcodex subreddit.

1, 2, 3

Once you recognize the "style" you see it everywhere. The main thing is that they are just way too busy.

I invite you to show me anything that makes all these images I've generated samey.

You're prompting it wrong.

Common link is they are all have far too many unnecessary elements that detract from the image. I will grant that only image #2 looks like a 100% match for the Dall-E 3 archetype.

What do you mean by "unnecessary things"?

They're precisely what I asked for, within the limits of my prompting and the model. Without knowing the prompts, I have no idea what you think they're missing.

At the very least the last one is a minimal brutalist logo for a PMC, I can hardly imagine what could be less so.

Maybe you just don't have an eye for this stuff. It seems really obvious to me how these images are cluttered.

Maybe you're not explaining yourself very well.

1 - looks video game inspired, I would expect it to have about that level of detail, it makes sense for the genera. It looks like it's trying to put in slums and high-rises by the explosion, and it seems likely the prompt asked for both slums and high-rises. In which case, I can't think of a less cluttered way to show them.

2 - The monitors are presumably part of the prompt. There seems to be a bit of office chair and some extra table legs and some lights intersecting on the ceiling back there, which, yeah, probably would be better left out.

3 - Do you mean you don't like the jagged shapes breaking up the main image? That's clearly an intentional stylistic choice

More comments