Using AI for image transcripts, yay or nay?

Gonzako@lemmy.world · 2 days ago

Using AI for image transcripts, yay or nay?

Lumidaub@feddit.org · 2 days ago

If you can get an AI to produce an actually useful description, that would be extremely interesting. However, AIs don’t know what’s important about an image and will fill up the description with useless information, effectively spam for the person that needs a description.

Write just a sentence, describe the thing that is important, while keeping in mind why you’re even posting the image, and it’s going to take less time than asking the AI.

Frank Heijkamp@mastodontech.de · 2 days ago

@Lumidaub
Writing a short description will be faster and more accurate.

It will tale less time than checking and correcting the output of #ai.
@Gonzako

Gonzako@lemmy.world · 2 days ago

So you posted this from mastodon? Is @Lumidaub your tag there?

Lumidaub@feddit.org · 2 days ago

“@Lumidaub” is a reference to me. The system added that because they were, technically, replying to my comment here.

Gonzako@lemmy.world · 2 days ago

Gotcha, these look so full of links on my client

Lumidaub@feddit.org · 2 days ago

Yep, same, it’s a bit of a weakness of the Fediverse imho.

HappyFrog@lemmy.blahaj.zone · 2 days ago

For those that need it, any description is better than none.

Lumidaub@feddit.org · 2 days ago

True and one sentence written by a human who understands the image is better than twenty sentences by a word prediction machine.

HappyFrog@lemmy.blahaj.zone · 2 days ago

No matter how good human written descriptions are, people just won’t do them. So having a automated system is much more preferable.

Lumidaub@feddit.org · 2 days ago

I know what you’re saying but I truly think for most people it’s simply that they’re overthinking it. They think every single thing needs to be in the description, with references explained and sourced and whatnot. That does sound exhausting. And I have written a handful of descriptions like that for pictures where I thought the details were interesting enough to justify the effort. But really, a simple “The thirteenth Doctor and Rose Tyler embracing and deeply kissing” is already very sufficient in most cases (add “standing on an asteroid in front of a field of glittering stars - digital colour painting” if you have the spoons). So imho it’s better to educate them and encourage short, concise descriptions than to give in to the slop.

x74sys@programming.dev · 2 days ago

Yeah, apart from the fact that I imagine that people who need alt text don’t appreciate LLM output. It‘s very boring. It’s either extremely technical and ice-cold or so cringe that you have to stop reading. Just what I think.

At least for me, if I realize that I’m reading an AI blog article or AI generated text in some other form, I don’t read it.