Hi!

While I really enjoy seeing many of my fellow man being accommodating to people with disabilities. I find manually transcribing every image I post to be very tiring.

I thought that I could at least use some sort of AI to help with image transcripts, tho, that could probably be better used by the actual person with the disability.

So thats the question, should I skip the transcribing of an image or let an AI do it?

  • Lumidaub@feddit.org
    link
    fedilink
    English
    arrow-up
    28
    arrow-down
    5
    ·
    2 days ago

    If you can get an AI to produce an actually useful description, that would be extremely interesting. However, AIs don’t know what’s important about an image and will fill up the description with useless information, effectively spam for the person that needs a description.

    Write just a sentence, describe the thing that is important, while keeping in mind why you’re even posting the image, and it’s going to take less time than asking the AI.

      • Lumidaub@feddit.org
        link
        fedilink
        English
        arrow-up
        15
        arrow-down
        3
        ·
        2 days ago

        True and one sentence written by a human who understands the image is better than twenty sentences by a word prediction machine.

        • HappyFrog@lemmy.blahaj.zone
          link
          fedilink
          English
          arrow-up
          14
          arrow-down
          4
          ·
          2 days ago

          No matter how good human written descriptions are, people just won’t do them. So having a automated system is much more preferable.

          • Lumidaub@feddit.org
            link
            fedilink
            English
            arrow-up
            7
            arrow-down
            3
            ·
            2 days ago

            I know what you’re saying but I truly think for most people it’s simply that they’re overthinking it. They think every single thing needs to be in the description, with references explained and sourced and whatnot. That does sound exhausting. And I have written a handful of descriptions like that for pictures where I thought the details were interesting enough to justify the effort. But really, a simple “The thirteenth Doctor and Rose Tyler embracing and deeply kissing” is already very sufficient in most cases (add “standing on an asteroid in front of a field of glittering stars - digital colour painting” if you have the spoons). So imho it’s better to educate them and encourage short, concise descriptions than to give in to the slop.

        • x74sys@programming.dev
          link
          fedilink
          English
          arrow-up
          7
          arrow-down
          3
          ·
          2 days ago

          Yeah, apart from the fact that I imagine that people who need alt text don’t appreciate LLM output. It‘s very boring. It’s either extremely technical and ice-cold or so cringe that you have to stop reading. Just what I think.

          At least for me, if I realize that I’m reading an AI blog article or AI generated text in some other form, I don’t read it.