New accessibility feature coming to Firefox, an “AI powered” alt-text generator.


"Starting in Firefox 130, we will automatically generate an alt text and let the user validate it. So every time an image is added, we get an array of pixels we pass to the ML engine and a few seconds after, we get a string corresponding to a description of this image (see the code).

Our alt text generator is far from perfect, but we want to take an iterative approach and improve it in the open.

We are currently working on improving the image-to-text datasets and model with what we’ve described in this blog post…"

  • jherazob@beehaw.org
    link
    fedilink
    English
    arrow-up
    10
    ·
    6 months ago

    Now i want this standalone in a commandline binary, take an image and give me a single phrase description (gut feeling says this already exists but depending on Teh Cloudz and OpenAI, not fully local on-device for non-GPU-powered computers)

      • jherazob@beehaw.org
        link
        fedilink
        English
        arrow-up
        4
        ·
        5 months ago

        So, it’s possible to build but no one has made it yet? Because i have negative interest in messing with that kinda tech, and would rather just “apt-get install whatever-image-describing-gizmo” so i wouldn’t be the one who does it

        • Swedneck@discuss.tchncs.de
          link
          fedilink
          arrow-up
          3
          ·
          5 months ago

          this is how i feel about basically all technology nowadays, it’s all so artificially limited by capitalism.

          nothing fucking progresses unless someone figures out a way to monetize it or an autistic furry decides to revolutionize things in a weekend because they were bored and inventing god was almost stimulating enough

        • The Doctor@beehaw.org
          link
          fedilink
          English
          arrow-up
          1
          ·
          5 months ago

          Folks have made it - I think ollama was name-checked specifically because it’s on Github and in Homebrew and in some distros’ package repositories (it’s definitely in Arch’s). I think some folks (at least) aren’t talking about it because of the general hate-on folks have for LLMs these days.