AltMaarri [they/them]

  • 2 Posts
  • 43 Comments
Joined 3 months ago
cake
Cake day: March 13th, 2026

help-circle

  • You could argue in the same way that car safety technologies are “bandaid after bandaid to try and compensate for the fact that cars are fundamentally huge hunks of metal going really fast”.

    The parallel doesn’t really work (there is nothing fundamentally incompatible between the purpose of a car and the basic design idea); but to try and answer, this would be a viable answer if the huge “AI labs” admitted the tech doesn’t work and were collectively working on entirely new architectures (likely ones that include actual symbolic representation of the world, not just statistical links between words).

    But they’re not, they’re using the same fundamental approach and doing ridiculous shit around that approach - ultimately a genuine, and interesting, NLP breakthrough - to try and sell it as what it’s not and will never be.

    you still need to do quite a bit of handholding and double checking it gets stuff right but it definitely can get medium-high complexity stuff right and overall it is a productivity boost.

    See, that’s one part I disagree about. First as a direct productivity boost; I realize you probably think it’s making you more productive; but that’s something you have to demonstrate, at scale. And the few genuine studies I’ve seen suggest the effect, if it exists, is marginal at best; at least two I remember suggest it decreases productivity (and at least one, possibly both, can’t remember, said self-reporting from devs was that it increased it despite the effect).

    More importantly, though: very little of the code you produce this way will be maintainable. Very little will also be secure, too (both “direct” vulns - they’re not actually good at finding them, FYI - and more fundamental logic vulnerabilities - they’ll never find these at all outside of trivial cases). If you want a concrete example of that, just look at the leaked Claude harness code: it’s genuinely pathetic. They’re vibe coding it all and it shows.

    Also, I notice you say “you still need to…”; do you think this will eventually get fixed ? it’s been nine years. There’s a point where “it’ll get better” is starting to become more of a meme than an answer. It was getting better, then a lot less, then lately almost not at all, despite greater and greater cost increases.

    Shit, lately I’ve even started to wonder if they’re even training new models anymore; I’ve been wondering if they’re just not shipping the same with a different harness / different bullshit surrounding it. It’s not like they can improve on the models themselves anyway (not without at least one, likely several, genuine fundamental breakthrough) - they’re all out of human-produced data by now.



  • Fair enough, if it works for you and you’re aware of the downsides (which you seem to be, contrary to most users); two things I haven’t even mentioned though are the power/environmental costs (obviously) but also the potential cognitive impact.

    You say it doesn’t change much compared to when you were parsing the reddit comments yourself; doesn’t it ? how sure are you “wasting” time parsing these comments wasn’t exercising an important muscle mentally for you (getting the jist of a text rapidly - excluding braindead content quickly, etc.) ?

    Here the example is pretty ridiculous - I doubt your mental faculties depend much on parsing reddit comments - but you get the idea. It’s very early to tell but several papers now suggest the negative cognitive impact is very real and potentially very fast.


  • China isn’t counting the same numbers i tell you that

    That’s true; the very quality of results you can obtain with some recent models locally suggests the possible optimizations are huge. But it’s also diminishing returns: the moat between state of the art models and one I run on a P40 locally is really small these days; the amounts they need to sink to get even slightly better results at this stage are more and more. They’re text generators; yes, if you run ten of them in parallel, vote; have them cross-check their results, introduce harnesses at every steps, etc. (all examples of actual bandaids I evoked above) you can improve results. But all of this is trying to make a tech that fundamentally doesn’t answer the problem answer it nevertheless. And all of it costs a lot.

    And I dearly hope China is not spending too much effort on backing these domestic initiatives, because again: outside of a few limited use cases (easily identifiable, I’ve listed a few in another comment in this very thread: those where having no relation to the truth in a portion of the generated text is acceptable), the tech doesn’t work.

    It’s incredibly useful for letting humans interact with a computer system in natural language and maybe you shouldn’t take from that “hey we should have it do highly sensitive stuff where the slightest error could have great consequences”

    There are use cases where having the computer completely invent actions or do shit randomly isn’t that bad, I guess; games come to mind. But ultimately and more generally I disagree, it’s shit for that too. Try one as a daily runner, just for laughs. Or just try an entire shell session where instead of typing the commands, you complete a description of what you want to do through one and then do it. It goes bad very fast, let me tell you.

    And I get it: what you describe would be awesome - a SF dream. I like tech and I wish all of this would work; and like many initially I genuinely wondered as well if scaling/attention was all you needed, and had some measure of hope; quickly dashed, though.

    it does a great job of summarizing reddit threads

    Again, no it does not. You have no guarantee it won’t pull shit out of its virtual ass; and just as crucially (and even more likely), no guarantee it won’t ignore significant parts of the source material. You’ll get a result that seem like it summarizes the thread, with no confidence level, and no guarantee of its reliability.


  • We don’t think it has remained static, we realize the “AI companies” have been implementing bandaid after bandaid to try and compensate for fundamental lacks in the technology itself, and that as a consequence results are better now (for like 1000x the cost but who’s counting - well we all will, soon, but you get the point).

    But this is still a credible text generator, ultimately close to a larger, N-dimensional markov chain. It still “hallucinates”. It’s still shit. It’s still almost entirely useless for coding (you know, if you care about long or even medium-term maintainability or security), never mind the other use cases they’re trying to sell it for. It does not work.

    And if it did (it doesn’t !), the aforementioned bandaids would have made the cost impossible to justify anyway.


  • The maximalist “anything using any kind of neural network math is exactly the same and is equally bad in all cases” position is nonsensical

    Does that position exist outside of twitter, though ?

    When GenAI-critical people say “AI”, they almost universally mean LLMs (and sometimes also diffusion models / multimodal ones). And in fact they usually mean “LLMs as sold by the VCs”. LLMs themselves do have a few legitimate use cases (for perhaps 0.1% of the target population they’re currently being sold to, but still).

    They do not mean machine learning in general - which is awesome, extremely useful, and has been in widespread use for decades now. Including traditional neural-network-based classifiers.






  • The few use cases for which they actually kinda work:

    • (not very high quality) Entertainment
    • Pre-processing large datasets used for training traditional ML models (I’m thinking specifically of the img2text models here), for applications that aren’t too outside of their training set.
    • Large-scale automated disinformation

    The first two can be fully performed locally for negligible costs; the same is mostly true also for the third one, which is immoral on top of it.

    The main ongoing use case/need they currently answer though is that of the capitalists: pretending to have discovered a new Internet in terms of economic growth (the VC morons still don’t seem to get that the web was a one-time thing) to make a new bubble and try and make the whole disgusting circus continue a little longer.

    Previous attempts - none of them as successful, many of them a lot less: ‘web 3.0’, ‘big data’, ‘smart cities’, ‘crypto/NFTs’, and ‘the metaverse’. Likely next one once the AI bubble pops (assuming the world economy doesn’t collapse with it): ‘quantum computing’ - the quick ones among the shitheads have already started drumming that beat; and once we’re in a new AI winter I also fully expect them to connect the two - ie, ‘the only reason we didn’t get AGI is the lack of quantum computing’.

    LLMs are a lot more successful than previous attempts specifically due to the appearance of working; if you don’t look at it too closely / are not even slightly well-versed in the tech itself you could believe what the people selling them are saying. It’s the perfect grift vector.




  • I have no idea what “Infrared” is but contextually and based on your comment I imagine some sort of antizionist youtuber. Not so with Dieudonné; he is antisemitic, it’s not even a debate.

    He did shows with Faurisson, an actual holocaust denier that pretends that gas chambers didn’t exist. He had, in one of his shows, the same Faurisson receiving an “insolence award” given out by Dieudonné’s assistant dressed as a death camp inmate. And that’s only the top of the rotten iceberg.