OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series

L4sBot@lemmy.world · 10 months ago

OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series

bachalxyz@lemmy.world · 10 months ago

How are they going to prove if something was written by an AI?

stevedidWHAT@lemmy.world · 10 months ago

It’s a complicated answer I’m unqualified to answer but essentially there exists some sort of baseline either for people or for how gpt responds usually and then they can figure out which way the text “leans”

redw04@lemmy.ca · 10 months ago

There’s not how it works lol

stevedidWHAT@lemmy.world · 10 months ago

https://medium.com/@rxtang/the-science-of-detecting-llm-generated-texts-e816a14c18d

But yes it is tho ;)

“Existing detection methods can be roughly grouped into two categories: black-box detection and white-box detection, black-box detection methods are limited to API-level access to LLMs. They rely on collecting text samples from human and machine sources, respectively, to train a classification model that can be used to discriminate between LLM- and human-generated texts. An alternative is white-box detection, in this scenario, the detector has full access to the LLMs and can control the model’s generation behavior for traceability purposes. In practice, black-box detectors are commonly constructed by external entities, whereas white-box detection is generally carried out by LLM developers.”

stevedidWHAT@lemmy.world · 10 months ago

Apologies about the other comments, mixed two conversations together lmao)

stevedidWHAT@lemmy.world · edit-2 10 months ago

deleted by creator

redw04@lemmy.ca · 10 months ago

Nah, you can go research it yourself and stop spreading misinformation online. Just because you don’t understand how something works doesn’t give you the right to be angry at it.

stevedidWHAT@lemmy.world · 10 months ago

deleted by creator