4/19/2025 at 3:59:02 PM
Intuitive and expected result (maybe without the prediction of performance). I'm glad somebody did the hard work of proving it.Though, if this is so clearly seen, how come AI detectors perform so badly?
by PunchTornado
4/19/2025 at 5:29:33 PM
This experiment involves each LLM responding to 128 or 256 prompts. AI detection is generally focused on determining the writer of a single document, not comparing two analagous sets of 128 documents and determining if the same person/tool wrote both. Totally different problem.by Calavar
4/19/2025 at 4:38:29 PM
It might be because detecting if output is AI generated and mapping output which is known to be from an LLM to a specific LLM or class of LLMs are different problems.by haltingproblem