Thank you for trying! I first built it as 'detect the human' response, but that was counter to the 'slop or not' framing. Yeah I'm also observing the same based on the first few hundred people's results. The harder models seem to write almost too well and that's generally not how humans write on the internet unless it is a blog post/essay. The easier models seem to be the ones that are tripping people the most.