Coder Social home page Coder Social logo

Comments (4)

slgphantom avatar slgphantom commented on August 11, 2024 1

I tried the same text on GPTZero.sg with another machine, and the result aligns this time...

Any how, let me also share the result of using v1 (GPTZero) of the GPT2PPL model of the same text here:

Perplexity 10
Perplexity per line 14.0
Burstiness 20
(OrderedDict([('Perplexity', 10),
              ('Perplexity per line', 14.0),
              ('Burstiness', 20),
              ('label', 0)]),
 'The Text is generated by AI.')

from detectgpt.

BurhanUlTayyab avatar BurhanUlTayyab commented on August 11, 2024 1

The results are same on every end. I think the fluctuation might occur because of different Pytorch / Transformer versions.
image

For GPTZerov1, you can watch this video to understand how it works: https://www.youtube.com/watch?v=x9H-aY5sCDA (we've added one more feature where perplexity per line is also calculated and aggregated).

For DetectGPT: We use Log Prob function from a LLM to detect text are similar to what it can produce. Also done line by line.

from detectgpt.

BurhanUlTayyab avatar BurhanUlTayyab commented on August 11, 2024

Additionally, it would be helpful if the threshold could be set as an input argument of the model so that users can customize it.
Sure, we can add that as an input parameter for our next release.

from detectgpt.

slgphantom avatar slgphantom commented on August 11, 2024

Thank you for the prompt response!

Screenshot_20230414-130040

I noticed that the results are fluctuating. It might be helpful if you could test it on your device as well.

Could you please provide more information about the relationship between the confidence level, threshold, and labeling? As someone who is not well-versed in the internal calculations, I believe that the text is only labeled as "human" when the confidence level is above the threshold (0.7). However, this does not seem to be the case. It is frustrating to see that other methods are still reporting the text as being written by AI, such as GPTZero in v1 and DetectGPT on both your and the OG's website.

It would also be highly beneficial if the module could return an object that includes the confidence, label, and highlighted text (AI generated) so that users can better interpret the results and further apply them in their work.

from detectgpt.

Related Issues (7)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.