DetectGPT: Distinguishing between Machine-Generated and Human-Written Text

This repository is an extension of the original work done on the detectGPT model by incorporating three datasets on new writing styles and verifying detectGPT's performance on them.

Original implementation of the experiments in the DetectGPT paper.

An interactive demo of DetectGPT can be found here.

Instructions

First, install the Python dependencies:

    python3 -m venv env
    source env/bin/activate
    pip install -r requirements.txt

Second, execute run.py using python run.py and provide the appropriate command line arguments to the script.

If you have new dataset to include, add its inclusion in custom_datasets.py script and then execute run.py as instructed above.

Please refer to the script for more details on how each function is working and what CLI to give.

Here, we extend the original DetectGPT paper. We apply the method on new datasets and compare the results to the ones achieved by the original authors, verifying the algorithm works well on new datasets (especially ones that have a different style of text). Further, we document the original source code and provide a document outlining how we ran our code (and tuned associated hyperparameters).

Future Work:

Improving GPT-2 model using ensemble methods
Further exploring the relationship between prompting and detection
Determing whether negative log likelihood curvature is present for generative models in other domains: audio, video and images.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
results		results
.DS_Store		.DS_Store
README.md		README.md
custom_datasets.py		custom_datasets.py
new_detect_gpt.ipynb		new_detect_gpt.ipynb
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DetectGPT: Distinguishing between Machine-Generated and Human-Written Text

Original implementation of the experiments in the DetectGPT paper.

Instructions

Future Work:

About

Releases

Packages

Contributors 3

Languages

sarthakforwet/DetectGPT

Folders and files

Latest commit

History

Repository files navigation

DetectGPT: Distinguishing between Machine-Generated and Human-Written Text

Original implementation of the experiments in the DetectGPT paper.

Instructions

Future Work:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages