Plagiarism-checker-Python

This repo consists of a source code of a Python script which detects plagiarism in a textual document using cosine similarity.

How is it Done?

You might be wondering how plagiarism detection on textual data is done, well it ain't as complicated as you may think.

We all know that computers are good with numbers; so in order to compute the similarity between two text documents, the textual raw data is transformed into vectors => arrays of numbers and from that, we make use of basic knowledge of vectors to compute the similarity between them.

This repo contains a basic example on how to do that.

Getting Started

To get started with the code on this repo, you need to either clone or download this repo into your machine as shown below;

git clone https://github.com/Kalebu/Plagiarism-checker-Python

Dependencies

Before you begin playing with the source code, you might need to install dependencies just as shown below;

pip3 install -r requirements.txt

Running the App

To run this code you need to have your textual documents in your project directory with the .txt extension. When you run the script, it will automatically load all the documents with that extension and then compute the similarities between them as shown below;

$-> cd Plagiarism-checker-Python
$ Plagiarism-checker-Python-> python3 app.py
('john.txt', 'juma.txt', 0.5465972177348937)
('fatma.txt', 'john.txt', 0.14806887549598566)
('fatma.txt', 'juma.txt', 0.18643448370323362)

A Python Library?

Would you like to use a Python library instead to help you compare strings and documents without spending time writing the vectorizers by yourself, then take a look at Pysimilar.

Explore it

Explore it and twist it to your own use case. In case of any questions feel free to reach me directly at isaackeinstein@gmail.com.

Issues

In case you have any difficulties or issues while trying to run the script you can raise an issue.

Pull Requests

If you have something to add, I welcome pull requests on improvement; your helpful contribution will be merged as soon as possible.

Give it a Star

If you find this repo useful, give it a star so that many people can get to know it.

Credits

All the credit goes to kalebu.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
pictures		pictures
README.md		README.md
app.py		app.py
fatma.txt		fatma.txt
image.png		image.png
john.txt		john.txt
juma.txt		juma.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Plagiarism-checker-Python

How is it Done?

Getting Started

Dependencies

Running the App

A Python Library?

Explore it

Issues

Pull Requests

Give it a Star

Credits

About

Releases

Packages

Contributors 5

Languages

Kalebu/Plagiarism-checker-Python

Folders and files

Latest commit

History

Repository files navigation

Plagiarism-checker-Python

How is it Done?

Getting Started

Dependencies

Running the App

A Python Library?

Explore it

Issues

Pull Requests

Give it a Star

Credits

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages