KOPI plagiarism checker

Description:

Nowadays most students can speak at least one foreign language, they are more and more willing to use foreign sources in their works translated word-by-word and sometimes not referencing properly. The application is both suitable for teachers to check the students‚ to work sentence by sentence and also for students to check their essays looking for the references. KOPI is special amongst many other plagiarism search tool for running on Desktop Grids, being cross-lingual and using Wikipedia as its main database.

The English Wikipedia consists of nearly 4 million articles, and is about 30GB, without the pictures and the accessory data. To process such a great amount of data large computation capacity is a must. To ensure that the database of the plagiarism-checker is up-to-date, we have to process the datasets from Wikipedia on a monthly basis. We are able to do this with the help of the donors linked to the SZTAKI Desktop Grid: we split the dataset into smaller parts, transfer them into textual format, split them into sentences, and then we take the stem of all words.

Activities:

The aim is to present the technology of KOPI Plagiarism Checker.

Link to KOPI
http://kopi.sztaki.hu/

Click for more on the Multilingual plagiarism search application.

 

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>