KOPI plagiarism checker
Description:
Nowadays most students can speak at least one foreign language, they are more and more willing to use foreign sources in their works translated word-by-word and sometimes not referencing properly. The application is both suitable for teachers to check the students‚ to work sentence by sentence and also for students to check their essays looking for the references. KOPI is special amongst many other plagiarism search tool for running on Desktop Grids, being cross-lingual and using Wikipedia as its main database.
The English Wikipedia consists of nearly 4 million articles, and is about 30GB, without the pictures and the accessory data. To process such a great amount of data large computation capacity is a must. To ensure that the database of the plagiarism-checker is up-to-date, we have to process the datasets from Wikipedia on a monthly basis. We are able to do this with the help of the donors linked to the SZTAKI Desktop Grid: we split the dataset into smaller parts, transfer them into textual format, split them into sentences, and then we take the stem of all words.
Activities:
The aim is to present the technology of KOPI Plagiarism Checker.

Link to KOPI
http://kopi.sztaki.hu/
Click for more on the Multilingual plagiarism search application.
Glimpse into the project

lab excursion

lab excursion

GLOBAL excursion team

project meeting

GLOBAL excursion team

GLOBAL - lab - excursion

BIFI and SZTAKI partners

Barbara and Teresa, ZSI

Agueda (EUN), Sue (UCAM), Claudia (ZSI)

Barbara and Teresa (ZSI)

Working group

ZSI, BIFI

Fermin (BIFI)

Agnes (SZTAKI)

working group
Follow us on Facebook
Feeds


