Por favor utiliza este link para citar o compartir este documento: http://repositoriodigital.academica.mx/jspui/handle/987654321/83056
Título: Detecting Derivatives using Specific and Invariant Descriptors
Palabras clave: Textual derivatives
detection of derivations
near-duplicates
revisions
linguistic descriptors
French corpus
Fecha de publicación: 31-Jul-2012
Editorial: Polibits
Descripción: This paper explores the detection of derivation links between texts (otherwise called plagiarism, near-duplication, revision, etc.) at the document level. We evaluate the use of textual elements implementing the ideas of specificity and invariance as well as their combination to characterize derivatives. We built a French press corpus based on Wikinews revisions to run this evaluation. We obtain performances similar to the state of the art method (n-grams overlap) while reducing the signature size and so, the processing costs. In order to ensure the verifiability and the reproducibility of our results we make our code as well as our corpus available to the community.
Other Identifiers: http://www.scielo.org.mx/scielo.php?script=sci_arttext&pid=S1870-90442011000100001
Aparece en las Colecciones:Polibits

Archivos de este documento:
No hay archivos asociados a este documento.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.