Abstract
Pattern of writing a document is often said to be strongly influenced by the mother tongue, but do not guarantee produce writing that is always the similar pattern. If the trend similarity patterns caused by intentional copying of documents, then it is necessary to be created a detection tool to identify the terms pattern in those documents. This phenomena initiate this paper to acquaint the further investigation on text document pattern recognition for terms appearances by employing latent semantic analysis (LSA) method couple with terms distance between two documents. This study also describes determination of text documents similarity, which in turn can be used for early plagiarism detection.
Original language | English |
---|---|
Pages (from-to) | 322-329 |
Number of pages | 8 |
Journal | Journal of Theoretical and Applied Information Technology |
Volume | 79 |
Issue number | 2 |
Publication status | Published - 20 Sept 2015 |
Keywords
- Latent semantic analysis
- Pattern
- Plagiarism
- Term distance
- Text document