Lazyweb: literary forensics
Aug. 6th, 2012 04:34 pmAre there any websites / Linux programs for helping determine if the same person wrote two different sets of message board posts?
I can find plagiarism checkers, but these tend to be looking for whether the texts have a common source, rather than a common author. (If I write two texts, one about soft fruit and one about deckchairs, they are liable to pass a plagiarism test, but should still have enough in common for someone to be able to say that the same person wrote both...)
I can find plagiarism checkers, but these tend to be looking for whether the texts have a common source, rather than a common author. (If I write two texts, one about soft fruit and one about deckchairs, they are liable to pass a plagiarism test, but should still have enough in common for someone to be able to say that the same person wrote both...)
(no subject)
Date: 2012-08-07 06:55 am (UTC)Your problem might be a very hard one, though. These things work best wIth decent corpuses of training text - like dozens of novels' worth. And they can generally only tell you, for a given test text, which of the authors in the training corpus is most likely to have written it - they assume that it must have been one of the authors in the training materials, not any random writer.
(no subject)
Date: 2012-08-07 06:59 am (UTC)