Date: 2013-01-15 04:12 pm (UTC)
amielleon: The three heroes of Tellius. (Default)
From: [personal profile] amielleon
Hahahahaha.

Okay but even if you had a good algorithm, I think my corpus (or at least, the way you've chosen it) may be inherently problematic. While I don't deny having a "general" voice and some very strong generalities in terms of theme, I'm fond of deliberately using slightly different voices in different pieces. A Terrycloth Mother is probably closest to what I consider some kind of "standard serious," though I haven't actually used the "standard serious" voice much aha. lucius, Coin in Palm, and New World I'd classify as "whimsical." And the rest are pretty much separate categories unto themselves, unless you dive into my fic not posted there at FFN. (visitor is exceptionally and markedly different.)

But, theoretically, if you were trying to build an algorithm that could identify the author of a piece even when the author were trying to consciously disguise it, I would be an excellent test.

Also, shouldn't accuracy also be counted for false positives? That Mark result looks much less impressive when you consider that it gave Mark 4 false positives. Before I realized that fact I was tempted to chalk up Mark's unusually high accuracy to the fact that she uses a very similar writing style throughout her corpus... though granted it still does better with her than either of us.

Incidentally, if it's a matter of reading score and frequencies I suspect it might okay with [personal profile] blankspectrum's stuff, as she has a fairly consistent writing voice and some favorite words. (*cough words based on "soft" cough*)

tl;dr yeah it's just one of those "fun waste of time" things.
This account has disabled anonymous posting.
If you don't have an account you can create one now.
HTML doesn't work in the subject.
More info about formatting

Expand Cut Tags

No cut tags