Moderators: Eric, Tomas, robin, Michael
This is also true of Anti-Virus software.Trapper wrote:"By default K9 will examine all emails however large they are. Because it can take time for an email to first be downloaded into K9 and then passed on to the email program some email applications timeout or complain, despite K9 sending "keep alive" messages to keep them awake.
SFCurley wrote:On the what-POPFile-does-with-new-words question. My recollection from reading about how POPFile works is that it assigns never-before-seen words a probability equal to 1/(10*Corpus Size).
POPFile assigns a probability to every word, for words that it has seen it naturally assigns the probability from the corpus. For unseen words it assigns the probability 1/(10 * size of that bucket's corpus)---i.e. a probability that indicates that the word is "unlikely" to appear.
The other possible choices are 0 (which would screw up classification since all classifications would be 0) or 1 (which would be a mistake since it would indicate that the word always appears).
long time ago I investigated different values for the unknown word and determined that what we have is the most efficient.
Which is what I believe is the reason that my accuracy improved when I took all the "junk" words out of my spam bucket (about 75% of the word count was random letters not even words).
It made new "spam words" less "unlikely" ...if you will.
Return to Junk Mail Filtering Help and How-To
Users browsing this forum: No registered users and 1 guest