POPFile v0.21.0 Update in OS X
Published March 11, 2004
POPFile 0.21.0 has been released, and with very little trouble I have it up and running in Mac OS 10.3.
The big change in this version is no more BerklyDB back end, POPFile now using a SQL database, and the easiest way to do this in OS X is to use SQLite. Thankfully Michael Tsai (SpamSieve author) has a built version of SQLite for Mac OS X. This means you do not have to compile and install it. The schema is available to use something like MySQL, but why bother?
Next up you have some perl modules to install:
These you will need to download, compile, and install. DBI needs to be installed before you can install the DBD::SQLite module.
More on installing SQLite and these associated perl modules below.
There are a couple really cool new features to POPFile, and in my opinion, worth the work to use them. They are:
- The global options for Subject Modification, X-Text-Classification insertion and X-POPFile-Link insertion have been removed and replaced with individual options on a per bucket basis to give greater choice in configuring POPFile.
- The 'unclassified' bucket is now visible in the UI so that you can see how many messages were unclassified, and configure header modification. This also means that unclassified messages are counted in the accuracy statistics; previously they were not counted which could have skewed the accuracy statistics if there were unclassified messages. — This was something I had asked for, and I assume others did as well.
- The history "page" bar has been simplified so that it uses a fixed amount of screen space, while making navigation easy. Filters and searches on the history are now persistent, for example you can click on the Buckets page and return to the History page without losing your filter or search settings.
- The Buckets page has been modified to only show the 'distinct word' count per bucket and to show the total number of distinct words in the database. Previously we showed two counts with confusing titles: now we show the true number of words in the database, not the "word counts" (which was the number of times each word occurred).
- We've recently seen spam start to use CSS to obscure messages and fool filters like POPFile; in response, this version of POPFile does analysis of CSS in HTML encoded messages. POPFile now correctly uses the SpamAssassin headers to make POPFile more efficient when used in conjunction with SpamAssassin. We now also look at TLDs (Top Level Domains) and store them as pseudowords (most useful for TLDs like .biz). — My favorite new feature.
- POPFile v0.21.0 Update in OS X
- Published: March 11, 2004
- Type:
- Section: Sci/Tech
- Filed Under: Sci/Tech: Software
- Writer: Ken Edwards
- Ken Edwards's BC Writer page
- Ken Edwards's personal site
- Spread the Word
- Like this article?
- Email this
Save to del.icio.us
Comments
Hi, I have been meaning to go to that very thing, but have been busy.
Follow steps A, B, and C.
Just get SQLite from Michael Tsai's web site.
The extra thing you will need is the StartupItem.
Go here and read step 4.5 and 5 (his instructions are not updated for 0.21.0 yet.) You can download a pre-made StartupItem here. Then set its permissions, and you should be all set.
Let me know if it does not work.
Only one word : perfect !!
Thanks for your quick and kind help
Popfile is running under sqlite on my powerbook
Luc





Hello,
All this is great, but this article doesn't exlain step by step how to install this stuff (for example, what to do with SQLite ???) and all the existing texts about popfile with berkeley simply doesn't work for a non-geek user... So is there somewehre a step by step "how to install the new popfile with sqlite under Panther " ??
Thanks in advance