I just tried to clear out blogmatcher's cache directory, and bash said:
[root@d2135 cache]# rm -f *
bash: /bin/rm: Argument list too long
The folder only had 4200 files... it's not
that many files. Or is it?
Tim Swanson, who came up pretty high in my blogmatcher results, somehow discovered my blog and blogmatcher and wrote about it in his blog.
On semi-related news, I've spent the last few hours trying to come up with a database driven version of blogmatcher, with hopes that I'd get better performance. I got Postgres up and running, but as of yet, things don't look too promising. Searches are taking as long (or possibly longer) than the current flat-file based version, and I'm starting to think that either I've got the whole database schema wrong, or it's just one of those problems where using a database doesn't help.
For anyone who's interested, the relations table (linking blogs to links) has over 60,000 entries, there are around 60,000 links, and a little over 1000 blogs.