Jacob Kaplan-Moss

5 items tagged “via:jkocherhans”

📌 Geeking with Greg: Clever method of near duplicate detection

A slick algorithm to “fingerprint” text based on chains for words following stop words. #

📌 Repository - directory - public: enfold.solr/trunk/server

This is awesome: a complete Solr/Jetty setup. This is similar to what I’ve been using, but even nicer. Thanks, Joseph! #

📌 Xapian: Theoretical Background

Some good notes on how relevancy algorithms actually work. Must read through this in more detail. #

📌 Salon.com Mothers Who Think | Tibetan curried potatoes #
📌 jerith.za.net: Socket programming in Erlang

Sweet—I’ve been looking for a quick tutorial of this nature. #