« Herd Of Scenes | Main | Stemming »

07 September 2004

More On Lucene

According to Ted Leung (link courtesy of Jarno Virtanen) "Lupy is much slower than either Lucene or CLucene."

CLucene is a C++ port of Lucene. (I was thinking of Common Lisp when I mentioned clucene.) Clucene claims to be "faster than lucene as it is written in C++."

These two articles by Otis Gospodnetic (linked from the Lucene site) introduce Lucene's API and index structures:

Reading the second article above, this page came to mind. To quote a passage: "Because we have about 2 gigs of static data we need rapid access to, we use C++ code to memory-map huge files containing pointerless C structs (of flights, fares, etc), and then access these from Common Lisp using foreign data accesses." I haven't a clue if Lucene indices are usable in this manner.

Ah, Francesco Bellomi has tried so, using Java's NIO (new I/O) API, which supports memory-mapped files.

BTW, here's the cue for James Robertson's rant on "final": "Some [methods] were final in Directory, so I have used a slightly modified version of Directory.java (BTW, I wonder why so many methods in Lucene are made final...)"


Posted by ngps at 01:04 | Comments (3) | Trackbacks (0)
Comments
Re: More On Lucene

They're all pushed to final to enable the java compiler to optimize them further...

Posted by: Joe at September 08,2004 10:19
Re: More On Lucene

1. Make all methods in a particular class X non-final.

2. Subclass X trivially as X'. Implement a final method in X' for each method in X.

Can X+X' be optimised as if it is only-X-with-all-methods-final?

Posted by: Ng Pheng Siong at September 08,2004 13:05
Java or C++/C for search engine?

with Java you use control in memory and thus the optimization opportunities for indexing and search and caching. I strongly believe C++/C is the way to go. Java may be good for prototyping and concept-proving. Search engine needs fast indexing and fast query into memory. - Roboo Meshfire

Posted by: Roboo Meshfire at September 02,2005 01:07
Trackbacks
Please send trackback to:http://sandbox.rulemaker.net/ngps/107/tbping
There is no trackback.