Whoosh + Tahoe = distributed search engine for documents in Tahoe

Okhin okhin at okhin.fr
Tue Jul 8 13:22:42 UTC 2014


Ohai,

First post here.

I'm working on a python module to extend whoosh[1] with a Tahoe LAFS
storage engine.

Whoosh is a search engine in pure python. It can be used to index
documents and metadata and search through them - well, what a search
engine is supposed to do.

By extending it with a storage over Tahoe it allows to have a CAP
containing documents and the associated metadata indexed by whoosh and
to access it directly from a client (and some python code). That way you
do not rely on a single directory to manage and search through the
collection of document.

The code is in Le Loop gitlab[2] and I've pushed the module through
Pypi[3].

If you have comment, patch, ideas, it will be mostly welcome :)

--
[1]: https://whoosh.readthedocs.org/en/latest/
[2]: https://git.leloop.org/okhin/tahoe-whoosh
[3]: https://pypi.python.org/pypi/Tahoe-whoosh
-- 
With datalove,
Okhin
:(){ :|:& };:
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 949 bytes
Desc: not available
URL: <http://tahoe-lafs.org/pipermail/tahoe-dev/attachments/20140708/ed01b1bb/attachment.pgp>


More information about the tahoe-dev mailing list