SIREn Semantic Information Retrieval Engine
SIREn: Efficient semi-structured Information Retrieval for Lucene
http://siren.sindice.com/index.html  AGPL license

Querying graph structured data (RDF) is commonly achieved using specific solutions, called triplestores, typically based on DBMS backends. In Sindicewe however needed something much more scalable than DBMS and with the desirable features of the typical Web Search engines: top-k query processing, real time updates, full text search, distributed indexes over shards, etc.

While Lucene has long offered these capabilities, its native capabilities are not intended for large semi-structured document collections (or documents with very different schemas). For this reason we developed SIREn - Semantic Information Retrieval Engine - a Lucene plugin to overcome these shortcomings and efficiently index and query RDF, as well as any textual document with an arbitrary amount of metadata fields.

CONTEXT(Help)
-
OpenSherlock Project »OpenSherlock Project
Resources »Resources
Harvesting Process Support »Harvesting Process Support
SIREn Semantic Information Retrieval Engine
Triplize »Triplize
Searching Web Data: Entity Retrieval High-Performance Indexing Model »Searching Web Data: Entity Retrieval High-Performance Indexing Model
+Comments (0)
+Citations (0)
+About