search - Looking for something like Lucene for keyword indexing for use in a tree -
i've been looking find open source project (like apache lucene) perform following functions on list of indexed objects (with properties text title & list of associated keywords):
- perform searches on both keywords & full text of title
- create ranked tree of keywords (based on number of occurrences of keyword). example top 10 key words on objects shown on top level, selecting keyword see list of top keywords associated keyword etc etc.
my idea use apache lucene provides great way full text , keyword search, i'm not 100% sure how translate created ranked keyword tree. other products may missing?
the ranked keyword problem elegantly resolved faceting . have foo present in 10 documents on keyword field, , bar present on 5 documents on same field. faceting on keyword field give 10 foo , 5 bar.
you can find pretty documented example here : http://lucene.apache.org/core/4_0_0/facet/org/apache/lucene/facet/doc-files/userguide.html
Comments
Post a Comment