solr - Index the data in DFS -


i have loaded data hdfs using command hadoop fs -put.the data set of rich documents pdfs, doc , text files. how can index data able query in solr ?

use apache tika . created extracting text , metadata rich file formats pdf or doc. solr comes jar tika included need have quick @ instructions using jar command line utility , you're go : http://tika.apache.org/1.5/gettingstarted.html


Comments

Popular posts from this blog

javascript - RequestAnimationFrame not working when exiting fullscreen switching space on Safari -

Python ctypes access violation with const pointer arguments -