solr - Index the data in DFS -
i have loaded data hdfs using command hadoop fs -put.the data set of rich documents pdfs, doc , text files. how can index data able query in solr ?
use apache tika . created extracting text , metadata rich file formats pdf or doc. solr comes jar tika included need have quick @ instructions using jar command line utility , you're go : http://tika.apache.org/1.5/gettingstarted.html
Comments
Post a Comment