i trying index hbase data using mapreduceindexer tool. indexing using below command
hadoop jar /usr/lib/hbase-solr/tools/hbase-indexer-mr-1.5-cdh5.3.0-job.jar --hbase-indexer-zk localhost --hbase-table-name indexdemo-user --hbase-indexer-name mynewindexer --hbase-indexer-file /home/cloudera/indexdemo-indexer.xml --collection collection1 --go-live above command indexed data ,i able see solr web ui also.but when run above command second time ,again indexing same , showing 2 documents in solr ui.but requiurement have single document(i.e second run has overwrite first one's data).so if run same command multiple times,i have 1 doc latest entries.(note : when index directly using java api,even if run same program multiple times ,it giving 1 document .same thing need using indexer tool)
any suggestions appreciated..thanks in advance.
define field has unique value identifies document the uniquekey schema. long uniquekey identical across each run, old documents replaced / updated.
if you're generating unique value each time you're indexing (or haven't configured uniquekey), solr have no way tell documents same document.
Comments
Post a Comment