Coder Social home page Coder Social logo

solrhbaseindex's Introduction

SolrHbaseIndex

Function: This is a basic HBase common API for HFile to solr.(Hbase table import into solr)

Used: Hadoop-1.1.2 Hbase-0.94.1 Solr-4.2.1

Step

Stored CSV to HDFS file. (Default csv put it into $HADOOP_HOME, and create a new directory in HDFS file system)

=> bin/hadoop fs -copyFromLocal import_data.csv /user/$user_name/new/import_data.csv 2. Importtsv (in $HBASE_HOME)

=> bin/hbase org.apache.hadoop.hbase.mapreduce.ImportTsv '-Dimporttsv.separator=,' -Dimporttsv.columns=HBASE_ROW_KEY, column_family:category,column_family:name,column_family:tel,column_family:province,column_family:address HBASE_TABLE hdfs://localhost:9000/user/$user_name/new/import_data.csv

Assemble jar file

It needs many jar file (please put these jar into HBASE_HOME/lib and HADOOP_HOME/lib replace old version)

commons-cli-1.2.jar commons-io-2.1.jar guava-11.0.2.jar hadoop-core-1.1.2.jar hbase-0.94.1.jar httpclient-4.2.3.jar httpcore-4.2.2.jar httpmime-4.2.3.jar jcl-over-slf4j-1.6.4.jar slf4j-api-1.6.4.jar slf4j-jdk14-1.6.4.jar slf4j-log4j12-1.6.4.jar solr-core-4.2.1.jar solr-solrj-4.2.1.jar wstx-asl-3.2.7.jar zookeeper-3.4.5.jar protobuf-java-2.4.0a.jar

step1: Java to class (Default directory is /opt/data/e/)

javac -classpath commons-cli-1.2.jar:commons-io-2.1.jar:guava-11.0.2.jar:hadoop-core-1.1.2.jar :hbase-0.94.1.jar:httpclient-4.2.3.jar:httpcore-4.2.2.jar:httpmime-4.2.3.jar: jcl-over-slf4j-1.6.4.jar:slf4j-api-1.6.4.jar:slf4j-jdk14-1.6.4.jar:slf4j-log4j12-1.6.4.jar: solr-core-4.2.1.jar:solr-solrj-4.2.1.jar:wstx-asl-3.2.7.jar:zookeeper-3.4.5.jar:protobuf-java-2.4.0a.jar -d /opt/data/e/ solrIndexer.java

step2: Jar generating

jar -cvf solrIndexer.jar -C /opt/data/e/ .

step3: Copy solrIndexer.jar into $HBASE_HOME and then

bin/hadoop jar solrIndexer.jar com.example.solr.solrIndexer

step4: OK

solrhbaseindex's People

Contributors

hushunghung avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.