Ruoyu Li [email protected]
Mini Project for analyzing Apache access log
- hadoopDocker: docker file for configuring hadoop cluster
- NG.java -> NG.jar
- SearchString.java -> searchstring.jar
- SearchPath.java -> searchpath.jar
- SearchSort.java -> searchsort.jar
- SearchIP.java -> searchip.jar
- SearchSIP.java -> searchsip.jar
- Source file: NG.java
- Jar file: NG.jar
Command example: bin/hadoop jar path/to/jar/NG.jar input/ output/ n
- n could be any integer
- Source file: SearchString.java
- Jar file: searchstring.jar
Command example:
bin/hadoop jar path/to/jar/searchstring.jar SearchString input/ output/
/assets/img/home-logo.png 98776
- Source file: SearchSIP.java
- Jar file: searchsip.jar
Command example:
bin/hadoop jar path/to/jar/searchsip.jar input/ output/
10.153.239.5 547
- Source file: SearchPath.java , SearchSort.java
- Jar file: searchpath.jar, searchsort.jar
Instructions: run searchpath.jar first then run searchsort.jar for ranking
Command example:
bin/hadoop jar path/to/jar/searchpath.jar SearchPath input/ output/
hdfs dfs -rm input/*
hdfs dfs -mv output/* input/
hdfs dfs -rmr output
bin/hadoop jar path/to/jar/searchsort.jar SearchSort input/ output/
hdfs dfs -cat output/*
117348 /assets/css/combined.css
- Source file: SearchIP.java , SearchSort.java
- Jar file: searchip.jar, searchsort.jar
Instructions: run searchip.jar first then run searchsort.jar for ranking
Command example:
bin/hadoop jar path/to/jar/searchip.jar input/ output/
hdfs dfs -rm input/*
hdfs dfs -mv output/* input/
hdfs dfs -rmr output/*
bin/hadoop jar path/to/jar/searchsort.jar input/ output/
hdfs dfs -cat output/*
158614 10.216.113.172