• Arguments->VM arguments: -Dlog4j.configuration=file:debug-log4j.properties
o Click on Apply, then Click on Run and then view the Console tab of the parent window
o Wait for results in the Console tab and execution to complete.

NB inference on the Iris dataset
Opens connection to the iris table
Loads iris classifications from bayesiancounters-site.xml into local memory
Moves columns in hbase between tiers, e.g. T5MIN, T30MIN, etc. while computing scores and tracking parent and child counts
The logic is derived from naive Bayes classifier theory
The resulting scores, counts and probabilities are displayed to standard output

The probabilities output of scoring can be used directly for mode complex decision making algorithms based on benefit/loss analysis.

6.30

Perform clique scoring with random projections
o Run->Run Configurations…
o Java Application-> (right click) -> New
o Click on the New_configuration to edit its settings on the right
o Configure the runtime options as:
• Name: CliqueRandom
• Main->Project: bcounts
• Main->Main class: com.cloudera.bayesiancounters.util.Driver
• Arguments->Program arguments: cr iris 300 2 3
• Arguments->VM arguments: -Dlog4j.configuration=file:debug-log4j.properties
o Click on Apply, then Click on Run and then view the Console tab of the parent window
o Wait for results in the Console tab and execution to complete.

Clique scoring can be used to perform variable importance analysis or for emerging trend identification.

6.31

Create small delta of the ad.data file
su poulin
head -n 1 /home/poulin/bcounts-0.1.0-SNAPSHOT/examples/data/ad.data > /tmp/ad.small
tail -n 1 /home/poulin/bcounts-0.1.0-SNAPSHOT/examples/data/ad.data >> /tmp/ad.small

6.32

Load Ad data into HBase via Eclipse
o Run->Run Configurations…
o Java Application-> (right click) -> New
o Click on the New_configuration to edit its settings on the right