6.35

Edit /home/poulin/bcounts-0.1.0-SNAPSHOT/bin/sp_schema.py
Change from: if len(sys.argv) Change to: if len(sys.argv) if len(sys.argv) /tmp/bayesiancounters-example.xml
tail /tmp/bayesiancounters-example.xml

SP_increase
bayesiancounters.dataset.sp.col.valueset.647
-100, -40, 10, 40, 100

6.37

Convert testing files into header-less files for storing in HDFS
su poulin
cd /home/poulin/bcounts-0.1.0-SNAPSHOT/
python273 ./bin/sp_training.py /tmp/bag-of-words \
./examples/data/training_19_2004-18_2005.dat > /tmp/sp-training-file
tail -c 32 /tmp/sp-training-file

0,0,0,0,0,0,0,0,0,0,0,0,0,0,7.2

6.38

Generate a ‘scored_’ file in current directory
su poulin
cd /home/poulin/bcounts-0.1.0-SNAPSHOT/
python273 ./bin/sp_testing.py /tmp/bag-of-words ./examples/data/testing_19_2005-19_2005.dat
tail -c 32 ./scored_testing_19_2005-19_2005.dat

0,1,0,0,0,0,0,0,2,2,2,1,1,3,6,0

6.39

Create small delta of sp-training-file
su poulin
head -n 1 /tmp/sp-training-file > /tmp/sp-training.small
tail -n 1 /tmp/sp-training-file >> /tmp/sp-training.small