摘要: I have started a new book called The Art of R Programming by Norman Matloff and I’m really digging it. I won’t blog about each chapter the way I did Machine Learning for Hackers, but I did come across something I thought made good blog material.In Chapter 8 (Doing Math and Simulations in R), Matloff 阅读全文
posted @ 2013-06-11 15:07 jinyulogin 阅读(136) 评论(0) 推荐(0)
摘要: Customer segmentationOutput Variable Definition -> 输出变量定义New field: Level (output variable):• VIP: Top 20% of customers with the highest value.• NOT VIP: 80% of customers who remained with lowest value.1. VIP customerStep 1: Generate association rule model -> 建立关联规则模型tran.VIP<-rc[rc$Level== 阅读全文
posted @ 2013-06-10 01:43 jinyulogin 阅读(800) 评论(0) 推荐(0)
摘要: Credit Card PurchasesData Set -> 数据集Data Download Link:card.csvThe number of records: 9999twenty-eight 28 Variables can be used for decision tree generation 28 Input Variable: Airlines ~ TravelStep 1: Read Data -> 读取数据# 读取数据并删除1-9列的数据集file.choose()rc<-read.csv("card.csv",header=T) 阅读全文
posted @ 2013-06-10 01:34 jinyulogin 阅读(635) 评论(0) 推荐(0)
摘要: Credit Card PurchasesData Set -> 数据集Data Download Link:card.csvThe number of records: 9999nine 9 Variables can be used for decision tree generation 1 Output Variable: Level5 Input Variable: Gender Age MaritalStatus OccupationCategory TotalTransationStep 1: Read Data -> 读取数据# 读取数据并保留1-9列的数据集fil 阅读全文
posted @ 2013-06-08 22:27 jinyulogin 阅读(873) 评论(0) 推荐(0)
摘要: ChurnData Set ->数据集Data Download Link:churn.csvThe number of records: 1477Sixteen Variable can be used for decision tree generation1 Output Variable:LEAVER = 'T' if CHURNED = 'Vol 'F' if CHURNED = 'InVol' of 'Current'13 Input Variables: LONGDIST International L 阅读全文
posted @ 2013-06-08 21:54 jinyulogin 阅读(1718) 评论(0) 推荐(0)