文章分类 -  R

R语言技术学习交流
摘要:An update to Microsoft R Client, Microsoft's distribution of open source R with additional proprietary packages — including RevoScaleR (for data analy 阅读全文
posted @ 2018-03-01 10:49 payton数据之旅 阅读(222) 评论(0) 推荐(0)
摘要:In this 4th post of my series on Deep Learning from first principles in Python, R and Octave – Part 4, I explore the details of creating a multi-class 阅读全文
posted @ 2018-03-01 10:42 payton数据之旅 阅读(227) 评论(0) 推荐(0)
摘要:Just In Time For Baseball The mlbgameday package has just reached the milestone of version 0.1.0. Designed to facilitate extract, transform and load f 阅读全文
posted @ 2018-03-01 10:41 payton数据之旅 阅读(230) 评论(0) 推荐(0)
摘要:If you've ever wanted to play around with big data sets in a Spark cluster from R with thesparklyr package, but haven't gotten started because setting 阅读全文
posted @ 2018-02-27 09:56 payton数据之旅 阅读(179) 评论(0) 推荐(0)
摘要:Previously we’ve covered the basics of exogenous variables in smooth functions. Today we will go slightly crazy and discuss automatic variables select 阅读全文
posted @ 2018-02-12 17:39 payton数据之旅 阅读(211) 评论(0) 推荐(0)
摘要:In this blog post, I am going to train a random forest on census data from the US to predict the probability that someone is looking for a job. To thi 阅读全文
posted @ 2018-02-12 09:43 payton数据之旅 阅读(184) 评论(0) 推荐(0)
摘要:In order to stay up to date, I try to follow Jeremy Howard on a regular basis. In one of his recent videos, he shows how to use embeddings for categor 阅读全文
posted @ 2018-01-29 10:11 payton数据之旅 阅读(203) 评论(0) 推荐(0)
摘要:BNOSAC is happy to announce the release of the udpipe R package (https://bnosac.github.io/udpipe/en) which is a Natural Language Processing toolkit th 阅读全文
posted @ 2018-01-16 09:57 payton数据之旅 阅读(336) 评论(0) 推荐(0)
摘要:“What does the world outside your head really ‘look’ like? Not only is there no color, there’s also no sound: the compression and expansion of air is 阅读全文
posted @ 2018-01-12 09:59 payton数据之旅 阅读(291) 评论(0) 推荐(0)
摘要:In a number of upcoming posts, I'll be analyzing an interesting dataset I found on Kaggle. The dataset contains information on 18,393 music reviews fr 阅读全文
posted @ 2017-12-08 10:04 payton数据之旅 阅读(207) 评论(0) 推荐(0)
摘要:Introduction Market Basket Analysis or association rules mining can be a very useful technique to gain insights in transactional data sets, and it can 阅读全文
posted @ 2017-11-24 10:22 payton数据之旅 阅读(313) 评论(0) 推荐(0)
摘要:Which layout of an advertisement leads to more clicks? Would a different color or position of the purchase button lead to a higher conversion rate? Do 阅读全文
posted @ 2017-11-23 15:04 payton数据之旅 阅读(207) 评论(0) 推荐(0)
摘要:I developed a tiny toy package, meme, which is now on CRAN. As it’s name indicated, it was designed to create memes, which are captioned photos that a 阅读全文
posted @ 2017-10-26 11:17 payton数据之旅 阅读(428) 评论(0) 推荐(0)
摘要:R is an incredible tool for reproducible research. In the present series of blog posts I want to show how one can easily acquire data within an R sess 阅读全文
posted @ 2017-10-18 10:41 payton数据之旅 阅读(233) 评论(0) 推荐(0)
摘要:At a glance: I explore half a million rows of disaggregated crash data for New Zealand, and along the way illustrate geo-spatial projections, maps, fo 阅读全文
posted @ 2017-10-16 10:42 payton数据之旅 阅读(119) 评论(0) 推荐(0)
摘要:One of the assumptions of Classical Linear Regression Model is that there is no exact collinearity between the explanatory variables. If the explanato 阅读全文
posted @ 2017-09-30 09:26 payton数据之旅 阅读(818) 评论(0) 推荐(0)
摘要:This post explains how to use R to automatically write and send emails based on automatically computed analyses (yep, everything automated). This mean 阅读全文
posted @ 2017-09-12 10:01 payton数据之旅 阅读(439) 评论(0) 推荐(0)
摘要:In the last post, we focused on the preparation of a tidy dataset describing consumer perceptions of beverages. In this post, I'll describe some analy 阅读全文
posted @ 2017-09-11 09:50 payton数据之旅 阅读(193) 评论(0) 推荐(0)
摘要:I have been playing around with spatial modelling in the R INLA package. This blog just records a few thoughts I have had about using INLA for kriging 阅读全文
posted @ 2017-09-08 10:14 payton数据之旅 阅读(253) 评论(0) 推荐(0)
摘要:1、背景 CTR预估,广告点击率(Click-Through Rate Prediction)是互联网计算广告中的关键环节,预估准确性直接影响公司广告收入。CTR预估中用的最多的模型是LR(Logistic Regression)[1],LR是广义线性模型,与传统线性模型相比,LR使用了Logit变 阅读全文
posted @ 2017-09-05 10:07 payton数据之旅 阅读(309) 评论(0) 推荐(0)