文章分类 - R
R语言技术学习交流
摘要:An update to Microsoft R Client, Microsoft's distribution of open source R with additional proprietary packages — including RevoScaleR (for data analy
阅读全文
摘要:In this 4th post of my series on Deep Learning from first principles in Python, R and Octave – Part 4, I explore the details of creating a multi-class
阅读全文
摘要:Just In Time For Baseball The mlbgameday package has just reached the milestone of version 0.1.0. Designed to facilitate extract, transform and load f
阅读全文
摘要:If you've ever wanted to play around with big data sets in a Spark cluster from R with thesparklyr package, but haven't gotten started because setting
阅读全文
摘要:Previously we’ve covered the basics of exogenous variables in smooth functions. Today we will go slightly crazy and discuss automatic variables select
阅读全文
摘要:In this blog post, I am going to train a random forest on census data from the US to predict the probability that someone is looking for a job. To thi
阅读全文
摘要:In order to stay up to date, I try to follow Jeremy Howard on a regular basis. In one of his recent videos, he shows how to use embeddings for categor
阅读全文
摘要:BNOSAC is happy to announce the release of the udpipe R package (https://bnosac.github.io/udpipe/en) which is a Natural Language Processing toolkit th
阅读全文
摘要:“What does the world outside your head really ‘look’ like? Not only is there no color, there’s also no sound: the compression and expansion of air is
阅读全文
摘要:In a number of upcoming posts, I'll be analyzing an interesting dataset I found on Kaggle. The dataset contains information on 18,393 music reviews fr
阅读全文
摘要:Introduction Market Basket Analysis or association rules mining can be a very useful technique to gain insights in transactional data sets, and it can
阅读全文
摘要:Which layout of an advertisement leads to more clicks? Would a different color or position of the purchase button lead to a higher conversion rate? Do
阅读全文
摘要:I developed a tiny toy package, meme, which is now on CRAN. As it’s name indicated, it was designed to create memes, which are captioned photos that a
阅读全文
摘要:R is an incredible tool for reproducible research. In the present series of blog posts I want to show how one can easily acquire data within an R sess
阅读全文
摘要:At a glance: I explore half a million rows of disaggregated crash data for New Zealand, and along the way illustrate geo-spatial projections, maps, fo
阅读全文
摘要:One of the assumptions of Classical Linear Regression Model is that there is no exact collinearity between the explanatory variables. If the explanato
阅读全文
摘要:This post explains how to use R to automatically write and send emails based on automatically computed analyses (yep, everything automated). This mean
阅读全文
摘要:In the last post, we focused on the preparation of a tidy dataset describing consumer perceptions of beverages. In this post, I'll describe some analy
阅读全文
摘要:I have been playing around with spatial modelling in the R INLA package. This blog just records a few thoughts I have had about using INLA for kriging
阅读全文
摘要:1、背景 CTR预估,广告点击率(Click-Through Rate Prediction)是互联网计算广告中的关键环节,预估准确性直接影响公司广告收入。CTR预估中用的最多的模型是LR(Logistic Regression)[1],LR是广义线性模型,与传统线性模型相比,LR使用了Logit变
阅读全文