文章分类 - R
R语言技术学习交流
摘要:We all have words we love to use, and that we perhaps use too much. As an example: I have a tendency to use the same transitional statements, to the p
阅读全文
摘要:You want to understand how your customers make decisions efficiently. Today, customers more so than ever undergo an incredibly complex decision making
阅读全文
摘要:We all hate the experience of calling a service provider and being placed on hold for a very long time. Organisations that take their level of service
阅读全文
摘要:In this blog post I will discuss missing data imputation and instrumental variables regression. This is based on a short presentation I will give at m
阅读全文
摘要:Recently, I attended a presentation where the following graph was shown illustrating the response to stimulation with Thalidomide among a cohort of HI
阅读全文
摘要:The larger and more complex the business the more metrics and dimensions. One day you understand that it is impossible to track them with only your ey
阅读全文
摘要:A new update of my sjstats-package just arrived at CRAN. This blog post demontrates those functions of the sjstats-package that deal especially with B
阅读全文
摘要:Background I am enjoying the magick package at the moment. Reading through the vignette I spotted the image_morph() function. In this post I experimen
阅读全文
摘要:Since I migrated my blog from Github Pages to blogdown and Netlify, I wanted to start migrating (most of) my old posts too - and use that opportunity
阅读全文
摘要:Time series prediction (forecasting) has experienced dramatic improvements in predictive accuracy as a result of the data science machine learning and
阅读全文
摘要:Today I saw this tweet on my timeline: For those of us that just can't wait until RStudio officially supports parallel purrr in #rstats, boy have I go
阅读全文
摘要:LightGBM参数列表 建议大家在使用LightGBM前,先仔细阅读参数介绍,毕竟LightGBM还能实现很多有趣的算法如随机森林,dart以及goss,以及众多使用辅助功能。 参数介绍传送门如下: https://github.com/Microsoft/LightGBM/blob/master
阅读全文
摘要:TL;DR: Use the Vectorize() function! If you’re reading this, you’ve either encountered this problem before, or you just got to this article out of cur
阅读全文
摘要:In their 2017 book “R for Data Science“, Grolemund and Wickham state that data.table is recommended instead of dplyr when working with larger datasets
阅读全文
摘要:At a glance: A negative binomial model isn't adequate for modelling the number of people killed per firearm incident in the USA; the real data has mor
阅读全文
摘要:Regression techniques are one of the most popular statistical techniques used for predictive modeling and data mining tasks. On average, analytics pro
阅读全文
摘要:Introduction The key to getting better at deep learning (or many fields) is practice. Practice on variety of problems – from image processing to speec
阅读全文
摘要:Discriminant analysis is used when the variable to be predicted is categorical in nature. This analysis requires that the way to define data points to
阅读全文
摘要:@drsimonj here to help you embark on git repo analyses! Ever wondered who contributes to git repos? How their contributions have changed over time? Wh
阅读全文
摘要:Data that’s been only partially observed I’ve been updating my skills in fitting models to truncated data and was pleased to find that, like so much e
阅读全文