随笔分类 -  Machine Learning with Python

摘要:The Dataset was acquired from https://www.kaggle.com/c/titanic For data preprocessing, I firstly defined three transformers: DataFrameSelector: Select 阅读全文
posted @ 2019-01-04 06:50 Sherrrry 阅读(806) 评论(0) 推荐(0)
摘要:In the former article "Data Preparation by Pandas and Scikit-Learn", we discussed about a series of steps in data preparation. Scikit-Learn provides t 阅读全文
posted @ 2019-01-02 09:40 Sherrrry 阅读(249) 评论(0) 推荐(0)
摘要:In this article, we dicuss some main steps in data preparation. Drop Labels Firstly, we drop labels for train set. Here we use drop() method in Pandas 阅读全文
posted @ 2019-01-02 09:34 Sherrrry 阅读(641) 评论(0) 推荐(0)
摘要:Train model: Show parameters of best model: Show the score of train set: Fit on test set: Show the score of test set: More about GridSearchCV: https:/ 阅读全文
posted @ 2019-01-02 05:06 Sherrrry 阅读(234) 评论(0) 推荐(0)
摘要:Before you can plot anything, you need to specify which backend Matplotlib should use. The simplest option is to use Jupyter’s magic command %matplotl 阅读全文
posted @ 2018-12-29 05:06 Sherrrry 阅读(464) 评论(0) 推荐(0)
摘要:Here I list some useful functions in Python to get familiar with your data. As an example, we load a dataset named housing which ... 阅读全文
posted @ 2018-12-29 04:37 Sherrrry 阅读(113) 评论(0) 推荐(0)
摘要:Using Pandas Library The simplest way is to read data from .csv files and store it as a data frame object: You can also read .xsl files and directly s 阅读全文
posted @ 2018-12-29 04:18 Sherrrry 阅读(200) 评论(0) 推荐(0)