浙江省高等学校教师教育理论培训

微信搜索“毛凌志岗前心得”小程序

  博客园  :: 首页  :: 新随笔  :: 联系 :: 订阅 订阅  :: 管理

ReAKKT: First steps in text mining with R

First steps in text mining with R

Everyone is preparing for Christmas Eve's Dinner.  No one is calling, little email. Looks like a perfect time to start researching text mining in R :)

The problem I'm trying to solve:

  • extract keywords from multiple texts
  • try to summarize texts > sentence extraction
  • group and relate products based on their descriptions > classification / clustering
  • add relevant information to text based on similar / related text
I've started with tm package.


Then I've jumped to TextRank algorithm for keywords & sentence extraction. Seems, TextRank is not present in tm, but there is Java source code available so should be possible to call it from R.

Will need to compare TextRank to KEA. The later is implemented for R in RKEA.

Looks promising.
posted on 2012-12-04 11:38  lexus  阅读(247)  评论(0)    收藏  举报