﻿<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/"><channel><title>博客园-First we try, then we trust-文章分类-ICTCLAS</title><link>http://www.cnblogs.com/zhenyulu/category/85598.html</link><description /><language>zh-cn</language><lastBuildDate>Sat, 17 May 2008 03:14:26 GMT</lastBuildDate><pubDate>Sat, 17 May 2008 03:14:26 GMT</pubDate><ttl>60</ttl><item><title>SharpICTCLAS分词系统简介(9)词库扩充</title><link>http://www.cnblogs.com/zhenyulu/articles/718375.html</link><dc:creator>吕震宇</dc:creator><author>吕震宇</author><pubDate>Wed, 18 Apr 2007 07:46:00 GMT</pubDate><guid>http://www.cnblogs.com/zhenyulu/articles/718375.html</guid><wfw:comment>http://www.cnblogs.com/zhenyulu/comments/718375.html</wfw:comment><comments>http://www.cnblogs.com/zhenyulu/articles/718375.html#Feedback</comments><slash:comments>4</slash:comments><wfw:commentRss>http://www.cnblogs.com/zhenyulu/comments/commentRss/718375.html</wfw:commentRss><trackback:ping>http://www.cnblogs.com/zhenyulu/services/trackbacks/718375.html</trackback:ping><description><![CDATA[&nbsp;&nbsp;&nbsp;&nbsp; 摘要: 1、SharpICTCLAS中词库的扩充如果对SharpICTCLAS目前词库不满意的化，可以考虑扩充现有词库。扩充方法非常简单，代码如下：CopyCode词库扩充st...&nbsp;&nbsp;<a href='http://www.cnblogs.com/zhenyulu/articles/718375.html'>阅读全文</a><img src ="http://www.cnblogs.com/zhenyulu/aggbug/718375.html?type=2" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://zhenyulu.cnblogs.com/" target="_blank">吕震宇</a> 2007-04-18 15:46 <a href="http://www.cnblogs.com/zhenyulu/articles/718375.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>SharpICTCLAS分词系统简介(8)其它</title><link>http://www.cnblogs.com/zhenyulu/articles/675218.html</link><dc:creator>吕震宇</dc:creator><author>吕震宇</author><pubDate>Wed, 14 Mar 2007 15:10:00 GMT</pubDate><guid>http://www.cnblogs.com/zhenyulu/articles/675218.html</guid><wfw:comment>http://www.cnblogs.com/zhenyulu/comments/675218.html</wfw:comment><comments>http://www.cnblogs.com/zhenyulu/articles/675218.html#Feedback</comments><slash:comments>5</slash:comments><wfw:commentRss>http://www.cnblogs.com/zhenyulu/comments/commentRss/675218.html</wfw:commentRss><trackback:ping>http://www.cnblogs.com/zhenyulu/services/trackbacks/675218.html</trackback:ping><description><![CDATA[&nbsp;&nbsp;&nbsp;&nbsp; 摘要: 前文对SharpICTCLAS中的一些主要内容做了介绍，本文介绍一下SharpICTCLAS中一些其它考虑，包括事件机制以及如何使用SharpICTCLAS。1、SharpICTCLAS中的事件...&nbsp;&nbsp;<a href='http://www.cnblogs.com/zhenyulu/articles/675218.html'>阅读全文</a><img src ="http://www.cnblogs.com/zhenyulu/aggbug/675218.html?type=2" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://zhenyulu.cnblogs.com/" target="_blank">吕震宇</a> 2007-03-14 23:10 <a href="http://www.cnblogs.com/zhenyulu/articles/675218.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>SharpICTCLAS分词系统简介(7)OptimumSegment</title><link>http://www.cnblogs.com/zhenyulu/articles/675217.html</link><dc:creator>吕震宇</dc:creator><author>吕震宇</author><pubDate>Wed, 14 Mar 2007 15:09:00 GMT</pubDate><guid>http://www.cnblogs.com/zhenyulu/articles/675217.html</guid><wfw:comment>http://www.cnblogs.com/zhenyulu/comments/675217.html</wfw:comment><comments>http://www.cnblogs.com/zhenyulu/articles/675217.html#Feedback</comments><slash:comments>2</slash:comments><wfw:commentRss>http://www.cnblogs.com/zhenyulu/comments/commentRss/675217.html</wfw:commentRss><trackback:ping>http://www.cnblogs.com/zhenyulu/services/trackbacks/675217.html</trackback:ping><description><![CDATA[&nbsp;&nbsp;&nbsp;&nbsp; 摘要: 上一篇文章说到经过NShortPath计算后，我们得到了数个候选分词方案，那么这么多个候选分词方案是如何最终成为一个分词结果的呢？其实这个过程是靠OptimumSegment完成的。SharpICTC...&nbsp;&nbsp;<a href='http://www.cnblogs.com/zhenyulu/articles/675217.html'>阅读全文</a><img src ="http://www.cnblogs.com/zhenyulu/aggbug/675217.html?type=2" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://zhenyulu.cnblogs.com/" target="_blank">吕震宇</a> 2007-03-14 23:09 <a href="http://www.cnblogs.com/zhenyulu/articles/675217.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>SharpICTCLAS分词系统简介(6)Segment</title><link>http://www.cnblogs.com/zhenyulu/articles/673650.html</link><dc:creator>吕震宇</dc:creator><author>吕震宇</author><pubDate>Tue, 13 Mar 2007 14:24:00 GMT</pubDate><guid>http://www.cnblogs.com/zhenyulu/articles/673650.html</guid><wfw:comment>http://www.cnblogs.com/zhenyulu/comments/673650.html</wfw:comment><comments>http://www.cnblogs.com/zhenyulu/articles/673650.html#Feedback</comments><slash:comments>10</slash:comments><wfw:commentRss>http://www.cnblogs.com/zhenyulu/comments/commentRss/673650.html</wfw:commentRss><trackback:ping>http://www.cnblogs.com/zhenyulu/services/trackbacks/673650.html</trackback:ping><description><![CDATA[&nbsp;&nbsp;&nbsp;&nbsp; 摘要: DynamicArray与NShortPath是ICTCLAS中的基础类，本人在完成了基础改造工作后，就着手开始对Segment分词进行移植与改造。SharpICTCLAS中的改造主要体现在以下几方面...&nbsp;&nbsp;<a href='http://www.cnblogs.com/zhenyulu/articles/673650.html'>阅读全文</a><img src ="http://www.cnblogs.com/zhenyulu/aggbug/673650.html?type=2" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://zhenyulu.cnblogs.com/" target="_blank">吕震宇</a> 2007-03-13 22:24 <a href="http://www.cnblogs.com/zhenyulu/articles/673650.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>SharpICTCLAS分词系统简介(5)NShortPath-2</title><link>http://www.cnblogs.com/zhenyulu/articles/672442.html</link><dc:creator>吕震宇</dc:creator><author>吕震宇</author><pubDate>Mon, 12 Mar 2007 14:42:00 GMT</pubDate><guid>http://www.cnblogs.com/zhenyulu/articles/672442.html</guid><wfw:comment>http://www.cnblogs.com/zhenyulu/comments/672442.html</wfw:comment><comments>http://www.cnblogs.com/zhenyulu/articles/672442.html#Feedback</comments><slash:comments>2</slash:comments><wfw:commentRss>http://www.cnblogs.com/zhenyulu/comments/commentRss/672442.html</wfw:commentRss><trackback:ping>http://www.cnblogs.com/zhenyulu/services/trackbacks/672442.html</trackback:ping><description><![CDATA[&nbsp;&nbsp;&nbsp;&nbsp; 摘要: 在了解了1-最短路径的计算方式后，我们看看N-最短路径的计算。N-最短路径的计算方式与1-最短路径基本相同，只是在记录所有可达路径时，要保留最短的前N个结果。让我们仍然以上篇文章的案例来看看如何实...&nbsp;&nbsp;<a href='http://www.cnblogs.com/zhenyulu/articles/672442.html'>阅读全文</a><img src ="http://www.cnblogs.com/zhenyulu/aggbug/672442.html?type=2" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://zhenyulu.cnblogs.com/" target="_blank">吕震宇</a> 2007-03-12 22:42 <a href="http://www.cnblogs.com/zhenyulu/articles/672442.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>SharpICTCLAS分词系统简介(4)NShortPath-1</title><link>http://www.cnblogs.com/zhenyulu/articles/669795.html</link><dc:creator>吕震宇</dc:creator><author>吕震宇</author><pubDate>Fri, 09 Mar 2007 14:47:00 GMT</pubDate><guid>http://www.cnblogs.com/zhenyulu/articles/669795.html</guid><wfw:comment>http://www.cnblogs.com/zhenyulu/comments/669795.html</wfw:comment><comments>http://www.cnblogs.com/zhenyulu/articles/669795.html#Feedback</comments><slash:comments>12</slash:comments><wfw:commentRss>http://www.cnblogs.com/zhenyulu/comments/commentRss/669795.html</wfw:commentRss><trackback:ping>http://www.cnblogs.com/zhenyulu/services/trackbacks/669795.html</trackback:ping><description><![CDATA[&nbsp;&nbsp;&nbsp;&nbsp; 摘要: N-最短路径中文词语粗分是分词过程中非常重要的一步，而原有ICTCLAS中该部分代码也是我认为最难读懂的部分，到现在还有一些方法没有弄明白，因此我几乎重写了NShortPath类。要想说明N-最短路径...&nbsp;&nbsp;<a href='http://www.cnblogs.com/zhenyulu/articles/669795.html'>阅读全文</a><img src ="http://www.cnblogs.com/zhenyulu/aggbug/669795.html?type=2" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://zhenyulu.cnblogs.com/" target="_blank">吕震宇</a> 2007-03-09 22:47 <a href="http://www.cnblogs.com/zhenyulu/articles/669795.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>SharpICTCLAS分词系统简介(3)DynamicArray</title><link>http://www.cnblogs.com/zhenyulu/articles/668695.html</link><dc:creator>吕震宇</dc:creator><author>吕震宇</author><pubDate>Thu, 08 Mar 2007 15:13:00 GMT</pubDate><guid>http://www.cnblogs.com/zhenyulu/articles/668695.html</guid><wfw:comment>http://www.cnblogs.com/zhenyulu/comments/668695.html</wfw:comment><comments>http://www.cnblogs.com/zhenyulu/articles/668695.html#Feedback</comments><slash:comments>6</slash:comments><wfw:commentRss>http://www.cnblogs.com/zhenyulu/comments/commentRss/668695.html</wfw:commentRss><trackback:ping>http://www.cnblogs.com/zhenyulu/services/trackbacks/668695.html</trackback:ping><description><![CDATA[&nbsp;&nbsp;&nbsp;&nbsp; 摘要: 从前文可以看出，ICTCLAS中DynamicArray类在初步分词过程中起到了至关重要的所用，而ICTCLAS中DynamicArray类的实现比较复杂，可以说是包罗万象，在一个GetElement...&nbsp;&nbsp;<a href='http://www.cnblogs.com/zhenyulu/articles/668695.html'>阅读全文</a><img src ="http://www.cnblogs.com/zhenyulu/aggbug/668695.html?type=2" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://zhenyulu.cnblogs.com/" target="_blank">吕震宇</a> 2007-03-08 23:13 <a href="http://www.cnblogs.com/zhenyulu/articles/668695.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>SharpICTCLAS分词系统简介(2)初步分词</title><link>http://www.cnblogs.com/zhenyulu/articles/668035.html</link><dc:creator>吕震宇</dc:creator><author>吕震宇</author><pubDate>Thu, 08 Mar 2007 06:27:00 GMT</pubDate><guid>http://www.cnblogs.com/zhenyulu/articles/668035.html</guid><wfw:comment>http://www.cnblogs.com/zhenyulu/comments/668035.html</wfw:comment><comments>http://www.cnblogs.com/zhenyulu/articles/668035.html#Feedback</comments><slash:comments>5</slash:comments><wfw:commentRss>http://www.cnblogs.com/zhenyulu/comments/commentRss/668035.html</wfw:commentRss><trackback:ping>http://www.cnblogs.com/zhenyulu/services/trackbacks/668035.html</trackback:ping><description><![CDATA[&nbsp;&nbsp;&nbsp;&nbsp; 摘要: ICTCLAS初步分词包括：1）原子切分；2）找出原子之间所有可能的组词方案；3）N-最短路径中文词语粗分三步。例如：&#8220;他说的确实在理&#8221;这句话。1）原子切分的目的是完成...&nbsp;&nbsp;<a href='http://www.cnblogs.com/zhenyulu/articles/668035.html'>阅读全文</a><img src ="http://www.cnblogs.com/zhenyulu/aggbug/668035.html?type=2" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://zhenyulu.cnblogs.com/" target="_blank">吕震宇</a> 2007-03-08 14:27 <a href="http://www.cnblogs.com/zhenyulu/articles/668035.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>SharpICTCLAS分词系统简介(1)读取词典库</title><link>http://www.cnblogs.com/zhenyulu/articles/668024.html</link><dc:creator>吕震宇</dc:creator><author>吕震宇</author><pubDate>Thu, 08 Mar 2007 06:25:00 GMT</pubDate><guid>http://www.cnblogs.com/zhenyulu/articles/668024.html</guid><wfw:comment>http://www.cnblogs.com/zhenyulu/comments/668024.html</wfw:comment><comments>http://www.cnblogs.com/zhenyulu/articles/668024.html#Feedback</comments><slash:comments>6</slash:comments><wfw:commentRss>http://www.cnblogs.com/zhenyulu/comments/commentRss/668024.html</wfw:commentRss><trackback:ping>http://www.cnblogs.com/zhenyulu/services/trackbacks/668024.html</trackback:ping><description><![CDATA[&nbsp;&nbsp;&nbsp;&nbsp; 摘要: ICTCLAS分词的总体流程包括：1）初步分词；2）词性标注；3）人名、地名识别；4）重新分词；5）重新词性标注这五步。就第一步分词而言，又细分成：1）原子切分；2）找出原子之间所有可能的组词方案；3...&nbsp;&nbsp;<a href='http://www.cnblogs.com/zhenyulu/articles/668024.html'>阅读全文</a><img src ="http://www.cnblogs.com/zhenyulu/aggbug/668024.html?type=2" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://zhenyulu.cnblogs.com/" target="_blank">吕震宇</a> 2007-03-08 14:25 <a href="http://www.cnblogs.com/zhenyulu/articles/668024.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>实现ICTCLAS到C#平台的移植</title><link>http://www.cnblogs.com/zhenyulu/articles/667359.html</link><dc:creator>吕震宇</dc:creator><author>吕震宇</author><pubDate>Wed, 07 Mar 2007 14:38:00 GMT</pubDate><guid>http://www.cnblogs.com/zhenyulu/articles/667359.html</guid><wfw:comment>http://www.cnblogs.com/zhenyulu/comments/667359.html</wfw:comment><comments>http://www.cnblogs.com/zhenyulu/articles/667359.html#Feedback</comments><slash:comments>11</slash:comments><wfw:commentRss>http://www.cnblogs.com/zhenyulu/comments/commentRss/667359.html</wfw:commentRss><trackback:ping>http://www.cnblogs.com/zhenyulu/services/trackbacks/667359.html</trackback:ping><description><![CDATA[&nbsp;&nbsp;&nbsp;&nbsp; 摘要: 在研究了一段时间中科院计算所张华平、刘群所开发的ICTCLAS分词系统（Free版）代码后，阅读了大量的相关资料，我开始着手将C++的ICTCLAS分词系统移植到.net平台下，并取得了较好的实验结果...&nbsp;&nbsp;<a href='http://www.cnblogs.com/zhenyulu/articles/667359.html'>阅读全文</a><img src ="http://www.cnblogs.com/zhenyulu/aggbug/667359.html?type=2" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://zhenyulu.cnblogs.com/" target="_blank">吕震宇</a> 2007-03-07 22:38 <a href="http://www.cnblogs.com/zhenyulu/articles/667359.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>天书般的ICTCLAS分词系统代码（二）</title><link>http://www.cnblogs.com/zhenyulu/articles/657017.html</link><dc:creator>吕震宇</dc:creator><author>吕震宇</author><pubDate>Mon, 26 Feb 2007 05:27:00 GMT</pubDate><guid>http://www.cnblogs.com/zhenyulu/articles/657017.html</guid><wfw:comment>http://www.cnblogs.com/zhenyulu/comments/657017.html</wfw:comment><comments>http://www.cnblogs.com/zhenyulu/articles/657017.html#Feedback</comments><slash:comments>8</slash:comments><wfw:commentRss>http://www.cnblogs.com/zhenyulu/comments/commentRss/657017.html</wfw:commentRss><trackback:ping>http://www.cnblogs.com/zhenyulu/services/trackbacks/657017.html</trackback:ping><description><![CDATA[&nbsp;&nbsp;&nbsp;&nbsp; 摘要: 上篇文章《天书般的ICTCLAS分词系统代码（一）》说了说ICTCLAS分词系统有些代码让人无所适从，需要好一番努力才能弄明白究竟是怎么回事。尽管有很多人支持应当写简单、清晰的代码，但也有人持不同意...&nbsp;&nbsp;<a href='http://www.cnblogs.com/zhenyulu/articles/657017.html'>阅读全文</a><img src ="http://www.cnblogs.com/zhenyulu/aggbug/657017.html?type=2" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://zhenyulu.cnblogs.com/" target="_blank">吕震宇</a> 2007-02-26 13:27 <a href="http://www.cnblogs.com/zhenyulu/articles/657017.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>天书般的ICTCLAS分词系统代码（一）</title><link>http://www.cnblogs.com/zhenyulu/articles/653254.html</link><dc:creator>吕震宇</dc:creator><author>吕震宇</author><pubDate>Tue, 20 Feb 2007 16:24:00 GMT</pubDate><guid>http://www.cnblogs.com/zhenyulu/articles/653254.html</guid><wfw:comment>http://www.cnblogs.com/zhenyulu/comments/653254.html</wfw:comment><comments>http://www.cnblogs.com/zhenyulu/articles/653254.html#Feedback</comments><slash:comments>30</slash:comments><wfw:commentRss>http://www.cnblogs.com/zhenyulu/comments/commentRss/653254.html</wfw:commentRss><trackback:ping>http://www.cnblogs.com/zhenyulu/services/trackbacks/653254.html</trackback:ping><description><![CDATA[&nbsp;&nbsp;&nbsp;&nbsp; 摘要: ICTCLAS分词系统是由中科院计算所的张华平、刘群所开发的一套获得广泛好评的分词系统，该版的Free版开放了源代码，为初学者提供了宝贵的学习材料。我们可以在&#8220;http://sewm.pk...&nbsp;&nbsp;<a href='http://www.cnblogs.com/zhenyulu/articles/653254.html'>阅读全文</a><img src ="http://www.cnblogs.com/zhenyulu/aggbug/653254.html?type=2" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://zhenyulu.cnblogs.com/" target="_blank">吕震宇</a> 2007-02-21 00:24 <a href="http://www.cnblogs.com/zhenyulu/articles/653254.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item></channel></rss>