摘要: 6.10Exercises 练习 ☼ Read up on one of the language technologies mentioned in this section, such as word sense disambiguation, semantic role labeling, question answering, machine translation, named entity detection. Find out what type and quantity of annotated data is required for... 阅读全文
posted @ 2011-09-05 23:33 牛皮糖NewPtone 阅读(1344) 评论(0) 推荐(0) 编辑
摘要: 6.9Further Reading深入阅读 Please consult http://www.nltk.org/ for further materials on this chapter and on how to install external machine learning packages, such as Weka, Mallet, TADM, and MEGAM. For more examples of classification and machine learning with NLTK, please see the classification HOWTOs . 阅读全文
posted @ 2011-09-05 23:30 牛皮糖NewPtone 阅读(659) 评论(0) 推荐(0) 编辑
摘要: 6.8Summary小结 Modeling the linguistic data found in corpora can help us to understand linguistic patterns, and can be used to make predictions about new language data. 建模语料库中的语言数据可以帮助我们理解语言模型,并且可以用于进行关于新语言数据的预测。 Supervised classifiers use labeled training corpora to build models tha... 阅读全文
posted @ 2011-09-05 23:28 牛皮糖NewPtone 阅读(656) 评论(0) 推荐(0) 编辑