Word histogram

Here is a program that reads a file and builds a histogram of the words in the file:

process_file loops through the lines of the file, passing them one at a time to process_line. The histogram h is being used as an accumulator. process_line uses the string method replace to replace hyphens with spaces before using split to break the line into a list of strings. It traverses the list of words and uses strip and lower to remove punctuation and convert to lower case. (It is a shorthand to say that strings are ‘converted;’ remember that string are immutable, so methods like strip and lower return new strings.)

Finally, process_line updates the histogram by creating a new item incrementing an existing one. To count the total number of words in the file, we can add up the frequencies in the histogram:

from Thinking in Python

posted @ 2014-08-17 15:09 平静缓和用胸音说爱阅读(234) 评论(0) 收藏举报

刷新页面返回顶部

一切很好不缺烦恼

Less is more

Word histogram

一切很好 不缺烦恼

Less is more

Word histogram

一切很好不缺烦恼