代码改变世界

网络爬虫 2

2012-07-20 16:24  youxin  阅读(262)  评论(0)    收藏  举报

  java 获取一个网页的源代码:

import java.io.BufferedReader;
import java.io.InputStreamReader;
import java.net.URL;

public class Main {
     public static void main(String[] args)  {
            try {
                URL my_url = new URL("http://www.schoolbaike.com/smashwork");
                BufferedReader br = new BufferedReader(new InputStreamReader(my_url.openStream()));
                String strTemp = "";
                while(null != (strTemp = br.readLine())){
                System.out.println(strTemp);
            }
            } catch (Exception ex) {
                ex.printStackTrace();
            }
        }
}
 

InputStream openStream()   Opens a connection to this URL and returns an InputStream for reading from that connection. This method is a shorthand for:    openConnection().getInputStream().