nodejs爬虫

1.安装cheerio npm install cheerio

2.加载网页源码

function LoadHtml(url,callback){
    https.get(url, (res) => {
        var chunks = [];
        var size = 0;
        res.on('data', (chunk) => {
            chunks.push(chunk);
            size += chunk.length;
        });
        res.on('end', () => {
            var data = Buffer.concat(chunks, size);
            var html = data.toString();
            if(callback)
            {
                callback(html);
            }
        });
        res.on('error', () => {
            callback('');
        });
    });
}

3.开始获取节点数据 let $ = cheerio.load(html); let node= $('标签名.类名');

posted @ 2024-10-23 13:47  游戏鼻祖  阅读(15)  评论(0)    收藏  举报