python爬取某瓣top250 demo 輸出到html

原創

2018-12-28 15:05

import re
import urllib.request;
from bs4 import BeautifulSoup


url = 'https://movie.douban.com/top250?start=';
fout = open('douban250.html','w',encoding='utf-8');
fout.write("<html>")
fout.write("<head>")
fout.write("<meta http-equiv=\"Content-Type\" content=\"text/html; charset=utf-8\">")
fout.write("<title>豆瓣")
fout.write("</title>")
fout.write("</head>")
fout.write("<body>")

fout.write("<table border = '1'>")
for pageNum in range(10):
    page = (pageNum*25);
    #print(url+str(page));
    resp = urllib.request.urlopen(url+str(page))
    doc = resp.read();
    soup = BeautifulSoup(doc,'html.parser',from_encoding='utf-8')
    card = soup.find('ol',class_='grid_view')
    items = card.find_all('div',class_= 'item');
    for item in items:
        pics = item.find_all('div',class_= 'pic')
        for pic in pics:
            index = pic.find('em');
            a = pic.find('a');
            href = a.get('href')
            img = pic.find('img');
            name = img.get('alt');
            src = img.get('src');
            #print(index.get_text(),href,name)
            print(index.get_text(),name)
            fout.write("<tr>")
            fout.write("<td>")
            fout.write(index.get_text())
            fout.write("</td>")
            fout.write("<td>")
            fout.write(name)
            fout.write("</td>")
            fout.write("<td>")
            fout.write("<img ")
            fout.write("src='"+src+'\' width =50 ')
            fout.write(">")
            fout.write("</td>")
            fout.write("</tr>")
fout.write("</table>")
fout.write("</body>")
fout.write("</html>")
fout.close();

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

python爬取某瓣top250 demo 輸出到html

再談23種設計模式（3）：行爲型模式（學習筆記）

Power Automate Desktop 安裝完，登錄後老是提示one driver 錯誤

微前端學習筆記(4):從微前端到微模塊之EMP與hel-micro方案探索

微前端學習筆記（1）：微前端總體架構概述，從微服務發微

985 碩士程序員，空窗 4 個月沒有 Offer！

一文搞懂 Spring 循環依賴

賽博鬥地主——使用大語言模型扮演Agent智能體玩牌類遊戲。

VScode右鍵打開(添加到右鍵)

記一次 .NET某工控視覺自動化系統卡死分析

WindowsServer--SQL Server搭建主從同步實現讀寫分離 - 事務性分發

基於一個一個物理機搭建Redis的主從架構並設置哨兵機制

Centos安裝Redis [編譯源碼方式]

SpringBoot源碼分析

python爬取某瓣top250 demo 輸出到html

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結