2番目のPython爬虫類プログラム-BeautifulSoupライブラリによるWebデータ解析

1911 ワード

Python

from bs4 import BeautifulSoup
markup = ' class="title"The Little Prince'
soup = BeautifulSoup(markup,"lxml")
soup.b #    
Out[32]: The Little Prince
soup.find_all('b') #      
Out[33]: [The Little Prince]

#    
import requests
from bs4 import BeautifulSoup 
r=requests.get('https://book.douban.com/subject/1084336/comments/')
soup=BeautifulSoup(r.text,'lxml') 
pattern=soup.find_all('span','short') 
for item in pattern:
    print(item. string)#  

#    ：
            ，             。         ，           ，      。
     ，     ，         ，        ，            ，       ？
     ，                。          ，  ，       ，        。
      ，          ，        。
                 5                 
            ，                    ，           ，     ，               ，            “   ”。
      ，          ？
       ，              ，          ，        ，        ，       ，         ，           ，    ，             ，             ，     ，         ，       
           
           ：         ；              ；     。 

              GAY
     ，  4 。      ，          “  ”。
           “  ”   ，             。                             。              ，       “  ” ，         “ ”，  “ ”          。
It is the time you have wasted for your rose that makes your rose so important.
    ，               ，            。 
　　　　 
　　　　        。  “                ”。        ，             ，      ，     ，   ？ 
　　　　 
　　　　           。        ，    “           ”。    ：“         ，        ”。        ，     ，              ，          ，              ！ 
　　　　 
　　　　         ，“         。      ，        ”。    ，          。 
　　　　 
　　　　    ？                  
       ，         ，      ，                 。                       。          B-612  。    ，           。
       ，   ，          ，       。              ⋯⋯
        。    。        。
     ，            ？

pythonとsqlite 3データベースの初期プローブ(簡単なログイン登録機能)

pythonリンクneo 4 jインポートデータインスタンス