Pythonでスクレイピング

1166 ワード

Python BeautifulSoup Python テキストリンク

最近モチベーション上がらなくて、上げるためにとりあえず書くことにしました。
本当に基礎の基礎なので悪しからず

とりあえず、BeautifulSoupを実行してみます。

from urllib.request import urlopen
from bs4 import BeautifulSoup

# 取得するページ
html = urlopen('http://www.pythonscraping.com/pages/page1.html')
bs = BeautifulSoup(html.read(), 'html.parser')
# h1タグを返す
print(bs.h1)

2行目の
from bs4 import BeautifulSoup
部分でBeautifulSoupをインポートする。
実行結果は下記となる。

http://www.pythonscraping.com/pages/page1.html
上記ページのh1タグだけ取得できることがわかる。

次回はちゃんとした内容で投稿します。

参考文献

PythonによるWebスクレイピング(オライリー・ジャパン)

Author And Source

この問題について(Pythonでスクレイピング), 我々は、より多くの情報をここで見つけました https://qiita.com/h1r0_1126/items/36cd0292b327fee417dc

著者帰属：元の著者の情報は、元のURLに含まれています。著作権は原作者に属する。

Content is automatically searched and collected through network algorithms . If there is a violation . Please contact us . We will adjust (correct author information ,or delete content ) as soon as possible .