100日のSoftwcodeの71日

2307 ワード

codenewbie womenintech 100daysofcode beginners テキストリンク

これは、私は私の71日目の100日のSoftwcodeとChiang少しパイソン学習です.今日も毎日Datacamp ウェブスクラリングのトピックについて.今日はシンプルなXPath、Slasher Double機能について学びました.トリビュート、コンテンツを含む、セレクターリストからデータを抽出し、HTMLを検査します.
セレクタリストからデータを抽出するPythonコードです.

Pythonコード

# Import a scrapy Selector
from scrapy import Selector

# Import requests
import requests

# Create the string html containing the HTML source
html = requests.get( url ).content

# Create the Selector object sel from html
sel = Selector( text = html )

# Print out the number of elements in the HTML document
print( "There are 1020 elements in the HTML document.")
print( "You have found: ", len( sel.xpath('//*') ) )

の71日とWebスクレーピングから https://t.co/6UM9B22c7f データアンプpic.twitter.com/A5vd6ID1mE — Durgaポワレル

Reference

この問題について(100日のSoftwcodeの71日), 我々は、より多くの情報をここで見つけました https://dev.to/iamdurga/day-71-of-100daysofcode-web-scraping-gc9

テキストは自由に共有またはコピーできます。ただし、このドキュメントのURLは参考URLとして残しておいてください。

Collection and Share based on the CC Protocol

[Programmers](ハイスコアKIT)ソート-H-Index

[JavaScript]プログラマ第1ステージ-ハッシュ数