国コード一覧取得

1833 ワード

Python Jupyter-notebook scraping Jupyter Python テキストリンク

wikipedia の国コードから、scraping して、国コードを取得するjupyter notebook をつくりました。

内容をここにもはっておきます。

国名　ISO 3166-1を wikipedia から取得

コードへのリンク

#!/usr/bin/env python
# -*- encoding:utf-8 -*-
import os
import sys
import re
import codecs
import requests
import json
from bs4 import BeautifulSoup


exp_url = 'https://ja.wikipedia.org/wiki/ISO_3166-1'
req = requests.get(exp_url)
#req.encoding = 'Shift_JIS'
# parser は、html.parser をつかう(lxmlはインストールが必要なため)
soup = BeautifulSoup(req.text,'html.parser')

# table 取得
tables = soup.find_all('table')
# 最初のテーブルにリンクがはいっている
trs = tables[0].find_all('tr')

iso3166_list = []
iso3166_dict['flag'] = {}
iso3166_dict['jpn']  = {}
iso3166_dict['eng'] = {}
iso3166_dict['numeric'] = {}
iso3166_dict['alpha-2'] = {}
iso3166_dict['location'] = {}
for tr in trs:
  try:
    tds = tr.find_all('td')
    if tds[0].find('a') is not None:
      id3 = tds[3].text.lower()
      iso3166_list.append(id3)
      iso3166_dict['flag'][id3] = tds[0].find('a').attrs['href']
      iso3166_dict['jpn'][id3] = tds[0].find_all('a')[1].attrs['title']
      iso3166_dict['eng'][id3] = tds[1].text
      iso3166_dict['numeric'][id3] = tds[2].text
      iso3166_dict['alpha-2'][id3] = tds[4].text
      iso3166_dict['location'][id3] = tds[5].text
  except:
    pass

Author And Source

この問題について(国コード一覧取得), 我々は、より多くの情報をここで見つけました https://qiita.com/zanjibar/items/f420dc130ebc487d2cef

著者帰属：元の著者の情報は、元のURLに含まれています。著作権は原作者に属する。

Content is automatically searched and collected through network algorithms . If there is a violation . Please contact us . We will adjust (correct author information ,or delete content ) as soon as possible .

汎用詳細超詳細

Codeforces 462 A Appleman and Easy Task(水題)

国コード一覧取得

国名 ISO 3166-1を wikipedia から取得

コードへのリンク

Author And Source

国名　ISO 3166-1を wikipedia から取得