JAvaネットワーク爬虫類jsoup全中国省・市・県・鎮村完全全集情報コードをキャプチャ

1590 ワード

JAvaネットワーク爬虫類jsoup全中国省・市・県・鎮村完全全集情報コードをキャプチャ
コードダウンロードアドレスscofield 7419/chinesePCCTVLocationExtraction
ChinesePCCTVLocationExtraction
This’s a cool try, budy.
Description
Complete Chinese location infos in the format of province_city_county_town_village.
Cautions of the methods calling order:
For the first use of this program, you have to run “readAllProv();” method first under the annotated method “getAllMaps();” in order to get all the property files.Then, annotate this method and call method “getAllMaps();”.
just like this:
The roadmap is constructed in accordance with the following four guidelines:
  • the datas was crawed from「中華人民共和国国家統計局2015データ」
  • the 3rd-party lib was jsoup.
  • Because of the data trafic constraint of the target server「中華人民共和国国家統計局2015データ」、I couldn’t get all the datas at once program running.So I just design a approach by utilizing the property files and sovled the problem.
  • here is the properties folder:
    properties/北京市.properties
    here is the outputs folder:
    outputs/province_city_county_town_village.txt
    and the output file was writed like this:
    other file in assets:
    assets/2015年全国都市省市県区行政レベル対照表.xls
    assets/province_city_county.txt
    ps:formats in PCC.
    and it look like this:
    Scofield.Phil
    
    Email: feish7419@163.com
    
    move fast, break things.