Python實戰之網絡與爬蟲篇-----如何對比多個庫,看哪個更靠譜
1.問題拆解
如何對比多個庫,看哪個最靠譜?
用哪些參考數值?(生態值、Star數、Fork數)
如何獲取數據?(Srar、Fork數)
如何查詢需要的數據?(requests庫)
2.背景知識
a.開發者網址https://developer.github.com/v3/search/
在裏面瞭解參數情況
b.star數、fork數的獲取
c.生態數
3.編程實現
# repos_api https://api.github.com/search/repositories?q=django
# ecosys_api https://api.github.com/search/repositories?q=topic:django
# https://api.github.com/search/repositories?q=topic:django+language:python+created:2019-01-01
import requests
def get_name():
print('please input separate each name with space')
names = input()
return names.split()
def check_repos(names):
repos_api = 'https://api.github.com/search/repositories?q='
ecosys_api = 'https://api.github.com/search/repositories?q=topic:'
for name in names:
repo_info = requests.get(repos_api+name).json()['items'][0]
ecosys_info = requests.get(ecosys_api+name).json()['total_count']
stars = repo_info['stargazers_count']
forks = repo_info['forks']
print(name)
print(stars)
print(forks)
print(ecosys_info)
print('----------------------------')
names = get_name()
check_repos(names)