使用Python抓取模板之家的CSS模板

2024-04-13 21:13:53 网络

掉漆的栏杆上，反射着夕阳的余晖渐渐消逝。我们的青春就随这样随着夕阳落幕。绮丽多姿扮靓妆，繁花锦簇沁心香。长歌游宝地，徒倚对珠林。家门口的风景，其实眼前的就是最好的，爱你选择的一切!

Python版本是2.7.9，在win8上测试成功，就是抓取有点慢，本来想用多线程的，有事就罢了。模板之家的网站上的url参数与页数不匹配，懒得去做分析了，就自己改代码中的url吧。大神勿喷！



#!/usr/bin/env python

# -*- coding: utf-8 -*-

# by ustcwq

# 2015-03-15



import urllib,urllib2,os,time

from bs4 import BeautifulSoup



start = time.clock()

path = os.getcwd()+u'/模板之家抓取的模板/'

if not os.path.isdir(path):

 os.mkdir(path)



url = "http://www.cssmoban.com/cssthemes/index_80.shtml" # 源网站中的index后面数字怎么编排的？

theme_url ='http://www.cssmoban.com/cssthemes/'

response = urllib2.urlopen(url)

soup = BeautifulSoup(response) 

result = soup.select('p[class="title"] a')

print result



for item in result:

 link = item['href']

 # down_name = item.text # 文件名称

 new_url = theme_url+link.split('/')[-1]

 response = urllib2.urlopen(new_url)

 soup = BeautifulSoup(response) 

 result = soup.select('.btn a')

 down_url = result[1]['href'] # 文件链接



 local = path+time.strftime('%Y%m%d%H%M%S',time.localtime(time.time()))+'.zip' 

 urllib.urlretrieve(down_url, local) # 远程保存函数



end = time.clock()

print u'模板抓取完成！'

print u'一共用时：',end-start,u'秒'

以上所述就是本文的全部内容了，希望大家能够喜欢。

到此这篇关于使用Python抓取模板之家的CSS模板就介绍到这了。和阳光的人在一起，心里才不会变得晦暗；和快乐的人在一起，脸上就会常带面带微笑；和进取的人在一起，行动就不会变得落后；借人之智，要尽自己最大的努力去完善自己；学最好的别人，做最好的我们。更多相关使用Python抓取模板之家的CSS模板内容请查看相关栏目，小编编辑不易，再次感谢大家的支持！

全站频道

大家都在搜索：

使用Python抓取模板之家的CSS模板