scrapy 的 selector 练习-白红宇

网页结构：

     Example website    
        
      Name: My image 1 
   
      Name: My image 2 
   
      Name: My image 3 
   
      Name: My image 4 
   
      Name: My image 5

连接:

scrapy shell http://doc.scrapy.org/en/latest/_static/selectors-sample1.html

Play:

注意xapth与css两种方式的区别与联系

# selectorresponse.selector.xpath('//title/text()').extract()response.selector.css('title::text').extract()# response的selector的xpath与css太常用了，所以提供了简捷写法：# 文本一response.xpath('//title/text()').extract()response.css('title::text').extract()# 文本 包括子节点sel.xpath("//a[1]//text()").extract()# 文本 包括子节点sel.xpath("string(//a[1])").extract()# 属性response.xpath('//img/@src').extract()response.css('img::attr(src)').extract()# 混合response.css('img').xpath('@src').extract()response.xpath('//img').css('::attr(src)').extract()# 精确response.xpath('//div[@id="images"]/a/text()').extract()response.css('div[id=images] a::text').extract()# 模糊response.xpath('//div[contains(@id, "image")]/a/text()').extract()response.css('div[id*=image] a::text').extract()# 正则response.xpath('//a[contains(@href, "image")]/text()').re(r'Name:\s*(.*)')本文转自罗兵博客园博客，原文链接：http://www.cnblogs.com/hhh5460/p/5817574.html，如需转载请自行联系原作者