Questions tagged «scraper»

3
XPath ::获得关注的兄弟姐妹
我有以下HTML结构:我正在尝试构建一种可靠的方法来提取第二个颜色摘要元素,因为DOM中将包含许多此类标记。 <table> <tbody> <tr bgcolor="#AAAAAA"> <tr> <tr> <tr> <tr> <td>Color Digest </td> <td>AgArAQICGQMVBBwTIRQHIwg0GUMURAZTBWQJcwV0AoEDAQ </td> </tr> <tr> <td>Color Digest </td> <td>2,43,2,25,21,28,0,0,0,0,0,0,0,0,0,0,0,0,0,0,33,7,0,0,0,0,0,0,0,0,0,0,0,0,0,0,8,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,25,0,0,0,0,0,0,0,0,0,0,0,0,0,0,20,6,0,0,0,0,0,0,0,0,0,0,0,0,0,0,5,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,9,0,0,0,0,0,0,0,0,0,0,0,0,0,0,5,2,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0, </td> </tr> </tbody> </table> 我正在尝试提取具有解码值的第二个“ Color Digest” td元素。 我写了下面的xpath,但是没有得到第二个,我没有得到第二个td元素。 //td[text() = ' Color Digest ']/following-sibling::td[2] 当我将其从td [2]更改为td [1]时,我得到了两个元素。
82 html  xpath  siblings  scraper 
By using our site, you acknowledge that you have read and understand our Cookie Policy and Privacy Policy.
Licensed under cc by-sa 3.0 with attribution required.