scrape
Determining origin of traffic from Referrer Header characteristics
I\'m writing a web application that will track incoming traffic to a website and track the origin of the traffic and its behaviour on our site, so that we can get some idea of the return on investment[详细]
2023-02-08 02:33 分类:问答Easiest way to scrape Google for URLs via my browser?
I\'d like to scrape all the URLs my searches return when searching for stuff via Google. I\'ve tried making a script, but Google did not like it, and adding cookie support and captcha was too tedious.[详细]
2023-02-06 01:29 分类:问答HTML Agility Pack - Can only load xml document from file system, not from web
I\'ve used HAP successfully before, downloading xhtml pages from web. However, now I\'m trying to load and parse xml documents. HAP will only load xml documents that are located on my file system, \"C[详细]
2023-02-04 11:07 分类:问答Extract Links from Facebook activity feed
I\'m trying to get the links from a facebook activity feed, i\'ve tried extracting the HTML from the iframe, 开发者_如何学Cbut this doesn\'t work because of cross domain. Then I tried cURL but that do[详细]
2023-02-02 20:10 分类:问答Python + Mechanize Async Tasks
So I have this bit of python code that runs through a delicious page and scrapes some links off of it. The extract method contains some magic that pull out the required content. However, running the p[详细]
2023-01-31 02:10 分类:问答Calculate area of HTML element on website?
I\'m trying to figure out if it\'s possible to calculate the area of a HTML element on a website? In pixels, as a percentage or whatever.[详细]
2023-01-29 04:42 分类:问答Please help: How can I scrape this web page?
There\'s a site that offers a search service. You enter a number, search, and it returns results. What I want to do is run that search programmatically through coldfusion instead of having to go to th[详细]
2023-01-27 04:28 分类:问答guidance on python scraping packages
I\'m still a newcomer to python, so I hope this question isn\'t inane. The more I google for web scraping solutions, the more confused I become (unable to see a forest, despite investigating many tr[详细]
2023-01-27 00:07 分类:问答Scraping structured information from hundreds of Word documents?
I\'ve been tasked with extracting some structured information from hundreds of human readable documents (mostly MS Word) and to put it into a database. The data is pretty much embedded in tables throu[详细]
2023-01-25 06:22 分类:问答Http Agility Pack - Accessing Siblings?
Using the HTML Agility Pack is great for getting descendants and whole tables etc... but how can you use it in the below situation[详细]
2023-01-21 23:56 分类:问答
加载中,请稍侯......