screen-scraping
Scrape gmail for the last time external pop accounts were checked and check them if longer than X time since the last check
Goal: To develop an script that will check the last time my external pop accounts were checked by google -- while not being logged in. If the time exceeds some amount, then check the pop account.[详细]
2023-03-18 08:37 分类:问答Avoiding 503 errors with urllib2
I\'m new to web scraping with python, so I don\'t know if I\'m doing this right. I\'m using a script that calls BeautifulSoup to parse the URLs from the first 10 pages of a google search. Tested with[详细]
2023-03-18 06:17 分类:问答Ruby, Scrape page written entirely in JavaScript
I am playing with Ruby + Hpricot and building a simple scraper. I am able to work with other sites with no issues. But if a page is written entirely in JavaScript, can that be scraped?but, google sear[详细]
2023-03-18 01:55 分类:问答help with perl code to parse a file
I am new to Perl and have a question about the syntax. I received this code for parsing a file containing specific information. I was wondering what the if (/DID/) part of the subroutine get_number is[详细]
2023-03-17 06:57 分类:问答how can I use js/coffee to screen scrape an asp page?
I\'ve got a website that I\'d like to pull data from and it\'s really stuck in the stone ages. There\'s no web service, no API and it\'s very much an ASP/Session/table-based-layout page. Pretty fugly.[详细]
2023-03-17 02:27 分类:问答Watir Changing Mozilla Firefox Preferences
I\'m running a Ruby script using Watir to automate some things for me. I\'m attempting to automatically save some files to a certain directory. So, in my Mozilla settings I set my default download dir[详细]
2023-03-16 20:40 分类:问答Python- is there a module that will automatically scrape the content of an article off a webpage?
I know there is lxml and BeautifulSoup, but that won\'t work for my project, because I don\'t know in advance what the H开发者_如何转开发TML format of the site I am trying to scrape an article off of[详细]
2023-03-16 10:24 分类:问答How to update content in html using htmlunit?
I found it very difficult to work with htmlunit in terms of creating new html content on the fly like we can do in jquery.[详细]
2023-03-16 01:31 分类:问答How can I scrape data from the Israeli Bureau of Statistics web query tool?
The following url: http://www.开发者_如何学JAVAcbs.gov.il/ts/ID40d250e0710c2f/databank/series_func_e_v1.html?level_1=31&level_2=1&level_3=7[详细]
2023-03-14 22:13 分类:问答Proper character encoding to display "”"?
I\'m having some nasty character encoding problems that I just can\'t figure out. Essentially, I\'m screen scraping some HTML off of a site using PHP, then running it through PHP\'s DOMDocument to ch[详细]
2023-03-14 18:23 分类:问答