rcurl web scraping timeout exits program

Posted by user1742368 on Stack Overflow See other posts from Stack Overflow or by user1742368
Published on 2012-10-12T21:33:48Z Indexed on 2012/10/12 21:36 UTC
Read the original article Hit count: 254

Filed under:

timeout

|

scrape

|

trap

|

rcurl

I am using a loop and rcurl scrape data from multiple pages which seems to work fine at certain times but fails when there is a timeout due to the server not responding. I am using a timeout=30 which traps the timeout error however the program stops after the timeout. i would like the progrm to continue to the next page when the timeout occurrs but cant figureout how to do this? url = getCurlHandle(cookiefile = "", verbose = TRUE)

Here is the statement I am using that causes the timeout. I am happy to share the code if there is interest. webpage = getURLContent(url, followlocation=TRUE, curl = curl,.opts=list( verbose = TRUE, timeout=90, maxredirs = 2))

woodwardjj

© Stack Overflow or respective owner

Related posts about timeout

Session Timeout Page Vs Login Page on ISA Server Timeout

as seen on Server Fault - Search for 'Server Fault'
Is it possible to show a different page(custom timeout page vs Login page) on Session Timeout using an ISA Server? >>> More
Javascript, AJAX, Extend PHP Session Timeout, Bank Timeout

as seen on Stack Overflow - Search for 'Stack Overflow'
Greetings, I have the following JS code: var reloadTimer = function (options) { var seconds = options.seconds || 0, logoutURL = options.logoutURL, message = options.message; this.start = function () { setTimeout(function (){ if ( confirm(message) ) { // RESET TIMER… >>> More
C#/.NET Little Wonders: The Timeout static class

as seen on Geeks with Blogs - Search for 'Geeks with Blogs'
Once again, in this series of posts I look at the parts of the .NET Framework that may seem trivial, but can help improve your code by making it easier to write and maintain. The index of all my past little wonders posts can be found here. When I started the “Little Wonders” series, I really wanted… >>> More
How to tell a connect timeout error from a read timeout error in Ruby's Net::HTTP

as seen on Stack Overflow - Search for 'Stack Overflow'
My question is related to http://stackoverflow.com/questions/2370140/how-to-rescue-timeout-issues-ruby-rails. Here's the common way to rescue from a timeout: def action # Post using Net::HTTP rescue Timeout::Error => e # Do something end I'd like to determine if the exception was raised… >>> More
HttpClient - setting a "global" socket timeout, and a separate timeout per request

as seen on Stack Overflow - Search for 'Stack Overflow'
With HttpClient, I am setting the default socket/connection timeout with the following: HttpParams params = new BasicHttpParams(); HttpConnectionParams.setSoTimeout(params, 30000); HttpConnectionParams.setConnectionTimeout(params, 30000); mClient = new DefaultHttpClient(connectionManager, params); I'm… >>> More

Related posts about scrape

How to scrape a _private_ google group?

as seen on Stack Overflow - Search for 'Stack Overflow'
Hi there, I'd like to scrape the discussion list of a private google group. It's a multi-page list and I might have to this later again so scripting sounds like the way to go. Since this is a private group, I need to login in my google account first. Unfortunately I can't manage to login using wget… >>> More
Scrape HTML tables from a given URL into CSV

as seen on Stack Overflow - Search for 'Stack Overflow'
I seek a tool that can be run on the command line like so: tablescrape 'http://someURL.foo.com' [n] If n is not specified and there's more than one HTML table on the page, it should summarize them (header row, total number of rows) in a numbered list. If n is specified or if there's only one table… >>> More
perl script to scrape out sentences

as seen on Stack Overflow - Search for 'Stack Overflow'
Perl script that would scrape out sentences that mention 'Calvein Klein' in articles in a file named by $file. (Sentences can cross zero or more CR/LF characters.) Create an array of sentences scraped and print it at the end. Please anyone help me with that. >>> More
Scrape data from HTML pages using Java, output to database

as seen on Stack Overflow - Search for 'Stack Overflow'
I need to know how to create a scraper (in Java) to gather data from HTML pages and output to a database...do not have a clue where to start so any information you can give me on this would be great. Also, you can't be too basic or simple here...thanks :) >>> More
I want to scrape a site using GAE and post the results into a Google Entity

as seen on Stack Overflow - Search for 'Stack Overflow'
I want to scrape this URL : https://www.xstreetsl.com/modules.php?searchSubmitImage_x=0&searchSubmitImage_y=0&SearchLocale=0&name=Marketplace&SearchKeyword=business&searchSubmitImage.x=0&searchSubmitImage.y=0&SearchLocale=0&SearchPriceMin=&SearchPriceMax=&SearchRatingMin=&SearchRatingMax=&sort=&dir=asc Go… >>> More