Does a multithreaded crawler in Python really speed things up?

Posted by beagleguy on Stack Overflow See other posts from Stack Overflow or by beagleguy
Published on 2010-05-13T23:02:42Z Indexed on 2010/05/13 23:14 UTC
Read the original article Hit count: 353

Filed under:
|
|

Was looking to write a little web crawler in python. I was starting to investigate writing it as a multithreaded script, one pool of threads downloading and one pool processing results. Due to the GIL would it actually do simultaneous downloading? How does the GIL affect a web crawler? Would each thread pick some data off the socket, then move on to the next thread, let it pick some data off the socket, etc..?

Basically I'm asking is doing a multi-threaded crawler in python really going to buy me much performance vs single threaded?

thanks!

© Stack Overflow or respective owner

Related posts about python

Related posts about multithreading