[Twisted-web] Design for HTTP client form login and scraping

David Bern odie5533 at gmail.com
Mon Jul 27 10:07:49 EDT 2009


On Mon, Jul 27, 2009 at 8:01 AM, Jean-Paul Calderone<exarkun at divmod.com> wrote:
> On Mon, 27 Jul 2009 01:05:21 -0500, David Bern <odie5533 at gmail.com> wrote:
>>I want to make multiple HTTP requests using the same set of cookies.
>>Should I call client.getPage a lot thus creating multiple factories to
>>do this?
>
> This is probably the right approach for the near term.  If you're worried
> about the overhead of creating a lot of factories, I don't think you should.
> Creating these objects isn't very expensive (particularly compared to parsing
> html).
>
> Jean-Paul
>
> _______________________________________________
> Twisted-web mailing list
> Twisted-web at twistedmatrix.com
> http://twistedmatrix.com/cgi-bin/mailman/listinfo/twisted-web
>

Thank you for the fast reply. I am more worried about the difficulty
of programming it this way, and was wondering if there is a better
method. I want a class with functions which would correspond to
different forms and pages on the web site. For instance, login(), then
post_message(), or some such set of commands based on a configuration
file. Would creating a class which calls client.getPage without
inheriting anything be the best method to accomplish this? When would
inheritance be the right method, only when extending the functionality
of HTTP client in Twisted and not to make use of it? My main question
here is that of style and ease of programming. I want to get into a
good programming habit with Twisted rather than have to redesign and
rewrite huge portions later because I put the logic in the wrong
place.

-- 
Thanks again,
David Bern



More information about the Twisted-web mailing list