diff options
author | Nick White <hg@njw.me.uk> | 2011-07-29 01:03:44 +0100 |
---|---|---|
committer | Nick White <hg@njw.me.uk> | 2011-07-29 01:03:44 +0100 |
commit | 5945770938a14d2364ab56049df4988cb25890d9 (patch) | |
tree | 05262709ce9414b7979170b7062b89a5efbd68f5 /TODO | |
parent | 568751040761f707deeb46b7032912ac7a7dd34b (diff) |
Retry properly with specific pages
Diffstat (limited to 'TODO')
-rw-r--r-- | TODO | 4 |
1 files changed, 4 insertions, 0 deletions
@@ -1,7 +1,11 @@ +Note: looks like google allows around 3 page requests per cookie session, and about 40 per ip per [some time period]. If I knew the time period, and once stdin retry is working, could make a script that gets all it can, gets a list of failures, waits, then tries failures, etc. Note these would also have to stop at some point; some pages just aren't available + make sure i'm checking all lib calls that could fail make sure all arrays are used within bounds +strace to check paths taken are sensible + use defined constants rather than e.g. 1024 getgbooktxt (different program as it gets from html pages, which getgbook doesn't any more) |