/tech/ - Technology (textboard)

[22 / 0 / ?]

[1187881809] wget and image galleries?

Anonymous ID:88r1ZBQV Fri 24 Aug 2007 02:58:00 No.8191 View View Reply Original Report

Quoted By:

trying to get wget to cooperate with downloading individual pictures from a pr0n site

i've decided against spidering the content, since i dont want be obvious (or piss off the webmaster too much), and on closer examination of the method of retriving the image seems pretty simple, there is a php script with an ID and a page number (readfile.php?arg_id=1234&arg_page=5, for example), so i decided that once i had the command working I'd write a script to download all of the images, one after the other.

i've seemed to make steady progress towards the solution, telling wget to ignore robots.txt and to masquerade as a different browser seems to have gotten me past some of the errors, and figuring out that it's a good idea to escape ampersands helped it not to spawn another process, but right now I seem to be stuck at a brick wall, with me staring down the barrel of a 401 forbidden, whereas I used to be able to download short little error .html files that gave me an idea of what was wrong.

here's the command as it stands, the URL included is fake.

$ wget --cookies=off --load-cookies=cookies.txt -erobots=off --user-agent="Mozilla/4.0 (compatible; MSIE 5.5;Windows NT 5.0)" <url removed>

for clarity the cookies.txt is simply what i copied from a firefox profile that had cookies cleared and then visited the site in question. that said, i got a couple of questions

1. is it likely that i am missing any other obvious "pretend I'm the real thing" bits that might make wget behave?
2. is this obstacle likely to be so immense that trying to bypass it is probably not worth the trouble, like something in the php script itself?

the site in question is e-hentai.org, and although it does require hentaikey for accessing most of it, the doujin stuff does not, so i should be within the rules, the reason i mentioned this last is because to keep the NWS content in this post to a minimum and i'm more interested in how they're keeping wget out than the porn itself, "the more you know" and all that

i honestly have no idea where else i could ask this question and have even a slim chance of getting an informed response, so help me anonymous

Anonymous

2

Anonymous ID:o3kCdtAT Fri 24 Aug 2007 04:05:00 No.8193 Report

Quoted By:

just try mirror option
iirc there was another option where you can tell wget how much to wait until next request, that may help too
if you have netcat or something like that, you can check if wget's request looks like the browser one

Anonymous

3

Anonymous ID:0vaXgWjk Sat 25 Aug 2007 09:49:00 No.8221 Report

Quoted By:

that's essentially the same as some of the other options i used earlier to attempt and spider the site, tried something like:

$ wget -r -l 1 -N -w 5 -nd --cookies=off --load-cookies=cookies.txt -erobots=of
f --user-agent="Mozilla/4.0 (compatible; MSIE 5.5;Windows NT 5.0)" <url removed>

on a specific page, but it seemed to be caught in an infinite loop of trying to redownload index.html over and over again.

Anonymous

4

Anonymous ID:0vaXgWjk Sat 25 Aug 2007 09:51:00 No.8222 Report

Quoted By:

^^^

to be more specific, my first attempt was to spider, but i quickly realized that it was probably idea, the above attempt was to try and mirror one page, and that i hadn't used those commands since my first attempt

Anonymous

5

Anonymous ID:0vaXgWjk Sat 25 Aug 2007 09:54:00 No.8223 Report

Quoted By:

^^^

i meant probably a bad idea. anyway after doing some further reading i've decided the exercize is probably futile, as putting anti-wget into google provides more results than you can shake a stick at and they look like they put effort into the site, so it's probably using one of those, damn you website

Anonymous

6

Anonymous Mon 12 Jan 2009 12:32:00 No.14069 Report

Quoted By:

Ok... I found this thread, and worked a little with it and get it...
Maybe they touch something in this 2 years and it's easier now but still...

This should make it harder for the admin to discover it.

wget --user-agent="Mozilla/5.0 (X11; U; Linux i686; es-AR; rv:1.9.0.1) Gecko/2008070206 Firefox/3.0.1" --header="Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8" --header="Accept-Language: es-ar,es;q=0.8,en-us;q=0.5,en;q=0.3" --header="Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7" --header="Keep-Alive: 300" --header="Connection: keep-alive" --load-cookies $cookies --save-cookies $cookies --keep-session-cookies $URL

I think that the file $cookies must previously exist

The trick is to use the --keep-session-cookies.

Then they have a new anti-AdBlock. I tried to look at the http headers from the browser connection but couldn't figure out exaclty what to download to get the cookie. It seems that the link changes with the Ad.

But it does't works the first time you connect to the site, so I get a page, parse it, download the image, delete de cookies, and continue with the next link. It doesn't works like a real browser but temporally works.

I think it isn't prossible to create an anti-wget if you make it act linke a user. Well maybe with javascript...

replica ebel

9

replica ebel Sun 03 Jan 2010 23:38:00 No.16396 Report

Quoted By:

http://www.iwcwatches.us/Bvlgari/ replica Bvlgari
http://www.iwcwatches.us/Cartier/ replica Cartier
http://www.iwcwatches.us/Chopard/ replica chopard
http://www.iwcwatches.us/Concord/ replica Concord
http://www.iwcwatches.us/Croum/ replica croum
http://www.iwcwatches.us/DeWitt/ replica DeWitt
http://www.iwcwatches.us/Ebel/ replica ebel

Concord watches

10

Concord watches Wed 06 Jan 2010 12:23:00 No.16810 Report

Quoted By:

replica Patek Philippe

11

replica Patek Philippe Fri 08 Jan 2010 06:07:00 No.17127 Report

Quoted By:

Rado replica

12

Rado replica Tue 12 Jan 2010 01:21:00 No.17711 Report

Quoted By:

watches replica

13

watches replica Tue 12 Jan 2010 19:10:00 No.17930 Report

Quoted By:

<A href="http://www.patekphilippewatches.us/">replica watches</A> <A href="http://www.patekphilippewatches.us/">replica watch</A> <A href="http://www.patekphilippewatches.us/Bvlgari/">Bvlgari watch for sale</A> <A href="http://www.patekphilippewatches.us/Oris/">oris watches</A> <A href="http://www.patekphilippewatches.us/Movado/">movado replica</A> <A href="http://www.patekphilippewatches.us/Rolex/GMT/">replica Rolex GMT</A> <A href="http://www.patekphilippewatches.us/Glashutte/">Glashutte watches</A> <A href="http://tiffanynecklaces.org">Tiffany&Co Titanium I.D. Necklace</A>

discount golf equipment

16

discount golf equipment Wed 26 May 2010 05:48:00 No.19323 Report

Quoted By:

hi guys, let's do some sports . healthy life and keeping one good boy needs it, the following links will do good to you:
http://www.golfclubs365.com
http://www.golfequipment18.com
There are two [url=http://www.golfclubs365.com]Golf Clubs[/url] stores,include kinds of [url=http://www.golfequipment18.com]golf equipment[/url],
[url=http://www.buyinggolfonline.com/]wholesale golf clubs[/url].
[url=http://www.golfclubs365.com/goods-214-Callaway+Diablo+Edge+Irons.html]Callaway Diablo Edge Irons[/url]
[url=http://www.golfclubs365.com/goods-214-Callaway+Diablo+Edge+Irons.html]Callaway Diablo Irons[/url]
[url=http://www.golfclubs365.com/goods-214-Callaway+Diablo+Edge+Irons.html]Callaway Edge Irons[/url]

star dvd city

17

star dvd city Fri 23 Jul 2010 05:03:00 No.19736 Report

Quoted By:

hello everybody, try some wonderful medium? Yes! you will get a lot from it, and high quality life needs it! you may be a crazy movie lover like me,there are some hot DVD movies i really like and want to share to you---
Entourage DVD 1-6 http://www.stardvdcity.com/entourage-seasons-16-dvd-boxset-p-420.html
Entourage DVD set http://www.stardvdcity.com/entourage-seasons-16-dvd-boxset-p-386.html
Queer as folk DVD set http://www.stardvdcity.com/queer-as-folk-seasons-15-dvd-boxset-p-329.html
Queer as folk DVD 1-5 http://www.stardvdcity.com/queer-as-folk-seasons-15-dvd-boxset-p-330.html
Boston legal DVD 1-5 http://www.stardvdcity.com/boston-legal-seasons-15-dvd-boxset-p-231.html
Boston legal DVD set http://www.stardvdcity.com/boston-legal-seasons-15-dvd-boxset-p-233.html
Boston legal DVD boxset http://www.stardvdcity.com/boston-legal-seasons-15-dvd-boxset-p-232.html
i believe you will love them! it really has amazing plot , wonderful screen, and also nice musics. we can't miss them!!!

Herve Leger

18

Herve Leger Wed 01 Sep 2010 12:37:00 No.19971 Report

Quoted By:

Are you always vexed about wearing what kind of dress at a banquet?
Herve Leger bandage dress online store eliminates your worries.
Ladies who is beautiful and noble may choose Herve Leger bandage
dress. The dress highlights your perfect feminine body curve.
If you wear this dress, it will send out your inherent glamour.
It is specially designed for you and meets your noble taste.
Wherever you go, you will be the focus. If you haven't a suitable
dress to attend a banquet and want to buy one now, Herve Leger bandage
dress online store is your best choice http://www.buyherveleger.net/

www.jewelry-magic01.com

22

www.jewelry-magic01.com Fri 03 Sep 2010 07:59:00 No.19989 Report

Quoted By:

designer perfumes 2010

23

designer perfumes 2010 Tue 07 Sep 2010 05:43:00 No.20004 Report

Quoted By:

[url=http://www.cheapshoeschina.com] Cheap Gucci Shoes[/url]
[url=http://www.cheapshoeschina.com] New style Armani clothing[/url]
[url=http://www.cheapshoeschina.com] Wholesale coach sunglasses[/url]
[url=http://www.cheapshoeschina.com] Discount Running sneakers[/url]
[url=http://www.cheapshoeschina.com] Fashion LV handbags[/url]

yumira

24

yumira Mon 11 Oct 2010 13:22:00 No.20173 Report

Quoted By:

http://www.jewelry-magic01.com/crystalquartz-bracelet/multicolor-quartz-crystal-bracelet.html
http://www.jewelry-magic01.com/crystalquartz-bracelet/colorized-quartz-crystal-faceted-opalite-bracelet.html
http://www.jewelry-magic01.com/quartiz-pendant/colorful-resin-crystal-glass-flower-pendant.html
http://www.jewelry-magic01.com/clasps/magnetic-ball-golden-bead-necklace-clasp.html
http://www.jewelry-magic01.com/clasps/magnetic-round-ball-bead-necklace-clasp-finding.html

golfshopshopping

25

golfshopshopping Fri 20 May 2011 12:49:00 No.21687 Report

Quoted By:

cheapgolf4u

26

cheapgolf4u Mon 15 Aug 2011 11:44:00 No.22204 Report

Quoted By:

Cheap golf equipment:http://www.cheapgolf4u.com

Taylormade R11 Irons:http://www.cheapgolf4u.com/goods-761-Taylormade+R11+Irons.html
G20 Irons:
http://www.cheapgolf4u.com/goods-764-Ping+G20+Irons.html
G15 Irons:http://www.cheapgolf4u.com/goods-445-Hot+Ping+G15+Irons+for+sale.html
BURNER 2.0 Irons:
http://www.cheapgolf4u.com/goods-671-TaylorMade+Burner+20+Irons.html

Anonymous

27

Anonymous Tue 16 Aug 2011 03:55:00 No.22209 Report

Quoted By:

use curl

cheap nike wholesale shoes

28

cheap nike wholesale shoes Mon 13 Aug 2012 11:17:00 No.24250 Report

Quoted By:

out of each other after right-back schwab of entwine,<a href="http://www.wholesaleonfire.com/">cheap nike wholesale shoes</a> sterling then cut through the defense in the wal-mart central defender sand Ed, and from the other party the other a central defender LaiNa watts fill the tackling,<a href="http://www.wholesaleonfire.com/">nike wholesale sneakers</a> face each other's goalkeeper lai, sterling calmly ride out his right foot curled into the bottom right-hand corner. One goal, reflects the speed and exquisite sterling feet technology.

Supra Society Womens

29

Supra Society Womens Mon 18 Nov 2013 10:59:00 No.25686 Report

Quoted By:

Welcome you back to our website, our company provide Wholesale Supra Society Womens shoes on sale, we guarantee quality, excellent service, providing secure payment methods to purchase our products, so you are assured peace of mind.
http://www.shoessalestor.com/--supra-society-womens_c2.html

Capcode	All Only User Posts Only Verified Posts Only Moderator Posts Only Manager Posts Only Admin Posts Only Developer Posts Only Founder Posts
Show Posts	All Only With Images Only Without Images Only Spoiler Images Only Non-Spoiler Images
Deleted Posts	All Only Deleted Posts Only Non-Deleted Posts
Ghost Posts	All Only Ghost Posts Only Non-Ghost Posts
Post Type	All Only Sticky Threads Only Opening Posts Only Reply Posts
Results	All Grouped By Threads
Order	Latest Posts First Oldest Posts First

On these archives

Your latest searches

[1187881809] wget and image galleries?