Topic: e621.net mass (sort of) Download tool

Posted under Off Topic

I've created a executable jar file that I can customize for use on your computer. It's written in Java and implements the Jsoup Html parser. It's sort of like a bot or web-crawler (I guess?).

It can download all images associated with a tag, So artist, character, or something else, and a single image (if needed). However It is not perfect, so I wouldn't try to download a tag with 1000 pages associated with it.It can download, with fast enough internet speed, ~200 images associated with a tag in 3-4 minutes.

It has the ability to back-list tags, so you don't have to have save unwanted images on you computer. It also will write the image name, artist of image, and character tags to a txt file, for all images saved to your drive, that can be converted to a CSV file that you can store in a database or even run in Excel/libre. It won't (can't or I don't know how) download flash games/movies. It works fine with Windows and if needed I can get it to work with Ubuntu.

I have downloaded all of the files that I want right now and will spend most of my time sifting through Images I want and don't want. So I am willing to share my executable jar so you too can download images more quickly off of e621.

Send me a message if your interested so I can hard code some stuff (for your convenience) into the program and send it to you.

Updated by Pup

cuberope said:
I hope u know about api

I made it as straight forward as possible. I didn't really put too many rules into it, so it can throw exceptions and terminate if you put in a wrong web url.

Updated by anonymous

You should use it in the future. It's easier because it can do XML or JSON requests. In fact, it's the same. Just append a ".xml" or ".json" to your requested URL

For example

http://e621.net/post/index?limit=1 is the standard (web browser) request (returns a HTML document)
http://e621.net/post/index.xml?limit=1 Requests a XML Document (just the useful stuff)
http://e621.net/post/index.json?limit=1 Returns a JSON encoded string. You can use it in many ways (even on Java )

All contain "?limit=1" to reduce server load (I suppose)

Updated by anonymous

Heyas!

Thanks for taking the time to write a tool for e621!

As other posters have mentioned, you might want to consider using the API in the future as it's a bit easier to parse.

One thing that I would like to request with any scripts that use e621 is that you rate limit the request speed for images to 120 requests per minute, ie 2 requests a second, and 30 requests per minute for searches / page results. This is to make sure that your script doesn't have an adverse effect on other users of the site - I would like to avoid having to place restrictions or block scripts from accessing the site if possible as I recognise the value in these scripts to our users.

One final request is that you give your bot a unique user-agent string for each version you publish, and mention what it is (eg 'GroundwalkerBot v0.1'). This is in case one of your older script versions turns out to spin wildly out of control, so in an emergency I can blacklist that specific version without having to blacklist all of the versions of your program.

Thanks,

Varka

Updated by anonymous

Varka said:
Heyas!

Thanks for taking the time to write a tool for e621!

As other posters have mentioned, you might want to consider using the API in the future as it's a bit easier to parse.

One thing that I would like to request with any scripts that use e621 is that you rate limit the request speed for images to 120 requests per minute, ie 2 requests a second, and 30 requests per minute for searches / page results. This is to make sure that your script doesn't have an adverse effect on other users of the site - I would like to avoid having to place restrictions or block scripts from accessing the site if possible as I recognise the value in these scripts to our users.

One final request is that you give your bot a unique user-agent string for each version you publish, and mention what it is (eg 'GroundwalkerBot v0.1'). This is in case one of your older script versions turns out to spin wildly out of control, so in an emergency I can blacklist that specific version without having to blacklist all of the versions of your program.

Thanks,

Varka

Sure I can see what I can do, I haven't had too many takers. I can send you the source code and you can tell me what to change. Wouldn't want to bog down the servers.

Updated by anonymous

We've been talking in email (I'm herqulees) but wanted to join in on the conversation here to see what everyone else thinks of my suggestions.
As I said I'd love to see this grow into a full e621-viewer program, or the website grow to have these features; I think it would be awesome to use it much in basic to how the website looks now, only add in the option to select multiple image thumbnails to do stuff like report selected, add selected to favorites, remove selected from favorites, mass tagging selected, download selected, and so on. I'm sure others could think of even more uses.
EDIT: Also as I mentioned in email I think it would be nice to hear from the admins when this site is least busy, being oh so familiar with the 500 errors on this website I'd like to aim to use this server when it's least used.

Updated by anonymous

Hi! Sorry waste your precious time with this stupid person, but...

I don't known how talk within the e621... Do you have any email or other social network?

Updated by anonymous

MaxWhite said:
Hi! Sorry waste your precious time with this stupid person, but...

I don't known how talk within the e621... Do you have any email or other social network?

The regular way or API

Updated by anonymous

AoBird said:
The regular way or API

Sorry, like I said, I'm a stupid person, so I did not understand ... I wanted to talk to him so he would give me this .jar file since he left no link to download ... What would be the regular way ?
PS: I always look at your avatar ... He's pretty cool! Really! ;)

Updated by anonymous

MaxWhite said:
Sorry, like I said, I'm a stupid person, so I did not understand ... I wanted to talk to him so he would give me this .jar file since he left no link to download ... What would be the regular way ?
PS: I always look at your avatar ... He's pretty cool! Really! ;)

Click on their username, then find and click on the "Send Message" link at the top left of the screen.

But it's been a very long time since that person was active. They're probably gone...

Also, why do you want this guy's tool, in particular? It sounds like they never used the e621 API, so it's probably going to be buggy. There must be other tools out there to do mass downloads...

Updated by anonymous

ikdind said:
Click on their username, then find and click on the "Send Message" link at the top left of the screen.

But it's been a very long time since that person was active. They're probably gone...

Also, why do you want this guy's tool, in particular? It sounds like they never used the e621 API, so it's probably going to be buggy. There must be other tools out there to do mass downloads...

It's because of "tags download function", but thanks for teaching me how to send messages, that's so sweet of you, and you're very kind!
(Besides being stupid, I have myopia, sorry for that ^^\)

Updated by anonymous

Pup

Privileged

MaxWhite said:
It's because of "tags download function", but thanks for teaching me how to send messages, that's so sweet of you, and you're very kind!
(Besides being stupid, I have myopia, sorry for that ^^\)

I'd recommend using Wulfre's e621dl, which also lets you download using tags

https://github.com/Wulfre/e621dl

Updated by anonymous

Pup

Privileged

MaxWhite said:
Owww, I can believe you answer me! I am so happy! Thanks!
But I mean, download the tags of one page in a txt file...

Ah, apologies, I thought you meant to search/download based on tags, not download the tags themselves.

It's the one thing I think's really missing from Wulfre's program, being able to save that stuff. I'd really like one that can download the descriptions of images, for the ones that have stories with them. That and maybe download swf files as well.

Looking into the api makes me want to try and have a go at coding something myself, though it's been a while since I coded anything. I'll have to think about it. It'd probably be something Wulfre could easily add to his program and I'm not sure if it's worth creating a downloader in Java from scratch for one feature. I couldn't really use his code as I don't have a clue when it comes to python, what his is coded in.

Updated by anonymous

  • 1