March 9, 2020: menu Status #archivebotX #archive-bsX #archivebot: Dashboard: http://dashboard.at.ninjawedding.org/?showNicks=1 | Docs: http://archivebot.rtfd.io/ | Viewer: https://archive.fart.website/archivebot/viewer/ [04:05] == PlsHelp [webchat@c-73-3-132-105.hsd1.co.comcast.net] has joined #archivebot [04:06] If it can archive 1,920 pages could you please help with [04:08] archiving https://xbooru.com/index.php?page=post&s=list&tags=real_person [04:08] Where it only crawls through pages with https://xbooru.com/index.php?page=post&s=list&tags=real_person&pid=# [04:08] where # is a number [04:09] It's 796 pages for real_person [04:09] 1914 pages for photo: [04:10] Start at https://xbooru.com/index.php?page=post&s=list&tags=photo&pid=80346 so it crawls/progresses through the pages backwards [04:10] all links to https://xbooru.com/index.php?page=post&s=list&tags=photo&pid=# [04:12] Oh, and if there is a setting, each ...&pid=# web page has 42 unique links to https://xbooru.com/index.php?page=post&s=view&id=# pages [04:13] If there is a setting to crawl links multiple levels deep [04:14] Each ID page (https://xbooru.com/index.php?page=post&s=view&id=#) has 3 to 4 links: [04:14] https://xbooru.com/index.php?page=history&type=tag_history&id=# [04:15] https://xbooru.com/index.php?page=history&type=page_notes&id=# [04:16] https://img.xbooru.com//images/#/#.ext where "ext" is jpg, jpeg, png, gif and "#" is one or more numbers [04:21] All of them have those 3. But only some have one or more https://xbooru.com/index.php?page=post&s=view&id=#&pid=# links [04:21] (>10 comments) [04:23] If I could figure how to run this thing if I even can or if someone else could archive all the aforewritten links then I would appreciate it. I'll be reading the docs and stuff. [04:28] !archive https://xbooru.com/index.php?page=post&s=list&tags=photo&pid=80346 [04:28] PlsHelp: Sorry, only channel operators or voiced users may use that command. [04:29] Now I remember that I cannot actually use this thing. [04:31] join #archive-b.s. March 7 or 8, 2020: chat.efnet.org%3A9090%2F%23archiveteam - menu Status #archiveteamX #archiveteam-bsX #archiveteam-otX #archiveteam: Archive Team: We're not archive.org | https://archiveteam.org/ | Discussion: #archiveteam-bs | Offtopic: #archiveteam-ot [22:55] == PlsHelp [webchat@c-73-3-132-105.hsd1.co.comcast.net] has joined #archiveteam [22:55] "there. On march 31 we will purge all real porn off Xbooru. Please take this time to download & save & upload content to realbooru." --Admin at https://xbooru.com/index.php?page=forum&s=view&id=1581 [22:55] Please help me archive this stuff at: [22:56] https://xbooru.com/index.php?page=post&s=list&tags=photo [22:56] https://xbooru.com/index.php?page=post&s=list&tags=real_person [22:57] I think https://xbooru.com/index.php?page=post&s=list&tags=real is the same thing as https://xbooru.com/index.php?page=post&s=list&tags=photo [22:58] Is there anyone here? I'll be running wget with my 4 TB hard drive. [23:06] == gbd [~Test@172.58.207.69] has joined #archiveteam [23:06] == Test_ [~Test@172.58.207.69] has quit [Read error: Connection reset by peer] [23:09] == PlsHelp_ [webchat@c-73-3-132-105.hsd1.co.comcast.net] has joined #archiveteam [23:10] I logged into this thing with my laptop and not tablet now. [23:10] Did anyone see my previous messages? [23:19] == morgandaw [~MeeDee@c-24-6-56-178.hsd1.ca.comcast.net] has joined #archiveteam [23:20] == dhyan_nat [~nataraj@91.203.188.92] has quit [Read error: Operation timed out] [23:23] == dhyan_nat [~nataraj@91.203.188.92] has joined #archiveteam [23:29] == gbd [~Test@172.58.207.69] has quit [Ping timeout: 276 seconds] [23:29] == MeeDee [~MeeDee@c-24-6-56-178.hsd1.ca.comcast.net] has quit [Ping timeout: 745 seconds] [23:40] magic word [23:40] Do I still have to write the magic word to join? Nevermind, that is just to register at the wiki. [23:42] If anyone wants to help archive xbooru, do it with web.archive.org/archive.is. I am currently running curl -v --tlsv1.2 "https://xbooru.com/index.php?page=post&s=view&id=#" -o #.html on page IDs 13-200000 [23:44] I just hope they don't have something to block people like me running a batch script.