Archive.ph, .is and .today don't seem to load and I believe this might be linked to the FBI investigation. I'd like to know if anyone knows other sites like this.
I'm wondering if it's linked to the investigation I recently heard about
I’ve had this blog for about 3 years and it has around 6k posts. I would like to print it all out but I have no clue where to start. Could I hire someone to write a program that scrapes each webpage? Sorry I really have very little tech/coding knowledge so I am pretty lost. Any advice would be super helpful as I have not been able to find anything online about this 🙏 thank you :)
I want some basic sanity check to do on files I automatically archive, since it will possibly years later that a corruption will me noticed manually.
My methods/ideas so far:
-
play back the video file (wanted to watch them anyway)
-
look at thumbnails of the image files in file explorer
-
generate preview image for video/gallery as multiple thumbnails next to another (had to do that anyway
-
covert video file with ffmpeg. (had to convert them anyway)
-
check metadata of the media file (ffprobe)
-
load image in image manipulation library, do some basic manipulation (rotate, resize), don't save the result to disk, but made sure it actually did the manipulation
None of these seem like the best way to do it and I have stopped doing it. (besides the stuff I do for other reasons).
I don't mean checksums (SHA..., CR..., blake...), since it's possible that the file was already corrupted on the server I'm downloading it from (has happened to me🙄).
For text files like JSON, HTML or XML it should be enough to parse them to check if they are valid. But even here it's not that easy,
Do you guys check/validate your media files after downloading?
Got 2 refurbished hard drives from bargin hardware on ebay. Plugged them in created a truenas dataset and they are making sounds like light knocking. They made a constant noise when I first plugged them in i thought was just initalisation of the drives. PSU is a 850w
I wanna back up my 4 k and blu rays and get a bunch of hdd like 2 20tb and play my movie s through this into nvdia shield then my 4 k tv. No streaming. What is the best hdd set up. ?
I'm looking for a network switch that I really only need to have two 10gbps RJ45 ports, and then three to four 2.5gbps or even 1gbps ports.
I'm also new to NASs, servers, etc, so I don't want something managed unless it runs out of the box with basically zero setup. I can't overstate this: I barely use command line tools in Windows, have never used another OS, etc. I basically just want the switch to act as a splitter.
It seems though that all the switches I can find either
-
Only have the 10gbps ports in SFP
-
Have more lik 8-12 ports, much more then I need and therfore more expensive
-
Are from no name brands I am concerned about the reliability of.
The least bad options I've found or have been directed to are the QNAP QSW-2104-2T-R2-US and the TRENDnet TEG-S762, which seem ideal, but I have been told they have reliability issues and they share hardware.
is a cross-platform open source desktop application built to simplify downloading YouTube and non-YouTube video and audio content. It has under the hood, paired with an easy-to-use interface (Qt6 GUI). This tool aims to offer you a seamless experience to get your favorite video and audio content saved offline. You can selectively or fully download channels, playlists, or individual videos from multiple platforms, opt for audio-only tracks, download the associated thumbnails, and specify the quality and format for your video or audio to download.
This app is different from similar apps in the sense that it allows to get not just single videos, but selectively or fully get an entire channel or playlist, and customize the audio/video quality to one's liking with an easy clickable GUI, progress indicators, download fallbacks, and heuristics to ensure proper core function.
Easy run in two steps with pip:
pip install yt-channel-downloader yt-channel-downloader
Source code on .
The binary releases for Windows, macOS, and Linux (Debian-compatible) are available from the .
Suggestions for new features, bug reports, and ideas for improvements are welcome.
Hi, Farmers Almanac is closing after 208 years
Is anyone able to archive the contents? Or able to let me know how to do a bulk save? I'm relatively new to the datahoarder community. Thank you!
Hey, I'm moving about 1 gigabyte of data C:\ -> E:\ with around 54k files. And it takes 4~ minutes, am I tweaking or is that slow for such good SSDs? It averages 900kB/s
I'm testing the SSDs for use in a server setup later on for API's
Specs:
- B650 Tomahawk Wifi Plus II
- Ryzen 7 7700x
- RTX 4070
- 3x 990 Pro 2TB
- 64gb 6000mhz CL36 RAM
shucked two 18tb drives (WD180EDGZ). i no longer have a use for them and threw the cases out. wondering if theres a market for them/how to sell them. im just not sure how/where to do it!
I’m looking for a enclosure to put 2 or more 20 tb hard drives into I need something that has a power switch so I don’t have to unplug it every night when I unplug my computer. One that’s stable that’s gonna work one that’s reliable any ideas?
Does any protection in 2.5" HDDs prevent external electromagnetic interference, for example: a 29" CRT TV placed near the 2.5" HDD, from causing any loss or degradation in the magnetism and data of that HDD?
Hi,
I had an Toshiba 18tb MG09ACA18TE HDD report a pending sector that did not disappear with a badblocks (read) scan. I therefore replaced the HDD and started the RMA process. But before sending it in, I overwrote it with /dev/urandom and now the pending sector is gone. I still see the errors in smartctl
Error 148 occurred at disk power-on lifetime: 23569 hours (982 days + 1 hours) When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
40 43 50 a0 77 3f 40 Error: UNC at LBA = 0x003f77a0 = 4159392
Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
60 20 50 a0 77 3f 40 00 2d+14:54:35.800 READ FPDMA QUEUED
60 20 48 80 77 3f 40 00 2d+14:54:35.799 READ FPDMA QUEUED
60 20 40 60 77 3f 40 00 2d+14:54:35.799 READ FPDMA QUEUED
60 20 38 40 77 3f 40 00 2d+14:54:35.799 READ FPDMA QUEUED
60 20 30 20 77 3f 40 00 2d+14:54:35.799 READ FPDMA QUEUED
but the SMART stats are "fine" now, no pending and no reallocated sectors. Will Toshiba reject the RMA now? I have screenshots of the failure warnings in the NAS and from the SMART stats from beforehand.
Hey everyone,
as some of you may already know, Webtoon Translate is being shut down. From what I’ve read, this is because the volunteer-created translations are now considered public domain — meaning Webtoon can’t monetize them, so the platform is being taken offline.
Here’s my problem: I’ve been reading Tower of God in German via Webtoon Translate to improve my language skills. There are hundreds of chapters, and with the shutdown coming soon, it’s impossible to read or save them all in time.
I’m not looking to redistribute or pirate anything — I only want to preserve the translations for personal use and for the sake of language learners who benefited from these volunteer works.
So I wanted to ask:
-
Has anyone found a safe and ethical way to archive the German (or other language) versions before they’re gone?
-
Are there any tools that work specifically for links (not the main Webtoon site)?
-
If some translators are still active, is there a community project for preservation or rehosting under an open license?
It’d be a real shame to lose years of volunteer translation work that helped so many people learn languages and access stories across cultures.
Any help, advice, or discussion welcome 🙏
(If anyone knows of ongoing archival efforts, please tag or link them!)
i swear every time i go back to rewatch or revisit something online it’s just gone. like entire youtube channels wiped old blogs showing 404 not found clips i saved in playlists just vanish overnight.
it’s honestly kinda scary how fragile the internet feels now. all this stuff that used to feel permanent just disappears like it never existed.
i’ve started saving everything i can before it’s too late but it’s getting out of hand lol. terabytes piling up and i’m not even sure what i’m gonna do with half of it.
does anyone else notice this happening more lately?
are y’all backing stuff up too or just letting it go?
I running a design and video editing studio, i plan to backup everything to the drive so i will rarely use the drive since we already have internal disk for our PC. Should i get the mybook 4tb or elements 5tb for durability? its the same price in my country. thank you
I running a design and video editing studio, i plan to backup everything to the drive so i will rarely use the drive since we already have internal disk for our PC. Should i get the mybook 4tb or elements 5tb for durability? its the same price in my country. thank you
For TrueNAS Scale build, wondering the following for running 64 GB ECC UDIMM RAM:
Would a 2 x 32 GB 3200 RAM kit ($370) have significantly better performance than a 4 x 16 GB 2666 RAM ($250) kit?
For a TrueNAS to be mostly used for:
-
Automated backups (from my homelab and other devices)
-
Accessing large video files and music projects
Specs:
-
Mobo: ASRock B550 Pro4
-
CPU: Ryzen 7 PRO 4750G
-
HDD storage: 5 x WD Ultrastar DC HC580 (24TB SATA) - (5 wide vdev in RAID2Z)
Both are basically same price right now (sub-$300) with 10% off discount at Seagate store and $15 off at eBay Seagate recertified store. What’s your pick?
Use case: Home NAS with low IO and 321 backup rule implemented.
Corrected: Factory recertified Exos sold by Seagate, not the refab one by 3rd parties.
I’m a total newbie, idk how to use yt-dlp. I couldn’t find how to do this anywhere. I want to download as much tutorials as I possibly can before the Russian government takes away vpns and my access to free education
I have a new Seagate Barracuda 6TB SMR drive in an external enclosure that I use as an offline backup for my home server. When I am done my backups, I do the usual unmount in Windows then power off the drive. It just occurred to me that even when I safely unmount the drive in Windows, AND with the USB unplugged, that you can often still hear drive activity. I am assuming this is the drive's firmware re-shingling itself after a bunch of data has just been written to it or some other sort of housekeeping.
So this has me concerned. With an SMR drive, is it safe to be turning it off WHILE the internal firmware is doing the re-shingling process? Even if you have unmounted the drive? Or do you need to wait until all drive activity has stopped?
I haven't found any reports online of data loss, but I am also not seeing anything confirming this is handled in a failsafe manner.
It’s my first holiday season as a hoarder. Think I’m left with 1.25tb on my 12tb drive, time for another.
Is it better to buy now or wait for Black Friday/holiday season? I was looking at 6tb drives to tide me over for a bit but I’d rather get more capacity from the get go. I know some retailers jack up prices so they can artificially lower them come holiday season, but I don’t know specific trends for our applications. All I know is prices keep going up.
Ive been looking soo much for this. Im building my own nas finally.
Specs :
CPU i3 12100 RAM ddr4 3200mhz 2x8gb H610 itx mobo gigabyte Corsair 650e 2025 psu 2x14tb (x16 seagate exos)
I've read that Magician won't work with a NVMe drive in a USB 3 enclosure, but that a NVMe drive in a TB4 / USB4 enclosure looks like an internal PCIe disk and that Magician should work.
I installed a Samsung 990 Pro NVMe in a UGREEN 40Gbps M.2 NVMe Enclosure and connected it to my Mac Mini M2 Pro. I did a benchmark and got reads and writes > 3000 MB/s. Very nice! I then fired up Samsung Magician. It only partly supports the device. I can view SMART data and perform a benchmark test, but I can't check the firmware or display the disk temperature. I can see the temperature though SMART, but not being able to upgrade the firmware is not ideal.
Should I try a different enclosure or am I trying to do something that doesn't work with any enclosure?
Do we know if this nasty bug thats been fixed in upstream is available in *ANY* of the truenas releases? If so, which ones? (Hoping for preferably something in Scale):
Pretty much what the title says.
Trying to pick between Exos drive with 100-1000 bad sectors and a 5 year warranty, vs a WD blue drive that's an extra 30$, but has no listed bad sectors and a 3 year warranty.
Both are refurbished/recertified from reputable sellers.
I'm wanting to use it in my NAS alongside 3 (non pro) ironwolves I got since I never bought a 4th drive before prices skyrocketed, and these are relatively cheap even if still more expensive then the 150$ I paid per drive months ago, those same drives are $220 now.
Hi, I'm upgrading my pc and I wanted to do a new clean Windows installation. I don't know if this is the correct sub for this, I'm sorry if that's the case, also I don't know much about this.
I currently have 2 nvme ssds and 2 internal sata hdd. Across the four of them I have about ~2.7 TB worth of stuff. I have bought and external 4tb hdd to copy the files into.
My goal is not to create a full drive image, I just want to copy all the files into the external hhd so that I will be able to grab them back when I'll need them. Then I'll format them and use these completely empty four drives in my new pc.
I downloaded TeraCopy and my plan was to copy everything into the external hhd and verifying the files at the end to check that everything went right.
Now here's the reason why I'm making this post: I saw that there was an option called "test" that lets you check the integrity/readability of your files. I'm running that but I saw that there are some folders which the program seemingly can't access (it says "folder not found" and other errors).
It's os stuff like: \ProgramData\Desktop, \user\myname\Doocuments etc.
Now, I'm sure that I'm running the program as administrator and I'm sure that the folders exist (they were hidden and I can't access them myself but they are there).
My fear is that when I'll actually copy the files these folders will not be copied. Is this just a limitation of the test function? Is there a better way I can go about this?
I'm fine with having my data on single hdd and I want to copy absolutely everything. Ty in advance.
Hello,
I'm not native english speaker sorry for bad wording.
I want to get my first NAS and... the choice is wide.
I want something user friendly, not too hard to set up, without too many maintenance, with long period of editor updates and if possible not too expensive !
I'm into computing but not in that field (developer).
I've been suggested by friend 2 options :
-
Synology DS224+ & 2 hardisk HGST Ultrastar 4To
-
QNAP TS‑216G (or TS‑233 but i think 216G is better ?) & 2 hardisk HGST Ultrastar 4To
Can you help me understanding the pro and cons of both option ?
Are they good options ?
thx for your help
I am currently using AIMP, but I've tried a dozen other pieces of software today. I want my M3U playlists to be in my music library alongside my albums (like Spotify does). Is there any software that would allow that?
I want to buy a 6TB hard drive mainly to store thousands of family photos and videos, thousands of MP3 and FLAC songs, personal documents, invoices, etc.
Until now I’ve been using 4TB Western Digital Blue drives, but I need to upgrade to 6TB.
I’ve seen two 6TB Western Digital drives: one Blue CMR and one Red Plus, also CMR, both with 256GB cache.
The price is the same. I don’t want it for a NAS or anything like that, just for storage.
At the same price, which one would you choose?
Thanks.
I am in a horrible position with an external hard drive. Does anyone have any guidance on a data extraction that isn’t going to cost $800-$1,200?
I have a pc with a gigabyte b360 hd3, when the card is inside the pc it seems to hang up the startup and keep the fans at 100%. None of the sas drives spin up however the card has a flashing light on it. I’ve updated the bios to the latest possible with no avail, tried a new slot to no avail.
Been debating getting into the hoarding or at least NAS setup for a bit and on the lookout for cheap capacity - The local Uni has some HGST/HC510 8TB drives at their reuse location if they havent been snapped up already, the couple I see are from 2015 and 2018, listing says 'passed health check' but wondering what your thoughts are on either what questions to ask, or if it wouldnt be worth it to pick them up? Obv the price point comes with risk attached, but from various searches and the reliability wiki here the drives themselves seem to be relatively bulletproof but the data I was seeing there stopped in 2021 or maybe doesnt count this far out?
With the S&P 500 and Bitcoin tearing up the charts, are those red-hot areas the best places to invest $10,000 right now?
In the latest edition of Where to Invest, one expert Bloomberg asked about timely opportunities counsels going long on the US and AI. Others, however, point to areas of the US and European markets that may offer greater value and the potential for continued momentum in coming months and years. Favored sectors run from defense to industrials to life sciences tools companies and banks.
When the four wealth advisers were asked where they’d spend $10,000 on a personal interest, ideas stretched from buying whole genome sequencing for the family, to a trip to Australia with loved ones, to following a favorite sports team around the world.
Read the full story here.
____________________________________________
For more in the series:
-
Where to invest $100,000
-
Where to invest $1 million
Hi,
Im currently Running a mergerFS pool and use rsnapshot to backup to another disk.
I also plan to add snapRaid to my mergerFS Pool, but for now Im thinking about how to extend my backup space as my backup disk is getting full.
I thought about
a) using another mergerFS without snapRaid as rsnapshot target
b) backing up different directories to different disks with multiple rules
c) use zfs or btrfs pools (raid or single)
general setup is, in the mergerFS pool I have some folders with important stuff that rsnapshot uses, and some media library stuff that does not get backed up at all currently (not important) but will be at least protected by snapRaid in future)
Im not quite sure what to do here and would be happy about some advice :)
Hey everybody!
I just finished building Arkibber, a free app that lets you leverage an LLM-powered middle layer to transform your query into a carefully crafted set of parameters to assist in tuning the output produced by your search.
So, I like to look for royalty-free outlets for viable assets to supplement my creative projects. However, when trying to leverage free content on websites like archive.org, I can sometimes fail to find interesting content. This wasn’t due to it not being present; mainly just a UX that seems heavily oriented towards very rigid-feeling static content retrieval, making it very frustrating for me to explore multi-media content. With hundreds of collections, subjects, and various publication years to sift through, finding a good search felt like striking gold. The issue then was that a few more filter tweaks left me lost in the straw heap.
For me, the best thing about Arkibber is iteration speed - I’m able to cycle through a wide set of natural language searches quickly, and test out my ideas. Some things aren’t available, but I’m still able to find that out way faster. Would really appreciate if some of y'all played around with it for a bit!
Every once in a while my DIY NAS hits the following errors:
[ 552.808886] ata6.00: failed command: WRITE FPDMA QUEUED
[ 552.809952] ata6.00: cmd 61/40:e8:90:2b:c4/00:00:0c:02:00/40 tag 29 ncq dma 32768 out
res 43/84:01:06:4f:c2/00:00:00:00:00/00 Emask 0x10 (ATA bus error)
[ 552.811735] ata6.00: status: { DRDY SENSE ERR }
[ 552.812414] ata6.00: error: { ICRC ABRT }This in turn will eventually put BTRFS into read-only mode on the affected drive, which is annoying but ultimately a good thing I suppose.
Wiggling the cables around will move the problem to a different drive or fix it altogether. But eventually it'll be back, from the cables settling is my guess.
Building cardboard cable spacers has bought me several months since the last incident, but apparently that wasn't enough to fix it permanently.
This is 10 drives on two ASM1166 PCIe boards (5 drives on each) in an Asus Z270 WS board running Linux (btrfs, snapraid).
older photo without the cardboard spacers.
I never had such problems with the Dell PERC H310 and random chinese cable whips. I switched to the ASM1166 for power savings (Germany).
Anybody got any other opinions or recommendations on how to deal with this for good?
I'd like to build a backup server for personal use. No streaming, transcoding or other fancy services, only a fileserver.
I've got a large ATX tower case dating from the last century with 8 drive bays as a starting point. The idea is to get a suitable motherboard and two 16+ TB hard drives, adding more drives later as needed.
Motherboard + CPU: would the ASUS Prime N100I-D D4-CSM be enough or should I go for the ASRock N100M? Are there any other reasonable options?
Obviously I'd need a 6 or 8 port PCIe SATA adapter. Any brands/models recommended or to be avoided? Would a single adapter present a bottleneck, so should I rather go for a 4+2 or 3+3 arrangement?
What PSU do I need that can handle 8 drives spinning up and doesn't shut down due to too little load when all of them go to sleep?
Would cooling be an issue? The case has one 80mm fan hole at the bottom/front and one at the top/back.
I can't put it anywhere near to a router, so I'd just plug in a USB WiFi adapter.
For the software I'd prefer some simple Linux based open source solution. I don't want to be locked in into an ecosystem where the vendor might get some funny ideas 5 or 10 years from now. Linux user for 30+ years, so not afraid of the command line, editing text files or writing scripts.
Hello there everyone, as the title said, I'm looking for a good NAS option to get. I have already looked at a few models, more specific Synology models.
But since I also read that Synology did some shitty things with forcing to use their own drives, I'm a little skeptical if I still want to go with this brand, so I'd love your input and suggestions there.
My requirements are as follows:
— at least 10 but better yet 12 TB in a raid 5, meaning at least a 4 slot NAS
— compatible with IP cameras, no specific models as I have none yet but wanna setup some cams in the future
— M.2 NVMe cache, not a killer but I'd like to have
— ability to connect phones/laptop outside of home network without a need for a VPN
I also already thought about making a DIY project like Open NAS, but I have no clue if that's a good idea or not, so if anyone knows something about that ID love some input.
The webpage itself has hundreds of links which themselves are compatible with yt dlp so I can copy paste it but I was looking for an automatic way to download it all
Title.
I just bought a small server PC. It lacks sata ports.
I want to compensate by having an external box a thing that would do only USB hub + USB<>sata adapter and hold 2 disks would be enough for me.
I do not need something really complex, the raid can be handled by the server using lvm. I want to avoid having something to keep up to date such as a nas/das firmware.
Buying hub+adapter separately could work but I dislike having 3 separate parts (potentially more if needs power)
I want it low maintenance, ideally passively cooled
I have a lot of video files that are in MP4 and MKV format. Most of the time, I need to quickly browse and search these with Windows Explorer.
The problem I am encountering is that Explorer has the ability to display some fields of information but not others (using the available columns in Detailed view). Also, with MKV files, you cannot directly edit information in Explorer, as you can with MP4 files. There are also inconsistencies between the available fields for MKV vs MP4, even though they are both video formats.
The meta data that I would like to include in the files are: Director(s), Actors (usually just the top 3-5 stars in the production), Year (released), MPAA Rating (NR, G, PG, etc.), Title, Subtitle (for those shows that have sequels, etc.), Genre or keywords (which will include multiple keywords, like Action / Horror or Action / War / WWII), and comments.
I am looking for a tool that can load multiple files at once so that I can simply click on the pertinent fields and edit them quickly. The fields edited need to be visible in Explorer and consistent between MKV and MP4.
So far, MKVToolNix doesn't work because it doesn't handle MP4 and you can only edit one file at a time. It's also super complicated because it has way more functionality than I need for this task.
I have tried MP3Tag, which does a great job with MP4 and can handle MKV, but is a pain to setup and the fields in the tool don't match with the fields mentioned in Explorer. E.g. I can edit Producers for 3 MP4 files and 2 MKV files, and the info shows up in Explorer for the MP4 files but not the MKV files.
I have also tried Audio and Video Tag Editor Studio (the version of MKV Tag Editor that also includes MP4 files). However, it is clumsy and again, doesn't have consistency in the available fields that are compatible with Explorer, MP4, and MKV.
It seems that this would be a very common need; i.e. simply editing a few pieces of information for videos in different formats so that you can sort/search/catalogue them with a single tool like Explorer.
If anyone has some suggestions on how to accomplish this goal, I would be very grateful to hear about options. 🙏
So I'm planning on having an uncompressed work flow for working on videos so youtubes compression has the best chances for having the highest quality with their low bit rates tho I'll store older videos in high bitrate av1. I'm also planning on having just a massive amount of storage just to throw whatever I wanna store at it like security camera feeds "legally" backuped blu-rays and the such and not have to worry about running out of data.
My question is should I look at LTO drives for this or would just a hard drive array suffice long term?
I ran unRAID for years, loved it, but their weird hatred of SSDs eventually forced me over to TrueNAS, which I know is solid and software-wise, have had no complaints.
But, my "server" is a desktop motherboard with an AMD APU and seems to kill SSDs every so often for some reason, and I'm too tired to figure out why. At the point now where I want something that can serve up raw Blu-Ray rips to Emby/Plex/whatever for watching with the family, and also double as a backup server.
I am not interested in spinning disks as I have a large collection of SSDs already, but am open to hearing an argument if there's a good use-case.
Is there anything can recommend or has good experience with that's ready to go, plug-and-play, and reliable I can pick up?
Can spend up to $3k, obviously would prefer less, but at this point am tired of having to touch it at all.
I currently have 12 TB (6x 2TB) SSDs on hand so the disk type is sort of a forgone conclusion, at least for the immediate time being.
So it seems that Google photos has changed it's authentication policy and rclone is no longer a option.
I would like to have a local backup of my Google photo in my NAS, but is there any clients in Linux that still support to download your whole library?
I really would like a automation for this, but maybe it isn't possible anymore?
I hope this is the right sub for the question, thanks!
In English there are plenty of aggregators so there is no need to archive the entire thing however, this isn't the case for korean and yesterday I found out that the only other site that I used was shut down leaving this one as my last go-to for free korean webnovels. I tried to do it on my own but the Chrome extensions kept breaking midway through and I don't have that much background in coding so I am lost, any help would be great.
Site information (as far as I can tell):
_ It uses cloudflair and a captcha that gets triggered every few minutes (the captcha technically can be removed after logging in except you can only register with a naver email which in turn requires a korean number).
_ Limited requests rate.
_ There is no clear table of content so you have to enter the name of the novel to get it but there is a search function by genre and first letter of the name which will give a list of 10 pages each. By using a combination of the two it's possible to expand and access more, it will still limit the results but I am fine with it, something is better than nothing.
_ The content itself is text written kind of like articles but with multiple chapters, there are cases of images containing the text but those are rare.
What I want to know:
_ What tool is best to use in this case, I have a windows unit (If there is no easy to use tool what should I focus on learning efficiently to scrape this particular website)
_ How to deal with cloudflair and the captcha preferably as free of a way as possible
_ How to plan out an optimum search combination and are there tutorials of similar cases to follow
_ Estimated storage required (I only have a 2T HDD but if necessary I can get more)
The results I want to achieve: Each novel title and content preferably as txt or epub but I will take anything that is readable (website screenshot or html files etc whatever easier to get I guess)
Name of the site (please remove the "") book_toki_469._com
I’m planning a home media server and want to make sure I’m heading in the right direction before I start buying everything.
What I want the server to handle: • Streaming 4K and 1080p media • Up to 15–20 users max (not all active at once, but that’s the ceiling) • Running Unraid • Parity protection so the system can rebuild if a drive fails • I want the ability to scale the array to at least 14–16 drives minimum (and possibly more later) • One or two drives for personal backups (photos, documents, files) • I want something I can grow into, not something I outgrow quickly
Hot swap is not required. It would just be nice to support later. With my current planned build I know I won’t have hot swap right away, but I’d like the setup to be able to move toward it in the future.
I will be starting with 3 drives first, and expanding slowly over time, so scalability and upgrade path really matter here.
Hard drive choice I’m deciding on: • Seagate IronWolf Pro 28TB (NAS grade) • Seagate Barracuda 24TB (desktop grade, cheaper)
IronWolf Pros are designed for multi-drive setups, vibration control, RAID rebuild behavior, and have longer warranties. But they cost more. I’m trying to figure out if they are the smarter long-term choice or if the Barracudas (or any other drives) realistically hold up fine in a home Unraid setup.
Current planned build (not purchased yet, open to feedback): CPU: Intel i5-14600K Motherboard: ASUS Prime B760-PLUS D4 RAM: 32GB Corsair Vengeance DDR4 GPU for Jellyfin transcoding: RTX 3050 6GB Power Supply: Corsair RM1200e fully modular HBA: LSI 9305-16i Starting drives: 3 × Seagate IronWolf Pro 28TB
Estimated cost so far is around $3200 before adding more drives.
Still deciding between building in a tower or going straight to a rack.
Option 1: Large tower case (Fractal Define 7 XL) Simple and quiet, but expanding to 14–16 drives later can get messy, and adding hot swap support is harder.
Option 2: 22U server rack (Sysracks SRW 22.600B) More space for future storage expansion, easier cable management, easier to add hot swap storage shelves later, room for UPS and networking inside the same rack. Costs more upfront but might avoid rebuilding everything later.
What I’m looking for feedback on:
-
For Unraid and long-term uptime, are IronWolf Pros worth the extra cost vs desktop drives?
-
Has anyone run desktop drives like Barracudas in a larger, always-on array? How did they hold up?
-
For those who planned for growth, did going with a rack pay off in the long run?
-
Any general feedback on the build, approach, or long-term planning is welcome.
Thanks in advance.
Hello all, as part of my project, I have a smaller hard drive backed up with Operating Systems, VM Software, and drivers. I use Snappy Driver Installer (SDIO) which comes with 40 GB worth of drivers that apply to countless pieces of hardware, and I back up the drivers on any new computer I have to load it into my collection. But does anyone know if there is a repository I can siphon from? To have a more complete collection? Just curious if there's one out there.
I’ve created a small command-line tool that generates a hash-based, human-readable list of git repositories and data folders. Its purpose is to capture the exact state of all projects and files in a single plain-text file.
I built it because I work across multiple machines and often worry about which projects are on which computer or whether I’ve left any files in unique locations. Now I can diff the summaries between devices to see what’s out of sync, which repositories have uncommitted changes, and which folders have been modified.
I avoid using cloud sync services, and most of my files are already in git anyway. I find that having clear visibility is enough, I just need to know what to commit, push, pull, or sync manually.
I would be glad if it proves useful to someone besides me.
I'm aware people have had issues with this drive before, but I need the extra storage on a bit of tight budget, it's going to be a game drive as my build has the following:
Patriot P300 NVME (512G): Primary boot drive + a few games like Helldivers
Seagate Barracuda 5400RPM HDD (4tb): files, photos, music, films and I wanted to put a few games like Silksong but was advised against it due HHD being bad at this speed
So i was looking for a SSD for a game drive and came across this, is it worth a go?
For a gaming drive (no-OS) would you go for XFS or EXT4 for games?
As far as I understand XFS works best with larger files, while the other works best with smaller files
How do you see the best scenario here?
Games do write a lot of smaller files, but once they are on the drive, does one or another format take a faster approach?
Does anyone know of a way to rip magazines from the Motortrend android app? They offer access to a very large archive of car magazines, and I want to hoard all of them. Is there a way to pull them from a cache or something? I also have a MSI android emulator with the app in it, running in Windows if that makes anything easier.
I’m building out a TrueNAS but a bit lost on what specific ECC RAM to get for my system, and what manufacturers are OK, and which are a no-no.
I would wait for RAM prices to go down, but the TrueNAS server is a high-priority for me.
——————————————
Questions:
-
I think I need at least 64 GB ECC RAM?
Would 32 GB be too little for my system?
2. Which specific ECC RAM kit would be both (relatively) affordable for my build?
———————————-
Specs:
Mobo: ASRock B550 Pro4 (6 x SATA)
CPU: Ryzen 5700G
Drives: 5 x 18TB SAS Ultrastar vdev (case can fit 11 x 3.5” HDDs total, will add 5 more later)
OS: TrueNAS Scale on 2 x Intel Enterprise SSDs (bought used for cheap) in RAID config
RAM: 64 GB ECC RAM (UDIMM) off eBay (how to get this at a reasonable price though?)
HBA: LSI 9300-8i
Fans: Noctua Industrial
I get the feeling this is probably as stupid as asking whether a pound of feathers is a heavy as a pound of lead, but here goes:
If two drives consume the same amount of power, but one is filled with helium and one is filled with air, will the helium-filled model run any cooler?
Hi all,
I'm not hugely tech savvy, so I apologize if this first post doesn't have all the necessary details or I'm asking a question with an obvious answer.
For years I've been collecting data and storing it across my Google Drive and various WD external drives. Recently I decided to connect my WD Elements to my Sony Bravia so I could watch some videos on it. Obviously whenever I unplugged and replugged the drive it would take the TV a bit to re-read all the data, but as long as I didn't unplug the drive all the data was ready to be accessed right away, even after the drive booted up from sleep.
Then the WD Elements fell two feet and stopped working. I picked up a Seagate Expansion Desktop Hard Drive since I'd been having other issues with WD and decided to switch brands after reading some good stuff about Seagate.
That's all a lot of set up to say that I've finally connected the Seagate to my TV but it seems as if it has to reread the entire disk every time it powers up from sleep, and that takes several minutes due to me having a few TB of videos.
Is there something inherently different about Seagate that causes this to happen? It's not a huge deal, but I did appreciate how WD didn't have this hangup.
Seeking (Re)Direction
I've preserved a VHS tape using a Panasonic DMR-ES45V (circa 2006) by burning it to DVD, but I would like to improve on the conversion workflow and try again. I dedicated time to learning about the problem space via Reddit/forum posts (e.g., digitalfaq and videohelp), YouTube videos (e.g., Technology Connections and Video Capture Guide), as well as trial and error. However, my budgeted time and money have almost run out, along with my patience for rabbit holing.
Current Setup
The VCR/DVD combo has an HDMI output, which is connected to AVerMedia's GC311 (Live Gamer MINI), and then passed into a Windows 11 laptop via USB Type A. I'm using AVerMedia's deprecated Stream Engine software for capturing. This workflow has encountered an HDCP issue, in spite of the VHS tape not having copy protection. The recording has audio without video. However, making a Nintendo Switch recording produced audio and video as expected.
Note: Based on Technology Connections' solution, perhaps I only need to output as S-Video and convert to HDMI first?
Willingness to Learn
I'm prepared to do a "For Dummies" book crash course (or an equivalent) if such a thing exists.
I’m building my first “truenas” box and initially had enough space in my workstation for 2 hdd’s and only needed one to be external, of which I had an enclosure for. As fate would have it, that workstation decided to unalive itself and my new workstation only has space for one full size hdd internally, so now I’m looking for a cheap or even a DIY solution for 2 external drives. Anyone have any suggestions for a low cost drive enclosure? I feel like there’s a ridiculously simple solution but my brain won’t allow me to figure it out. I’ve looked online and honestly am surprised I’m not finding anything inexpensive that’ll work for me.
Editing to add.. I’ve read that using TrueNas with external drives connected via USB can be problematic. I’m specifically looking for an enclosure with a sata connection or even just an enclosure that I can run cables through to the drives.
I've been trying to figure out how to maintain backups of the various PCs in my house. We have the main desktop backed up using backblaze and I'm using iDrive to back up the NAS.
As our use of the NAS has grown I was thinking about flipping the script and using something more reliable for the NAS backup. But as I was researching, I saw that iDrive has iDrive Mirror in beta, so I thought I'd give it a try for backing up the PCs. Any of the important data should be saved on the NAS at this stage, so I just need something that doesn't suck to backup the PCs in case of catastrophic failure. And at $20/year/PC with unlimited storage the price seemed too good to ignore
Unfortunately, iDrive Mirror is not the solution. I set multiple PCs off backing up with it on Friday and it looked like it was working. Then all of a sudden all the backup processes died. Nothing saved. No snapshots. No email warnings. Just a subtle error on the desktop app about Socket errors. Thinking I might have messed up my network, I tried backing up a laptop using a mobile hotspot. Same issue. So as far as I can tell, this backup service can suddenly die without warning. It feels like next to "not losing your backed up data", "successfully backs up your data first time you ask it" ranks pretty highly - Right above keeping the data stored securely (security doesn't matter too much if it doesn't back up in the first place).
I doubt this sub needed any other reasons to dunk on iDrive but I'm going to go back to not being a cheap-ass with my backup plans.
What is the safest archive type for high compression?
After losing everything to a max LZMA2 7z (Headers Error), I've been avoiding 7z like the plague.
What archive type is the safest and has the least chance of being corrupted?
I'm using ARK on Linux and trying to pick between LZMA Tar and Zip
Not a Seagate fan, but this seems to be a huge deal, no?
Hey y’all. I was hoping to ask for your thoughts as a community on the best way to synchronise and share (with family and friends) my data store. I’ve read various reviews and NextCloud looks solid, but I wanted some real world experience on what works well for you all and any potential issues to watch for. My goals are to share about 15tb or photos, videos and documents securely, with two or three decentralised copies of the contents. Thanks do you thoughts!
Hello,
I'd say, i am a DataHoarder myself because i backupped everything Personal on my PCs/Laptops/Smartphones since ~2007. My Current "Setup" is just two 4TB HDDs, where one HDD is a Mirror of the other.
I was aware about Bitrot but not how fast it could affect the data. Yesterday i've read an article stating, that unpowered HDDs could get corrupted just after 5 Years.
Since i don't want to lose Data, i want to store the Data more safely. My first guess was burning the Backups onto Bluraydisks. Then i discovered M-Discs, which are stated to last for 1000 Years. I was about to order some M-Discs but then i've read that for 3 Years Verbatim is not selling real M-Discs but "normal" bluray disks, which are just labeled as M-Discs.
Since M-Disc labeled BD-Rs are much more expensive then "normal" BD-Rs i am not sure if i should go for the M-Discs.
Do you have any sugestions how i could archive my data for long-term storage? (It does not have to last 1000 Years, since i guess i will die a little earlier than 1000 Years)
How do you download videos from Fansly? It looks like all the methods I’ve found online have stopped working.
Can anyone suggset any solution that I might not tried yet.
Hi! I have a very large collection of music, which do not have much .lrc files/lyrics. I am looking for some kind of CLI software that would automatically find and save a .lrc file for each song in my library (if available). The reason I'm looking for CLI is that I can then make it a cronjob on my server.
I haven't really found anything still maintained that fits this criteria. Does anyone know of a good project? Thanks
I thought it'd be cool to organize pdfs and videos i have downloaded on a larger scale. Is there any tips you'd give a beginner, or tutorials online on how to get started data hoarding?
Hi! I'm working on a few data management projects, and I'm not sure where to get started!
The primary issue is that I have multiple full Google accounts and I don't want to pay them for storage.
I don't mind paying for storage in general but I don't want to pay for several different Google accounts.
I would like to download all of my Google Photos images/videos, my Facebook data, my Instagram data, and the data from 3 Gmail accounts and store them somewhere, because while I've used social media as a kind of journal and love scrolling through my memories, I'd like to start deleting content from my accounts for more privacy.
A few issues:
-
I trued using Google Takeout to download my data and Google Photos, but it came in literally hundreds of zip files. I'm willing to put in the time to download them all, but I want to make sure first that that's the best approach.
-
Another issue I'm facing is that in my Google Drive, sub-folders didn't seem to download. I'd open certain folders and then see none of the subfolders. Is the only alternative to download Drive content manually?
-
I need to delete photos from my phone, and therefore from Google Photos. I've synced my phone to a (paid) Dropbox account, but I'm finding the Dropbox interface very counterintuitive and I don't feel confident that my photos will stay in Dropbox if I delete them from my phone.
In total, between my 3 Google accounts (photos, Drive, voice memos), 2 Instagram accounts, and 1 Facebook account, I'm looking at around 750GB of data -- not that much in the scheme of things, but kind of clunky to move around (especially for a data hoarding beginner!).
I'm looking for:
-
Advice on how best to download all this stuff
-
Cloud-based photo management platforms with facial recognition (so far I'm aware of Immich)
-
Any recommendations for a photo sorting platform that lets me easily keep and delete photos.
Thank you!!
Stream Detector used to perform this task successfully, but now it doesn't work on YouTube. I use the Brave browser any solutions?
Note: I won’t be able to use Firefox, the platform rejects any browser that isn’t a Chromium-based one.
Hello!
I very much want to save the livestream of my mother's funeral. I have seen other posts with instructions on how to do it using the websites' developer tools via F12, but once in the network tab, I can't find anything that looks like the further instructions. Whether this is something that has changed since the other posts, or is something specific to the website in my country, I don't know.
I could reach out to the streaming company or the funeral director to ask for and pay for the file, but I don't really want to bring it to their attention, as strictly speaking, the file shouldn't still be available. I know that it's late, but I misremembered how long I was supposed to have access to it. There are other family members and friends who were unable to attend who also very much want to see the service.
This is the link:
I want to save Camera 1, Camera 2, and Slideshow.
Thank-you very much to anyone who can help me.
Getting ready for Black Friday and getting a nas. Is there really any difference between ironwolf, ironwolf pro, WD red plus, barracuda, etc. they all seem to have the same specs more or less. I’ve skimmed several websites looking but every place mixes their rankings around. Other than cost is there a real benefit from one or the other.
Hi!
I'm looking for a single bay docking station with external power that supports 28-30TB drives. Every dock I see supports up to 22 or 24TB. Any help? Thank you
Is there any way to download pornhub videos?
Please dont say yt dlp, cause i dont any DNS logs to be logged, i watch pornhub from tor browser. The websites have stopped working
I’ve got thousands of old printed photos from the 80s and 90s that I’d like to digitise and organise. I’m looking for advice and recommendations from anyone who has tackled a similar project.
-
What’s the best approach for high-volume scanning—should I do it myself at home, or is it worth paying for a professional service?
-
Are there specific scanners or workflows you’d suggest for speed and quality?
-
Once they’re digitised, how do you organise such a large collection? Any software, tagging methods, or folder structures that have worked well for you?
-
Has anyone used facial recognition or AI-powered tools to help sort and group photos by people or events?
-
Bonus points for any tips on backing up and safely storing large photo libraries.
I’m open to all suggestions—hardware, software, and general workflow. Keen to hear how others in the DataHoarder community have approached this!
I should have started this from the beginning, but I didn't so here I am.
The use for these drives are Jellyfin, they are all WD Red Pro (NAS drives although I've never used NAS). They are housed in two of these: and connected to my PC via USB C.
As my hoarding grows, I see this is unmanageable. I've been looking at this: and from my understanding RAID would be the way to go so I can have a single unified storage solution?
The reason being is with my *arr stack this is becoming unmanageable because I have "TV - 1080p", "TV - 4K" (etc) folders across 7 drives. I want a unified solution for this.
Currently I have:
-
1 12TB
-
2 14TB
-
4 22TB
My understanding is with RAID all the drives need to be the same size, or you're limited by the smallest disk. So the 12TB and 14TB drives are basically useless in this so I'd need to get more 22TB drives, start the array and start copying stuff over.
Is this the right thinking? I made big fuck up not doing this before if so.
Samsung wind for speeds due to being double of the WD one, but since i am looking for an external ssd solution, what do you guys recommend? Also what enclosure would gove the most thruput for these drives?
Also, there is also Crucial 8TB, SanDisk 8TB, and a few others....which would make most sense?
So i have purchased this one course around 5 days ago, and i get access to the course content for only 1 week, now i want to download the videos so i can access it later. The website (teachcode.in) does not allow to download the videos directly (obviously), generally in this type of situation i download videos through Inspect element (Developer tools) by going to Network --> Media to find the .mp4 file and downloading it, but in this case, when i open the developer tools, the video shows that "Paused in debugger". Are there any ways to download it even through any type of third party extensions or any other possible ways (preferably free)?
Help will be really appreciated.
Hi everyone,
got some weird behaviour with one of my HDDs and hope to find some answers here.
I have 6 x 20 TB Seagate Exos X X20 20TB (ST20000NM007D) in mdraid, raid6 (md0).
Once a month I run checkarray and this time i got some errors, I can't explain.
I woke up to two mdadm monitoring emails informing me about a fail event and a degraded array event, so I investigated further and checked dmesg:
2025-11-02T23:01:05,921290+01:00 md: data-check of RAID array md0 2025-11-02T23:01:05,942946+01:00 md: data-check of RAID array md1 [*unrelated stuff*] 2025-11-03T00:17:51,185931+01:00 sd 0:0:0:0: [sdc] tag#3258 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=3s 2025-11-03T00:17:51,185934+01:00 sd 0:0:0:0: [sdc] tag#3258 Sense Key : Not Ready [current] 2025-11-03T00:17:51,185935+01:00 sd 0:0:0:0: [sdc] tag#3258 Add. Sense: Logical unit not ready, cause not reportable 2025-11-03T00:17:51,185937+01:00 sd 0:0:0:0: [sdc] tag#3258 CDB: Read(16) 88 00 00 00 00 00 59 1a 6d 98 00 00 01 00 00 00 2025-11-03T00:17:51,185938+01:00 I/O error, dev sdc, sector 1494904216 op 0x0:(READ) flags 0x4000 phys_seg 32 prio class 0 2025-11-03T00:17:51,186018+01:00 sd 0:0:0:0: [sdc] tag#3260 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=3s 2025-11-03T00:17:51,186019+01:00 sd 0:0:0:0: [sdc] tag#3260 Sense Key : Not Ready [current] 2025-11-03T00:17:51,186020+01:00 sd 0:0:0:0: [sdc] tag#3260 Add. Sense: Logical unit not ready, cause not reportable 2025-11-03T00:17:51,186021+01:00 sd 0:0:0:0: [sdc] tag#3260 CDB: Read(16) 88 00 00 00 00 00 59 1a 4f 98 00 00 01 00 00 00 2025-11-03T00:17:51,186021+01:00 I/O error, dev sdc, sector 1494896536 op 0x0:(READ) flags 0x4000 phys_seg 32 prio class 0 2025-11-03T00:17:51,186090+01:00 sd 0:0:0:0: [sdc] tag#3262 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=3s 2025-11-03T00:17:51,186091+01:00 sd 0:0:0:0: [sdc] tag#3262 Sense Key : Not Ready [current] 2025-11-03T00:17:51,186092+01:00 sd 0:0:0:0: [sdc] tag#3262 Add. Sense: Logical unit not ready, cause not reportable 2025-11-03T00:17:51,186093+01:00 sd 0:0:0:0: [sdc] tag#3262 CDB: Read(16) 88 00 00 00 00 00 59 1a 4e 98 00 00 01 00 00 00 2025-11-03T00:17:51,186093+01:00 I/O error, dev sdc, sector 1494896280 op 0x0:(READ) flags 0x4000 phys_seg 32 prio class 0 2025-11-03T00:17:51,186166+01:00 sd 0:0:0:0: [sdc] tag#3200 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=3s 2025-11-03T00:17:51,186168+01:00 sd 0:0:0:0: [sdc] tag#3200 Sense Key : Not Ready [current] 2025-11-03T00:17:51,186169+01:00 sd 0:0:0:0: [sdc] tag#3200 Add. Sense: Logical unit not ready, cause not reportable 2025-11-03T00:17:51,186170+01:00 sd 0:0:0:0: [sdc] tag#3200 CDB: Read(16) 88 00 00 00 00 00 59 1a 69 98 00 00 01 00 00 00 2025-11-03T00:17:51,186171+01:00 I/O error, dev sdc, sector 1494903192 op 0x0:(READ) flags 0x4000 phys_seg 32 prio class 0 2025-11-03T00:17:51,186250+01:00 sd 0:0:0:0: [sdc] tag#3201 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=3s 2025-11-03T00:17:51,186251+01:00 sd 0:0:0:0: [sdc] tag#3201 Sense Key : Not Ready [current] 2025-11-03T00:17:51,186254+01:00 sd 0:0:0:0: [sdc] tag#3201 Add. Sense: Logical unit not ready, cause not reportable 2025-11-03T00:17:51,186255+01:00 sd 0:0:0:0: [sdc] tag#3201 CDB: Read(16) 88 00 00 00 00 00 59 1a 6a 98 00 00 01 00 00 00 2025-11-03T00:17:51,186256+01:00 I/O error, dev sdc, sector 1494903448 op 0x0:(READ) flags 0x4000 phys_seg 32 prio class 0 2025-11-03T00:17:51,186343+01:00 sd 0:0:0:0: [sdc] tag#3204 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s 2025-11-03T00:17:51,186345+01:00 sd 0:0:0:0: [sdc] tag#3204 Sense Key : Not Ready [current] 2025-11-03T00:17:51,186346+01:00 sd 0:0:0:0: [sdc] tag#3204 Add. Sense: Logical unit not ready, cause not reportable 2025-11-03T00:17:51,186347+01:00 sd 0:0:0:0: [sdc] tag#3204 CDB: Read(16) 88 00 00 00 00 00 59 1a 6e 98 00 00 01 00 00 00 2025-11-03T00:17:51,186348+01:00 I/O error, dev sdc, sector 1494904472 op 0x0:(READ) flags 0x4000 phys_seg 32 prio class 0 2025-11-03T00:17:51,186423+01:00 sd 0:0:0:0: [sdc] tag#3205 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s 2025-11-03T00:17:51,186425+01:00 sd 0:0:0:0: [sdc] tag#3205 Sense Key : Not Ready [current] 2025-11-03T00:17:51,186426+01:00 sd 0:0:0:0: [sdc] tag#3205 Add. Sense: Logical unit not ready, cause not reportable 2025-11-03T00:17:51,186428+01:00 sd 0:0:0:0: [sdc] tag#3205 CDB: Read(16) 88 00 00 00 00 00 59 1a 6f 98 00 00 01 00 00 00 2025-11-03T00:17:51,186428+01:00 I/O error, dev sdc, sector 1494904728 op 0x0:(READ) flags 0x4000 phys_seg 32 prio class 0 2025-11-03T00:17:51,186502+01:00 sd 0:0:0:0: [sdc] tag#3206 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s 2025-11-03T00:17:51,186504+01:00 sd 0:0:0:0: [sdc] tag#3206 Sense Key : Not Ready [current] 2025-11-03T00:17:51,186505+01:00 sd 0:0:0:0: [sdc] tag#3206 Add. Sense: Logical unit not ready, cause not reportable 2025-11-03T00:17:51,186506+01:00 sd 0:0:0:0: [sdc] tag#3206 CDB: Read(16) 88 00 00 00 00 00 59 1a 70 98 00 00 01 00 00 00 2025-11-03T00:17:51,186507+01:00 I/O error, dev sdc, sector 1494904984 op 0x0:(READ) flags 0x4000 phys_seg 32 prio class 0 2025-11-03T00:17:51,186584+01:00 sd 0:0:0:0: [sdc] tag#2927 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s 2025-11-03T00:17:51,186586+01:00 sd 0:0:0:0: [sdc] tag#2927 Sense Key : Not Ready [current] 2025-11-03T00:17:51,186587+01:00 sd 0:0:0:0: [sdc] tag#2927 Add. Sense: Logical unit not ready, cause not reportable 2025-11-03T00:17:51,186590+01:00 sd 0:0:0:0: [sdc] tag#2927 CDB: Read(16) 88 00 00 00 00 00 59 1a 71 98 00 00 01 00 00 00 2025-11-03T00:17:51,186591+01:00 I/O error, dev sdc, sector 1494905240 op 0x0:(READ) flags 0x4000 phys_seg 32 prio class 0 2025-11-03T00:17:51,186664+01:00 sd 0:0:0:0: [sdc] tag#3207 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_OK cmd_age=0s 2025-11-03T00:17:51,186666+01:00 sd 0:0:0:0: [sdc] tag#3207 Sense Key : Not Ready [current] 2025-11-03T00:17:51,186667+01:00 sd 0:0:0:0: [sdc] tag#3207 Add. Sense: Logical unit not ready, cause not reportable 2025-11-03T00:17:51,186669+01:00 sd 0:0:0:0: [sdc] tag#3207 CDB: Read(16) 88 00 00 00 00 00 59 1a 72 98 00 00 01 00 00 00 2025-11-03T00:17:51,186669+01:00 I/O error, dev sdc, sector 1494905496 op 0x0:(READ) flags 0x4000 phys_seg 32 prio class 0 2025-11-03T00:17:51,336817+01:00 md/raid:md0: 21036 read_errors > 21035 stripes 2025-11-03T00:17:51,336820+01:00 md/raid:md0: Too many read errors, failing device sdc1. 2025-11-03T00:17:51,336821+01:00 md/raid:md0: Disk failure on sdc1, disabling device. 2025-11-03T00:17:51,336866+01:00 md/raid:md0: Operation continuing on 5 devices. 2025-11-03T00:17:51,565901+01:00 md: md0: data-check interrupted. 2025-11-03T06:39:21,678375+01:00 sd 0:0:0:0: Power-on or device reset occurred 2025-11-03T13:54:33,416711+01:00 md: md1: data-check done.
So I removed sdc from the array, did a short self test (smartctl -t short /dev/sdc) followed by a long self test (smartctl -t long /dev/sdc).
Both reported everything OK:
smartctl 7.4 2023-08-01 r5530 [x86_64-linux-6.1.0-40-amd64] (local build)
Copyright (C) 2002-23, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Device Model: ST20000NM007D-3DJ103
Serial Number: ZVT9PCFE
LU WWN Device Id: 5 000c50 0e69bc39b
Firmware Version: SN03
User Capacity: 20,000,588,955,648 bytes [20.0 TB]
Sector Sizes: 512 bytes logical, 4096 bytes physical
Rotation Rate: 7200 rpm
Form Factor: 3.5 inches
Device is: Not in smartctl database 7.3/5528
ATA Version is: ACS-4 (minor revision not indicated)
SATA Version is: SATA 3.3, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is: Sun Nov 9 07:58:28 2025 CET
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
General SMART Values:
Offline data collection status: (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 567) seconds.
Offline data collection
capabilities: (0x7b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 1) minutes.
Extended self-test routine
recommended polling time: (1708) minutes.
Conveyance self-test routine
recommended polling time: ( 2) minutes.
SCT capabilities: (0x70bd) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 074 064 044 Pre-fail Always - 25007752
3 Spin_Up_Time 0x0003 091 090 000 Pre-fail Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 40
5 Reallocated_Sector_Ct 0x0033 100 100 010 Pre-fail Always - 0
7 Seek_Error_Rate 0x000f 072 060 045 Pre-fail Always - 17691187
9 Power_On_Hours 0x0032 080 080 000 Old_age Always - 18145
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 40
18 Unknown_Attribute 0x000b 100 100 050 Pre-fail Always - 0
187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0
188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0
190 Airflow_Temperature_Cel 0x0022 066 044 000 Old_age Always - 34 (Min/Max 33/39)
192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 37
193 Load_Cycle_Count 0x0032 100 100 000 Old_age Always - 732
194 Temperature_Celsius 0x0022 034 041 000 Old_age Always - 34 (0 19 0 0 0)
197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0
200 Multi_Zone_Error_Rate 0x0023 100 100 001 Pre-fail Always - 0
240 Head_Flying_Hours 0x0000 100 100 000 Old_age Offline - 18143 (204 138 0)
241 Total_LBAs_Written 0x0000 100 253 000 Old_age Offline - 301968082432
242 Total_LBAs_Read 0x0000 100 253 000 Old_age Offline - 1392228696377
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Short offline Completed without error 00% 18061 -
# 2 Extended offline Completed without error 00% 18026 -
# 3 Short offline Completed without error 00% 17999 -
# 4 Extended offline Completed without error 00% 17889 -
# 5 Extended offline Completed without error 00% 17721 -
# 6 Extended offline Completed without error 00% 17547 -
# 7 Extended offline Completed without error 00% 17394 -
# 8 Extended offline Completed without error 00% 17212 -
# 9 Short offline Completed without error 00% 17024 -
#10 Extended offline Interrupted (host reset) 50% 17017 -
#11 Short offline Completed without error 00% 16992 -
#12 Extended offline Completed without error 00% 16849 -
#13 Extended offline Interrupted (host reset) 90% 16686 -
#14 Extended offline Completed without error 00% 16532 -
#15 Short offline Completed without error 00% 16489 -
#16 Extended offline Completed without error 00% 16361 -
#17 Short offline Completed without error 00% 16321 -
#18 Extended offline Completed without error 00% 16194 -
#19 Short offline Completed without error 00% 16153 -
#20 Extended offline Completed without error 00% 16028 -
#21 Short offline Completed without error 00% 15986 -
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
The above only provides legacy SMART information - try 'smartctl -x' for more
After this I tried writing and reading the whole disk (fio --name=writetest --filename=/dev/sdc --rw=write --bs=1M --direct=1 --ioengine=libaio --iodepth=16 --numjobs=1 --verify=crc32), without errors too:
writetest: (g=0): rw=write, bs=(R) 1024KiB-1024KiB, (W) 1024KiB-1024KiB, (T) 1024KiB-1024KiB, ioengine=libaio, iodepth=16
fio-3.33
Starting 1 process
Jobs: 1 (f=1): [V(1)][100.0%][r=118MiB/s][r=118 IOPS][eta 00m:00s]
writetest: (groupid=0, jobs=1): err= 0: pid=2516205: Thu Nov 6 12:42:32 2025
read: IOPS=208, BW=208MiB/s (218MB/s)(18.2TiB/91695431msec)
slat (usec): min=8, max=18316, avg=35.99, stdev=21.63
clat (msec): min=35, max=1828, avg=74.57, stdev=17.58
lat (msec): min=35, max=1828, avg=74.60, stdev=17.58
clat percentiles (msec):
| 1.00th=[ 55], 5.00th=[ 58], 10.00th=[ 59], 20.00th=[ 61],
| 30.00th=[ 63], 40.00th=[ 65], 50.00th=[ 69], 60.00th=[ 73],
| 70.00th=[ 80], 80.00th=[ 88], 90.00th=[ 103], 95.00th=[ 113],
| 99.00th=[ 125], 99.50th=[ 129], 99.90th=[ 138], 99.95th=[ 155],
| 99.99th=[ 192]
write: IOPS=207, BW=208MiB/s (218MB/s)(18.2TiB/91781273msec); 0 zone resets
slat (usec): min=2386, max=29276, avg=2507.81, stdev=189.37
clat (msec): min=36, max=2690, avg=74.48, stdev=18.05
lat (msec): min=38, max=2692, avg=76.99, stdev=18.02
clat percentiles (msec):
| 1.00th=[ 55], 5.00th=[ 57], 10.00th=[ 59], 20.00th=[ 61],
| 30.00th=[ 63], 40.00th=[ 65], 50.00th=[ 68], 60.00th=[ 73],
| 70.00th=[ 80], 80.00th=[ 88], 90.00th=[ 103], 95.00th=[ 113],
| 99.00th=[ 126], 99.50th=[ 130], 99.90th=[ 159], 99.95th=[ 180],
| 99.99th=[ 243]
bw ( KiB/s): min=61440, max=307815, per=100.00%, avg=212893.85, stdev=44412.65, samples=183562
iops : min= 60, max= 300, avg=207.82, stdev=43.36, samples=183562
lat (msec) : 50=0.05%, 100=88.46%, 250=11.49%, 500=0.01%, 750=0.01%
lat (msec) : 1000=0.01%, 2000=0.01%, >=2000=0.01%
cpu : usr=49.79%, sys=0.65%, ctx=196520266, majf=63407, minf=585050
IO depths : 1=0.1%, 2=0.1%, 4=0.1%, 8=0.1%, 16=100.0%, 32=0.0%, >=64=0.0%
submit : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0%
complete : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.1%, 32=0.0%, 64=0.0%, >=64=0.0%
issued rwts: total=19074048,19074048,0,0 short=0,0,0,0 dropped=0,0,0,0
latency : target=0, window=0, percentile=100.00%, depth=16
Run status group 0 (all jobs):
READ: bw=208MiB/s (218MB/s), 208MiB/s-208MiB/s (218MB/s-218MB/s), io=18.2TiB (20.0TB), run=91695431-91695431msec
WRITE: bw=208MiB/s (218MB/s), 208MiB/s-208MiB/s (218MB/s-218MB/s), io=18.2TiB (20.0TB), run=91781273-91781273msec
Disk stats (read/write):
sdc: ios=152592305/152592384, merge=0/0, ticks=18446744071880316987/2449605557, in_queue=620370929, util=100.00%What could have caused those read errors during checkarray? Is the disk failing? Is it a loose SATA-Connector? Any more things I could investigate?
Any idea would be appreciated.
Kind of a strange use case, but a friend and I are creating bug-out data cache hard drives for possible apocalyptic scenarios, and we're wondering if there's a way we can download or torrenr them all at once instead of needing to pick and choose them all.
I should clarify, we intend to use these on scavenged computers, including everything from consumer tech to embedded systems and computerized appliances like cash registers and order systems. So older 32 bit versions from the 90s and early 2000s are just as important.
We also intend on archiving Windows XP and 7 for our data caches.
Hi, I found this JSON viewer for Facebook Messenger exports:
Here's the problem, Meta gave me 6 separate Zip files; it divided my data in some way across all six files. Should I unzip them and try to combine the contents into one folder before trying to use a JSON viewer?
I'm building a TrueNAS setup and trying to figure out the best place to keep it. Noise is my main concern since I like my room to stay quiet and I get distracted easily.
I live in a area where the climate is pretty mild and dry, usually on the warmer side. The garage stays dry too, but for about four months a year it can get up to around 80–90°F (30–33°C) and a bit dusty. In the winter it drops to about 40–45°F (5–7°C), so not bad.
The system will start with 5 x 18TB WD Ultrastar DC HC550 (SAS) drives in one vdev, and later I plan to expand to two vdevs (10 drives total). I’ve also considered using consumer NAS drives to keep the noise down, though I’m guessing they’ll still be fairly loud.
Electricity is expensive here, so I don’t plan to run it 24/7. I’ll probably power it on once or twice a week for backups and when working on large music or video projects.
My main question is: if I keep the NAS in the garage instead of my room (which usually stays between 65°F and 80°F year-round), how much shorter should I expect the lifespan to be? I’ll be using Noctua fans for cooling either way.
I recall having some tool or extension that would download all of my liked tiktok videos along with all of the tiktok creators videos I follow. The cool thing was that it created a HTML file that would display them all. I just can't recall what it was called and the ones I've looked at don't seem to be it.
Anyone know of it?
I’ve been searching for a DAM that works for mostly media content without enterprise budgets. Managing content for my personal brand (team of 3-4) with iPhone footage, action cameras, and professional camera files in various orientations.
Preferred features -
-
Integration with existing Google Drive (not interested in migrating 15TB+ of files)
-
AI auto-tagging to find specific content quickly
-
Visual previews with clear aspect ratio indicators
-
Modern, intuitive interface
-
Face recognition across different shoots
The problem is I can't find any affordable options that I like. Anything decent starts at $300+/month.
I created an n8n automation for AI tagging my Drive content for about $1-3/month, which works well for tagging but still leaves me with Google Drive’s limited interface.
I'm thinking of turning that n8n agent into a better solution. I have an early beta and would appreciate feedback from others who manage large media libraries. Targeting under $50/month, but still evaluating if there’s enough interest to fully develop it.
Has anyone found a good solution for this problem? If you’re interested in testing or providing input, comment or DM me. Thank you!!!
Hi, I have a Terramaster f4-423 NAS system. I have 8TB on a single disc in there now. I just bought 4 new 10TB drives and want to take the existing drive out and add the new ones to configure into either raid 5 or 6, or TRAID/TRAID+. Is it safe to simply unmount the old drive without it getting corrupted before I can connect it to my PC and transfer the data to the new drives when the raid is set up? Also, I've seen that a UPS is recommended in case power is lost, if I don't have one of these, and my NAS turns off or needs to be moved to another location, what is the risk to my data? Noob question, sorry, I've been researching a lot but I'm still slightly baffled.
My setup isn't as complicated as some of yours but i've seen optane being discussed here quite a lot. Forgive me if this is the wrong subreddit.
Bit of background info:
So I have a 16gb optane lying around and a free PCIE 3 1x lane slot in my mobo.
Currently have a 1tb boot drive and 3x4tbs, all nvme and pcie 4.
I do have 64gb of ram if that's relevant to what i'm about to ask
I was wondering if I could use the optane to either be used as a page file/%temp% or using something like primo cache.
I know the benefits will be very minimal (even more so by the 1 pcie 3 lane) and not noticable but which would be the best option to help Windows chug along?
As for the reason i'm doing this:
I simply have too much time on my hands
Cheers!
Hi all,
I have a NAS with two 8TB HDD's in it, linux md software RAID, ext4.
I am wanting to do monthly backups, and evaluating the best method.
Things I am NOT asking about:
-
Changing filesystems to something with checksumming like ZFS etc.
-
Changing my NAS, or rolling my own
-
Changing my RAID level.
-
Not interested in changing my hardware setup at all right now.
-
RAID "not being a backup"
-
Scripts to hash all files for bitrot detection.
I want to back up my entire 8TB volume monthly.
Given that ext4 has no checksumming, I am relying on drive ECC during SMART scans for bitrot detection.
I am wanting to minimise drive wear and maximise lifetime.
There are two methods I am comparing:
-
1: rsync file-level backup to an external eSATA disk.
(with checksumming on, I don't trust metadata based delta backup) -
2: 3-disk rotation of RAID1, removing and swapping one out per month to trigger full rebuild.
Here are the comparison points I have evaluated:
Run-time per pass
-
rsync -c method
~ 6 days runtime - CPU hash limited to 30MiB/s -
Disk swap + rebuild method
~ 1 day runtime - I/O limited 80MiB/s -
Comment
Rebuild method finishes far sooner.
Annual read load per drive
-
rsync -c method
192 TB (both source and dest disk full read) -
Disk swap + rebuild method
96 TB -
Comment
Rebuild halves read duty.
Annual write load per drive
-
rsync -c method
~ 0TB (source disk), <= 24TB (target disk(s)) -
Disk swap + rebuild method
~ 32TB (with 3-disk rotation, so each disk gets a full write every 3 months, 4 times per year) -
Comment
Rebuild adds sequential writes but still within NAS drive spec.
Heat exposure
-
rsync -c method
~+1 degree Celsius x 6 days = "6" -
Disk swap + rebuild method
~+2 degrees Celsius x 1 day = "2" -
Comment
Rebuild subjects disks to one third lower cumulative heat.
Seek activity
-
rsync -c method
Millions of random seeks -
Disk swap + rebuild method
Near-zero seeks -
Comment
Rebuild imposes significantly less actuator wear.
Bit-rot detection & repair
-
rsync -c method
Catches ECC-failing sectors only (if extended SMART scan done first), residual ~5% risk of ECC valid bit flips -
Disk swap + rebuild method
Full-disk rewrite every 3 months refreshes ECC as compared to long-static data, residual risk drops to ~0.25% -
Comment
Rebuild greatly lowers remaining silent-corruption risk
Chance of write-induced silent error
-
rsync -c method
None (read-only on live disks) -
Disk swap + rebuild method
Negligible; firmware verification makes failures rarer than 1 in 10¹⁵–10¹⁶ bits -
Comment
Added risk is statistically tiny.
Overall evaluation
Although conventionally frowned upon as "writes are heavier", the rebuild method lowers total heat, has drastically fewer seeks, significantly faster completion, and a sixteen fold reduction in unrecoverable bit-rot risk.
The incremental write burden is well within drive workload ratings and introduces negligible new corruption probability.
Overall the combined parameters make the disk swap + rebuild method objectively superior in this setup.
The only issue is 24hours of degraded RAID 1 status during rebuild - but this is something I am comfortable with given the ejected disk is an exact point in time backup during this time, it's not as if a disk actually died - so functionally I still have a safe RAID mirror - just one copy is up to 24 hours stale - which at my data write rates is irrelevant.
Thoughts? (on THIS comparison)
Also does anyone know any other subs I can ask this in, or maybe discords?
I’m looking into buying a new external drive to store a sizable music collection and a small amount of pictures. I’m leaning towards grabbing a HDD because of the large storage capacity but I’m concerned about supposed fragility after doing some research.
The external drive would serve as a physical back up for those same items which I’ll be putting also be putting into a cloud based service.
Any advice is or drive suggestions is appreciated.
I know x22 means it’s the generation where the top capacity was 22. So you can have x22 22tb, x22 20tb, etc but not x22 24tb.
But now I see tons of exos 22tb drives with no “x” branding at all. What are these drives exactly. What is the difference between an x22 22TB exos drive and a 22TB unbranded exos drive? They often don’t seem all that different in price. But to me these unbranded ones seem like something I avoid like the plague because I have no fucking clue why they don’t have X monicker. What series are they from? No clue. Are they barracudas put into exos containers? No clue. Are they 5 year old drives that broke then they remade them and took broken platters off and now it’s a shitty 22TB drive that used to be 24tb? No clue.
I am deciding between a Toshiba MG10 20TB, Toshiba N300 22TB, and a Seagate Exos X22 22TB. They're all the same $/TB.
I was going to buy an Ultrastar HC560 22TB but the price shot up over $100 where I live while I was thinking about getting it.
It will be a PC under my desk, I play games and watch films and stuff in the same room so want to keep it quiet-ish.
Has anyone had at least two of those drives and can tell me if one is noticeable louder than the other? Not so much when writing/reading since it will be for backups, and not necessarily at the same capacity but not sure if it matters.
I'm using a regular PC case, Fractal Design Define XL R2. The soundproofing isn't all that amazing, with two 4TB HGSTs and a 10TB WD Black it's already fairly loud.
I’m trying to collect and archive tweets from about 40–60 specific accounts spanning 2020–2025 for a research project. The goal is to analyze the accuracy of political pundits’ predictions over time (study preregistered here: ).
I’ve tested snscrape, nitter-scraper, and Playwright, but none have been reliable for full-history pulls — especially with the ongoing API and site changes.
I’m looking for advice on:
-
Any current tools or scripts that still work for bulk/historical scraping
-
Whether archived datasets or mirrors (e.g., from Internet Archive, pushshift-like projects, etc.) exist for Twitter
-
Whether it’s still possible to get academic-level API access or a good alternative
-
Recommended data formats or storage methods for large tweet collections
Open to creative or gray-area but legal solutions — goal is reproducible research, not redistribution.
Would love to hear what’s working for others lately.
I had a media server that I think the CPU died in. I didn't panic, I have backups, and it was Plex on Windows. I thought I could just take the hard drives from that system and move them to another system. Seagate Exos X18 16TB drives completely unrecognized by the new system, not recognized when connected as external drives thru USB,
The drives don't show up in file explorer, there's no pop up for select what to do with this drive. The drives show up in device manager under disk drives, they show up in disk management as unallocated. It looks like I have to reformat the drives and restore from backups, but I haven't hit this snag in swapping hard drives before so how do I prevent it for next time?
Long story short, I’m at a loss finding a relatively quieter replacement for aging 14tb Wd Red Pluses — WD has halted production of their helium filled non-pro HDDs with no 12tb option either. These were 20/29db
The only ones I see in contention are WD Red Pros at 20/32 but have prominent 5sec PWL clicks.
The Toshiba n300 have come up in blackblaze as fairly reliable in comparison to seagate. I can find no seek noise db level posted — only 20db idle. Also unclear is if there is any seek noise difference on the n300 vs n300pro.
Can anybody provide me with info on the >20tb Toshiba n300 and n300pro especially in comparison to WD Pros both in idle, seek, and PWL noise?