{"id":244,"date":"2024-03-03T19:11:27","date_gmt":"2024-03-03T19:11:27","guid":{"rendered":"https:\/\/www.ogselfhosting.com\/?p=244"},"modified":"2024-03-17T11:53:01","modified_gmt":"2024-03-17T11:53:01","slug":"troubleshooting-my-offline-zpool","status":"publish","type":"post","link":"https:\/\/www.ogselfhosting.com\/index.php\/2024\/03\/03\/troubleshooting-my-offline-zpool\/","title":{"rendered":"Troubleshooting my offline Zpool"},"content":{"rendered":"\n<p>It&#8217;s a quiet Sunday, and I wasn&#8217;t planning on writing an article.<\/p>\n\n\n\n<p>There I was copying files and doing some maintenance, and my network drive was offline.  I figured I must have done something dumb, so I logged into my server and checked.   My 8 x 6TB iron wolf raid-z2 zfs array was offline.  So much for a quiet day.<\/p>\n\n\n\n<p>Four of the eight disks were showing errors.  And the &#8216;lsblk&#8217; command could only find four of the eight disks:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"607\" height=\"221\" src=\"https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-13-31-02.png\" alt=\"\" class=\"wp-image-246\" srcset=\"https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-13-31-02.png 607w, https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-13-31-02-300x109.png 300w\" sizes=\"auto, (max-width: 607px) 100vw, 607px\" \/><\/figure>\n\n\n\n<p>Where have my drives gone?<\/p>\n\n\n\n<p>In fact, I was a little relieved &#8211; one drive error might be real, but I thought 4 is probably a glitch.  Hopefully software, but I have to troubleshot to find out.  Here&#8217;s what I did.  Firstly, server reboot &#8211; that should fix software issues, if any.  It almost worked too:   The drives reappeared, and the raid away came back to life.  But then it died a few minutes later during a scrub I initiated.  Again, FOUR disks gave errors.   It&#8217;s probably not the software.<\/p>\n\n\n\n<p>So I rebooted the server, logged into the IPMI interface and spammed the delete key a few times so I could check interrupt the reboot and enter the bios setup screen of my H12SSi-NT motherboard.  I wanted to see what the motherboard could detect.  The H12 motherboard has a pair of slim-SAS connectors, and I was using all of one of them:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1169\" height=\"788\" src=\"https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-14-03-44.png\" alt=\"\" class=\"wp-image-251\" srcset=\"https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-14-03-44.png 1169w, https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-14-03-44-300x202.png 300w, https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-14-03-44-768x518.png 768w\" sizes=\"auto, (max-width: 1169px) 100vw, 1169px\" \/><\/figure>\n\n\n\n<p>Both 8-port SATA connectors showed up, but I still wondered if the port I was using was somehow at fault (it&#8217;s a new motherboard&#8230; and wouldn&#8217;t make me smile if it was dead already).  So I powered off, switched SAS port connectors and rebooted.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"817\" height=\"618\" src=\"https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-13-58-42.png\" alt=\"\" class=\"wp-image-252\" srcset=\"https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-13-58-42.png 817w, https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-13-58-42-300x227.png 300w, https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-13-58-42-768x581.png 768w\" sizes=\"auto, (max-width: 817px) 100vw, 817px\" \/><\/figure>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"862\" height=\"542\" src=\"https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-13-44-59.png\" alt=\"\" class=\"wp-image-249\" srcset=\"https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-13-44-59.png 862w, https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-13-44-59-300x189.png 300w, https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-13-44-59-768x483.png 768w\" sizes=\"auto, (max-width: 862px) 100vw, 862px\" \/><\/figure>\n\n\n\n<p>At power-up, however, the zpool array was still dead with four drives not showing.<\/p>\n\n\n\n<p>Believe it or not, I felt BETTER: the chances of both SAS ports faulting is&#8230;low.  And if the SATA ports were both working properly then it&#8217;s probably NOT the motherboard:  remember that I said four drives were dead?  Well each pair of four-drives is powered by a separate power cable connected to the single power supply.  Could this be a dodgy power connection?<\/p>\n\n\n\n<p>So I took the cover off and juggled the SATA power leads a little on each drive and on each power connector to the power supply.  All the leads were all clicked-in-place, so I couldn&#8217;t easily see a problem.  But I rebooted anyway as it&#8217;s an easy check.  Wonder of wonders, on power-up, all eight drives reappeared and the zpool imported without issue.<\/p>\n\n\n\n<p>As I type, I am scrubbing the zpool&#8230;but I am also going to order a new SATA power cable as I can&#8217;t really expect a &#8216;cable-jiggle&#8217; to be a good long-term solution.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"650\" height=\"457\" src=\"https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-13-36-23.png\" alt=\"\" class=\"wp-image-247\" srcset=\"https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-13-36-23.png 650w, https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/Screenshot-from-2024-03-03-13-36-23-300x211.png 300w\" sizes=\"auto, (max-width: 650px) 100vw, 650px\" \/><\/figure>\n\n\n\n<p>I also put my SAS connector back to the original port as the cabling was less stressful (I would have to re-route the cable to use that port permanently):<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1152\" height=\"2048\" src=\"https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/PXL_20240303_190149657-1152x2048.jpg\" alt=\"\" class=\"wp-image-253\" srcset=\"https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/PXL_20240303_190149657-1152x2048.jpg 1152w, https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/PXL_20240303_190149657-169x300.jpg 169w, https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/PXL_20240303_190149657-768x1365.jpg 768w, https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/PXL_20240303_190149657-864x1536.jpg 864w, https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/PXL_20240303_190149657-1200x2133.jpg 1200w, https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/PXL_20240303_190149657-1980x3520.jpg 1980w, https:\/\/www.ogselfhosting.com\/wp-content\/uploads\/2024\/03\/PXL_20240303_190149657-scaled.jpg 1440w\" sizes=\"auto, (max-width: 1152px) 100vw, 1152px\" \/><\/figure>\n\n\n\n<p>So the GOOD news is, I think it&#8217;s an inexpensive problem: a power lead.  The BETTER news is that by systematically checking out the potential problems, I have a likely root-cause and a short-term fix (&#8216;jiggling power leads&#8217;).  I also have an executable plan for eliminating this (i.e. buy new (different?) power lead(s) for the drives).<\/p>\n\n\n\n<p>The takeaway?  Check one thing at a time.  \ud83d\ude42<\/p>\n\n\n\n<p>Enjoy your Sunday!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>It&#8217;s a quiet Sunday, and I wasn&#8217;t planning on writing an article. There I was copying files and doing some maintenance, and my network drive was offline. I figured I must have done something dumb, so I logged into my server and checked. My 8 x 6TB iron wolf raid-z2 zfs array was offline. So [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[8],"tags":[],"class_list":["post-244","post","type-post","status-publish","format-standard","hentry","category-zfs"],"_links":{"self":[{"href":"https:\/\/www.ogselfhosting.com\/index.php\/wp-json\/wp\/v2\/posts\/244","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.ogselfhosting.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.ogselfhosting.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.ogselfhosting.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.ogselfhosting.com\/index.php\/wp-json\/wp\/v2\/comments?post=244"}],"version-history":[{"count":4,"href":"https:\/\/www.ogselfhosting.com\/index.php\/wp-json\/wp\/v2\/posts\/244\/revisions"}],"predecessor-version":[{"id":256,"href":"https:\/\/www.ogselfhosting.com\/index.php\/wp-json\/wp\/v2\/posts\/244\/revisions\/256"}],"wp:attachment":[{"href":"https:\/\/www.ogselfhosting.com\/index.php\/wp-json\/wp\/v2\/media?parent=244"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.ogselfhosting.com\/index.php\/wp-json\/wp\/v2\/categories?post=244"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.ogselfhosting.com\/index.php\/wp-json\/wp\/v2\/tags?post=244"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}