诡异的硬盘问题:报错 -问答-阿里云开发者社区-阿里云

开发者社区> 问答> 正文

诡异的硬盘问题:报错

kun坤 2020-06-07 20:42:37 113

系统: CentOS 5

分区: ext3 (全盘只有一个分区)

硬盘: ST3250310SV

接口: SATA

挂载: /dev/sdb

已使用空间: 6%

 

问题:

    读取数据正常,数据正确;  写数据很慢,偶尔卡死;  删除数据,卡死!

    删除数据时: 用 iostat -x 1  查看, util 达到 100%,居高不下, 删除一个文件(普通小文件)可能需要半个小时.

    cat /var/log/messages | grep error  没有错误.

    smartctl -l error /dev/sdb   一个小时都没有出来结果...

 

请教大家, 可能是干什么原因导致这样的问题.

Linux
分享到
取消 提交回答
全部回答(1)
  • kun坤
    2020-06-07 20:42:42

    如果机器IO完全没什么负载的话,我估计这块硬盘要寿终正寝了

    ######

    这块硬盘主要存放 web 的数据和 apache log ,属于读多写少.

    之前没有监控硬盘的负载, 并不知道用得多厉害.

    老大,不要吓我, 换硬盘很郁闷的!

    诡异的是,为什么是卡死要"删除"上面呢?

    ######

    真的很有可能是硬盘问题,我觉得你不妨未雨绸缪,要不等真正坏了,哭都没有眼泪

    ######

    啊!...

    郁闷!

    ######

    # smartctl -l error /dev/sdb
    smartctl version 5.38 [x86_64-redhat-linux-gnu] Copyright (C) 2002-8 Bruce Allen
    Home page is http://smartmontools.sourceforge.net/

    === START OF READ SMART DATA SECTION ===
    SMART Error Log Version: 1
    ATA Error Count: 14813 (device log contains only the most recent five errors)
            CR = Command Register [HEX]
            FR = Features Register [HEX]
            SC = Sector Count Register [HEX]
            SN = Sector Number Register [HEX]
            CL = Cylinder Low Register [HEX]
            CH = Cylinder High Register [HEX]
            DH = Device/Head Register [HEX]
            DC = Device Command Register [HEX]
            ER = Error register [HEX]
            ST = Status register [HEX]
    Powered_Up_Time is measured from power on, and printed as
    DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
    SS=sec, and sss=millisec. It "wraps" after 49.710 days.

    Error 14813 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
      When the command that caused the error occurred, the device was active or idle.

      After command completion occurred, registers were:
      ER ST SC SN CL CH DH
      -- -- -- -- -- -- --
      40 51 00 97 f5 9f ee  Error: UNC at LBA = 0x0e9ff597 = 245364119

      Commands leading to the command that caused the error were:
      CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
      -- -- -- -- -- -- -- --  ----------------  --------------------
      c8 00 08 97 f5 9f ee 00      02:17:21.325  READ DMA
      27 00 00 00 00 00 e0 00      02:17:23.503  READ NATIVE MAX ADDRESS EXT
      ec 00 00 00 00 00 a0 00      02:17:23.503  IDENTIFY DEVICE
      ef 03 46 00 00 00 a0 00      02:17:23.503  SET FEATURES [Set transfer mode]
      27 00 00 00 00 00 e0 00      02:17:23.503  READ NATIVE MAX ADDRESS EXT

    Error 14812 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
      When the command that caused the error occurred, the device was active or idle.

      After command completion occurred, registers were:
      ER ST SC SN CL CH DH
      -- -- -- -- -- -- --
      40 51 00 97 f5 9f ee  Error: UNC at LBA = 0x0e9ff597 = 245364119

      Commands leading to the command that caused the error were:
      CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
      -- -- -- -- -- -- -- --  ----------------  --------------------
      c8 00 08 97 f5 9f ee 00      02:17:21.325  READ DMA
      27 00 00 00 00 00 e0 00      02:17:21.325  READ NATIVE MAX ADDRESS EXT
      ec 00 00 00 00 00 a0 00      02:17:21.325  IDENTIFY DEVICE
      ef 03 46 00 00 00 a0 00      02:17:21.324  SET FEATURES [Set transfer mode]
      27 00 00 00 00 00 e0 00      02:17:20.316  READ NATIVE MAX ADDRESS EXT

    Error 14811 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
      When the command that caused the error occurred, the device was active or idle.

      After command completion occurred, registers were:
      ER ST SC SN CL CH DH
      -- -- -- -- -- -- --
      40 51 00 97 f5 9f ee  Error: UNC at LBA = 0x0e9ff597 = 245364119

      Commands leading to the command that caused the error were:
      CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
      -- -- -- -- -- -- -- --  ----------------  --------------------
      c8 00 08 97 f5 9f ee 00      02:17:21.325  READ DMA
      27 00 00 00 00 00 e0 00      02:17:21.325  READ NATIVE MAX ADDRESS EXT
      ec 00 00 00 00 00 a0 00      02:17:21.325  IDENTIFY DEVICE
      ef 03 46 00 00 00 a0 00      02:17:21.324  SET FEATURES [Set transfer mode]
      27 00 00 00 00 00 e0 00      02:17:20.316  READ NATIVE MAX ADDRESS EXT

    Error 14810 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
      When the command that caused the error occurred, the device was active or idle.

      After command completion occurred, registers were:
      ER ST SC SN CL CH DH
      -- -- -- -- -- -- --
      40 51 00 97 f5 9f ee  Error: UNC at LBA = 0x0e9ff597 = 245364119

      Commands leading to the command that caused the error were:
      CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
      -- -- -- -- -- -- -- --  ----------------  --------------------
      c8 00 08 97 f5 9f ee 00      02:17:19.297  READ DMA
      27 00 00 00 00 00 e0 00      02:17:17.982  READ NATIVE MAX ADDRESS EXT
      ec 00 00 00 00 00 a0 00      02:17:17.982  IDENTIFY DEVICE
      ef 03 46 00 00 00 a0 00      02:17:17.981  SET FEATURES [Set transfer mode]
      27 00 00 00 00 00 e0 00      02:17:20.316  READ NATIVE MAX ADDRESS EXT

    Error 14809 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
      When the command that caused the error occurred, the device was active or idle.

      After command completion occurred, registers were:
      ER ST SC SN CL CH DH
      -- -- -- -- -- -- --
      40 51 00 97 f5 9f ee  Error: UNC at LBA = 0x0e9ff597 = 245364119

      Commands leading to the command that caused the error were:
      CR FR SC SN CL CH DH DC   Powered_Up_Time  Command/Feature_Name
      -- -- -- -- -- -- -- --  ----------------  --------------------
      c8 00 08 97 f5 9f ee 00      02:17:19.297  READ DMA
      27 00 00 00 00 00 e0 00      02:17:17.982  READ NATIVE MAX ADDRESS EXT
      ec 00 00 00 00 00 a0 00      02:17:17.982  IDENTIFY DEVICE
      ef 03 46 00 00 00 a0 00      02:17:17.981  SET FEATURES [Set transfer mode]
      27 00 00 00 00 00 e0 00      02:17:17.981  READ NATIVE MAX ADDRESS EXT

    ######

    Error 14809

    后面的数字是错误的 次数 吗?

    ######

    ATA Error Count: 14813 肯定是错误啊

    这块硬盘真的不能要了,赶紧备份数据吧。

    ######

    ATA Error Count: 14813

    ######

    错误14813次?

    ######

    天啊...三个硬盘坏了两个....

    这种硬盘拿回来给PC用, 不知道还能不能用?

    比如重新低格之类的.

    0 0
开发与运维
使用钉钉扫一扫加入圈子
+ 订阅

集结各类场景实战经验,助你开发运维畅行无忧

推荐文章
相似问题
推荐课程