系统: CentOS 5
分区: ext3 (全盘只有一个分区)
硬盘: ST3250310SV
接口: SATA
挂载: /dev/sdb
已使用空间: 6%
问题:
读取数据正常,数据正确; 写数据很慢,偶尔卡死; 删除数据,卡死!
删除数据时: 用 iostat -x 1 查看, util 达到 100%,居高不下, 删除一个文件(普通小文件)可能需要半个小时.
cat /var/log/messages | grep error 没有错误.
smartctl -l error /dev/sdb 一个小时都没有出来结果...
请教大家, 可能是干什么原因导致这样的问题.
如果机器IO完全没什么负载的话,我估计这块硬盘要寿终正寝了
######这块硬盘主要存放 web 的数据和 apache log ,属于读多写少.
之前没有监控硬盘的负载, 并不知道用得多厉害.
老大,不要吓我, 换硬盘很郁闷的!
诡异的是,为什么是卡死要"删除"上面呢?
######真的很有可能是硬盘问题,我觉得你不妨未雨绸缪,要不等真正坏了,哭都没有眼泪
######啊!...
郁闷!
####### smartctl -l error /dev/sdb
smartctl version 5.38 [x86_64-redhat-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF READ SMART DATA SECTION ===
SMART Error Log Version: 1
ATA Error Count: 14813 (device log contains only the most recent five errors)
CR = Command Register [HEX]
FR = Features Register [HEX]
SC = Sector Count Register [HEX]
SN = Sector Number Register [HEX]
CL = Cylinder Low Register [HEX]
CH = Cylinder High Register [HEX]
DH = Device/Head Register [HEX]
DC = Device Command Register [HEX]
ER = Error register [HEX]
ST = Status register [HEX]
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.
Error 14813 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 00 97 f5 9f ee Error: UNC at LBA = 0x0e9ff597 = 245364119
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 08 97 f5 9f ee 00 02:17:21.325 READ DMA
27 00 00 00 00 00 e0 00 02:17:23.503 READ NATIVE MAX ADDRESS EXT
ec 00 00 00 00 00 a0 00 02:17:23.503 IDENTIFY DEVICE
ef 03 46 00 00 00 a0 00 02:17:23.503 SET FEATURES [Set transfer mode]
27 00 00 00 00 00 e0 00 02:17:23.503 READ NATIVE MAX ADDRESS EXT
Error 14812 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 00 97 f5 9f ee Error: UNC at LBA = 0x0e9ff597 = 245364119
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 08 97 f5 9f ee 00 02:17:21.325 READ DMA
27 00 00 00 00 00 e0 00 02:17:21.325 READ NATIVE MAX ADDRESS EXT
ec 00 00 00 00 00 a0 00 02:17:21.325 IDENTIFY DEVICE
ef 03 46 00 00 00 a0 00 02:17:21.324 SET FEATURES [Set transfer mode]
27 00 00 00 00 00 e0 00 02:17:20.316 READ NATIVE MAX ADDRESS EXT
Error 14811 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 00 97 f5 9f ee Error: UNC at LBA = 0x0e9ff597 = 245364119
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 08 97 f5 9f ee 00 02:17:21.325 READ DMA
27 00 00 00 00 00 e0 00 02:17:21.325 READ NATIVE MAX ADDRESS EXT
ec 00 00 00 00 00 a0 00 02:17:21.325 IDENTIFY DEVICE
ef 03 46 00 00 00 a0 00 02:17:21.324 SET FEATURES [Set transfer mode]
27 00 00 00 00 00 e0 00 02:17:20.316 READ NATIVE MAX ADDRESS EXT
Error 14810 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 00 97 f5 9f ee Error: UNC at LBA = 0x0e9ff597 = 245364119
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 08 97 f5 9f ee 00 02:17:19.297 READ DMA
27 00 00 00 00 00 e0 00 02:17:17.982 READ NATIVE MAX ADDRESS EXT
ec 00 00 00 00 00 a0 00 02:17:17.982 IDENTIFY DEVICE
ef 03 46 00 00 00 a0 00 02:17:17.981 SET FEATURES [Set transfer mode]
27 00 00 00 00 00 e0 00 02:17:20.316 READ NATIVE MAX ADDRESS EXT
Error 14809 occurred at disk power-on lifetime: 13861 hours (577 days + 13 hours)
When the command that caused the error occurred, the device was active or idle.
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 00 97 f5 9f ee Error: UNC at LBA = 0x0e9ff597 = 245364119
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 08 97 f5 9f ee 00 02:17:19.297 READ DMA
27 00 00 00 00 00 e0 00 02:17:17.982 READ NATIVE MAX ADDRESS EXT
ec 00 00 00 00 00 a0 00 02:17:17.982 IDENTIFY DEVICE
ef 03 46 00 00 00 a0 00 02:17:17.981 SET FEATURES [Set transfer mode]
27 00 00 00 00 00 e0 00 02:17:17.981 READ NATIVE MAX ADDRESS EXT
Error 14809
后面的数字是错误的 次数 吗?
######ATA Error Count: 14813 肯定是错误啊
这块硬盘真的不能要了,赶紧备份数据吧。
######ATA Error Count: 14813
######错误14813次?
######天啊...三个硬盘坏了两个....
这种硬盘拿回来给PC用, 不知道还能不能用?
比如重新低格之类的.
版权声明:本文内容由阿里云实名注册用户自发贡献,版权归原作者所有,阿里云开发者社区不拥有其著作权,亦不承担相应法律责任。具体规则请查看《阿里云开发者社区用户服务协议》和《阿里云开发者社区知识产权保护指引》。如果您发现本社区中有涉嫌抄袭的内容,填写侵权投诉表单进行举报,一经查实,本社区将立刻删除涉嫌侵权内容。