OVH Community, your new community space.

ks311804 - Festplatte defekt


peterdoo
04.01.14, 20:01
Vielen Dank. Da alle Daten schon gesichert waren, habe ich zum Test noch eine Reinstallation auf Debian 6 32-bit probiert. Dort funktioniert smartctl im Gegensatz zum OVH Rescue-System. Vielleicht kann OVH bei Gelegenheit das Rescue-System so anpassen, dass auch dort smartctl mit dem 3ware 8006 Controller funktionieren wird.

Tommi
04.01.14, 13:28
Für deinen Server wurde ein Störungsticket erstellt. Du hast dazu eine E-Mail erhalten.
Bitte bestätige im Ticket, dass Du ein aktuelles Backup vorliegen hast und der Eingriff durchgeführt werden kann.

peterdoo
02.01.14, 21:05
Hallo,

so, habe jetzt noch herausgefunden, dass man mit tw_cli show diag sehen kann, warum ein rebuild abgebrochen wurde. Als Grund sieht man:
Code:
---------Error---------
  Status: 006C
    Code: 4051
    Time: 00D1A44D msec
   ReqID: 0100
TFR Out 00 40 C0 89 3C E0 25
Aport error 00 00D1AF65 A520
TFR In  40 00 F5 89 3C 00 51
Laut 3ware handelt es sich um ein Problem (4051=ECC) beim Lesen vom Port 0. Ist es die Festplatte, von der gelesen wird (ist in diesem Fall am Port 0 angeschlossen), sollte man noch ein Rebuild mit dem Parameter ignoreECC starten. Wurde auch gemacht. Diesmal hat es auch etwas länger gedauert, bis es dann wieder zum Abbruch kam. Im Log sieht man:

Code:
//rescue> show diag

### CLI Version:      x86_64 (64 bit)
### Time Stamp:       03:29.04 28-Mar-1970
### Host Name:        rescue.ovh.net
### OS Version:       Linux 3.10.23-xxxx-std-ipv6-64-rescue
### Driver Version:   1.26.02.003
### Controller ID:    0
### Model:            8006-2LP
### Firmware:         FE8S 1.05.00.068
### BIOS:             BE7X 1.08.00.048
### Serial #:         L018501C8170109
### Memory Installed: 512KB

==========================================================================
Diagnostic Information on Controller //rescue.ovh.net/c0 ...
--------------------------------------------------------------------------

00 03644447 A520
 TFR In  40 00 B6 F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 03644447 msec
   ReqID: 0100
 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 03644EEF A520
 TFR In  40 00 B6 F9 AB 00 51

AEN sent to host: 000A

 TFR Out 00 01 B6 F9 AB E0 25
 Aport error 00 03645A0F A520
 TFR In  40 00 B6 F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 03645A0F msec
   ReqID: 0100
 TFR Out 00 01 B6 F9 AB E0 25
 Aport error 00 036464CC A520
 TFR In  40 00 B6 F9 AB 00 51

AEN sent to host: 002C

 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 036470FA A520
 TFR In  40 00 B7 F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 036470FA msec
   ReqID: 0100
 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 03647BED A520
 TFR In  40 00 B7 F9 AB 00 51

AEN sent to host: 000A

 TFR Out 00 01 B7 F9 AB E0 25
 Aport error 00 036486DB A520
 TFR In  40 00 B7 F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 036486DB msec
   ReqID: 0100
 TFR Out 00 01 B7 F9 AB E0 25
 Aport error 00 036491E2 A520
 TFR In  40 00 B7 F9 AB 00 51

AEN sent to host: 002C

 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 03649CA4 A520
 TFR In  40 00 B8 F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 03649CA4 msec
   ReqID: 0100
 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 0364A7A3 A520
 TFR In  40 00 B8 F9 AB 00 51

AEN sent to host: 000A

 TFR Out 00 01 B8 F9 AB E0 25
 Aport error 00 0364B291 A520
 TFR In  40 00 B8 F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 0364B291 msec
   ReqID: 0100
 TFR Out 00 01 B8 F9 AB E0 25
 Aport error 00 0364BD77 A520
 TFR In  40 00 B8 F9 AB 00 51

AEN sent to host: 002C

 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 0364C9B5 A520
 TFR In  40 00 B9 F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 0364C9B5 msec
   ReqID: 0100
 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 0364D4B9 A520
 TFR In  40 00 B9 F9 AB 00 51

AEN sent to host: 000A

 TFR Out 00 01 B9 F9 AB E0 25
 Aport error 00 0364DF96 A520
 TFR In  40 00 B9 F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 0364DF96 msec
   ReqID: 0100
 TFR Out 00 01 B9 F9 AB E0 25
 Aport error 00 0364EA95 A520
 TFR In  40 00 B9 F9 AB 00 51

AEN sent to host: 002C

 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 0364F6CB A520
 TFR In  40 00 BA F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 0364F6CB msec
   ReqID: 0100
 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 0365065D A520
 TFR In  40 00 BA F9 AB 00 51

AEN sent to host: 000A

 TFR Out 00 01 BA F9 AB E0 25
 Aport error 00 03651779 A520
 TFR In  40 00 BA F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 03651779 msec
   ReqID: 0100
 TFR Out 00 01 BA F9 AB E0 25
 Aport error 00 036526E5 A520
 TFR In  40 00 BA F9 AB 00 51

AEN sent to host: 002C

 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 036536B7 A520
 TFR In  40 00 BB F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 036536B7 msec
   ReqID: 0100
 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 03654547 A520
 TFR In  40 00 BB F9 AB 00 51

AEN sent to host: 000A

 TFR Out 00 01 BB F9 AB E0 25
 Aport error 00 0365551E A520
 TFR In  40 00 BB F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 0365551E msec
   ReqID: 0100
 TFR Out 00 01 BB F9 AB E0 25
 Aport error 00 03656517 A520
 TFR In  40 00 BB F9 AB 00 51

AEN sent to host: 002C

 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 036575E2 A520
 TFR In  40 00 BC F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 036575E2 msec
   ReqID: 0100
 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 03658564 A520
 TFR In  40 00 BC F9 AB 00 51

AEN sent to host: 000A

 TFR Out 00 01 BC F9 AB E0 25
 Aport error 00 03659680 A520
 TFR In  40 00 BC F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 03659680 msec
   ReqID: 0100
 TFR Out 00 01 BC F9 AB E0 25
 Aport error 00 0365A7B6 A520
 TFR In  40 00 BC F9 AB 00 51

AEN sent to host: 002C

 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 0365B2D0 A520
 TFR In  40 00 BD F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 0365B2D0 msec
   ReqID: 0100
 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 0365BD81 A520
 TFR In  40 00 BD F9 AB 00 51

AEN sent to host: 000A

 TFR Out 00 01 BD F9 AB E0 25
 Aport error 00 0365C8B2 A520
 TFR In  40 00 BD F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 0365C8B2 msec
   ReqID: 0100
 TFR Out 00 01 BD F9 AB E0 25
 Aport error 00 0365D38F A520
 TFR In  40 00 BD F9 AB 00 51

AEN sent to host: 002C

 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 0365DFB4 A520
 TFR In  40 00 BE F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 0365DFB4 msec
   ReqID: 0100
 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 0365EAA7 A520
 TFR In  40 00 BE F9 AB 00 51

AEN sent to host: 000A

 TFR Out 00 01 BE F9 AB E0 25
 Aport error 00 0365F59E A520
 TFR In  40 00 BE F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 0365F59E msec
   ReqID: 0100
 TFR Out 00 01 BE F9 AB E0 25
 Aport error 00 036600A5 A520
 TFR In  40 00 BE F9 AB 00 51

AEN sent to host: 002C

 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 03660CD2 A520
 TFR In  40 00 BF F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 03660CD2 msec
   ReqID: 0100
 TFR Out 00 40 80 F9 AB E0 25
 Aport error 00 036617BD A520
 TFR In  40 00 BF F9 AB 00 51

AEN sent to host: 000A

 TFR Out 00 01 BF F9 AB E0 25
 Aport error 00 036622A4 A520
 TFR In  40 00 BF F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 036622A4 msec
   ReqID: 0100
 TFR Out 00 01 BF F9 AB E0 25
 Aport error 00 03662DAB A520
 TFR In  40 00 BF F9 AB 00 51

AEN sent to host: 002C

 TFR Out 00 40 C0 F9 AB E0 25
 Aport error 00 0366399B A520
 TFR In  40 00 C0 F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 0366399B msec
   ReqID: 0100
 TFR Out 00 40 C0 F9 AB E0 25
 Aport error 00 036644A3 A520
 TFR In  40 00 C0 F9 AB 00 51

AEN sent to host: 000A

 TFR Out 00 01 C0 F9 AB E0 25
 Aport error 00 03664F6F A520
 TFR In  40 00 C0 F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 03664F6F msec
   ReqID: 0100
 TFR Out 00 01 C0 F9 AB E0 25
 Aport error 00 03665A7F A520
 TFR In  40 00 C0 F9 AB 00 51

AEN sent to host: 002C

 TFR Out 00 40 C0 F9 AB E0 25
 Aport error 00 03666A50 A520
 TFR In  40 00 C1 F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 03666A50 msec
   ReqID: 0100
 TFR Out 00 40 C0 F9 AB E0 25
 Aport error 00 036678E8 A520
 TFR In  40 00 C1 F9 AB 00 51

AEN sent to host: 000A

 TFR Out 00 01 C1 F9 AB E0 25
 Aport error 00 036688AE A520
 TFR In  40 00 C1 F9 AB 00 51

---------Error---------
  Status: 006C
    Code: 4051
    Time: 036688AE msec
   ReqID: 0100
 TFR Out 00 01 C1 F9 AB E0 25
 Aport error 00 03669896 A520
 TFR In  40 00 C1 F9 AB 00 51

AEN sent to host: 002C

 TFR Out 00 40 C0 F9 AB E0 25
 Aport timeout 00 0366E82A 99C9
 TFR In  00 40 C0 F9 AB E0 D0
 Reset link ...
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
 TFR In  00 00 00 00 00 00 80
AEN sent to host: 000A

AEN sent to host: 0004

        bkgrnd tasks stopped
         Unit 00: Degraded TwinStor[0:1x] of a CBOD[0] and a CBOD[1]

//rescue> show alarms

Ctl  Date                        Severity  Alarm Message
------------------------------------------------------------------------------
c0   -                           INFO      (0x0F:0x000B): Rebuild started: Unit #0
c0   -                           ERROR     (0x0F:0x0004): Controller error
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
c0   -                           WARNING   (0x0F:0x002C): Overwrote bad sector during rebuild: Port #0
c0   -                           ERROR     (0x0F:0x000A): Drive error: Port #0
Es gab da mehrere ECC Fehler (4051) beim Lesen vom Port 0, die aber diesmal nicht zum Abbruch von Rebuild führten. Dann antwortete die Platte am Port 0 auf eine Leseanforderung zu lange nicht und es kam zum Timeout. Nach dem Beispiel auf der Seite 5 waren es etwa 6,9 Sekunden. Das Rebuild wurde wieder abgebrochen und der Zustand bleibt DEGRADED.

Ich bitte den Support, sich zu äußern. Die Platte auf dem Port 0 scheint viele Bad Blocks zu haben und kann auch durch Ignorieren der ECC Fehler nicht vollständig gelesen werden (Timeout). Die Platte auf dem Port 1 ist im RAID1 als DEGRADED markiert und kann wegen der Probleme bei der Platte 0 nicht ins RAID1 eingebunden werden. Das System kann so nicht laufen.

Die Tools zum Auslesen der SMART Daten scheinen beim eingebauten RAID-Controller 3ware 8006-2LP nicht zu funktionieren, obwohl nur OVH Komponenten beteiligt sind (OVH Hardware und OVH Rescue-System).

Bitte entweder die Platten Überprüfen/Tauschen oder mitteilen, wie die SMART Daten ausgelesen werden können.

peterdoo
31.12.13, 15:00
Hallo,

vielen Dank für den Link. So habe ich es auch versucht. Mit keiner der angegebenen Möglichkeiten kommt was vernünftiges:

Code:
root@rescue:~# sudo smartctl -A /dev/sda
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.10.23-xxxx-std-ipv6-64-rescue] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

Smartctl open device: /dev/sda failed: AMCC/3ware controller, please try adding '-d 3ware,N',
you may need to replace /dev/sda with /dev/twlN, /dev/twaN or /dev/tweN

root@rescue:~# smartctl -a -d 3ware,0 -a /dev/sda
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.10.23-xxxx-std-ipv6-64-rescue] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Device Model:     [No Information Found]
Serial Number:    [No Information Found]
Firmware Version: [No Information Found]
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   [No Information Found]
ATA Standard is:  [No Information Found]
Local Time is:    Tue Dec 31 15:50:29 2013 CET
SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 82-83 don't show if SMART supported.
SMART support is: Ambiguous - ATA IDENTIFY DEVICE words 85-87 don't show if SMART is enabled.
A mandatory SMART command failed: exiting. To continue, add one or more '-T permissive' options.

root@rescue:~# smartctl -a -d 3ware,0 /dev/twe0
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.10.23-xxxx-std-ipv6-64-rescue] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

Smartctl: Device Read Identity Failed: Input/output error

A mandatory SMART command failed: exiting. To continue, add one or more '-T permissive' options.
root@rescue:~# smartctl -a -d 3ware,0 /dev/twa0
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.10.23-xxxx-std-ipv6-64-rescue] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

No major number for /dev/twa listed in /proc/devices. Is the 3w-9xxx driver loaded?
Smartctl open device: /dev/twa0 [3ware_disk_00] failed: setup_3ware_nodes("twa", "3w-9xxx") failed

root@rescue:~# smartctl -a -d 3ware,0 /dev/twl0
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.10.23-xxxx-std-ipv6-64-rescue] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

No major number for /dev/twl listed in /proc/devices. Is the 3w-sas driver loaded?
Smartctl open device: /dev/twl0 [3ware_disk_00] failed: setup_3ware_nodes("twl", "3w-sas") failed

root@rescue:~# smartctl -a -d 3ware,0 /dev/tws0
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.10.23-xxxx-std-ipv6-64-rescue] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

Smartctl open device: /dev/tws0 [3ware_disk_00] failed: No such device
Mit cat /var/log/syslog sieht man:

Code:
Dec 31 15:50:29 rescue kernel: program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO
Dec 31 15:50:29 rescue kernel: 3w-xxxx: SCSI_IOCTL_SEND_COMMAND deprecated, please update your 3ware tools.
Dec 31 15:50:29 rescue kernel: 3w-xxxx: SCSI_IOCTL_SEND_COMMAND deprecated, please update your 3ware tools.
Dec 31 15:50:29 rescue kernel: 3w-xxxx: SCSI_IOCTL_SEND_COMMAND deprecated, please update your 3ware tools.
Dec 31 15:50:29 rescue kernel: 3w-xxxx: SCSI_IOCTL_SEND_COMMAND deprecated, please update your 3ware tools.
Dec 31 15:50:29 rescue kernel: 3w-xxxx: SCSI_IOCTL_SEND_COMMAND deprecated, please update your 3ware tools.
Dec 31 15:50:29 rescue kernel: 3w-xxxx: SCSI_IOCTL_SEND_COMMAND deprecated, please update your 3ware tools.
Anscheinend vertragen sich die Versionen von smartctl und 3ware tools, die im Rescue System enthalten sind, nicht. Ein Update im Rescue Modus scheitert aber, da anscheinend keine Schreibrechte vorhanden sind. Als Hauptsystem ist Windows Core installiert, so dass ich dort leider keine Andere Linux Versionen probieren kann.

EDIT: Habe neben der 5.41, die im Rescue System enthalten ist, noch die folgenden Versionen von smartmontools probiert: 6.2, 6.1, 5.26 und 5.0-42. Mit allen dasselbe Ergebnis. Kein Unterschied ob mit oder ohne sudo. Genau gleiche Ergebnisse mit dem Parameter "3ware,0" (Disk 0-OK) oder mit "3ware,1" (Disk 1-DEGRADED).

EDIT2: Mit tw_cli bekommt man die folgenden Hex-Werte der SMART Daten:

Code:
//rescue> /c0/p1 show smart

/c0/p1 Drive Smart Data:
0A 00 01 0F 00 63 63 55 86 D3 0D 00 00 00 03 03
00 5F 5E 00 00 00 00 00 00 00 04 32 00 64 64 66
00 00 00 00 00 00 05 33 00 64 64 12 00 00 00 00
00 00 07 0F 00 4E 3C 3A EE 8A 04 00 00 00 09 32
00 41 41 C1 7A 00 00 00 00 00 0A 13 00 64 64 00
00 00 00 00 00 00 0C 32 00 64 64 33 00 00 00 00
00 00 B7 32 00 64 64 00 00 00 00 00 00 00 B8 32
00 64 64 00 00 00 00 00 00 00 BB 32 00 01 01 64
00 00 00 00 00 00 BC 32 00 64 63 02 00 01 00 01
00 00 BD 3A 00 63 63 01 00 00 00 00 00 00 BE 22
00 3B 28 29 00 27 3C 5E 00 00 C2 22 00 29 3C 29
00 00 00 10 00 00 C3 1A 00 17 06 55 86 D3 0D 00
00 00 C5 12 00 63 63 36 00 00 00 00 00 00 C6 10
00 63 63 36 00 00 00 00 00 00 C7 3E 00 C8 C8 00
00 00 00 00 00 00 F0 00 00 64 FD 25 3F 00 00 43
B1 A7 F1 00 00 64 FD B7 5D 31 60 00 00 00 F2 00
00 64 FD D2 AA B8 0F 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 82 00 58 02 00 7B
03 00 01 00 01 B4 02 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 03 06 04 06 06 06 06 06
06 00 00 00 00 00 00 00 00 01 00 00 00 00 00 00
00 00 00 00 00 00 0F 00 3A 93 1D F2 F0 66 00 00
00 00 00 00 01 00 FF FF B7 5D 31 60 59 00 00 00
D2 AA B8 0F 22 00 00 00 00 00 00 00 FF FF FF FF
00 00 00 62 62 00 00 00 2D 21 00 00 36 00 0E 00
00 00 00 00 44 1E 00 00 00 00 00 00 00 00 00 3A
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 43


//rescue> /c0/p0 show smart

/c0/p0 Drive Smart Data:
0A 00 01 0F 00 6C 5D 0F 4C D7 01 00 00 00 03 03
00 64 64 00 00 00 00 00 00 00 04 32 00 64 64 23
00 00 00 00 00 00 05 33 00 64 64 1F 00 00 00 00
00 00 07 0F 00 52 3C 6B 8E 02 0B 00 00 00 09 32
00 43 43 ED 70 00 00 00 00 00 0A 13 00 64 64 00
00 00 00 00 00 00 0C 32 00 64 64 23 00 00 00 00
00 00 B8 32 00 64 64 00 00 00 00 00 00 00 BB 32
00 01 01 91 02 00 00 00 00 00 BC 32 00 64 64 00
00 00 00 00 00 00 BD 3A 00 2B 2B 39 00 00 00 00
00 00 BE 22 00 3A 24 2A 00 23 40 B1 02 00 C2 22
00 2A 40 2A 00 00 00 1C 00 00 C3 1A 00 31 11 0F
4C D7 01 00 00 00 C5 12 00 64 64 3F 00 00 00 00
00 00 C6 10 00 64 64 3F 00 00 00 00 00 00 C7 3E
00 C8 C8 00 00 00 00 00 00 00 F0 00 00 64 FD E1
70 00 00 85 07 15 F1 00 00 64 FD 1E 66 D9 92 00
00 00 F2 00 00 64 FD D0 6B 83 17 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 00 82 00 69 02 00 7B
03 00 01 00 01 CE 02 00 00 00 00 00 00 00 00 00
00 00 00 00 54 04 00 00 03 00 00 00 00 00 00 00
00 00 00 00 00 00 00 00 00 01 00 00 00 00 00 00
00 00 00 00 00 00 15 00 9A F4 88 8C AA 5E 00 00
00 00 00 00 01 00 FF FF 1E 66 D9 92 9E 01 00 00
D0 6B 83 17 3A 14 00 00 00 00 00 00 C3 D4 1A 00
00 00 00 00 02 00 00 00 54 22 00 00 30 00 03 00
00 00 00 00 0E D6 02 00 00 00 00 00 00 00 00 17
00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 D5

Marcel40625
31.12.13, 14:45
http://wiki.ubuntuusers.de/Festplattenstatus

sudo smartctl -A /dev/sda

peterdoo
31.12.13, 14:01
Hallo,

habe probiert an die SMART Daten zu kommen, jedoch gelingt mir das nicht mit den Parametern, die ich in diversen Anleitungen dazu finden konnte.

Weder -a -d 3ware,0 /dev/sda noch die anderen 3 Varianten mit a0, e0 und l0 am Ende lieferten die Daten. Welchen Befehl soll ich ausführen?

Tommi
31.12.13, 13:22
Hallo peterdoo,

um ein Ticket für den Festplattentausch erstellen zu können benötigen wir noch einen Auszug aus den SMART Logs, der den Festplattendefekt bestätigt.

Thomas

peterdoo
31.12.13, 11:47
Hallo,

beim ks311804 wurde RAID-1 als DEGRADED angezeigt (die Platte auf Port 1-p1). Ich habe ein Rebuild angestossen. Nach etwa 60% ist der Zustand wieder zu DEGRADED gewechselt:

Code:
root@rescue:~# tw_cli info c0

Unit  UnitType  Status         %Cmpl  Stripe  Size(GB)  Cache  AVerify  IgnECC
------------------------------------------------------------------------------
u0    RAID-1    DEGRADED       -      -       931.512   ON     -        -       

Port   Status           Unit   Size        Blocks        Serial
---------------------------------------------------------------
p0     OK               u0     931.51 GB   1953525168    9TE0CW8V
p1     DEGRADED         u0     931.51 GB   1953525168    9VP4JZ53
Der Server ist im Rescue-Pro Modus, die Daten gesichert. Ich bitte um Überprüfung der Platte.