Step checking problem pada server SUN


Step checking problem pada server SUN

  1. Jika terjadi problem di Server SUN (HLR, BSM dll) misal Sistem Operasi Crash atau tiba2 Server mati secara abnormal, setelah kejadian segera lakukan command “explorer” untuk mencapture konfigurasi system dan log di unix. Data Log kirim ke Jakartaūüôā

Berikut Hasil Capture eksekusi command “explorer”:

Screen clipping taken: 24/06/2009, 14:46

Capture : Output Hasil explorer

Screen clipping taken: 24/06/2009, 14:54

  1. Cek Log di /var/adm/messages

# more /var/adm/mess ages

Jun 15 00:00:00 master sendmail[17990]: [ID 702911 mail.crit] My unqualified host name (master) unknown; sleeping for retry

Jun 15 00:01:00 master sendmail[17990]: [ID 702911 mail.alert] unable to qualify my own domain name (master) — using short name

Jun 15 00:30:00 master sendmail[19299]: [ID 702911 mail.crit] My unqualified host name (master) unknown; sleeping for retry

Jun 15 00:31:00 master sendmail[19299]: [ID 702911 mail.alert] unable to qualify my own domain name (master) — using short name

Jun 15 09:22:14 master pcipsy: [ID 819770 kern.warning] WARNING: pci: Thermal warning detected!

Jun 15 09:22:28 master pseudo: [ID 129642 kern.info] pseudo-device: tod0

Jun 15 09:22:28 master genunix: [ID 936769 kern.info] tod0 is /pseudo/tod@0

Jun 15 09:22:29 master syslogd: going down on signal 15

Jun 15 09:22:34 master Array Monitor stopped

Jun 15 09:25:17 master RDAC support disabled

Jun 15 09:22:51 master genunix: [ID 672855 kern.notice] syncing file systems…

Jun 15 09:22:51 master genunix: [ID 904073 kern.notice]  done

Jun 15 09:25:07 master genunix: [ID 540533 kern.notice] ^MSunOS Release 5.8 Version Generic_108528-19 64-bit

Jun 15 09:25:07 master genunix: [ID 913631 kern.notice] Copyright 1983-2001 Sun Microsystems, Inc.  All rights reserved.

Jun 15 09:25:07 master genunix: [ID 678236 kern.info] Ethernet address = 8:0:20:b2:35:54

Jun 15 09:25:07 master unix: [ID 389951 kern.info] mem = 2097152K (0x80000000)

  1. Command untuk cek Status DISK

# format

Searching for disks…done

AVAILABLE DISK SELECTIONS:

0. c0t0d0 <SUN36G cyl 24620 alt 2 hd 27 sec 107>

/pci@1f,4000/scsi@3/sd@0,0

1. c2t5d0 <Symbios-StorEDGEA1000-0003 cyl 34690 alt 2 hd 64 sec 64>

/pci@1f,4000/scsi@5/sd@5,0

Specify disk (enter its number): ^D

#

# ls -l /dev / rd sk/ | grep /pci@1f,4000/scsi@5

lrwxrwxrwx   1 root     root          47 May 20  2003 c1t5d0s0 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:a,raw

lrwxrwxrwx   1 root     root          47 May 20  2003 c1t5d0s1 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:b,raw

lrwxrwxrwx   1 root     root          47 May 20  2003 c1t5d0s2 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:c,raw

lrwxrwxrwx   1 root     root          47 May 20  2003 c1t5d0s3 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:d,raw

lrwxrwxrwx   1 root     root          47 May 20  2003 c1t5d0s4 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:e,raw

lrwxrwxrwx   1 root     root          47 May 20  2003 c1t5d0s5 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:f,raw

lrwxrwxrwx   1 root     root          47 May 20  2003 c1t5d0s6 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:g,raw

lrwxrwxrwx   1 root     root          47 May 20  2003 c1t5d0s7 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:h,raw

lrwxrwxrwx   1 root     root          45 May 20  2003 c2t5d0s0 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:a,raw

lrwxrwxrwx   1 root     root          45 May 20  2003 c2t5d0s1 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:b,raw

lrwxrwxrwx   1 root     root          45 May 20  2003 c2t5d0s2 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:c,raw

lrwxrwxrwx   1 root     root          45 May 20  2003 c2t5d0s3 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:d,raw

lrwxrwxrwx   1 root     root          45 May 20  2003 c2t5d0s4 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:e,raw

lrwxrwxrwx   1 root     root          45 May 20  2003 c2t5d0s5 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:f,raw

lrwxrwxrwx   1 root     root          45 May 20  2003 c2t5d0s6 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:g,raw

lrwxrwxrwx   1 root     root          45 May 20  2003 c2t5d0s7 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:h,raw

#

# df -k

Filesystem            kbytes    used   avail capacity  Mounted on

/dev/dsk/c0t0d0s0    4129290   99444 3988554     3%    /

/dev/dsk/c0t0d0s5    4129290  908050 3179948    23%    /usr

/proc                      0       0       0     0%    /proc

fd                         0       0       0     0%    /dev/fd

mnttab                     0       0       0     0%    /etc/mnttab

/dev/dsk/c0t0d0s3    4129290 1922446 2165552    48%    /var

swap                 4975032      16 4975016     1%    /var/run

/dev/dsk/c0t0d0s6    16426922 2098158 14164495    13%    /home1

swap                 4975392     376 4975016     1%    /tmp

/dev/dsk/c0t0d0s4    2053605    4579 1987418     1%    /opt

/dev/dsk/c2t5d0s6    69955723 14990925 54265241    22%    /ARRAY

#

# iostat -x

extended device statistics

device       r/s    w/s   kr/s   kw/s wait actv  svc_t  %w  %b

fd0          0.0    0.0    0.0    0.0  0.0  0.0    0.0   0   0

sd0         19.1    5.6  434.5   40.9  0.0  0.4   17.9   0   7

sd6          0.0    0.0    0.0    0.0  0.0  0.0    0.0   0   0

sd133       26.7   20.6 1043.9  949.5  2.5  1.1   77.0   4  21

st12         0.0    0.0    0.0    0.0  0.0  0.0    0.0   0   0

nfs1         0.0    0.0    0.0    0.0  0.0  0.0    0.0   0   0

#

#

#

# iostat -E

sd0      Soft Errors: 0 Hard Errors: 0 Transport Errors: 0

Vendor: SEAGATE  Product: ST336607LSUN36G  Revision: 0207 Serial No: 3JA1H8HR00007342

Size: 36.42GB <36418595328 bytes>

Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0

Illegal Request: 0 Predictive Failure Analysis: 0

sd6      Soft Errors: 0 Hard Errors: 0 Transport Errors: 0

Vendor: TOSHIBA  Product: DVD-ROM SD-M1401 Revision: 1009 Serial No: 12/20/00

Size: 18446744073.71GB <-1 bytes>

Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0

Illegal Request: 0 Predictive Failure Analysis: 0

sd133    Soft Errors: 0 Hard Errors: 1 Transport Errors: 0

Vendor: Symbios  Product: StorEDGE A1000   Revision: 0003 Serial No: 1T02699221

Size: 72.75GB <72752300032 bytes>

Media Error: 0 Device Not Ready: 0 No Device: 1 Recoverable: 0

Illegal Request: 0 Predictive Failure Analysis: 0

st12     Soft Errors: 0 Hard Errors: 0 Transport Errors: 0

Vendor: HP       Product: C1537A           Revision: L007 Serial No:   62

#

  1. Cek System Configuration

# prtdiag -v

System Configuration:  Sun Microsystems  sun4u Sun Fire V890

System clock frequency: 150 MHz

Memory size: 16384 Megabytes

========================= CPUs ===============================================

Run   E$  CPU    CPU

Brd  CPU   MHz   MB Impl.   Mask

— —– —- —- ——- —-

A  0, 16 1500 32.0 US-IV+   2.2

B  1, 17 1500 32.0 US-IV+   2.2

A  2, 18 1500 32.0 US-IV+   2.2

B  3, 19 1500 32.0 US-IV+   2.2

========================= Memory Configuration ===============================

Logical  Logical  Logical

MC   Bank     Bank     Bank         DIMM    Interleave  Interleaved

Brd  ID   num      size     Status       Size    Factor      with

—-¬† —¬† —-¬†¬†¬†¬† ——¬†¬† ———–¬† ——¬† ———-¬† ———–

A    0     0      1024MB   no_status     512MB     8-way        0

A    0     1      1024MB   no_status     512MB     8-way        0

A    0     2      1024MB   no_status     512MB     8-way        0

A    0     3      1024MB   no_status     512MB     8-way        0

B    1     0      1024MB   no_status     512MB     8-way        1

B    1     1      1024MB   no_status     512MB     8-way        1

B    1     2      1024MB   no_status     512MB     8-way        1

B    1     3      1024MB   no_status     512MB     8-way        1

A    2     0      1024MB   no_status     512MB     8-way        0

A    2     1      1024MB   no_status     512MB     8-way        0

A    2     2      1024MB   no_status     512MB     8-way        0

A    2     3      1024MB   no_status     512MB     8-way        0

B    3     0      1024MB   no_status     512MB     8-way        1

B    3     1      1024MB   no_status     512MB     8-way        1

B    3     2      1024MB   no_status     512MB     8-way        1

B    3     3      1024MB   no_status     512MB     8-way        1

========================= IO Cards =========================

Bus  Max

IO   Port Bus       Freq Bus  Dev,

Brd  Type  ID  Side Slot MHz  Freq Func State Name                              Model

—- —- —- —- —- —- —- —- —– ——————————–¬† ———————-

I/O  PCI   8    B    3    33   33  2,0  ok    pci-pci8086,b154.0/pci108e,1000   PCI-BRIDGE

I/O  PCI   8    B    3    33   33  0,0  ok    pci108e,1000-pci108e,1000.1       device on pci-bridge

I/O  PCI   8    B    3    33   33  0,1  ok    SUNW,qfe-pci108e,1001             SUNW,pci-qfe/pci-bridg+

I/O  PCI   8    B    3    33   33  1,0  ok    pci108e,1000-pci108e,1000.1       device on pci-bridge

I/O  PCI   8    B    3    33   33  1,1  ok    SUNW,qfe-pci108e,1001             SUNW,pci-qfe/pci-bridg+

I/O  PCI   8    B    3    33   33  2,0  ok    pci108e,1000-pci108e,1000.1       device on pci-bridge

I/O  PCI   8    B    3    33   33  2,1  ok    SUNW,qfe-pci108e,1001             SUNW,pci-qfe/pci-bridg+

I/O  PCI   8    B    3    33   33  3,0  ok    pci108e,1000-pci108e,1000.1       device on pci-bridge

I/O  PCI   8    B    3    33   33  3,1  ok    SUNW,qfe-pci108e,1001             SUNW,pci-qfe/pci-bridg+

I/O  PCI   8    B    0    33   33  5,0  ok    scsi-pci1000,f.1000.1000.14/disk+

I/O  PCI   8    B    0    33   33  5,1  ok    scsi-pci1000,f.1000.1000.14/disk+

I/O  PCI   9    B    4    33   33  4,0  ok    pci-pci1011,25.4/pci108e,1000     PCI-BRIDGE

I/O  PCI   9    B    4    33   33  0,0  ok    pci108e,1000-pci108e,1000.1       device on pci-bridge

I/O  PCI   9    B    4    33   33  0,1  ok    SUNW,qfe-pci108e,1001             SUNW,pci-qfe/pci-bridg+

I/O  PCI   9    B    4    33   33  1,0  ok    pci108e,1000-pci108e,1000.1       device on pci-bridge

I/O  PCI   9    B    4    33   33  1,1  ok    SUNW,qfe-pci108e,1001             SUNW,pci-qfe/pci-bridg+

I/O  PCI   9    B    4    33   33  2,0  ok    pci108e,1000-pci108e,1000.1       device on pci-bridge

I/O  PCI   9    B    4    33   33  2,1  ok    SUNW,qfe-pci108e,1001             SUNW,pci-qfe/pci-bridg+

I/O  PCI   9    B    4    33   33  3,0  ok    pci108e,1000-pci108e,1000.1       device on pci-bridge

I/O  PCI   9    B    4    33   33  3,1  ok    SUNW,qfe-pci108e,1001             SUNW,pci-qfe/pci-bridg+

I/O  PCI   9    A    7    66   66  2,0  ok    pci-pci8086,b154.0/pci108e,1000   PCI-BRIDGE

I/O  PCI   9    A    7    66   66  0,0  ok    pci108e,1000-pci108e,1000.1       device on pci-bridge

I/O  PCI   9    A    7    66   66  0,1  ok    SUNW,qfe-pci108e,1001             SUNW,pci-qfe/pci-bridg+

I/O  PCI   9    A    7    66   66  1,0  ok    pci108e,1000-pci108e,1000.1       device on pci-bridge

I/O  PCI   9    A    7    66   66  1,1  ok    SUNW,qfe-pci108e,1001             SUNW,pci-qfe/pci-bridg+

I/O  PCI   9    A    7    66   66  2,0  ok    pci108e,1000-pci108e,1000.1       device on pci-bridge

I/O  PCI   9    A    7    66   66  2,1  ok    SUNW,qfe-pci108e,1001             SUNW,pci-qfe/pci-bridg+

I/O  PCI   9    A    7    66   66  3,0  ok    pci108e,1000-pci108e,1000.1       device on pci-bridge

I/O  PCI   9    A    7    66   66  3,1  ok    SUNW,qfe-pci108e,1001             SUNW,pci-qfe/pci-bridg+

No failures found in System

===========================

========================= Environmental Status =========================

System Temperatures (Celsius):

——————————-

Device          Temperature     Status

—————————————

CPU0             58             OK

CPU1             53             OK

CPU2             54             OK

CPU3             51             OK

MB               28             OK

IOB              23             OK

DBP0             22             OK

=================================

Front Status Panel:

——————-

Keyswitch position: LOCKED

System LED Status:

GEN FAULT                REMOVE

[OFF]                    [OFF]

DISK FAULT               POWER FAULT

[OFF]                    [OFF]

LEFT THERMAL FAULT       RIGHT THERMAL FAULT

[OFF]                    [OFF]

LEFT DOOR                RIGHT DOOR

[OFF]                    [OFF]

=================================

Disk Status:

Presence      Fault LED       Remove LED

DISK   0: [PRESENT]        [OFF]           [OFF]

DISK   1: [PRESENT]        [OFF]           [OFF]

DISK   2: [PRESENT]        [OFF]           [OFF]

DISK   3: [PRESENT]        [OFF]           [OFF]

DISK   4: [  EMPTY]

DISK   5: [  EMPTY]

DISK   6: [  EMPTY]

DISK   7: [  EMPTY]

DISK   8: [  EMPTY]

DISK   9: [  EMPTY]

DISK  10: [  EMPTY]

DISK  11: [  EMPTY]

=================================

Fan Bank :

———-

Bank                        Speed         Status        Fan State

( RPMS )

—-¬†¬†¬†¬†¬†¬†¬†¬†¬†¬†¬†¬†¬†¬†¬†¬†¬†¬†¬†¬†¬†¬† ——–¬†¬†¬†¬†¬† ———¬†¬†¬†¬†¬† ———

CPU0_PRIM_FAN                2040        [ENABLED]          OK

CPU1_PRIM_FAN                2173        [ENABLED]          OK

CPU0_SEC_FAN                    0        [DISABLED]         OK

CPU1_SEC_FAN                    0        [DISABLED]         OK

IO0_PRIM_FAN                 3000        [ENABLED]          OK

IO1_PRIM_FAN                 2941        [ENABLED]          OK

IO0_SEC_FAN                     0        [DISABLED]         OK

IO1_SEC_FAN                     0        [DISABLED]         OK

IO_BRIDGE_PRIM_FAN           3658        [ENABLED]          OK

IO_BRIDGE_SEC_FAN               0        [DISABLED]         OK

=================================

Power Supplies:

—————

Current Drain:

Supply     Status     Fan Fail  Temp Fail  CS Fail  3.3V   5V   12V   48V

——¬† ————¬† ——–¬† ———¬† ——-¬† —-¬†¬† —¬†¬† —¬†¬† —

PS0      GOOD                                         6     4     2     3

PS1      GOOD                                         6     4     2     3

PS2      GOOD                                         6     4     2     3

========================= HW Revisions =======================================

System PROM revisions:

———————-

OBP 4.18.11 2006/05/03 07:41

IO ASIC revisions:

——————

Port

Model     ID  Status Version

——– —- —— ——-

Schizo    8     ok      7

Schizo    9     ok      7

  1. Cek Status Processor

Diambil contoh Processor HLR HA-1 Problem

# psrinfo

0       on-line   since 05/06/2008 03:30:34

1       faulted   since 09/23/2008 16:32:20

2       on-line   since 05/06/2008 03:30:34

3       on-line   since 05/06/2008 03:30:26

16      on-line   since 05/06/2008 03:30:34

17      faulted   since 09/23/2008 16:19:40

18      on-line   since 05/06/2008 03:30:34

19      on-line   since 05/06/2008 03:30:34

# psrinfo -v

Status of virtual processor 0 as of: 06/16/2009 15:15:17

on-line since 05/06/2008 03:30:34.

The sparcv9 processor operates at 1500 MHz,

and has a sparcv9 floating point processor.

Status of virtual processor 1 as of: 06/16/2009 15:15:17

faulted since 09/23/2008 16:32:20.

The sparcv9 processor operates at 1500 MHz,

and has a sparcv9 floating point processor.

Status of virtual processor 2 as of: 06/16/2009 15:15:17

on-line since 05/06/2008 03:30:34.

The sparcv9 processor operates at 1500 MHz,

and has a sparcv9 floating point processor.

Status of virtual processor 3 as of: 06/16/2009 15:15:17

on-line since 05/06/2008 03:30:26.

The sparcv9 processor operates at 1500 MHz,

and has a sparcv9 floating point processor.

Status of virtual processor 16 as of: 06/16/2009 15:15:17

on-line since 05/06/2008 03:30:34.

The sparcv9 processor operates at 1500 MHz,

and has a sparcv9 floating point processor.

Status of virtual processor 17 as of: 06/16/2009 15:15:17

faulted since 09/23/2008 16:19:40.

The sparcv9 processor operates at 1500 MHz,

and has a sparcv9 floating point processor.

Status of virtual processor 18 as of: 06/16/2009 15:15:17

on-line since 05/06/2008 03:30:34.

The sparcv9 processor operates at 1500 MHz,

and has a sparcv9 floating point processor.

Status of virtual processor 19 as of: 06/16/2009 15:15:17

on-line since 05/06/2008 03:30:34.

The sparcv9 processor operates at 1500 MHz,

and has a sparcv9 floating point processor.

  1. Cek SERIAL NUMBER Server :

B_inniha1[/]# cd /opt/SUNWse/opt/SUNWsneep/bin

B_inniha1[/opt/SUNWsneep/bin]# ./sneep

0637AM1615

  1. Pada dasarnya dengan log hasil step nomor  1, sudah cukup untuk menganalisa problem, memang dibutuhkan ketelitian untuk menganalisanya. Semoga Bermanfaat
  2. Jika ada yang perlu ditanyakan silahkan email ke sriyono.basuki@mobile-8.com atau sbasuki_tech@yahoo.com

One thought on “Step checking problem pada server SUN

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s