Step checking problem pada server SUN
Step checking problem pada server SUN
- Jika terjadi problem di Server SUN (HLR, BSM dll) misal Sistem Operasi Crash atau tiba2 Server mati secara abnormal, setelah kejadian segera lakukan command “explorer” untuk mencapture konfigurasi system dan log di unix. Data Log kirim ke Jakarta
Berikut Hasil Capture eksekusi command “explorer”:
Screen clipping taken: 24/06/2009, 14:46
Capture : Output Hasil explorer
Screen clipping taken: 24/06/2009, 14:54
- Cek Log di /var/adm/messages
# more /var/adm/mess ages
Jun 15 00:00:00 master sendmail[17990]: [ID 702911 mail.crit] My unqualified host name (master) unknown; sleeping for retry
Jun 15 00:01:00 master sendmail[17990]: [ID 702911 mail.alert] unable to qualify my own domain name (master) — using short name
Jun 15 00:30:00 master sendmail[19299]: [ID 702911 mail.crit] My unqualified host name (master) unknown; sleeping for retry
Jun 15 00:31:00 master sendmail[19299]: [ID 702911 mail.alert] unable to qualify my own domain name (master) — using short name
Jun 15 09:22:14 master pcipsy: [ID 819770 kern.warning] WARNING: pci: Thermal warning detected!
Jun 15 09:22:28 master pseudo: [ID 129642 kern.info] pseudo-device: tod0
Jun 15 09:22:28 master genunix: [ID 936769 kern.info] tod0 is /pseudo/tod@0
Jun 15 09:22:29 master syslogd: going down on signal 15
Jun 15 09:22:34 master Array Monitor stopped
Jun 15 09:25:17 master RDAC support disabled
Jun 15 09:22:51 master genunix: [ID 672855 kern.notice] syncing file systems…
Jun 15 09:22:51 master genunix: [ID 904073 kern.notice] done
Jun 15 09:25:07 master genunix: [ID 540533 kern.notice] ^MSunOS Release 5.8 Version Generic_108528-19 64-bit
Jun 15 09:25:07 master genunix: [ID 913631 kern.notice] Copyright 1983-2001 Sun Microsystems, Inc. All rights reserved.
Jun 15 09:25:07 master genunix: [ID 678236 kern.info] Ethernet address = 8:0:20:b2:35:54
Jun 15 09:25:07 master unix: [ID 389951 kern.info] mem = 2097152K (0×80000000)
- Command untuk cek Status DISK
# format
Searching for disks…done
AVAILABLE DISK SELECTIONS:
0. c0t0d0 <SUN36G cyl 24620 alt 2 hd 27 sec 107>
/pci@1f,4000/scsi@3/sd@0,0
1. c2t5d0 <Symbios-StorEDGEA1000-0003 cyl 34690 alt 2 hd 64 sec 64>
/pci@1f,4000/scsi@5/sd@5,0
Specify disk (enter its number): ^D
#
# ls -l /dev / rd sk/ | grep /pci@1f,4000/scsi@5
lrwxrwxrwx 1 root root 47 May 20 2003 c1t5d0s0 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:a,raw
lrwxrwxrwx 1 root root 47 May 20 2003 c1t5d0s1 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:b,raw
lrwxrwxrwx 1 root root 47 May 20 2003 c1t5d0s2 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:c,raw
lrwxrwxrwx 1 root root 47 May 20 2003 c1t5d0s3 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:d,raw
lrwxrwxrwx 1 root root 47 May 20 2003 c1t5d0s4 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:e,raw
lrwxrwxrwx 1 root root 47 May 20 2003 c1t5d0s5 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:f,raw
lrwxrwxrwx 1 root root 47 May 20 2003 c1t5d0s6 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:g,raw
lrwxrwxrwx 1 root root 47 May 20 2003 c1t5d0s7 -> ../../devices/pci@1f,4000/scsi@5,1/sd@5,0:h,raw
lrwxrwxrwx 1 root root 45 May 20 2003 c2t5d0s0 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:a,raw
lrwxrwxrwx 1 root root 45 May 20 2003 c2t5d0s1 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:b,raw
lrwxrwxrwx 1 root root 45 May 20 2003 c2t5d0s2 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:c,raw
lrwxrwxrwx 1 root root 45 May 20 2003 c2t5d0s3 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:d,raw
lrwxrwxrwx 1 root root 45 May 20 2003 c2t5d0s4 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:e,raw
lrwxrwxrwx 1 root root 45 May 20 2003 c2t5d0s5 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:f,raw
lrwxrwxrwx 1 root root 45 May 20 2003 c2t5d0s6 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:g,raw
lrwxrwxrwx 1 root root 45 May 20 2003 c2t5d0s7 -> ../../devices/pci@1f,4000/scsi@5/sd@5,0:h,raw
#
# df -k
Filesystem kbytes used avail capacity Mounted on
/dev/dsk/c0t0d0s0 4129290 99444 3988554 3% /
/dev/dsk/c0t0d0s5 4129290 908050 3179948 23% /usr
/proc 0 0 0 0% /proc
fd 0 0 0 0% /dev/fd
mnttab 0 0 0 0% /etc/mnttab
/dev/dsk/c0t0d0s3 4129290 1922446 2165552 48% /var
swap 4975032 16 4975016 1% /var/run
/dev/dsk/c0t0d0s6 16426922 2098158 14164495 13% /home1
swap 4975392 376 4975016 1% /tmp
/dev/dsk/c0t0d0s4 2053605 4579 1987418 1% /opt
/dev/dsk/c2t5d0s6 69955723 14990925 54265241 22% /ARRAY
#
# iostat -x
extended device statistics
device r/s w/s kr/s kw/s wait actv svc_t %w %b
fd0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0 0
sd0 19.1 5.6 434.5 40.9 0.0 0.4 17.9 0 7
sd6 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0 0
sd133 26.7 20.6 1043.9 949.5 2.5 1.1 77.0 4 21
st12 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0 0
nfs1 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0 0
#
#
#
# iostat -E
sd0 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: SEAGATE Product: ST336607LSUN36G Revision: 0207 Serial No: 3JA1H8HR00007342
Size: 36.42GB <36418595328 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
sd6 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: TOSHIBA Product: DVD-ROM SD-M1401 Revision: 1009 Serial No: 12/20/00
Size: 18446744073.71GB <-1 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
sd133 Soft Errors: 0 Hard Errors: 1 Transport Errors: 0
Vendor: Symbios Product: StorEDGE A1000 Revision: 0003 Serial No: 1T02699221
Size: 72.75GB <72752300032 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 1 Recoverable: 0
Illegal Request: 0 Predictive Failure Analysis: 0
st12 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0
Vendor: HP Product: C1537A Revision: L007 Serial No: 62
#
- Cek System Configuration
# prtdiag -v
System Configuration: Sun Microsystems sun4u Sun Fire V890
System clock frequency: 150 MHz
Memory size: 16384 Megabytes
========================= CPUs ===============================================
Run E$ CPU CPU
Brd CPU MHz MB Impl. Mask
— —– —- —- ——- —-
A 0, 16 1500 32.0 US-IV+ 2.2
B 1, 17 1500 32.0 US-IV+ 2.2
A 2, 18 1500 32.0 US-IV+ 2.2
B 3, 19 1500 32.0 US-IV+ 2.2
========================= Memory Configuration ===============================
Logical Logical Logical
MC Bank Bank Bank DIMM Interleave Interleaved
Brd ID num size Status Size Factor with
—- — —- —— ———– —— ———- ———–
A 0 0 1024MB no_status 512MB 8-way 0
A 0 1 1024MB no_status 512MB 8-way 0
A 0 2 1024MB no_status 512MB 8-way 0
A 0 3 1024MB no_status 512MB 8-way 0
B 1 0 1024MB no_status 512MB 8-way 1
B 1 1 1024MB no_status 512MB 8-way 1
B 1 2 1024MB no_status 512MB 8-way 1
B 1 3 1024MB no_status 512MB 8-way 1
A 2 0 1024MB no_status 512MB 8-way 0
A 2 1 1024MB no_status 512MB 8-way 0
A 2 2 1024MB no_status 512MB 8-way 0
A 2 3 1024MB no_status 512MB 8-way 0
B 3 0 1024MB no_status 512MB 8-way 1
B 3 1 1024MB no_status 512MB 8-way 1
B 3 2 1024MB no_status 512MB 8-way 1
B 3 3 1024MB no_status 512MB 8-way 1
========================= IO Cards =========================
Bus Max
IO Port Bus Freq Bus Dev,
Brd Type ID Side Slot MHz Freq Func State Name Model
—- —- —- —- —- —- —- —- —– ——————————– ———————-
I/O PCI 8 B 3 33 33 2,0 ok pci-pci8086,b154.0/pci108e,1000 PCI-BRIDGE
I/O PCI 8 B 3 33 33 0,0 ok pci108e,1000-pci108e,1000.1 device on pci-bridge
I/O PCI 8 B 3 33 33 0,1 ok SUNW,qfe-pci108e,1001 SUNW,pci-qfe/pci-bridg+
I/O PCI 8 B 3 33 33 1,0 ok pci108e,1000-pci108e,1000.1 device on pci-bridge
I/O PCI 8 B 3 33 33 1,1 ok SUNW,qfe-pci108e,1001 SUNW,pci-qfe/pci-bridg+
I/O PCI 8 B 3 33 33 2,0 ok pci108e,1000-pci108e,1000.1 device on pci-bridge
I/O PCI 8 B 3 33 33 2,1 ok SUNW,qfe-pci108e,1001 SUNW,pci-qfe/pci-bridg+
I/O PCI 8 B 3 33 33 3,0 ok pci108e,1000-pci108e,1000.1 device on pci-bridge
I/O PCI 8 B 3 33 33 3,1 ok SUNW,qfe-pci108e,1001 SUNW,pci-qfe/pci-bridg+
I/O PCI 8 B 0 33 33 5,0 ok scsi-pci1000,f.1000.1000.14/disk+
I/O PCI 8 B 0 33 33 5,1 ok scsi-pci1000,f.1000.1000.14/disk+
I/O PCI 9 B 4 33 33 4,0 ok pci-pci1011,25.4/pci108e,1000 PCI-BRIDGE
I/O PCI 9 B 4 33 33 0,0 ok pci108e,1000-pci108e,1000.1 device on pci-bridge
I/O PCI 9 B 4 33 33 0,1 ok SUNW,qfe-pci108e,1001 SUNW,pci-qfe/pci-bridg+
I/O PCI 9 B 4 33 33 1,0 ok pci108e,1000-pci108e,1000.1 device on pci-bridge
I/O PCI 9 B 4 33 33 1,1 ok SUNW,qfe-pci108e,1001 SUNW,pci-qfe/pci-bridg+
I/O PCI 9 B 4 33 33 2,0 ok pci108e,1000-pci108e,1000.1 device on pci-bridge
I/O PCI 9 B 4 33 33 2,1 ok SUNW,qfe-pci108e,1001 SUNW,pci-qfe/pci-bridg+
I/O PCI 9 B 4 33 33 3,0 ok pci108e,1000-pci108e,1000.1 device on pci-bridge
I/O PCI 9 B 4 33 33 3,1 ok SUNW,qfe-pci108e,1001 SUNW,pci-qfe/pci-bridg+
I/O PCI 9 A 7 66 66 2,0 ok pci-pci8086,b154.0/pci108e,1000 PCI-BRIDGE
I/O PCI 9 A 7 66 66 0,0 ok pci108e,1000-pci108e,1000.1 device on pci-bridge
I/O PCI 9 A 7 66 66 0,1 ok SUNW,qfe-pci108e,1001 SUNW,pci-qfe/pci-bridg+
I/O PCI 9 A 7 66 66 1,0 ok pci108e,1000-pci108e,1000.1 device on pci-bridge
I/O PCI 9 A 7 66 66 1,1 ok SUNW,qfe-pci108e,1001 SUNW,pci-qfe/pci-bridg+
I/O PCI 9 A 7 66 66 2,0 ok pci108e,1000-pci108e,1000.1 device on pci-bridge
I/O PCI 9 A 7 66 66 2,1 ok SUNW,qfe-pci108e,1001 SUNW,pci-qfe/pci-bridg+
I/O PCI 9 A 7 66 66 3,0 ok pci108e,1000-pci108e,1000.1 device on pci-bridge
I/O PCI 9 A 7 66 66 3,1 ok SUNW,qfe-pci108e,1001 SUNW,pci-qfe/pci-bridg+
No failures found in System
===========================
========================= Environmental Status =========================
System Temperatures (Celsius):
——————————-
Device Temperature Status
—————————————
CPU0 58 OK
CPU1 53 OK
CPU2 54 OK
CPU3 51 OK
MB 28 OK
IOB 23 OK
DBP0 22 OK
=================================
Front Status Panel:
——————-
Keyswitch position: LOCKED
System LED Status:
GEN FAULT REMOVE
[OFF] [OFF]
DISK FAULT POWER FAULT
[OFF] [OFF]
LEFT THERMAL FAULT RIGHT THERMAL FAULT
[OFF] [OFF]
LEFT DOOR RIGHT DOOR
[OFF] [OFF]
=================================
Disk Status:
Presence Fault LED Remove LED
DISK 0: [PRESENT] [OFF] [OFF]
DISK 1: [PRESENT] [OFF] [OFF]
DISK 2: [PRESENT] [OFF] [OFF]
DISK 3: [PRESENT] [OFF] [OFF]
DISK 4: [ EMPTY]
DISK 5: [ EMPTY]
DISK 6: [ EMPTY]
DISK 7: [ EMPTY]
DISK 8: [ EMPTY]
DISK 9: [ EMPTY]
DISK 10: [ EMPTY]
DISK 11: [ EMPTY]
=================================
Fan Bank :
———-
Bank Speed Status Fan State
( RPMS )
—- ——– ——— ———
CPU0_PRIM_FAN 2040 [ENABLED] OK
CPU1_PRIM_FAN 2173 [ENABLED] OK
CPU0_SEC_FAN 0 [DISABLED] OK
CPU1_SEC_FAN 0 [DISABLED] OK
IO0_PRIM_FAN 3000 [ENABLED] OK
IO1_PRIM_FAN 2941 [ENABLED] OK
IO0_SEC_FAN 0 [DISABLED] OK
IO1_SEC_FAN 0 [DISABLED] OK
IO_BRIDGE_PRIM_FAN 3658 [ENABLED] OK
IO_BRIDGE_SEC_FAN 0 [DISABLED] OK
=================================
Power Supplies:
—————
Current Drain:
Supply Status Fan Fail Temp Fail CS Fail 3.3V 5V 12V 48V
—— ———— ——– ——— ——- —- – — —
PS0 GOOD 6 4 2 3
PS1 GOOD 6 4 2 3
PS2 GOOD 6 4 2 3
========================= HW Revisions =======================================
System PROM revisions:
———————-
OBP 4.18.11 2006/05/03 07:41
IO ASIC revisions:
——————
Port
Model ID Status Version
——– —- —— ——-
Schizo 8 ok 7
Schizo 9 ok 7
- Cek Status Processor
Diambil contoh Processor HLR HA-1 Problem
# psrinfo
0 on-line since 05/06/2008 03:30:34
1 faulted since 09/23/2008 16:32:20
2 on-line since 05/06/2008 03:30:34
3 on-line since 05/06/2008 03:30:26
16 on-line since 05/06/2008 03:30:34
17 faulted since 09/23/2008 16:19:40
18 on-line since 05/06/2008 03:30:34
19 on-line since 05/06/2008 03:30:34
# psrinfo -v
Status of virtual processor 0 as of: 06/16/2009 15:15:17
on-line since 05/06/2008 03:30:34.
The sparcv9 processor operates at 1500 MHz,
and has a sparcv9 floating point processor.
Status of virtual processor 1 as of: 06/16/2009 15:15:17
faulted since 09/23/2008 16:32:20.
The sparcv9 processor operates at 1500 MHz,
and has a sparcv9 floating point processor.
Status of virtual processor 2 as of: 06/16/2009 15:15:17
on-line since 05/06/2008 03:30:34.
The sparcv9 processor operates at 1500 MHz,
and has a sparcv9 floating point processor.
Status of virtual processor 3 as of: 06/16/2009 15:15:17
on-line since 05/06/2008 03:30:26.
The sparcv9 processor operates at 1500 MHz,
and has a sparcv9 floating point processor.
Status of virtual processor 16 as of: 06/16/2009 15:15:17
on-line since 05/06/2008 03:30:34.
The sparcv9 processor operates at 1500 MHz,
and has a sparcv9 floating point processor.
Status of virtual processor 17 as of: 06/16/2009 15:15:17
faulted since 09/23/2008 16:19:40.
The sparcv9 processor operates at 1500 MHz,
and has a sparcv9 floating point processor.
Status of virtual processor 18 as of: 06/16/2009 15:15:17
on-line since 05/06/2008 03:30:34.
The sparcv9 processor operates at 1500 MHz,
and has a sparcv9 floating point processor.
Status of virtual processor 19 as of: 06/16/2009 15:15:17
on-line since 05/06/2008 03:30:34.
The sparcv9 processor operates at 1500 MHz,
and has a sparcv9 floating point processor.
- Cek SERIAL NUMBER Server :
B_inniha1[/]# cd /opt/SUNWse/opt/SUNWsneep/bin
B_inniha1[/opt/SUNWsneep/bin]# ./sneep
0637AM1615
- Pada dasarnya dengan log hasil step nomor 1, sudah cukup untuk menganalisa problem, memang dibutuhkan ketelitian untuk menganalisanya. Semoga Bermanfaat
- Jika ada yang perlu ditanyakan silahkan email ke sriyono.basuki@mobile-8.com atau sbasuki_tech@yahoo.com