20 Linux System Monitoring Tool
Sumber: http://www.cyberciti.biz/tips/top-linux-monitoring-tools.html
Anda butuh untuk memonitor performance Server Linux? Coba perintah berikut. Kebanyakan distribusi Linux akan di lengkapi dengan banyak monitoring tools. Tool ini akan mengukur aktifitas system. Dengan tool ini kita dapat mengetahui penyebab masalah yang terjadi. Perintah dasar yang di jelaskan berikut ini menjadi bermanfaat terutama untuk analisa dan debugging server untuk masalah,
- Mencari bottleneck.
- Disk (storage) bottleneck.
- CPU dan memory bottleneck.
- Network bottleneck.
#1: top - Perintah untuk melihat aktifitas proses
Top memberikan gambaran real time sebuah sistem yang sedang berjalan, proses yang ada. Defaultnya, dia akan menampilkan proses yang sedang menggunakan CPU dan mengupdate-nya setiap 5 detik.
top - 11:42:04 up 7:11, 8 users, load average: 5.22, 5.42, 5.91 Tasks: 250 total, 2 running, 248 sleeping, 0 stopped, 0 zombie Cpu(s): 2.9%us, 0.6%sy, 0.0%ni, 96.0%id, 0.0%wa, 0.0%hi, 0.4%si, 0.0%st Mem: 2021012k total, 1558636k used, 462376k free, 218932k buffers Swap: 976892k total, 0k used, 976892k free, 762492k cached PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 1477 root 20 0 84456 40m 7664 S 5 2.1 28:36.33 Xorg 3777 onno 20 0 499m 117m 23m S 4 5.9 28:17.09 firefox-bin 2281 onno 20 0 109m 16m 10m S 1 0.9 1:01.13 gnome-terminal 9961 root 20 0 4308 992 712 D 1 0.0 0:25.50 chown 9965 root 20 0 4072 708 476 D 1 0.0 0:24.45 chmod 10970 root 20 0 2624 1204 844 R 1 0.1 0:00.03 top 78 root 20 0 0 0 0 R 0 0.0 0:08.87 kondemand/0 10380 root 20 0 4136 900 476 D 0 0.0 0:05.40 chmod
Perintah top mempunyai beberapa hot key yang sangat berguna seperti
Hot Key Penggunaan t Tampilkan rangkuman informasi off dan on m Tampilkan informasi memory off dan on. A Urutkan tampilan berdasarkan pengguna paling besar dari sumber daya system. f Masuk ke konfigurasi interaktif layar untuk top. o Memungkinkan kita untuk secara interaktif memilih urutan didalam top r Mengeluarkan perintah renice. k Mengeluarkan perintah kill / bunuh sebuah proses. z Menyalakan / mematikan color / mono.
=> Related: How do I Find Out Linux CPU Utilization?
#2: vmstat - Aktivitas Sistem, Informasi Perangkat Keras dan Hardware
Perintah vmstat melaporkan informasi tentang proses, memori, paging, block IO, traps dan aktifitas cpu.
# vmstat 3
Sample Outputs:
procs -----------memory---------- ---swap-- -----io---- -system-- ----cpu---- r b swpd free buff cache si so bi bo in cs us sy id wa 0 0 0 434272 221012 774744 0 0 59 42 253 393 4 2 85 10 0 0 0 434560 221104 774776 0 0 32 132 829 1426 2 2 95 0 0 0 0 435936 221040 773688 0 0 15 108 989 1416 1 1 98 0 1 0 0 436644 221104 774032 0 0 12 167 805 1112 1 1 99 0 0 0 0 507552 221108 703244 0 0 11 0 534 899 1 1 98 0 0 0 0 506700 221176 703648 0 0 17 161 826 1255 1 1 98 0
Menampilkan pemakaian memori Slabinfo
# vmstat -m
Mendapatkan informasi tentang memori aktif / tidak aktif
# vmstat -a
#3: w - Mencari Siapa Yang Login dan Apa Yang Dia Lakukan
perintah w menampilkan informasi tentang user yang sedang berada di dalam mesin dan menampilkan proses dari user tersebut.
# w username # w onno
Contoh Output:
17:58:47 up 5 days, 20:28, 2 users, load average: 0.36, 0.26, 0.24 USER TTY FROM LOGIN@ IDLE JCPU PCPU WHAT root pts/0 10.1.3.145 14:55 5.00s 0.04s 0.02s vim /etc/resolv.conf root pts/1 10.1.3.145 17:43 0.00s 0.03s 0.00s w
#4: uptime - Memberitahu Berapa Lama Sistem Telah Berjalan
Perintah uptime dapat digunakan untuk melihat berapa lama server telah berjalan. Lebih lengkapnya perintah ini memberi tahu waktu saat ini, berapa lama sistem berjalan, berapa banyak user yang sedang login dan rata-rata beban server pada 1, 5, dan 15 menit.
# uptime
Output:
18:02:41 up 41 days, 23:42, 1 user, load average: 0.91, 0.58, 0.54
1 dapat dianggap sebagai nilai beban optimal. Beban dapat berubah dari sistem ke sistem. Untuk sistem CPU tunggal 1 - 3 dan sistem SMP 6-10 nilai beban mungkin dapat diterima.
#5: ps - Menampilkan Proses
Perintah ps akan melaporkan snapshot dari proses saat ini. Untuk memilih semua proses gunakan option -A atau -e.
# ps -A
Contoh Output:
PID TTY TIME CMD 1 ? 00:00:02 init 2 ? 00:00:02 migration/0 3 ? 00:00:01 ksoftirqd/0 4 ? 00:00:00 watchdog/0 5 ? 00:00:00 migration/1 6 ? 00:00:15 ksoftirqd/1 .... ..... 4881 ? 00:53:28 java 4885 tty1 00:00:00 mingetty 4886 tty2 00:00:00 mingetty 4887 tty3 00:00:00 mingetty 4888 tty4 00:00:00 mingetty 4891 tty5 00:00:00 mingetty 4892 tty6 00:00:00 mingetty 4893 ttyS1 00:00:00 agetty 12853 ? 00:00:00 cifsoplockd 12854 ? 00:00:00 cifsdnotifyd 14231 ? 00:10:34 lighttpd 14232 ? 00:00:00 php-cgi 54981 pts/0 00:00:00 vim 55465 ? 00:00:00 php-cgi 55546 ? 00:00:00 bind9-snmp-stat 55704 pts/1 00:00:00 ps
ps seperti top tetapi memberikan lebih banyak informasi.
Melihat output dalam format panjang
# ps -Al
Menghidupkan mode extra full (memperlihatkan argumen baris perintah pada proses):
# ps -AlF
Melihat thread ( LWP dan NLWP)
# ps -AlFH
Melihat thread setelah proses
# ps -AlLm
Menampilkan semua proses pada server
# ps ax # ps axu
Menampilkan pohon proses
# ps -ejH # ps axjf # pstree
Menampilkan informasi keamanan
# ps -eo euser,ruser,suser,fuser,f,comm,label # ps axZ # ps -eM
Melihat setiap proses yang berjalan sebagai user onno
# ps -U onno -u onno u
Set output sesuai dengan yang kita inginkan
# ps -eo pid,tid,class,rtprio,ni,pri,psr,pcpu,stat,wchan:14,comm # ps axo stat,euid,ruid,tty,tpgid,sess,pgrp,ppid,pid,pcpu,comm # ps -eopid,tt,user,fname,tmout,f,wchan
Hanya menampilkan ID proses dari Lighttpd
# ps -C lighttpd -o pid=
atau
# pgrep lighttpd
atau
# pgrep -u vivek php-cgi
Menampilkan nama dari PID 55977
# ps -p 55977 -o comm=
Mencari tahu 10 proses yang memakan memori paling banyak
# ps -auxf | sort -nr -k 4 | head -10
Mencari tahu 10 proses yang memakan CPU paling banyak
# ps -auxf | sort -nr -k 3 | head -10
#6: free - Pemakaian Memory
Perintah free menampilkan jumlah total dari physical dan swap memory yang dipakai dan tidak dipakai di dalam sistem serta buffer yang digunakan oleh kernel.
# free
Contoh Output:
total used free shared buffers cached Mem: 12302896 9739664 2563232 0 523124 5154740 -/+ buffers/cache: 4061800 8241096 Swap: 1052248 0 1052248
#7: iostat - Rata-rata load CPU dan Aktifitas Disk
Perintah iostat melaporkan statistik Central Processing Unit (CPU) dan statistik input/output dari device, partisi dan network filesystems (NFS).
# iostat
Contoh Output:
Linux 2.6.18-128.1.14.el5 (www03.nixcraft.in) 06/26/2009 avg-cpu: %user %nice %system %iowait %steal %idle 3.50 0.09 0.51 0.03 0.00 95.86 Device: tps Blk_read/s Blk_wrtn/s Blk_read Blk_wrtn sda 22.04 31.88 512.03 16193351 260102868 sda1 0.00 0.00 0.00 2166 180 sda2 22.04 31.87 512.03 16189010 260102688 sda3 0.00 0.00 0.00 1615 0
#8: sar - Collect and Report System Activity
The sar command is used to collect, report, and save system activity information. To see network counter, enter:
# sar -n DEV | more
To display the network counters from the 24th:
# sar -n DEV -f /var/log/sa/sa24 | more
You can also display real time usage using sar:
# sar 4 5
Sample Outputs:
Linux 2.6.18-128.1.14.el5 (www03.nixcraft.in) 06/26/2009 06:45:12 PM CPU %user %nice %system %iowait %steal %idle 06:45:16 PM all 2.00 0.00 0.22 0.00 0.00 97.78 06:45:20 PM all 2.07 0.00 0.38 0.03 0.00 97.52 06:45:24 PM all 0.94 0.00 0.28 0.00 0.00 98.78 06:45:28 PM all 1.56 0.00 0.22 0.00 0.00 98.22 06:45:32 PM all 3.53 0.00 0.25 0.03 0.00 96.19 Average: all 2.02 0.00 0.27 0.01 0.00 97.70
=> Related: : How to collect Linux system utilization data into a file
#9: mpstat - Multiprocessor Usage
The mpstat command displays activities for each available processor, processor 0 being the first one. mpstat -P ALL to display average CPU utilization per processor:
# mpstat -P ALL
Sample Output:
Linux 2.6.18-128.1.14.el5 (www03.nixcraft.in) 06/26/2009 06:48:11 PM CPU %user %nice %sys %iowait %irq %soft %steal %idle intr/s 06:48:11 PM all 3.50 0.09 0.34 0.03 0.01 0.17 0.00 95.86 1218.04 06:48:11 PM 0 3.44 0.08 0.31 0.02 0.00 0.12 0.00 96.04 1000.31 06:48:11 PM 1 3.10 0.08 0.32 0.09 0.02 0.11 0.00 96.28 34.93 06:48:11 PM 2 4.16 0.11 0.36 0.02 0.00 0.11 0.00 95.25 0.00 06:48:11 PM 3 3.77 0.11 0.38 0.03 0.01 0.24 0.00 95.46 44.80 06:48:11 PM 4 2.96 0.07 0.29 0.04 0.02 0.10 0.00 96.52 25.91 06:48:11 PM 5 3.26 0.08 0.28 0.03 0.01 0.10 0.00 96.23 14.98 06:48:11 PM 6 4.00 0.10 0.34 0.01 0.00 0.13 0.00 95.42 3.75 06:48:11 PM 7 3.30 0.11 0.39 0.03 0.01 0.46 0.00 95.69 76.89
=> Related: : Linux display each multiple SMP CPU processors utilization individually.
#10: pmap - Process Memory Usage
The command pmap report memory map of a process. Use this command to find out causes of memory bottlenecks.
# pmap -d PID
To display process memory information for pid # 47394, enter:
# pmap -d 47394
Sample Outputs:
47394: /usr/bin/php-cgi Address Kbytes Mode Offset Device Mapping 0000000000400000 2584 r-x-- 0000000000000000 008:00002 php-cgi 0000000000886000 140 rw--- 0000000000286000 008:00002 php-cgi 00000000008a9000 52 rw--- 00000000008a9000 000:00000 [ anon ] 0000000000aa8000 76 rw--- 00000000002a8000 008:00002 php-cgi 000000000f678000 1980 rw--- 000000000f678000 000:00000 [ anon ] 000000314a600000 112 r-x-- 0000000000000000 008:00002 ld-2.5.so 000000314a81b000 4 r---- 000000000001b000 008:00002 ld-2.5.so 000000314a81c000 4 rw--- 000000000001c000 008:00002 ld-2.5.so 000000314aa00000 1328 r-x-- 0000000000000000 008:00002 libc-2.5.so 000000314ab4c000 2048 ----- 000000000014c000 008:00002 libc-2.5.so ..... ...... .. 00002af8d48fd000 4 rw--- 0000000000006000 008:00002 xsl.so 00002af8d490c000 40 r-x-- 0000000000000000 008:00002 libnss_files-2.5.so 00002af8d4916000 2044 ----- 000000000000a000 008:00002 libnss_files-2.5.so 00002af8d4b15000 4 r---- 0000000000009000 008:00002 libnss_files-2.5.so 00002af8d4b16000 4 rw--- 000000000000a000 008:00002 libnss_files-2.5.so 00002af8d4b17000 768000 rw-s- 0000000000000000 000:00009 zero (deleted) 00007fffc95fe000 84 rw--- 00007ffffffea000 000:00000 [ stack ] ffffffffff600000 8192 ----- 0000000000000000 000:00000 [ anon ] mapped: 933712K writeable/private: 4304K shared: 768000K
The last line is very important:
* mapped: 933712K total amount of memory mapped to files * writeable/private: 4304K the amount of private address space * shared: 768000K the amount of address space this process is sharing with others
=> Related: : Linux find the memory used by a program / process using pmap command
#11 and #12: netstat and ss - Network Statistics
The command netstat displays network connections, routing tables, interface statistics, masquerade connections, and multicast memberships. ss command is used to dump socket statistics. It allows showing information similar to netstat. See the following resources about ss and netstat commands:
- ss: Display Linux TCP / UDP Network and Socket Information
- Get Detailed Information About Particular IP address Connections Using netstat Command
#13: iptraf - Real-time Network Statistics
The iptraf command is interactive colorful IP LAN monitor. It is an ncurses-based IP LAN monitor that generates various network statistics including TCP info, UDP counts, ICMP and OSPF information, Ethernet load info, node stats, IP checksum errors, and others. It can provide the following info in easy to read format:
- Network traffic statistics by TCP connection
- IP traffic statistics by network interface
- Network traffic statistics by protocol
- Network traffic statistics by TCP/UDP port and by packet size
- Network traffic statistics by Layer2 address
Fig.02: General interface statistics: IP traffic statistics by network interface
Fig.02: General interface statistics: IP traffic statistics by network interface Fig.03 Network traffic statistics by TCP connection
Fig.03 Network traffic statistics by TCP connection
#14: tcpdump - Detailed Network Traffic Analysis
The tcpdump is simple command that dump traffic on a network. However, you need good understanding of TCP/IP protocol to utilize this tool. For.e.g to display traffic info about DNS, enter:
# tcpdump -i eth1 'udp port 53'
To display all IPv4 HTTP packets to and from port 80, i.e. print only packets that contain data, not, for example, SYN and FIN packets and ACK-only packets, enter:
# tcpdump 'tcp port 80 and (((ip[2:2] - ((ip[0]&0xf)<<2)) - ((tcp[12]&0xf0)>>2)) != 0)'
To display all FTP session to 202.54.1.5, enter:
# tcpdump -i eth1 'dst 202.54.1.5 and (port 21 or 20'
To display all HTTP session to 192.168.1.5:
# tcpdump -ni eth0 'dst 192.168.1.5 and tcp and port http'
Use wireshark to view detailed information about files, enter:
# tcpdump -n -i eth1 -s 0 -w output.txt src or dst port 80
#15: strace - System Calls
Trace system calls and signals. This is useful for debugging webserver and other server problems. See how to use to trace the process and see What it is doing.
#16: /Proc file system - Various Kernel Statistics
/proc file system provides detailed information about various hardware devices and other Linux kernel information. See Linux kernel /proc documentations for further details. Common /proc examples:
# cat /proc/cpuinfo # cat /proc/meminfo # cat /proc/zoneinfo # cat /proc/mounts
17#: Nagios - Server And Network Monitoring
Nagios is a popular open source computer system and network monitoring application software. You can easily monitor all your hosts, network equipment and services. It can send alert when things go wrong and again when they get better. FAN is "Fully Automated Nagios". FAN goals are to provide a Nagios installation including most tools provided by the Nagios Community. FAN provides a CDRom image in the standard ISO format, making it easy to easilly install a Nagios server. Added to this, a wide bunch of tools are including to the distribution, in order to improve the user experience around Nagios.
18#: Cacti - Web-based Monitoring Tool
Cacti is a complete network graphing solution designed to harness the power of RRDTool's data storage and graphing functionality. Cacti provides a fast poller, advanced graph templating, multiple data acquisition methods, and user management features out of the box. All of this is wrapped in an intuitive, easy to use interface that makes sense for LAN-sized installations up to complex networks with hundreds of devices. It can provide data about network, CPU, memory, logged in users, Apache, DNS servers and much more. See how to install and configure Cacti network graphing tool under CentOS / RHEL.
#19: KDE System Guard - Real-time Systems Reporting and Graphing
KSysguard is a network enabled task and system monitor application for KDE desktop. This tool can be run over ssh session. It provides lots of features such as a client/server architecture that enables monitoring of local and remote hosts. The graphical front end uses so-called sensors to retrieve the information it displays. A sensor can return simple values or more complex information like tables. For each type of information, one or more displays are provided. Displays are organized in worksheets that can be saved and loaded independently from each other. So, KSysguard is not only a simple task manager but also a very powerful tool to control large server farms. Fig.05 KDE System Guard
Fig.05 KDE System Guard {Image credit: Wikipedia}
See the KSysguard handbook for detailed usage.
#20: Gnome System Monitor - Real-time Systems Reporting and Graphing
The System Monitor application enables you to display basic system information and monitor system processes, usage of system resources, and file systems. You can also use System Monitor to modify the behavior of your system. Although not as powerful as the KDE System Guard, it provides the basic information which may be useful for new users:
- Displays various basic information about the computer's hardware and software.
- Linux Kernel version
- GNOME version
- Hardware
- Installed memory
- Processors and speeds
- System Status
- Currently available disk space
- Processes
- Memory and swap space
- Network usage
- File Systems
- Lists all mounted filesystems along with basic information about each.
Fig.06 The Gnome System Monitor application
Fig.06 The Gnome System Monitor application
Bonus: Tools Tambahan
Beberapa tool lainnya:
- nmap - scan port yang terbuka pada server anda.
- lsof - daftar file yang terbuka, koneksi jaringan dan banyak lagi.
- ntop web based tool - ntop adalah tool terbaik untuk melihat penggunaan jaringan, mirip dengan perintah top tetapi ini adalah aplikasi untuk memonitor lalu lintas jaringan. Anda dapat melihat status jaringan, lalu lintas jaringan pada protokol UDP, TCP, DNS, HTTP dan protokol lainnya.
adalah alat terbaik untuk melihat penggunaan jaringan dalam cara yang mirip dengan apa perintah atas tidak untuk proses yaitu membutuhkan software monitoring lalu lintas jaringan. Anda dapat melihat status jaringan, protokol distribusi bijaksana lalu lintas untuk UDP, TCP, DNS, HTTP dan protokol lainnya. is the best tool to see network usage in a way similar to what top command does for processes i.e. it is network traffic monitoring software. You can see network status, protocol wise distribution of traffic for UDP, TCP, DNS, HTTP and other protocols.
- Conky - Monitoring tool yang bagus untuk X Window System. Conky sangat konfigurable dan bisa memonitor banyak variable sistem termasuk status CPU, memori, sisa swap, disk penyimpanan, temperature, proses, interface jaringan, power batre, system messages, inbox e-mail dan lain-lain.
- GKrellM - Dapat digunakan untuk memonitor status CPU, memori utama, hard disk, interface network, mailbox lokal dan remote dan banyak hal lainnya.
- vnstat - vnStat adalah network traffic monitor berbasis console. Tool ini menjaga log lalu lintas jaringan per jam, harian dan bulanan dari interface yang dipilih.
- htop - htop adalah peningkatan versi dari top, melihat proses secara interaktif yang dapat menampilkan proses menggunakan bentuk pohon.
- mtr - mtr menggabungkan fungsi traceroute dan program ping pada sebuah tool diagnosa jaringan.