BCC观测工具的使用
观测工具下载地址:https://github.com/iovisor/bcc
bcc观测工具的安装方式:https://github.com/iovisor/bcc/blob/master/INSTALL.md
工具简介:
BCC makes BPF programs easier to write, with kernel instrumentation in C (and includes a C wrapper around LLVM), and front-ends in Python and lua. It is suited for many tasks, including performance analysis and network traffic control.
BCC 是用于创建高效内核跟踪和操作程序的工具包,包括一些有用的工具和示例。它利用了扩展的BPF(Berkeley Packet Filters),正式名称为 eBPF。
使用要求:
1)Linux 内核版本最低 4.1
2)kernel-devel 版本必须和内核版本一致
[root@yang-01 ~]# rpm -qa|grep -i kernel | grep -i devel
kernel-devel-3.10.0-1127.18.2.el7.x86_64
[root@yang-01 ~]# uname -r
3.10.0-1127.el7.x86_64
3)kernel 配置文件相关参数必须按照下面的要求
[root@yang-01 ~]# grep -i BPF /boot/config-`uname -r`
CONFIG_BPF=y
CONFIG_BPF_SYSCALL=y
CONFIG_BPF_JIT_ALWAYS_ON=y
CONFIG_NETFILTER_XT_MATCH_BPF=m
CONFIG_NET_CLS_BPF=m
CONFIG_BPF_JIT=y
CONFIG_HAVE_EBPF_JIT=y
CONFIG_BPF_EVENTS=y
CONFIG_BPF_KPROBE_OVERRIDE=y
4)对于 CentOS 建议升级到 3.6 及以上版本
5)使用默认的 Python 2.7 即可,不需要升级 Python 3
[root@yang-01 ~]# cat /etc/redhat-releaseCentOS Linux release 7.8.2003 (Core)[root@yang-01 ~]# uname -r3.10.0-1127.el7.x86_64 [root@yang-01 ~]# yum update kernelLoaded plugins: fastestmirrorLoading mirror speeds from cached hostfile * base: mirrors.cn99.com * epel: mirrors.yun-idc.com * extras: mirrors.aliyun.com * updates: mirrors.aliyun.comResolving Dependencies--> Running transaction check---> Package kernel.x86_64 0:3.10.0-1127.18.2.el7 will be installed--> Finished Dependency ResolutionDependencies Resolved=========================================================================================================================================================================== Package Arch Version Repository Size===========================================================================================================================================================================Installing: kernel x86_64 3.10.0-1127.18.2.el7 updates 50 MTransaction Summary===========================================================================================================================================================================Install 1 PackageTotal download size: 50 MInstalled size: 64 MIs this ok [y/d/N]: yDownloading packages:Delta RPMs disabled because /usr/bin/applydeltarpm not installed.kernel-3.10.0-1127.18.2.el7.x86_64.rpm | 50 MB 00:00:08 Running transaction checkRunning transaction testTransaction test succeededRunning transaction Installing : kernel-3.10.0-1127.18.2.el7.x86_64 1/1 Verifying : kernel-3.10.0-1127.18.2.el7.x86_64 1/1Installed: kernel.x86_64 0:3.10.0-1127.18.2.el7 Complete![root@yang-01 ~]# yum install bcc -yLoaded plugins: fastestmirrorLoading mirror speeds from cached hostfile * base: mirrors.cn99.com * epel: mirrors.yun-idc.com * extras: mirrors.aliyun.com * updates: mirrors.aliyun.comResolving Dependencies--> Running transaction check---> Package bcc.x86_64 0:0.10.0-1.el7 will be installed--> Processing Dependency: bcc-tools = 0.10.0-1.el7 for package: bcc-0.10.0-1.el7.x86_64--> Processing Dependency: llvm-private >= 6.0.1-0.3 for package: bcc-0.10.0-1.el7.x86_64--> Processing Dependency: libLLVM-7-rhel.so(LLVM_7)(64bit) for package: bcc-0.10.0-1.el7.x86_64--> Processing Dependency: libclangSerialization.so.7()(64bit) for package: bcc-0.10.0-1.el7.x86_64--> Processing Dependency: libclangSema.so.7()(64bit) for package: bcc-0.10.0-1.el7.x86_64--> Processing Dependency: libclangRewrite.so.7()(64bit) for package: bcc-0.10.0-1.el7.x86_64--> Processing Dependency: libclangParse.so.7()(64bit) for package: bcc-0.10.0-1.el7.x86_64--> Processing Dependency: libclangLex.so.7()(64bit) for package: bcc-0.10.0-1.el7.x86_64--> Processing Dependency: libclangFrontend.so.7()(64bit) for package: bcc-0.10.0-1.el7.x86_64--> Processing Dependency: libclangEdit.so.7()(64bit) for package: bcc-0.10.0-1.el7.x86_64--> Processing Dependency: libclangDriver.so.7()(64bit) for package: bcc-0.10.0-1.el7.x86_64--> Processing Dependency: libclangCodeGen.so.7()(64bit) for package: bcc-0.10.0-1.el7.x86_64--> Processing Dependency: libclangBasic.so.7()(64bit) for package: bcc-0.10.0-1.el7.x86_64--> Processing Dependency: libclangAnalysis.so.7()(64bit) for package: bcc-0.10.0-1.el7.x86_64--> Processing Dependency: libclangAST.so.7()(64bit) for package: bcc-0.10.0-1.el7.x86_64--> Processing Dependency: libLLVM-7-rhel.so()(64bit) for package: bcc-0.10.0-1.el7.x86_64--> Running transaction check---> Package bcc-tools.x86_64 0:0.10.0-1.el7 will be installed--> Processing Dependency: python-bcc = 0.10.0-1.el7 for package: bcc-tools-0.10.0-1.el7.x86_64--> Processing Dependency: python-netaddr for package: bcc-tools-0.10.0-1.el7.x86_64--> Processing Dependency: kernel-devel for package: bcc-tools-0.10.0-1.el7.x86_64---> Package llvm-private.x86_64 0:7.0.1-1.el7 will be installed--> Running transaction check---> Package kernel-devel.x86_64 0:3.10.0-1127.18.2.el7 will be installed---> Package python-bcc.x86_64 0:0.10.0-1.el7 will be installed---> Package python-netaddr.noarch 0:0.7.5-9.el7 will be installed--> Finished Dependency ResolutionDependencies Resolved=========================================================================================================================================================================== Package Arch Version Repository Size===========================================================================================================================================================================Installing: bcc x86_64 0.10.0-1.el7 base 567 kInstalling for dependencies: bcc-tools x86_64 0.10.0-1.el7 base 394 k kernel-devel x86_64 3.10.0-1127.18.2.el7 updates 18 M llvm-private x86_64 7.0.1-1.el7 base 23 M python-bcc x86_64 0.10.0-1.el7 base 74 k python-netaddr noarch 0.7.5-9.el7 base 983 kTransaction Summary===========================================================================================================================================================================Install 1 Package (+5 Dependent packages)Total download size: 42 MInstalled size: 126 MDownloading packages:Delta RPMs disabled because /usr/bin/applydeltarpm not installed.(1/6): bcc-tools-0.10.0-1.el7.x86_64.rpm | 394 kB 00:00:05 (2/6): bcc-0.10.0-1.el7.x86_64.rpm | 567 kB 00:00:05 (3/6): python-bcc-0.10.0-1.el7.x86_64.rpm | 74 kB 00:00:00 (4/6): python-netaddr-0.7.5-9.el7.noarch.rpm | 983 kB 00:00:00 (5/6): kernel-devel-3.10.0-1127.18.2.el7.x86_64.rpm | 18 MB 00:00:06 (6/6): llvm-private-7.0.1-1.el7.x86_64.rpm | 23 MB 00:00:30 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------Total 1.4 MB/s | 42 MB 00:00:30 Running transaction checkRunning transaction testTransaction test succeededRunning transaction Installing : llvm-private-7.0.1-1.el7.x86_64 1/6 Installing : kernel-devel-3.10.0-1127.18.2.el7.x86_64 2/6 Installing : python-netaddr-0.7.5-9.el7.noarch 3/6 Installing : python-bcc-0.10.0-1.el7.x86_64 4/6 Installing : bcc-tools-0.10.0-1.el7.x86_64 5/6 Installing : bcc-0.10.0-1.el7.x86_64 6/6 Verifying : python-netaddr-0.7.5-9.el7.noarch 1/6 Verifying : python-bcc-0.10.0-1.el7.x86_64 2/6 Verifying : bcc-0.10.0-1.el7.x86_64 3/6 Verifying : kernel-devel-3.10.0-1127.18.2.el7.x86_64 4/6 Verifying : bcc-tools-0.10.0-1.el7.x86_64 5/6 Verifying : llvm-private-7.0.1-1.el7.x86_64 6/6Installed: bcc.x86_64 0:0.10.0-1.el7 Dependency Installed: bcc-tools.x86_64 0:0.10.0-1.el7 kernel-devel.x86_64 0:3.10.0-1127.18.2.el7 llvm-private.x86_64 0:7.0.1-1.el7 python-bcc.x86_64 0:0.10.0-1.el7 python-netaddr.noarch 0:0.7.5-9.el7 Complete![root@yang-01 tools]# pwd/usr/share/bcc/tools[root@yang-01 tools]# lsargdist cachestat dcstat fileslower javagc mountsnoop opensnoop pythoncalls rubystat sslsniff tcpaccept tplistbashreadline cachetop deadlock filetop javaobjnew mysqld_qslower perlcalls pythonflow runqlat stackcount tcpconnect tracebiolatency capable deadlock.c funccount javastat nfsdist perlflow pythongc runqlen statsnoop tcpconnlat ttysnoopbiosnoop cobjnew doc funclatency javathreads nfsslower perlstat pythonstat runqslower syncsnoop tcpdrop vfscountbiotop cpudist drsnoop funcslower killsnoop nodegc phpcalls reset-trace shmsnoop syscount tcplife vfsstatbitesize cpuunclaimed execsnoop gethostlatency lib nodestat phpflow rubycalls slabratetop tclcalls tcpretrans wakeuptimebpflist dbslower ext4dist hardirqs llcstat offcputime phpstat rubyflow sofdsnoop tclflow tcpsubnet xfsdistbtrfsdist dbstat ext4slower javacalls mdflush offwaketime pidpersec rubygc softirqs tclobjnew tcptop xfsslowerbtrfsslower dcsnoop filelife javaflow memleak oomkill profile rubyobjnew solisten tclstat tcptracer |
# 报错举例[root@yang-01 ~]# cachestat 1 3modprobe: FATAL: Module kheaders not found.chdir(/lib/modules/3.10.0-1127.el7.x86_64/build): No such file or directoryTraceback (most recent call last): File "/usr/share/bcc/tools/cachestat", line 96, in <module> b = BPF(text=bpf_text) File "/usr/lib/python2.7/site-packages/bcc/__init__.py", line 325, in __init__ raise Exception("Failed to compile BPF text")Exception: Failed to compile BPF text# 发现报错,这时需要进行内核版本的查看,确保 kernel-devel 与系统上当前安装的内核版本匹配[root@yang-01 tools]# rpm -qa|grep -i kernel | grep -i develkernel-devel-3.10.0-1127.18.2.el7.x86_64[root@yang-01 tools]# uname -r3.10.0-1127.el7.x86_64[root@yang-01 tools]# yum install "kernel-devel-uname-r == $(uname -r)"Loaded plugins: fastestmirrorLoading mirror speeds from cached hostfile * base: mirrors.cn99.com * epel: mirrors.yun-idc.com * extras: mirrors.aliyun.com * updates: mirrors.aliyun.comResolving Dependencies--> Running transaction check---> Package kernel-devel.x86_64 0:3.10.0-1127.el7 will be installed--> Finished Dependency ResolutionDependencies Resolved=========================================================================================================================================================================== Package Arch Version Repository Size===========================================================================================================================================================================Installing: kernel-devel x86_64 3.10.0-1127.el7 base 18 MTransaction Summary===========================================================================================================================================================================Install 1 PackageTotal download size: 18 MInstalled size: 38 MIs this ok [y/d/N]: yDownloading packages:Delta RPMs disabled because /usr/bin/applydeltarpm not installed.kernel-devel-3.10.0-1127.el7.x86_64.rpm | 18 MB 00:00:06 Running transaction checkRunning transaction testTransaction test succeededRunning transaction Installing : kernel-devel-3.10.0-1127.el7.x86_64 1/1 Verifying : kernel-devel-3.10.0-1127.el7.x86_64 1/1Installed: kernel-devel.x86_64 0:3.10.0-1127.el7 Complete! |
报错: 在安装kernel-devel显示 No package No package kernel-devel-uname-r == 5.6.2-1.el7.elrepo.x86_64 available. 解决办法: 启用[elrepo-kerne]存储库,# vim /etc/yum.repos.d/elrepo.repo[elrepo-kernel]name=ELRepo.org Community Enterprise Linux Kernel Repository - el7baseurl=http://elrepo.org/linux/kernel/el7/$basearch/ http://mirrors.coreix.net/elrepo/kernel/el7/$basearch/ http://mirror.rackspace.com/elrepo/kernel/el7/$basearch/ http://repos.lax-noc.com/elrepo/kernel/el7/$basearch/ http://mirror.ventraip.net.au/elrepo/kernel/el7/$basearch/mirrorlist=http://mirrors.elrepo.org/mirrors-elrepo-kernel.el7# 修改这一行将0改为1enabled=1然后重新执行:# sudo yum install "kernel-devel-uname-r == $(uname -r)" |
[root@yang-01 tools]# cachestat 1 5 HITS MISSES DIRTIES HITRATIO BUFFERS_MB CACHED_MB 1409 0 77 100.00% 1 2440 1438 0 101 100.00% 1 2440 1413 0 68 100.00% 1 2440 1412 0 117 100.00% 1 2440 1411 0 72 100.00% 1 2440第一个参数为间隔秒,第二个参数为输出次数TOTAL ,表示总的 I/O 次数MISSES ,表示缓存未命中的次数HITS ,表示缓存命中的次数DIRTIES, 表示新增到缓存中的脏页数BUFFERS_MB 表示 Buffers 的大小,以 MB 为单位CACHED_MB 表示 Cache 的大小,以 MB 为单位 [root@yang-01 tools]# cachetop16:24:15 Buffers MB: 1 / Cached MB: 2443 / Sort: HITS / Order: ascendingPID UID CMD HITS MISSES DIRTIES READ_HIT% WRITE_HIT% 1653 actionte ustats 1 0 0 100.0% 0.0% 1778 actionte umon-mgr 1 0 0 100.0% 0.0% 903 mysql jsvc 1 0 0 100.0% 0.0% 10909 actionte mysqld 2 0 0 100.0% 0.0% 22506 actionte uproxy 2 0 1 50.0% 0.0% 869 mysql jsvc 2 2 2 0.0% 0.0% 10788 actionte mysqld 3 0 0 100.0% 0.0% 1724 actionte urman-agent 4 2 2 33.3% 0.0% 21780 actionte uguard-agent 4 2 2 33.3% 0.0% 12798 actionte ustats 6 0 0 100.0% 0.0% 12799 actionte ustats 6 0 0 100.0% 0.0% 12800 actionte ustats 6 0 0 100.0% 0.0%这个命令的输出和top类似,默认按照缓存的命中次数(HITS)排序,展示了每个进程的缓存命中情况。这里的 HITS、MISSES 和 DIRTIES ,跟 cachestat 里的含义一样,分别代表间隔时间内的缓存命中次数、未命中次数以及新增到缓存中的脏页数。而 READ_HIT 和 WRITE_HIT ,分别表示读和写的缓存命中率。 |
# 查看新的进程。那些会消耗系统资源,但很短暂的进程,它们甚至不会出现在 top(1) 命令或其它工具中的显示之中。这些新进程可以使用 execsnoop 进行检测(或使用行业术语说,可以被追踪traced)。[root@yang-01 tools]# execsnoop -husage: execsnoop [-h] [-t] [-x] [-q] [-n NAME] [-l LINE] [--max-args MAX_ARGS]Trace exec() syscallsoptional arguments: -h, --help show this help message and exit -t, --timestamp include timestamp on output -x, --fails include failed exec()s -q, --quote Add quotemarks (") around arguments. -n NAME, --name NAME only print commands matching this name (regex), any arg -l LINE, --line LINE only print commands where arg contains this line (regex) --max-args MAX_ARGS maximum number of arguments parsed and displayed, defaults to 20examples: ./execsnoop # trace all exec() syscalls ./execsnoop -x # include failed exec()s ./execsnoop -t # include timestamps ./execsnoop -q # add "quotemarks" around arguments ./execsnoop -n main # only print command lines containing "main" ./execsnoop -l tpkg # only print command where arguments contains "tpkg"[root@yang-01 tools]# execsnoopPCOMM PID PPID RET ARGSbash 17436 1642 0 /usr/bin/bash --noprofile --norc -c systemctl is-active mysqld_5690.servicebash 17437 1642 0systemctl 17437 1642 0 /usr/bin/bash --noprofile --norc -c systemctl is-active mysqld_3307.service /usr/bin/systemctl is-active mysqld_3307.servicesystemctl 17436 1642 0 /usr/bin/systemctl is-active mysqld_5690.servicebash 17439 1642 0 /usr/bin/bash --noprofile --norc -c systemctl is-active mysqld_3307.servicesystemctl 17439 1642 0 /usr/bin/systemctl is-active mysqld_3307.servicebash 17440 1642 0 /usr/bin/bash --noprofile --norc -c systemctl is-active mysqld_5690.servicesystemctl 17440 1642 0 /usr/bin/systemctl is-active mysqld_5690.servicebash 17442 1642 0 /usr/bin/bash --noprofile --norc -c systemctl is-active mysqld_3307.servicesystemctl 17442 1642 0 /usr/bin/systemctl is-active mysqld_3307.servicebash 17443 1642 0 /usr/bin/bash --noprofile --norc -c systemctl is-active mysqld_5690.servicesystemctl 17443 1642 0 /usr/bin/systemctl is-active mysqld_5690.servicebash 17445 1642 0 /usr/bin/bash --noprofile --norc -c systemctl is-active mysqld_3307.servicebash 17446 1642 0 /usr/bin/bash --noprofile --norc -c systemctl is-active mysqld_5690.service################################################################################################################################################################## # opensnoop 通过跟踪 open() 系统调用来工作。opensnoop 的一些优点在于它能在系统范围内工作,并且跟踪所有进程的 open() 系统调用。[root@yang-01 tools]# opensnoop -husage: opensnoop [-h] [-T] [-U] [-x] [-p PID] [-t TID] [-u UID] [-d DURATION] [-n NAME] [-e] [-f FLAG_FILTER]Trace open() syscallsoptional arguments: -h, --help show this help message and exit -T, --timestamp include timestamp on output -U, --print-uid print UID column -x, --failed only show failed opens -p PID, --pid PID trace this PID only -t TID, --tid TID trace this TID only -u UID, --uid UID trace this UID only -d DURATION, --duration DURATION total duration of trace in seconds -n NAME, --name NAME only print process names containing this name -e, --extended_fields show extended fields -f FLAG_FILTER, --flag_filter FLAG_FILTER filter on flags argument (e.g., O_WRONLY)examples: ./opensnoop # trace all open() syscalls ./opensnoop -T # include timestamps ./opensnoop -U # include UID ./opensnoop -x # only show failed opens ./opensnoop -p 181 # only trace PID 181 ./opensnoop -t 123 # only trace TID 123 ./opensnoop -u 1000 # only trace UID 1000 ./opensnoop -d 10 # trace for 10 seconds only ./opensnoop -n main # only print process names containing "main" ./opensnoop -e # show extended fields ./opensnoop -f O_WRONLY -f O_RDWR # only print calls for writing[root@yang-01 tools]# opensnoopPID COMM FD ERR PATH1642 ustats 14 0 ./logs18699 opensnoop -1 2 /usr/lib64/python2.7/encodings/ascii.so18699 opensnoop -1 2 /usr/lib64/python2.7/encodings/asciimodule.so18699 opensnoop 16 0 /usr/lib64/python2.7/encodings/ascii.py18699 opensnoop 17 0 /usr/lib64/python2.7/encodings/ascii.pyc1309 uagent 16 0 /action-dmp/ustats/ustats.pid1309 uagent 16 0 /proc/1642/cmdline1309 uagent 16 0 /action-dmp/umon-mgr/umon-mgr.pid1309 uagent 16 0 /proc/1684/cmdline1309 uagent 16 0 /action-dmp/urman-agent/urman-agent.pid1309 uagent 16 0 /proc/1672/cmdline1309 uagent 16 0 /action-dmp/ucore/ucore.pid1309 uagent 16 0 /proc/1538/cmdline1309 uagent 16 0 /action-dmp/uguard-agent/uguard-agent.pid1309 uagent 16 0 /proc/1664/cmdline1309 uagent 16 0 /action-dmp/uguard-mgr/uguard-mgr.pid1309 uagent 16 0 /proc/1636/cmdline1309 uagent 16 0 /action-dmp/umon/umon.pid1309 uagent 16 0 /proc/1734/cmdline1309 uagent 16 0 /action-dmp/udeploy/udeploy.pid1309 uagent 16 0 /proc/1644/cmdline1309 uagent 16 0 /action-dmp/umc/umc.pid1309 uagent 16 0 /proc/1638/cmdline1309 uagent 16 0 /action-dmp/uproxy/uproxy.pid1309 uagent 16 0 /proc/22496/cmdline1309 uagent 16 0 /action-dmp/urman-mgr/urman-mgr.pid1309 uagent 16 0 /proc/1640/cmdline1664 uguard-agent 18 0 ./logs1642 ustats 14 0 /dev/null1642 ustats 40 0 /dev/null18703 bash 3 0 /etc/ld.so.cache18703 bash 3 0 /lib64/libtinfo.so.518703 bash 3 0 /lib64/libdl.so.218703 bash 3 0 /lib64/libc.so.618703 bash -1 6 /dev/tty18703 bash 3 0 /usr/lib/locale/locale-archive18703 bash 3 0 /proc/meminfo18703 bash 3 0 /usr/lib64/gconv/gconv-modules.cache18703 bash 3 0 /etc/nsswitch.conf18703 bash 3 0 /etc/ld.so.cache18703 bash 3 0 /lib64/libnss_files.so.218703 bash 3 0 /etc/passwd5448 mysqld -1 2 ./universe/u_delay.frm18703 systemctl 3 0 /etc/ld.so.cache18703 systemctl 3 0 /lib64/librt.so.118703 systemctl 3 0 /lib64/libselinux.so.118703 systemctl 3 0 /lib64/liblzma.so.518703 systemctl 3 0 /lib64/liblz4.so.118703 systemctl 3 0 /lib64/libgcrypt.so.1118703 systemctl 3 0 /lib64/libcap.so.218703 systemctl 3 0 /lib64/libgcc_s.so.118703 systemctl 3 0 /lib64/libpthread.so.018703 systemctl 3 0 /lib64/libc.so.618703 systemctl 3 0 /lib64/libpcre.so.118703 systemctl 3 0 /lib64/libdl.so.218703 systemctl 3 0 /lib64/libgpg-error.so.018703 systemctl 3 0 /lib64/libattr.so.118704 bash 3 0 /etc/ld.so.cache18704 bash 3 0 /lib64/libtinfo.so.5################################################################################################################################################################## # xfsslower 工具可以跟踪大于 1 毫秒(参数)延迟的常见 XFS 文件系统操作。# 在 xfsslower 工具运行的时候,通过在 XFS 中动态地检测内核函数实现的,并当它结束的时候解除该检测。# 这个 bcc 工具也有其它文件系统的版本:ext4slower、btrfsslower、zfsslower 和 nfsslower。[root@yang-01 tools]# xfsslower -husage: xfsslower [-h] [-j] [-p PID] [min_ms]Trace common XFS file operations slower than a thresholdpositional arguments: min_ms minimum I/O duration to trace, in ms (default 10)optional arguments: -h, --help show this help message and exit -j, --csv just print fields: comma-separated values -p PID, --pid PID trace this PID onlyexamples: ./xfsslower # trace operations slower than 10 ms (default) ./xfsslower 1 # trace operations slower than 1 ms ./xfsslower -j 1 # ... 1 ms, parsable output (csv) ./xfsslower 0 # trace all operations (warning: verbose) ./xfsslower -p 185 # trace PID 185 only[root@yang-01 tools]# xfsslower 1Tracing XFS operations slower than 1 msTIME COMM PID T BYTES OFF_KB LAT(ms) FILENAME17:02:44 umon-mgr 1684 S 0 0 5.93 detail.log17:02:44 umc 1638 S 0 0 1.05 detail.log17:02:45 mysqld 10763 W 81920 0 8.12 temp_16.ibt17:02:45 mysqld 10763 W 16384 80 2.16 temp_16.ibt17:02:45 ustats 1642 S 0 0 3.12 brief.log17:02:46 mysqld 10763 W 835584 1024 7.81 ibdata117:02:46 mysqld 10763 S 0 0 31.14 ib_logfile017:02:46 ucore 1538 S 0 0 5.89 raft.db17:02:46 ustats 1642 S 0 0 4.04 detail.log17:02:46 mysqld 10763 S 0 0 3.58 ib_logfile017:02:46 ustats 1642 S 0 0 1.49 brief.log17:02:46 umc 1638 S 0 0 13.70 brief.log17:02:46 uguard-mgr 1636 S 0 0 9.62 detail.log17:02:46 ucore 1538 S 0 0 8.87 detail.log17:02:48 ustats 1642 S 0 0 2.16 brief.log17:02:49 ustats 1642 S 0 0 1.41 brief.log17:02:49 uguard-agent 1664 S 0 0 26.30 brief.log17:02:49 uguard-agent 1664 S 0 0 6.56 detail.log17:02:49 mysqld 10763 S 0 0 1.89 ib_logfile017:02:50 ustats 1642 S 0 0 1.01 brief.log17:02:50 mysqld 10763 S 0 0 1.18 ib_logfile0################################################################################################################################################################## # 可以用来实现高效的内核摘要统计。从内核层到用户层的数据传输仅仅是“计数”列。 用户级程序生成其余的。[root@yang-01 tools]# biolatency -husage: biolatency [-h] [-T] [-Q] [-m] [-D] [-F] [interval] [count]Summarize block device I/O latency as a histogrampositional arguments: interval output interval, in seconds count number of outputsoptional arguments: -h, --help show this help message and exit -T, --timestamp include timestamp on output -Q, --queued include OS queued time in I/O time -m, --milliseconds millisecond histogram -D, --disks print a histogram per disk device -F, --flags print a histogram per set of I/O flagsexamples: ./biolatency # summarize block I/O latency as a histogram ./biolatency 1 10 # print 1 second summaries, 10 times ./biolatency -mT 1 # 1s summaries, milliseconds, and timestamps ./biolatency -Q # include OS queued time in I/O time ./biolatency -D # show each disk device separately ./biolatency -F # show I/O flags separately[root@yang-01 tools]# biolatencyTracing block device I/O... Hit Ctrl-C to end.^C usecs : count distribution 0 -> 1 : 0 | | 2 -> 3 : 0 | | 4 -> 7 : 0 | | 8 -> 15 : 0 | | 16 -> 31 : 13 |* | 32 -> 63 : 306 |********************************* | 64 -> 127 : 367 |****************************************| 128 -> 255 : 254 |*************************** | 256 -> 511 : 12 |* | 512 -> 1023 : 1 | | 1024 -> 2047 : 1 | | 2048 -> 4095 : 9 | | 4096 -> 8191 : 10 |* | 8192 -> 16383 : 9 | | 16384 -> 32767 : 3 | | 32768 -> 65535 : 2 | |################################################################################################################################################################## # tcplife显示 TCP 会话的生命周期和吞吐量统计[root@yang-01 tools]# tcplife -husage: tcplife [-h] [-T] [-t] [-w] [-s] [-p PID] [-L LOCALPORT] [-D REMOTEPORT]Trace the lifespan of TCP sessions and summarizeoptional arguments: -h, --help show this help message and exit -T, --time include time column on output (HH:MM:SS) -t, --timestamp include timestamp on output (seconds) -w, --wide wide column output (fits IPv6 addresses) -s, --csv comma separated values output -p PID, --pid PID trace this PID only -L LOCALPORT, --localport LOCALPORT comma-separated list of local ports to trace. -D REMOTEPORT, --remoteport REMOTEPORT comma-separated list of remote ports to trace.examples: ./tcplife # trace all TCP connect()s ./tcplife -t # include time column (HH:MM:SS) ./tcplife -w # wider colums (fit IPv6) ./tcplife -stT # csv output, with times & timestamps ./tcplife -p 181 # only trace PID 181 ./tcplife -L 80 # only trace local port 80 ./tcplife -L 80,81 # only trace local ports 80 and 81 ./tcplife -D 80 # only trace remote port 80[root@yang-01 tools]# tcplifePID COMM LADDR LPORT RADDR RPORT TX_KB RX_KB MS854 jsvc ::ffff:127.0.0.1 33222 ::ffff:127.0.0.1 3307 0 0 1.9110763 mysqld ::ffff:127.0.0.1 3307 ::ffff:127.0.0.1 33222 0 0 1.91854 jsvc ::ffff:127.0.0.1 33224 ::ffff:127.0.0.1 3307 0 0 2.0210763 mysqld ::ffff:127.0.0.1 3307 ::ffff:127.0.0.1 33224 0 0 2.031638 umc 10.186.60.24 48697 10.186.60.24 5704 0 0 3.011636 uguard-mgr ::ffff:10.186.60.24 5704 ::ffff:10.186.60.24 48697 0 0 3.03854 jsvc ::ffff:127.0.0.1 33228 ::ffff:127.0.0.1 3307 0 0 1.7610763 mysqld ::ffff:127.0.0.1 3307 ::ffff:127.0.0.1 33228 0 0 1.771684 umon-mgr ::1 37504 ::1 8200 0 0 0.031684 umon-mgr 127.0.0.1 48056 127.0.0.1 8200 0 0 0.02854 jsvc ::ffff:127.0.0.1 33234 ::ffff:127.0.0.1 3307 0 0 2.0210763 mysqld ::ffff:127.0.0.1 3307 ::ffff:127.0.0.1 33234 0 0 2.041642 ustats 127.0.0.1 33236 127.0.0.1 3307 0 0 3.1310763 mysqld ::ffff:127.0.0.1 3307 ::ffff:127.0.0.1 33236 0 0 3.151642 ustats 127.0.0.1 45738 127.0.0.1 5690 0 0 3.335448 mysqld ::ffff:127.0.0.1 5690 ::ffff:127.0.0.1 45738 0 0 3.35854 jsvc ::ffff:127.0.0.1 33240 ::ffff:127.0.0.1 3307 0 0 1.6510763 mysqld ::ffff:127.0.0.1 3307 ::ffff:127.0.0.1 33240 0 0 1.66#################################################################################################################################################################### # 检测用于名称解析的 gethostbyname(3) 和相关的库调用[root@yang-01 tools]# gethostlatency -husage: gethostlatency [-h] [-p PID]Show latency for getaddrinfo/gethostbyname[2] callsoptional arguments: -h, --help show this help message and exit -p PID, --pid PID trace this PID onlyexamples: ./gethostlatency # trace all TCP accept()s ./gethostlatency -p 181 # only trace PID 181 [root@yang-01 tools]# gethostlatencyTIME PID COMM LATms HOST17:37:55 903 jsvc 0.28 localhost17:38:03 22079 umon 0.28 localhost17:38:08 14308 umon-mgr 0.23 localhost17:38:16 919 jsvc 0.17 yang-0117:38:25 903 jsvc 0.28 localhost17:38:37 24347 umon 0.25 localhost17:38:38 1780 umon-mgr 0.34 localhost17:38:38 22085 umon 0.22 localhost17:38:43 1780 umon-mgr 0.22 localhost17:38:45 893 jsvc 0.08 yang-01##################################################################################################################################################################### # trace 工具由 Sasha Goldshtein 提供,并提供了一些基本的 printf(1) 功能和自定义探针。[root@yang-01 tools]# trace -husage: trace [-h] [-b BUFFER_PAGES] [-p PID] [-L TID] [-v] [-Z STRING_SIZE] [-S] [-M MAX_EVENTS] [-t] [-T] [-C] [-B] [-s SYM_FILE_LIST] [-K] [-U] [-a] [-I header] probe [probe ...]Attach to functions and print trace messages.positional arguments: probe probe specifier (see examples)optional arguments: -h, --help show this help message and exit -b BUFFER_PAGES, --buffer-pages BUFFER_PAGES number of pages to use for perf_events ring buffer (default: 64) -p PID, --pid PID id of the process to trace (optional) -L TID, --tid TID id of the thread to trace (optional) -v, --verbose print resulting BPF program code before executing -Z STRING_SIZE, --string-size STRING_SIZE maximum size to read from strings -S, --include-self do not filter trace's own pid from the trace -M MAX_EVENTS, --max-events MAX_EVENTS number of events to print before quitting -t, --timestamp print timestamp column (offset from trace start) -T, --time print time column -C, --print_cpu print CPU id -B, --bin_cmp allow to use STRCMP with binary values -s SYM_FILE_LIST, --sym_file_list SYM_FILE_LIST coma separated list of symbol files to use for symbol resolution -K, --kernel-stack output kernel stack trace -U, --user-stack output user stack trace -a, --address print virtual address in stacks -I header, --include header additional header files to include in the BPF program as either full path, or relative to current working directory, or relative to default kernel header search pathEXAMPLES:trace do_sys_open Trace the open syscall and print a default trace message when enteredtrace 'do_sys_open "%s", arg2' Trace the open syscall and print the filename being openedtrace 'sys_read (arg3 > 20000) "read %d bytes", arg3' Trace the read syscall and print a message for reads >20000 bytestrace 'r::do_sys_open "%llx", retval' Trace the return from the open syscall and print the return valuetrace 'c:open (arg2 == 42) "%s %d", arg1, arg2' Trace the open() call from libc only if the flags (arg2) argument is 42trace 'c:malloc "size = %d", arg1' Trace malloc calls and print the size being allocatedtrace 'p:c:write (arg1 == 1) "writing %d bytes to STDOUT", arg3' Trace the write() call from libc to monitor writes to STDOUTtrace 'r::__kmalloc (retval == 0) "kmalloc failed!"' Trace returns from __kmalloc which returned a null pointertrace 'r:c:malloc (retval) "allocated = %x", retval' Trace returns from malloc and print non-NULL allocated bufferstrace 't:block:block_rq_complete "sectors=%d", args->nr_sector' Trace the block_rq_complete kernel tracepoint and print # of tx sectorstrace 'u:pthread:pthread_create (arg4 != 0)' Trace the USDT probe pthread_create when its 4th argument is non-zerotrace 'p::SyS_nanosleep(struct timespec *ts) "sleep for %lld ns", ts->tv_nsec' Trace the nanosleep syscall and print the sleep duration in nstrace -I 'linux/fs.h' \ 'p::uprobe_register(struct inode *inode) "a_ops = %llx", inode->i_mapping->a_ops' Trace the uprobe_register inode mapping ops, and the symbol can be found in /proc/kallsymstrace -I 'kernel/sched/sched.h' \ 'p::__account_cfs_rq_runtime(struct cfs_rq *cfs_rq) "%d", cfs_rq->runtime_remaining' Trace the cfs scheduling runqueue remaining runtime. The struct cfs_rq is defined in kernel/sched/sched.h which is in kernel source tree and not in kernel-devel package. So this command needs to run at the kernel source tree root directory so that the added header file can be found by the compiler.trace -I 'net/sock.h' \ 'udpv6_sendmsg(struct sock *sk) (sk->sk_dport == 13568)' Trace udpv6 sendmsg calls only if socket's destination port is equal to 53 (DNS; 13568 in big endian order)trace -I 'linux/fs_struct.h' 'mntns_install "users = %d", $task->fs->users' Trace the number of users accessing the file system of the current task |
# bcc 基于 eBPF 开发(需要 Linux 3.15 及更高版本)。bcc 使用的大部分内容都需要 Linux 4.1 及更高版本。 [root@yang-01 tools]# dbstat mysql -p 10763 -uTraceback (most recent call last): File "/usr/share/bcc/tools/dbstat", line 88, in <module> usdt.enable_probe("query__start", "probe_start") File "/usr/lib/python2.7/site-packages/bcc/usdt.py", line 154, in enable_probe probebcc.usdt.USDTException: failed to enable probe 'query__start'; a possible cause can be that the probe requires a pid to enable[root@yang-01 tools]# dbslower mysql -p 10763 -m 30Traceback (most recent call last): File "/usr/share/bcc/tools/dbslower", line 193, in <module> usdt.enable_probe("query__start", "query_start") File "/usr/lib/python2.7/site-packages/bcc/usdt.py", line 154, in enable_probe probebcc.usdt.USDTException: failed to enable probe 'query__start'; a possible cause can be that the probe requires a pid to enable # OS内核版本, >= 4.4 存在统计Bug, 部分功能需要>= 4.13# "bcc.usdt.USDTException: failed to enable probe 'query__start'; a possible cause can be that the probe requires a pid to enable” 需要有Dtrace tracepoint的MySQL, 需要重新编译. https://dev.mysql.com/doc/refman/5.7/en/dba-dtrace-server.html# 使用bcc需要root权限 |
[root@yang-01 tools]# biolatency -D 2Tracing block device I/O... Hit Ctrl-C to end.disk = 'vda' usecs : count distribution 0 -> 1 : 0 | | 2 -> 3 : 0 | | 4 -> 7 : 0 | | 8 -> 15 : 0 | | 16 -> 31 : 0 | | 32 -> 63 : 63 |*********************************** | 64 -> 127 : 71 |****************************************| 128 -> 255 : 46 |************************* | 256 -> 511 : 4 |** | 512 -> 1023 : 0 | | 1024 -> 2047 : 0 | | 2048 -> 4095 : 3 |* | 4096 -> 8191 : 3 |* |disk = '' usecs : count distribution 0 -> 1 : 0 | | 2 -> 3 : 0 | | 4 -> 7 : 0 | | 8 -> 15 : 0 | | 16 -> 31 : 0 | | 32 -> 63 : 0 | | 64 -> 127 : 0 | | 128 -> 255 : 0 | | 256 -> 511 : 0 | | 512 -> 1023 : 0 | | 1024 -> 2047 : 0 | | 2048 -> 4095 : 1 |****************************************|disk = 'vda' usecs : count distribution 0 -> 1 : 0 | | 2 -> 3 : 0 | | 4 -> 7 : 0 | | 8 -> 15 : 0 | | 16 -> 31 : 10 |*** | 32 -> 63 : 84 |******************************** | 64 -> 127 : 103 |****************************************| 128 -> 255 : 43 |**************** | 256 -> 511 : 4 |* | 512 -> 1023 : 0 | | 1024 -> 2047 : 0 | | 2048 -> 4095 : 7 |** | 4096 -> 8191 : 17 |****** | 8192 -> 16383 : 1 | | 16384 -> 32767 : 1 | | 32768 -> 65535 : 1 | | 65536 -> 131071 : 1 | | 131072 -> 262143 : 44 |***************** |disk = '' usecs : count distribution 0 -> 1 : 0 | | 2 -> 3 : 0 | | 4 -> 7 : 0 | | 8 -> 15 : 0 | | 16 -> 31 : 0 | | 32 -> 63 : 0 | | 64 -> 127 : 0 | | 128 -> 255 : 0 | | 256 -> 511 : 0 | | 512 -> 1023 : 0 | | 1024 -> 2047 : 0 | | 2048 -> 4095 : 1 |****************************************| [root@yang-01 tools]# ps aux | grep mysql | grep optactiont+ 10763 3.3 18.0 4570952 1070628 ? Sl Jul16 1887:06 /opt/mysql/base/8.0.19/bin/mysqld --defaults-file=/opt/mysql/etc/3307/my.cnf --daemonize --pid-file=/opt/mysql/data/3307/mysqld.pid --user=actiontech-mysql --socket=/opt/mysql/data/3307/mysqld.sock --port=3307[root@yang-01 tools]# filetop -p 10763 -C 5Tracing... Output every 5 secs. Hit Ctrl-C to end18:01:40 loadavg: 0.14 0.15 0.20 2/1119 31195TID COMM READS WRITES R_Kb W_Kb T FILE10788 mysqld 0 1 0 832 R ibdata110909 mysqld 19 0 117 0 R mysql-relay.00004410794 mysqld 0 99 0 67 R ib_logfile010977 mysqld 0 60 0 4 R mysql-relay.00004410910 mysqld 0 10 0 4 R mysql-bin.00001510789 mysqld 0 1 0 0 R ib_logfile018:01:45 loadavg: 0.21 0.16 0.21 2/1119 31212TID COMM READS WRITES R_Kb W_Kb T FILE10909 mysqld 19 0 116 0 R mysql-relay.00004410794 mysqld 0 98 0 64 R ib_logfile010977 mysqld 0 60 0 4 R mysql-relay.00004410910 mysqld 0 10 0 4 R mysql-bin.00001518:01:50 loadavg: 0.19 0.16 0.20 2/1119 31226TID COMM READS WRITES R_Kb W_Kb T FILE10909 mysqld 15 0 88 0 R mysql-relay.00004410794 mysqld 0 106 0 70 R ib_logfile019780 mysqld 2 2 8 7 R ibzMa2hy10977 mysqld 0 60 0 4 R mysql-relay.00004410910 mysqld 0 10 0 4 R mysql-bin.00001519780 mysqld 1 0 0 0 R mysql-bin.00000919780 mysqld 1 0 0 0 R mysql-bin.00001219780 mysqld 1 0 0 0 R mysql-bin.00001419780 mysqld 1 0 0 0 R mysql-bin.00001019780 mysqld 1 0 0 0 R mysql-bin.00001319780 mysqld 1 0 0 0 R mysql-bin.000011[root@yang-01 tools]# filelifeTIME PID COMM AGE(s) FILE18:03:26 1823 ustats 0.00 disk-read-write-test.log18:03:26 1830 ustats 0.00 disk-read-write-test.log18:03:29 1736 umon-mgr 0.01 grafana.db-journal18:03:41 1823 ustats 0.00 disk-read-write-test.log18:03:41 1784 ustats 0.01 disk-read-write-test.log |


浙公网安备 33010602011771号