early kdump

简介: early kdump

参考

正文

Early Kdump HOWTO

Introduction

Early kdump is a mechanism to make kdump operational earlier than normal kdump

service. The kdump service starts early enough for general crash cases, but

there are some cases where it has no chance to make kdump operational in boot

sequence, such as detecting devices and starting early services. If you hit

such a case, early kdump may allow you to get more information of it.

Early kdump is implemented as a dracut module. It adds a kernel (vmlinuz) and

initramfs for kdump to your system's initramfs in order to load them as early

as possible. After that, if you provide "rd.earlykdump" in kernel command line,

then in the initramfs, early kdump will load those files like the normal kdump

service. This is disabled by default.

For the normal kdump service, it can check whether the early kdump has loaded

the crash kernel and initramfs. It has no conflict with the early kdump.

How to configure early kdump

We assume if you're reading this document, you should already have kexec-tools

installed.

You can rebuild the initramfs with earlykdump support with below steps:

  1. start kdump service to make sure kdump initramfs is created.
    # systemctl start kdump
    NOTE: If a crash occurs during boot process, early kdump captures a vmcore
    and reboot the system by default, so the system might go into crash loop.
    You can avoid such a crash loop by adding the following settings, which
    power off the system after dump capturing, to kdump.conf in advance:
final_action poweroff
 failure_action poweroff
  1. For the failure_action, you can choose anything other than "reboot".
  2. rebuild system initramfs with earlykdump support.
    # dracut --force --add earlykdump
    NOTE: Recommend to backup the original system initramfs before performing
    this step to put it back if something happens during boot-up.
  3. add rd.earlykdump in grub kernel command line.

After making said changes, reboot your system to take effect. Of course, if you

want to disable early kdump, you can simply remove "rd.earlykdump" from kernel

boot parameters in grub, and reboot system like above.

Once the boot is completed, you can check the status of the early kdump support

on the command prompt:

# journalctl -b | grep early-kdump

Then, you will see some useful logs, for example:

  • if early kdump is successful.

Mar 09 09:57:56 localhost dracut-cmdline[190]: early-kdump is enabled.

Mar 09 09:57:56 localhost dracut-cmdline[190]: kexec: loaded early-kdump kernel

  • if early kdump is disabled.

Mar 09 10:02:47 localhost dracut-cmdline[189]: early-kdump is disabled.

Notes

  • The size of early kdump initramfs will be large because it includes vmlinuz
    and kdump initramfs.
    (所以可能需要将内核启动参数里的crashkernel的大小增大,否则在第二个内核启动时可能会出现内存不足而导致无法保存宕机信息)
  • Early kdump inherits the settings of normal kdump, so any changes that
    caused normal kdump rebuilding also require rebuilding the system initramfs
    to make sure that the changes take effect for early kdump. Therefore, after
    the rebuilding of kdump initramfs is completed, provide a prompt message to
    tell the fact.
  • If you install an updated kernel and reboot the system with it, the early
    kdump will be disabled by default. To enable it with the new kernel, you
    need to take the above steps again.

Limitation

  • At present, early kdump doesn't support fadump.
  • Early kdump loads a crash kernel and initramfs at the beginning of the
    process in system's initramfs, so a crash at earlier than that (e.g. in
    kernel initialization) cannot be captured even with the early kdump.
相关文章
|
2月前
|
机器学习/深度学习 监控
早停法(Early Stopping)
早停法(Early Stopping)
|
1月前
crash命令 —— irq
crash命令 —— irq
|
1月前
|
NoSQL
crash —— 查看内核配置
crash —— 查看内核配置
|
1月前
crash命令 —— mach
crash命令 —— mach
启动报错“No bootable device”
三个步骤排查启动报错“No bootable device”
启动报错“No bootable device”
|
Linux Windows
胆战心惊-Kernel Panic -- not syncing: attempted to kill init
关闭了Redhat,重新启动,就遇到Kernel Panic -- not syncing: attempted to kill init,并有这样的提示:CPU has been disabled by guest os. Power off or resset。
1960 0
|
传感器 算法 Linux
Perf Subsystem —— 基于PMI实现的NMI Watchdog
## 背景 任务能否被及时响应,对内核来说,至关重用。Linux kernel实现了softlockup和hardlockup,用于检测系统是否出现了长时间无响应。 > A ‘softlockup’ is defined as a bug that causes the kernel to loop in kernel mode for more than 20 seconds, with
2043 1
|
机器学习/深度学习
|
C语言
1003 Emergency (25)
#include #include using namespace std; int main(int argc, const char * argv[]) { //基本初始化 const int ...
874 0