cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
575
Views
0
Helpful
2
Replies

IOS XRv 9000 fails to boot in DevNet CML

msanghera6
Level 1
Level 1

The IOS XRv 9000 node fails to boot / gets stuck in a boot loop.

It does get to this point in the boot process, but doesn't progress further, even if you wait (30mins+)

I understand this is a resource intensive node, but its been months since this was a last able to boot successfully.

I believe it is using v7.4.1 and runs with the default allocated resources.

################################################################################
# #
# Welcome to the Cisco IOS XRv9k platform #
# #
# Please wait for Cisco IOS XR to start. #
# #
# Copyright (c) 2014-2019 by Cisco Systems, Inc. #
# #
################################################################################

Cisco IOS XR console will start on the 1st serial port
Cisco IOS XR aux console will start on the 2nd serial port
Cisco Calvados console will start on the 3rd serial port
Cisco Calvados aux will start on the 4th serial port
[ 108.267032] BUG: unable to handle kernel paging request at ffffffff8a0449b1
[ 108.267032] IP: [<ffffffff8a0449b1>] kvm_unlock_kick+0x81/0x90
[ 108.267032] PGD aa0f067 PUD aa10063 PMD a0001e1
[ 108.267032] Oops: 0003 [#1] SMP
[ 108.267032] Modules linked in: xts gf128mul dm_crypt veth tun bridge ip6table_filter ip6_tables iptable_filter ip_tables 8021q garp stp mrp llc ext3 jbd cisco_nb_vrf(O) lcndklm_vrf(O) kvm_amd kvm sr_mod virtio_rng cdrom rng_core virtio_net virtio_balloon button crc32_pclmul i2c_piix4 uhci_hcd i2c_core virtio_console vmw_pvscsi megaraid_sas(O) ata_piix virtio_blk virtio_pci virtio_ring virtio sd_mod crc_t10dif crct10dif_common ahci libahci ext4 jbd2 dm_mod
[ 108.267032] CPU: 1 PID: 15398 Comm: tune2fs Tainted: G O 3.14.23-WR7.0.0.2_standard #1
[ 108.267032] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.13.0-1ubuntu1.1 04/01/2014
[ 108.267032] task: ffff880037aed090 ti: ffff8800b7ddc000 task.ti: ffff8800b7ddc000
[ 108.267032] RIP: 0010:[<ffffffff8a0449b1>] [<ffffffff8a0449b1>] kvm_unlock_kick+0x81/0x90
[ 108.267032] RSP: 0018:ffff8800b7ddfbb0 EFLAGS: 00010046
[ 108.267032] RAX: 0000000000000005 RBX: 0000000000000000 RCX: 0000000000000002
[ 108.267032] RDX: 0000000000000002 RSI: 0000000000000004 RDI: ffff88052491a358
[ 108.267032] RBP: ffff8800b7ddfbc8 R08: ffffffff8adafb00 R09: 0000000000000000
[ 108.267032] R10: ffff88050a536270 R11: 0000000000000000 R12: ffff88052491a358
[ 108.267032] R13: 000000000000540a R14: 0000000000000246 R15: 0000000000000000
[ 108.267032] FS: 00007fed19989740(0000) GS:ffff88053fc80000(0000) knlGS:0000000000000000
[ 108.267032] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 108.267032] CR2: ffffffff8a0449b1 CR3: 00000000b7efb000 CR4: 00000000003407e0
[ 108.267032] Stack:
[ 108.267032] 0000000000000246 ffff88052491a340 ffff88052491a358 ffff8800b7ddfbe0
[ 108.267032] ffffffff8a5c815a ffffea000c395588 ffff8800b7ddfc18 ffffffff8a1d5406
[ 108.267032] ffffea000c395588 ffff88050b8005e0 0000000000000000 0000000000000001
[ 108.267032] Call Trace:
[ 108.267032] [<ffffffff8a5c815a>] _raw_spin_unlock_irqrestore+0x5a/0x70
[ 108.267032] [<ffffffff8a1d5406>] __set_page_dirty+0x66/0xc0
[ 108.267032] [<ffffffff8a1d5581>] mark_buffer_dirty+0x61/0xd0
[ 108.267032] [<ffffffff8a1d6623>] __block_commit_write.isra.21+0x83/0xc0
[ 108.267032] [<ffffffff8a1d6693>] block_write_end+0x33/0x80
[ 108.267032] [<ffffffff8a1d7cf6>] ? block_write_begin+0x46/0x90
[ 108.267032] [<ffffffff8a1dcfe0>] blkdev_write_end+0x20/0x40
[ 108.267032] [<ffffffff8a1318f0>] generic_file_buffered_write+0x150/0x280
[ 108.267032] [<ffffffff8a132e76>] __generic_file_aio_write+0x176/0x3c0
[ 108.267032] [<ffffffff8a1dd52a>] blkdev_aio_write+0x4a/0xe0
[ 108.267032] [<ffffffff8a1a2cea>] do_sync_write+0x5a/0x90
[ 108.267032] [<ffffffff8a1a33ea>] vfs_write+0xba/0x1e0
[ 108.267032] [<ffffffff8a1a3f9a>] SyS_pwrite64+0x7a/0xa0
[ 108.267032] [<ffffffff8a0e9346>] ? __audit_syscall_exit+0x236/0x2e0
[ 108.267032] [<ffffffff8a5d1189>] system_call_fastpath+0x16/0x1b
[ 108.267032] Code: 9e b1 8a 48 8d 04 0b 48 8b 38 4c 39 e7 75 cb 0f b7 40 08 66 44 39 e8 75 c1 48 c7 c0 32 a0 00 00 31 db 0f b7 0c 08 b8 05 00 00 00 <0f> 01 c1 0f 1f 40 00 5b 41 5c 41 5d 5d c3 90 66 66 66 66 90 48
[ 108.267032] RIP [<ffffffff8a0449b1>] kvm_unlock_kick+0x81/0x90
[ 108.267032] RSP <ffff8800b7ddfbb0>
[ 108.267032] CR2: ffffffff8a0449b1
[ 108.267032] ---[ end trace 65b19dfa9527b0dd ]---

 

1 Accepted Solution

Accepted Solutions

From the error this is a a kernel panic issue with the OS XRv 9000 node, which is causing it to fail to boot or get stuck in a boot loop. The kernel panic is happening due to an issue with the kernel paging request at the memory location ffffffff8a0449b1 and also shown by the "Oops: 0003 [#1] SMP" message in the log. This is a critical error that causes the CML system to halt, often due to invalid memory access or other issues.

I am no up to date on the image versions, and if this is fixed in a later version in CML or XRv image.

Please mark this as helpful or solution accepted to help others
Connect with me https://bigevilbeard.github.io

View solution in original post

2 Replies 2

From the error this is a a kernel panic issue with the OS XRv 9000 node, which is causing it to fail to boot or get stuck in a boot loop. The kernel panic is happening due to an issue with the kernel paging request at the memory location ffffffff8a0449b1 and also shown by the "Oops: 0003 [#1] SMP" message in the log. This is a critical error that causes the CML system to halt, often due to invalid memory access or other issues.

I am no up to date on the image versions, and if this is fixed in a later version in CML or XRv image.

Please mark this as helpful or solution accepted to help others
Connect with me https://bigevilbeard.github.io

Guilherme Lyra
Level 1
Level 1

Just a heads up, I managed to boot the XRv 9000 on DevNet CML by increasing the node CPUs to 8 (by default it uses 4 CPUs)