05-06-2022 07:41 AM
Hello everybody,
on our network we are using a pair of Cisco Nexus 9K (9300) in vPC mode; the switches are configured with some SVIs, VRRP enabled, and make some static routing.
These switches have been running for more than six year without a reboot, and starting from two months ago I noticed that the memory utilization in increasing, ranging around 90% of the total available memory.
I'm planning to reboot both switches (once authorized), and would like to know if there is any check I can do before the reboot to gather some information about why the ram started to fill.
Apart from this, can someone confirm that I should reboot the "master" unit first and then the other one, as we do when installing a software upgrade?
Thank you in advance!
Solved! Go to Solution.
05-06-2022 08:08 AM
Hi,
I'm planning to reboot both switches (once authorized), and would like to know if there is any check I can do before the reboot to gather some information about why the ram started to fill.
Before you reboot, gather all the logs for reference. Also, if you have a support contract on the switch, I recommend opening a ticket with TAC and having them investigate further before rebooting.
Apart from this, can someone confirm that I should reboot the "master" unit first and then the other one, as we do when installing a software upgrade?
If you are upgrading, I would reboot the master first, and then the secondary. This way, once you reboot the secondary, the old master will be the primary switch again.
HTH
05-06-2022 08:08 AM
Hi,
I'm planning to reboot both switches (once authorized), and would like to know if there is any check I can do before the reboot to gather some information about why the ram started to fill.
Before you reboot, gather all the logs for reference. Also, if you have a support contract on the switch, I recommend opening a ticket with TAC and having them investigate further before rebooting.
Apart from this, can someone confirm that I should reboot the "master" unit first and then the other one, as we do when installing a software upgrade?
If you are upgrading, I would reboot the master first, and then the secondary. This way, once you reboot the secondary, the old master will be the primary switch again.
HTH
05-06-2022 09:11 AM
>.... if there is any check I can do before the reboot to gather some information about why the ram started to fill.
Connect to device with https://cway.cisco.com/cli , at the top left run or press 'System Diagnostics'
M.
05-06-2022 12:21 PM
can you share this
NSK# show processes memory
05-06-2022 11:27 PM
Hi, this is the output of the "show processes memory". Firmware version is 7.0(3)I4(2).
PID MemAlloc MemLimit MemUsed StackBase/Ptr Process
----- -------- ---------- ---------- ----------------- ----------------
1 176128 0 4325376 7c0b4530/7c0b3b58 init
2 0 0 0 0/0 kthreadd
3 0 0 0 0/0 ksoftirqd/0
6 0 0 0 0/0 migration/0
7 0 0 0 0/0 watchdog/0
8 0 0 0 0/0 migration/1
10 0 0 0 0/0 ksoftirqd/1
12 0 0 0 0/0 watchdog/1
13 0 0 0 0/0 migration/2
15 0 0 0 0/0 ksoftirqd/2
16 0 0 0 0/0 watchdog/2
17 0 0 0 0/0 migration/3
19 0 0 0 0/0 ksoftirqd/3
20 0 0 0 0/0 watchdog/3
21 0 0 0 0/0 cpuset
22 0 0 0 0/0 khelper
23 0 0 0 0/0 kdevtmpfs
24 0 0 0 0/0 netns
25 0 0 0 0/0 sync_supers
26 0 0 0 0/0 bdi-default
27 0 0 0 0/0 kblockd
28 0 0 0 0/0 ata_sff
29 0 0 0 0/0 khubd
30 0 0 0 0/0 rpciod
39 0 0 0 0/0 kworker/u:1
51 0 0 0 0/0 khungtaskd
52 0 0 0 0/0 kswapd0
53 0 0 0 0/0 ksmd
54 0 0 0 0/0 fsnotify_mark
55 0 0 0 0/0 unionfs_siod
56 0 0 0 0/0 nfsiod
57 0 0 0 0/0 crypto
72 0 0 0 0/0 scsi_eh_0
73 0 0 0 0/0 scsi_eh_1
74 0 0 0 0/0 scsi_eh_2
75 0 0 0 0/0 scsi_eh_3
76 0 0 0 0/0 scsi_eh_4
78 0 0 0 0/0 kworker/u:3
79 0 0 0 0/0 cnic_wq
80 0 0 0 0/0 bnx2x
81 0 0 0 0/0 edac-poller
82 0 0 0 0/0 deferwq
138 0 0 0 0/0 loop0
646 0 0 0 0/0 loop1
648 0 0 0 0/0 loop2
2094 0 0 0 0/0 kworker/0:2
2255 0 0 0 0/0 jbd2/sda4-8
2256 0 0 0 0/0 ext4-dio-unwrit
2597 0 0 0 0/0 jbd2/sda5-8
2598 0 0 0 0/0 ext4-dio-unwrit
2603 0 0 0 0/0 jbd2/sda6-8
2604 0 0 0 0/0 ext4-dio-unwrit
2682 0 0 0 0/0 flush-8:0
4932 0 0 0 0/0 loop3
4968 0 0 0 0/0 jbd2/sda3-8
4969 0 0 0 0/0 ext4-dio-unwrit
4978 0 0 0 0/0 jbd2/sda2-8
4979 0 0 0 0/0 ext4-dio-unwrit
4992 0 0 0 0/0 jbd2/sda7-8
4993 0 0 0 0/0 ext4-dio-unwrit
5016 184320 0 8638464 7d7611e0/7d761000 portmap
5029 0 0 0 0/0 lockd
5030 0 0 0 0/0 nfsd
5032 323584 0 15368192 875e35d0/875e3288 rpc.mountd
5034 208896 0 15093760 ddbacbb0/ddbac948 rpc.statd
5214 0 0 0 0/0 kworker/3:0
5332 0 0 0 0/0 ppm
5400 0 0 0 0/0 loop4
5401 0 0 0 0/0 ext4-dio-unwrit
5647 167936 0 2269184 ffb50c00/ffb50ab0 mcelog
5898 94208 0 3166208 ffb5c1d0/ffb5b958 sh
5899 14209024 0 446779392 ffdfabb0/ffdfa910 sysmgr
5900 13127680 0 420933632 ffcd6780/ffcd64e0 sysmgr
5914 14118912 0 20221952 ffdaa9b0/ffdaa3b0 libvirtd
5920 0 0 0 0/0 kworker/3:1
6118 0 0 0 0/0 ppm
6258 0 0 0 0/0 mping-thread
6259 0 0 0 0/0 mping-thread
6291 0 0 0 0/0 cctrl_kthread
6339 0 0 0 0/0 redun_kthread
6360 0 0 0 0/0 usd_mts_kthread
6365 0 0 0 0/0 ls-notify-mts-t
6582 389120 735279001 2994176 ff8ed900/ff8ed750 xinetd
6583 434176 317279001 2740224 ffe96130/ffe95f40 tftpd
6584 2306048 640189510 336601088 ffa99bf0/ffa98f20 sdwrapd
6586 503808 0 280035328 ffdd3710/ffdd333c dme_proxy
6587 51171328 0 496615424 ff9abaa0/ff9aa9c0 platform
6590 167215104 0 758538240 ffe00820/ffe0075c event_manager
6591 141025280 0 650620928 ffaac7f0/ffaac72c policyelem
6595 454656 0 280031232 ffee6240/ffee6070 sdwrapd
6597 2154496 0 285880320 ffdabdd0/ffdaada0 pfmclnt
6628 10780672 0 420995072 ffee1cd0/ffee1b40 dme_asap_backend
6629 20910080 812320115 455356416 ffc54430/ffc4ff0c syslogd
6631 12181504 972782131 445616128 ffab2250/ffab1b78 vshd
6632 7892992 942540684 345169920 ffdcfc70/ffdcf6b0 smm
6633 2031616 667833689 293507072 ff90c240/ff90bf10 psshelper
6634 53411840 924999680 394579968 ffa2dad0/ffa2cec0 pixm_vl
6635 53796864 1045957651 490364928 ff9d5880/ff9d4c70 pixm_gl
6636 1007616 0 10735616 ff87e390/ff87e0e0 nxapi
6637 200749056 0 707039232 fff9daf0/fff9d860 nginx
6638 3170304 837235520 400834560 ff940640/ff93da8c mmode
6639 286720 0 3432448 ff9a5540/ff9a41b0 lmgrd
6640 1388544 0 337625088 fff284d0/fff27c50 fs-daemon
6641 14647296 0 448626688 ffdbadd0/ffdb9c8c feature-mgr
6642 1249280 0 335589376 fff1bc90/fff1ba50 confcheck
6643 3166208 623743577 337465344 fffdaeb0/fffda2d0 capability
6644 5251072 665594899 342671360 ffe0fa40/ffe0e21c bloggerd
6645 1773568 667675993 291012608 ff814650/ff814320 psshelper_gsvc
6653 33542144 1436260531 558047232 ffe7a550/ffe799dc clis
6654 1589248 666127756 339013632 ffb0a540/ffb0a078 licmgr
6663 307200 0 3461120 ff8c0b50/ff8c0990 cisco
6675 991232 0 336314368 ffa27250/ffa27080 xmlma
6676 1916928 636699008 337272832 ffa5a410/ffa591d0 vmm
6677 4780032 700537516 352092160 ffdf3810/ffdf3308 vman
6678 13340672 866243276 445472768 ffb5e7b0/ffb5dc60 vdc_mgr
6679 9949184 0 289624064 ffcf76a0/ffcf546c usbhsd
6683 1835008 680295193 339578880 ffaf6cd0/ffaf65a0 ttyd
6684 946176 604999084 280567808 ff8f3d20/ff8f3888 sysinfo
6685 1921024 669840371 338915328 ffe857f0/ffe847b0 snmpmib_proc
6686 1228800 639148716 338038784 ffdfd2e0/ffdfcc40 sksd
6688 1720320 622414425 338542592 ffc50070/ffc4fb9c res_mgr
6689 1318912 427279001 332603392 ffbf5b30/ffbf4cec pyproxy
6690 3620864 1987316531 292134912 ffbc5fc0/ffbc4f8c plugin
6691 548864 700675008 318537728 ffc48d30/ffc48890 plog_sup
6692 892928 1484169420 280805376 ffd25860/ffd252a8 patch-installer
6693 10022912 4294967295 759083008 ffaa9750/ffaa91a0 nbproxy
6694 1683456 783802272 336756736 ffd6d810/ffd6c590 mvsh
6695 512000 0 280899584 ffa0b380/ffa0af30 mping_server
6696 2772992 685810048 343531520 ffa6cc10/ffa6b880 module
6697 13205504 805347065 355770368 fffb0e50/fffb08d0 kim
6698 1941504 626239680 337960960 ffc79710/ffc786e0 evms
6700 9408512 775625804 347131904 fffcad10/fffcaac8 epld_upgrade_stdby
6701 4415488 668934745 339902464 ff8288f0/ff827880 diagmgr
6702 4177920 1198766784 752046080 ff91f700/ff91db7c dhclient
6703 12541952 1054536665 351719424 ffd70df0/ffd70550 crdcfg_server
6704 1003520 0 337055744 ff83d1f0/ff83cb90 core-dmon
6705 119828480 0 659779584 ff92c800/ff92c73c confelem
6706 2007040 669069913 339845120 ff855390/ff854360 clk_mgr
6707 638976 460279001 287543296 ffa4f1a0/ffa4e6c8 bios_daemon
6708 12419072 1476094963 533876736 ff9d4fb0/ff9d43a8 ascii-cfg
6709 13058048 0 446046208 ff931de0/ff930b18 securityd
6710 2297856 0 342016000 ff91d3b0/ff91b5fc cert_enroll
6711 12890112 0 446472192 ffc9c440/ffc9afe0 aaa
6715 1396736 714083673 340463616 ffa180e0/ffa17798 obfl
6716 21348352 1645618585 462168064 ffb237f0/ffb22400 aclmgr
6722 18247680 1790424268 650776576 ff835410/ff835030 urib
6724 1859584 888500518 337047552 ff8b9f40/ff8b8f10 evmc
6727 5943296 669538496 342913024 fffd6320/fffd4fd0 diagclient
6737 5001216 0 340529152 fff8d700/fff8c6a0 xbar
6738 17399808 768682675 354586624 ffc82a40/ffc81fcc device_test
6742 1544192 715124467 340545536 ffb5f4b0/ffb5e5a0 ExceptionLog
6743 11509760 807881254 445349888 ffa1a240/ffa19af0 bootvar
6744 1114112 0 334548992 ffedf490/ffedf030 cardclient
6745 34770944 802365414 473804800 ffc172d0/ffc16290 ifmgr
6746 25300992 1076438105 469577728 ffdba390/ffdb9f60 l3vm
6765 13570048 685384691 353710080 ffe7a310/ffe79e70 statsclient
6800 266240 0 12836864 78f33750/78f2b2d0 incrond
6851 8413184 949506342 354549760 ffeb5360/ffeb4dd0 npacl
6870 33746944 1224169356 770908160 ffcf4e80/ffcf4930 adjmgr
6871 21569536 1205192588 749711360 ffb571f0/ffb570b0 u6rib
6879 75345920 1415698188 726204416 ffafc230/ffafc0e0 arp
6881 24104960 1301720563 884776960 ff84d040/ff84cad0 icmpv6
6882 24584192 978306137 416235520 ffc06b60/ffc06a30 pktmgr
6898 54153216 1574836096 944336896 ffcac710/ffcac150 netstack
6922 13422592 1176875974 723795968 fff21940/fff1c89c radius
6924 13369344 1154296089 729210880 fff1f3e0/fff1e20c cdp
6925 5947392 997301888 343101440 ff9efe00/ff9ee28c cfs
6926 507904 625317862 280035328 ffb71310/ffb710d8 ip_dummy
6927 507904 625317862 280035328 ff831940/ff831708 ipv6_dummy
6928 4218880 1198536998 751808512 ffede610/ffedca9c otm
6929 37040128 1154853939 786464768 ffe570c0/ffe51f0c snmpd
6930 507904 625317862 280035328 ff81fcf0/ff81fab8 tcpudp_dummy
6940 2019328 735279001 332345344 ffeacc00/ffea8e0c dcos-xinetd
6975 2150400 771340979 338505728 ffb5d590/ffb5d28c callhome
7498 0 0 0 0/0 plugin
7551 56455168 1624293580 470491136 ffbd6a40/ffbd4e5c port-profile
7561 27406336 1310221811 883138560 ff88b600/ff88b4e0 rpm
7563 4292608 708863865 353136640 ffc422e0/ffc4073c pltfm_config
7564 5931008 948154662 350060544 ffea0820/ffe9f800 plcmgr
7566 14667776 1061362368 448364544 ffd3e4b0/ffd3c92c pfstat
7568 13434880 1320674803 856924160 ffc82c30/ffc7e95c ntp
7569 4603904 1046202240 621010944 ffa90920/ffa8edac monitor
7570 14360576 1208111398 775880704 ff9154f0/ff9153d0 m6rib
7571 11730944 770674995 353128448 ff8d3c80/ff8d20dc lim
7572 36610048 985861324 405618688 ffb65fd0/ffb65a80 l2rib
7573 77631488 1117101568 435101696 ffd52320/ffd514bc ipfib
7574 40394752 1541774988 823087104 ffa94c80/ffa94740 igmp
7575 28921856 808713804 464568320 ff9f61e0/ff9f441c eth_port_channel
7576 3035136 897520588 347672576 ffd14120/ffd125ac adbm
7577 2297856 942482112 338509824 ffc180c0/ffc170a0 acllog
7592 15802368 712472851 355172352 ffbf9210/ffbf766c eltm
7596 21975040 854437619 467484672 ffb4e5a0/ffb4cd1c vlan_mgr
7610 1511424 0 333488128 ffd1bdb0/ffd17fdc ntpd
7612 1531904 697346214 336756736 ffb348f0/ffb34510 eth_dstats
7613 24256512 991083878 468684800 ffc1b2d0/ffc19c30 ipqosmgr
7614 15085568 1062831193 451735552 ffb810f0/ffb800c0 lacp
7622 35782656 906373318 476381184 ffa24b80/ffa22dac ethpm
7623 44072960 1388452384 500416512 ffe7c290/ffe7a4ec l2fm
7624 33292288 688687296 378732544 ff9f0400/ff9ee66c aclqos
7625 19730432 1340354329 461086720 ffea4830/ffea3d10 stp
7626 4366336 678792998 341712896 ff90f1f0/ff90e1a0 stripcl
7642 16130048 989745715 456417280 ffa38a60/ffa377b0 copp
7652 4702208 1264318899 634081280 ffeb1ef0/ffead4fc vpc
7653 2072576 678815526 338681856 ffe86f40/ffe85f10 u2
7654 4923392 1331122636 344227840 ffdf07b0/ffdeebbc spm
7655 2371584 1202677644 722636800 ffe58700/ffe56b8c sal
7656 18264064 1486729395 719826944 fff3ede0/fff3ecc0 mrib
7657 16396288 1075082201 365699072 ffbe9c60/ffbe809c mfdm
7658 2367488 680149184 342294528 ffafeb90/ffafdb10 mcm
7659 2572288 943009267 342224896 ffce61e0/ffce464c l2pt
7660 19718144 818388313 458969088 ffc73470/ffc7188c interface-vlan
7668 21983232 962635660 415424512 ffc7dc60/ffc7c0ec ufdm
7669 8130560 1078978700 345919488 ffb1a360/ffb196b0 m2rib
7672 11886592 1173634406 705236992 ffd32390/ffd322b0 mcastfwd
7697 0 0 0 0/0 wdpunch_thread
7742 0 0 0 0/0 bkncmd
7743 0 0 0 0/0 bknevt
7783 3575808 0 287432704 ffa940f0/ffa9341c bloggerd
7863 1732608 0 286425088 ffff64a0/ffff6170 psshelper
7864 1732608 0 286425088 ff840850/ff840520 psshelper
7865 71057408 0 469352448 ffef8c60/ffef87c0 plog_lc
7866 843776 0 280580096 fff4fc70/fff4f6e8 patch_installer
7867 466944 0 280535040 ff8ec980/ff8ec038 obfl_lc
7868 1183744 0 284327936 ffbe1af0/ffbe0870 mvsh
7869 1253376 0 284467200 ffe03660/ffe02630 evmc
7870 80203776 0 482349056 ff820ca0/ff82052c dt_helper
7871 3031040 0 348794880 ffca7d90/ffca6a40 diagclient
7872 17182720 0 360239104 ffca8350/ffca7ac0 crdcfg_server
7873 499712 0 280338432 ffa34260/ffa33ff8 capability
7884 76320768 0 478597120 ffda3ed0/ffda346c device_test
7919 811008 0 345006080 ffcafbf0/ffcaf710 crdclient
7921 200384512 0 634863616 ffd6ac20/ffd6a78c t2usd
7923 21573632 0 743190528 ffa62e00/ffa62160 nsusd
8131 72155136 0 474206208 ffd3fb90/ffd3f6d0 dc3_sensor
8141 91025408 0 499462144 ff9c1380/ff9c0e50 bfdc
8142 143904768 0 556765184 ffa193b0/ffa1833c iftmc
8143 115154944 0 525111296 ffcb4bc0/ffcb3fbc pixc
8144 78970880 0 482656256 ffba7d20/ffba7120 port_client
8145 83636224 0 489664512 ff8b66d0/ff8b6230 stats_client
8146 81498112 0 490598400 ffe64520/ffe634ec vntagc
8147 113397760 0 524271616 ff80d3f0/ff80c8cc mtm
8153 175144960 0 597024768 ff8f8160/ff8f72fc ipfib
8156 118140928 0 537980928 ffea77d0/ffea658c aclqos
8157 81772544 0 505847808 ffeb2440/ffeb140c ptplc
8162 82186240 0 492183552 ff96d1a0/ff96c16c monc
8163 92790784 0 496664576 ffb8c260/ffb8b1cc xbar_client
8235 0 0 0 0/0 ppm
8935 0 0 0 0/0 kworker/0:1
8963 184320 0 4325376 7cc43360/7cc43248 klogd
9835 0 0 0 0/0 ppm
9967 0 0 0 0/0 ppm
12079 0 0 0 0/0 ppm
12143 0 0 0 0/0 ppm
12160 5550080 1210387545 767406080 fff523b0/fff4d74c hsrp_engine
12539 0 0 0 0/0 kworker/2:2
13039 188416 0 2572288 ff8fdf30/ff8fb888 getty
13364 0 0 0 0/0 ppm
13381 0 0 0 0/0 ppm
18573 153915392 758766694 496168960 ffe79940/ffe78b30 diag_port_lb
22872 1777664 0 335474688 ff9988f0/ff9941ac dcos_sshd
22877 13152256 0 495513600 ffa5b610/ffa52f58 vsh
22930 184320 0 6586368 979b4a90/979b48c8 more
22931 13287424 0 495706112 ffa5b610/ffa52838 vsh
22932 0 0 0 f735960/f735658 ps
26436 0 0 0 0/0 kworker/2:0
27712 0 0 0 0/0 kworker/1:0
29466 0 0 0 0/0 kworker/1:2
31800 0 0 0 0/0 ppm
All processes: MemAlloc = 4278689792
05-27-2022 09:17 AM
As the Cisco TAC confirmed, there were some bugs causing memory leak. Rebooting the switches solved and now we are upgrading to a fixed version.
05-27-2022 09:33 AM
Yeah, memory leaks are notorious for crashing the switch, and if you don't reboot in a planned maintenance window, it will get rebooted at a very bad time.
Good Luck with the upgrade!
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide