05-06-2022 07:41 AM
Hello everybody,
on our network we are using a pair of Cisco Nexus 9K (9300) in vPC mode; the switches are configured with some SVIs, VRRP enabled, and make some static routing.
These switches have been running for more than six year without a reboot, and starting from two months ago I noticed that the memory utilization in increasing, ranging around 90% of the total available memory.
I'm planning to reboot both switches (once authorized), and would like to know if there is any check I can do before the reboot to gather some information about why the ram started to fill.
Apart from this, can someone confirm that I should reboot the "master" unit first and then the other one, as we do when installing a software upgrade?
Thank you in advance!
Solved! Go to Solution.
05-06-2022 08:08 AM
Hi,
I'm planning to reboot both switches (once authorized), and would like to know if there is any check I can do before the reboot to gather some information about why the ram started to fill.
Before you reboot, gather all the logs for reference. Also, if you have a support contract on the switch, I recommend opening a ticket with TAC and having them investigate further before rebooting.
Apart from this, can someone confirm that I should reboot the "master" unit first and then the other one, as we do when installing a software upgrade?
If you are upgrading, I would reboot the master first, and then the secondary. This way, once you reboot the secondary, the old master will be the primary switch again.
HTH
05-06-2022 08:08 AM
Hi,
I'm planning to reboot both switches (once authorized), and would like to know if there is any check I can do before the reboot to gather some information about why the ram started to fill.
Before you reboot, gather all the logs for reference. Also, if you have a support contract on the switch, I recommend opening a ticket with TAC and having them investigate further before rebooting.
Apart from this, can someone confirm that I should reboot the "master" unit first and then the other one, as we do when installing a software upgrade?
If you are upgrading, I would reboot the master first, and then the secondary. This way, once you reboot the secondary, the old master will be the primary switch again.
HTH
05-06-2022 09:11 AM
>.... if there is any check I can do before the reboot to gather some information about why the ram started to fill.
Connect to device with https://cway.cisco.com/cli , at the top left run or press 'System Diagnostics'
M.
05-06-2022 12:21 PM
can you share this
NSK# show processes memory
05-06-2022 11:27 PM
Hi, this is the output of the "show processes memory". Firmware version is 7.0(3)I4(2).
PID MemAlloc MemLimit MemUsed StackBase/Ptr Process ----- -------- ---------- ---------- ----------------- ---------------- 1 176128 0 4325376 7c0b4530/7c0b3b58 init 2 0 0 0 0/0 kthreadd 3 0 0 0 0/0 ksoftirqd/0 6 0 0 0 0/0 migration/0 7 0 0 0 0/0 watchdog/0 8 0 0 0 0/0 migration/1 10 0 0 0 0/0 ksoftirqd/1 12 0 0 0 0/0 watchdog/1 13 0 0 0 0/0 migration/2 15 0 0 0 0/0 ksoftirqd/2 16 0 0 0 0/0 watchdog/2 17 0 0 0 0/0 migration/3 19 0 0 0 0/0 ksoftirqd/3 20 0 0 0 0/0 watchdog/3 21 0 0 0 0/0 cpuset 22 0 0 0 0/0 khelper 23 0 0 0 0/0 kdevtmpfs 24 0 0 0 0/0 netns 25 0 0 0 0/0 sync_supers 26 0 0 0 0/0 bdi-default 27 0 0 0 0/0 kblockd 28 0 0 0 0/0 ata_sff 29 0 0 0 0/0 khubd 30 0 0 0 0/0 rpciod 39 0 0 0 0/0 kworker/u:1 51 0 0 0 0/0 khungtaskd 52 0 0 0 0/0 kswapd0 53 0 0 0 0/0 ksmd 54 0 0 0 0/0 fsnotify_mark 55 0 0 0 0/0 unionfs_siod 56 0 0 0 0/0 nfsiod 57 0 0 0 0/0 crypto 72 0 0 0 0/0 scsi_eh_0 73 0 0 0 0/0 scsi_eh_1 74 0 0 0 0/0 scsi_eh_2 75 0 0 0 0/0 scsi_eh_3 76 0 0 0 0/0 scsi_eh_4 78 0 0 0 0/0 kworker/u:3 79 0 0 0 0/0 cnic_wq 80 0 0 0 0/0 bnx2x 81 0 0 0 0/0 edac-poller 82 0 0 0 0/0 deferwq 138 0 0 0 0/0 loop0 646 0 0 0 0/0 loop1 648 0 0 0 0/0 loop2 2094 0 0 0 0/0 kworker/0:2 2255 0 0 0 0/0 jbd2/sda4-8 2256 0 0 0 0/0 ext4-dio-unwrit 2597 0 0 0 0/0 jbd2/sda5-8 2598 0 0 0 0/0 ext4-dio-unwrit 2603 0 0 0 0/0 jbd2/sda6-8 2604 0 0 0 0/0 ext4-dio-unwrit 2682 0 0 0 0/0 flush-8:0 4932 0 0 0 0/0 loop3 4968 0 0 0 0/0 jbd2/sda3-8 4969 0 0 0 0/0 ext4-dio-unwrit 4978 0 0 0 0/0 jbd2/sda2-8 4979 0 0 0 0/0 ext4-dio-unwrit 4992 0 0 0 0/0 jbd2/sda7-8 4993 0 0 0 0/0 ext4-dio-unwrit 5016 184320 0 8638464 7d7611e0/7d761000 portmap 5029 0 0 0 0/0 lockd 5030 0 0 0 0/0 nfsd 5032 323584 0 15368192 875e35d0/875e3288 rpc.mountd 5034 208896 0 15093760 ddbacbb0/ddbac948 rpc.statd 5214 0 0 0 0/0 kworker/3:0 5332 0 0 0 0/0 ppm 5400 0 0 0 0/0 loop4 5401 0 0 0 0/0 ext4-dio-unwrit 5647 167936 0 2269184 ffb50c00/ffb50ab0 mcelog 5898 94208 0 3166208 ffb5c1d0/ffb5b958 sh 5899 14209024 0 446779392 ffdfabb0/ffdfa910 sysmgr 5900 13127680 0 420933632 ffcd6780/ffcd64e0 sysmgr 5914 14118912 0 20221952 ffdaa9b0/ffdaa3b0 libvirtd 5920 0 0 0 0/0 kworker/3:1 6118 0 0 0 0/0 ppm 6258 0 0 0 0/0 mping-thread 6259 0 0 0 0/0 mping-thread 6291 0 0 0 0/0 cctrl_kthread 6339 0 0 0 0/0 redun_kthread 6360 0 0 0 0/0 usd_mts_kthread 6365 0 0 0 0/0 ls-notify-mts-t 6582 389120 735279001 2994176 ff8ed900/ff8ed750 xinetd 6583 434176 317279001 2740224 ffe96130/ffe95f40 tftpd 6584 2306048 640189510 336601088 ffa99bf0/ffa98f20 sdwrapd 6586 503808 0 280035328 ffdd3710/ffdd333c dme_proxy 6587 51171328 0 496615424 ff9abaa0/ff9aa9c0 platform 6590 167215104 0 758538240 ffe00820/ffe0075c event_manager 6591 141025280 0 650620928 ffaac7f0/ffaac72c policyelem 6595 454656 0 280031232 ffee6240/ffee6070 sdwrapd 6597 2154496 0 285880320 ffdabdd0/ffdaada0 pfmclnt 6628 10780672 0 420995072 ffee1cd0/ffee1b40 dme_asap_backend 6629 20910080 812320115 455356416 ffc54430/ffc4ff0c syslogd 6631 12181504 972782131 445616128 ffab2250/ffab1b78 vshd 6632 7892992 942540684 345169920 ffdcfc70/ffdcf6b0 smm 6633 2031616 667833689 293507072 ff90c240/ff90bf10 psshelper 6634 53411840 924999680 394579968 ffa2dad0/ffa2cec0 pixm_vl 6635 53796864 1045957651 490364928 ff9d5880/ff9d4c70 pixm_gl 6636 1007616 0 10735616 ff87e390/ff87e0e0 nxapi 6637 200749056 0 707039232 fff9daf0/fff9d860 nginx 6638 3170304 837235520 400834560 ff940640/ff93da8c mmode 6639 286720 0 3432448 ff9a5540/ff9a41b0 lmgrd 6640 1388544 0 337625088 fff284d0/fff27c50 fs-daemon 6641 14647296 0 448626688 ffdbadd0/ffdb9c8c feature-mgr 6642 1249280 0 335589376 fff1bc90/fff1ba50 confcheck 6643 3166208 623743577 337465344 fffdaeb0/fffda2d0 capability 6644 5251072 665594899 342671360 ffe0fa40/ffe0e21c bloggerd 6645 1773568 667675993 291012608 ff814650/ff814320 psshelper_gsvc 6653 33542144 1436260531 558047232 ffe7a550/ffe799dc clis 6654 1589248 666127756 339013632 ffb0a540/ffb0a078 licmgr 6663 307200 0 3461120 ff8c0b50/ff8c0990 cisco 6675 991232 0 336314368 ffa27250/ffa27080 xmlma 6676 1916928 636699008 337272832 ffa5a410/ffa591d0 vmm 6677 4780032 700537516 352092160 ffdf3810/ffdf3308 vman 6678 13340672 866243276 445472768 ffb5e7b0/ffb5dc60 vdc_mgr 6679 9949184 0 289624064 ffcf76a0/ffcf546c usbhsd 6683 1835008 680295193 339578880 ffaf6cd0/ffaf65a0 ttyd 6684 946176 604999084 280567808 ff8f3d20/ff8f3888 sysinfo 6685 1921024 669840371 338915328 ffe857f0/ffe847b0 snmpmib_proc 6686 1228800 639148716 338038784 ffdfd2e0/ffdfcc40 sksd 6688 1720320 622414425 338542592 ffc50070/ffc4fb9c res_mgr 6689 1318912 427279001 332603392 ffbf5b30/ffbf4cec pyproxy 6690 3620864 1987316531 292134912 ffbc5fc0/ffbc4f8c plugin 6691 548864 700675008 318537728 ffc48d30/ffc48890 plog_sup 6692 892928 1484169420 280805376 ffd25860/ffd252a8 patch-installer 6693 10022912 4294967295 759083008 ffaa9750/ffaa91a0 nbproxy 6694 1683456 783802272 336756736 ffd6d810/ffd6c590 mvsh 6695 512000 0 280899584 ffa0b380/ffa0af30 mping_server 6696 2772992 685810048 343531520 ffa6cc10/ffa6b880 module 6697 13205504 805347065 355770368 fffb0e50/fffb08d0 kim 6698 1941504 626239680 337960960 ffc79710/ffc786e0 evms 6700 9408512 775625804 347131904 fffcad10/fffcaac8 epld_upgrade_stdby 6701 4415488 668934745 339902464 ff8288f0/ff827880 diagmgr 6702 4177920 1198766784 752046080 ff91f700/ff91db7c dhclient 6703 12541952 1054536665 351719424 ffd70df0/ffd70550 crdcfg_server 6704 1003520 0 337055744 ff83d1f0/ff83cb90 core-dmon 6705 119828480 0 659779584 ff92c800/ff92c73c confelem 6706 2007040 669069913 339845120 ff855390/ff854360 clk_mgr 6707 638976 460279001 287543296 ffa4f1a0/ffa4e6c8 bios_daemon 6708 12419072 1476094963 533876736 ff9d4fb0/ff9d43a8 ascii-cfg 6709 13058048 0 446046208 ff931de0/ff930b18 securityd 6710 2297856 0 342016000 ff91d3b0/ff91b5fc cert_enroll 6711 12890112 0 446472192 ffc9c440/ffc9afe0 aaa 6715 1396736 714083673 340463616 ffa180e0/ffa17798 obfl 6716 21348352 1645618585 462168064 ffb237f0/ffb22400 aclmgr 6722 18247680 1790424268 650776576 ff835410/ff835030 urib 6724 1859584 888500518 337047552 ff8b9f40/ff8b8f10 evmc 6727 5943296 669538496 342913024 fffd6320/fffd4fd0 diagclient 6737 5001216 0 340529152 fff8d700/fff8c6a0 xbar 6738 17399808 768682675 354586624 ffc82a40/ffc81fcc device_test 6742 1544192 715124467 340545536 ffb5f4b0/ffb5e5a0 ExceptionLog 6743 11509760 807881254 445349888 ffa1a240/ffa19af0 bootvar 6744 1114112 0 334548992 ffedf490/ffedf030 cardclient 6745 34770944 802365414 473804800 ffc172d0/ffc16290 ifmgr 6746 25300992 1076438105 469577728 ffdba390/ffdb9f60 l3vm 6765 13570048 685384691 353710080 ffe7a310/ffe79e70 statsclient 6800 266240 0 12836864 78f33750/78f2b2d0 incrond 6851 8413184 949506342 354549760 ffeb5360/ffeb4dd0 npacl 6870 33746944 1224169356 770908160 ffcf4e80/ffcf4930 adjmgr 6871 21569536 1205192588 749711360 ffb571f0/ffb570b0 u6rib 6879 75345920 1415698188 726204416 ffafc230/ffafc0e0 arp 6881 24104960 1301720563 884776960 ff84d040/ff84cad0 icmpv6 6882 24584192 978306137 416235520 ffc06b60/ffc06a30 pktmgr 6898 54153216 1574836096 944336896 ffcac710/ffcac150 netstack 6922 13422592 1176875974 723795968 fff21940/fff1c89c radius 6924 13369344 1154296089 729210880 fff1f3e0/fff1e20c cdp 6925 5947392 997301888 343101440 ff9efe00/ff9ee28c cfs 6926 507904 625317862 280035328 ffb71310/ffb710d8 ip_dummy 6927 507904 625317862 280035328 ff831940/ff831708 ipv6_dummy 6928 4218880 1198536998 751808512 ffede610/ffedca9c otm 6929 37040128 1154853939 786464768 ffe570c0/ffe51f0c snmpd 6930 507904 625317862 280035328 ff81fcf0/ff81fab8 tcpudp_dummy 6940 2019328 735279001 332345344 ffeacc00/ffea8e0c dcos-xinetd 6975 2150400 771340979 338505728 ffb5d590/ffb5d28c callhome 7498 0 0 0 0/0 plugin 7551 56455168 1624293580 470491136 ffbd6a40/ffbd4e5c port-profile 7561 27406336 1310221811 883138560 ff88b600/ff88b4e0 rpm 7563 4292608 708863865 353136640 ffc422e0/ffc4073c pltfm_config 7564 5931008 948154662 350060544 ffea0820/ffe9f800 plcmgr 7566 14667776 1061362368 448364544 ffd3e4b0/ffd3c92c pfstat 7568 13434880 1320674803 856924160 ffc82c30/ffc7e95c ntp 7569 4603904 1046202240 621010944 ffa90920/ffa8edac monitor 7570 14360576 1208111398 775880704 ff9154f0/ff9153d0 m6rib 7571 11730944 770674995 353128448 ff8d3c80/ff8d20dc lim 7572 36610048 985861324 405618688 ffb65fd0/ffb65a80 l2rib 7573 77631488 1117101568 435101696 ffd52320/ffd514bc ipfib 7574 40394752 1541774988 823087104 ffa94c80/ffa94740 igmp 7575 28921856 808713804 464568320 ff9f61e0/ff9f441c eth_port_channel 7576 3035136 897520588 347672576 ffd14120/ffd125ac adbm 7577 2297856 942482112 338509824 ffc180c0/ffc170a0 acllog 7592 15802368 712472851 355172352 ffbf9210/ffbf766c eltm 7596 21975040 854437619 467484672 ffb4e5a0/ffb4cd1c vlan_mgr 7610 1511424 0 333488128 ffd1bdb0/ffd17fdc ntpd 7612 1531904 697346214 336756736 ffb348f0/ffb34510 eth_dstats 7613 24256512 991083878 468684800 ffc1b2d0/ffc19c30 ipqosmgr 7614 15085568 1062831193 451735552 ffb810f0/ffb800c0 lacp 7622 35782656 906373318 476381184 ffa24b80/ffa22dac ethpm 7623 44072960 1388452384 500416512 ffe7c290/ffe7a4ec l2fm 7624 33292288 688687296 378732544 ff9f0400/ff9ee66c aclqos 7625 19730432 1340354329 461086720 ffea4830/ffea3d10 stp 7626 4366336 678792998 341712896 ff90f1f0/ff90e1a0 stripcl 7642 16130048 989745715 456417280 ffa38a60/ffa377b0 copp 7652 4702208 1264318899 634081280 ffeb1ef0/ffead4fc vpc 7653 2072576 678815526 338681856 ffe86f40/ffe85f10 u2 7654 4923392 1331122636 344227840 ffdf07b0/ffdeebbc spm 7655 2371584 1202677644 722636800 ffe58700/ffe56b8c sal 7656 18264064 1486729395 719826944 fff3ede0/fff3ecc0 mrib 7657 16396288 1075082201 365699072 ffbe9c60/ffbe809c mfdm 7658 2367488 680149184 342294528 ffafeb90/ffafdb10 mcm 7659 2572288 943009267 342224896 ffce61e0/ffce464c l2pt 7660 19718144 818388313 458969088 ffc73470/ffc7188c interface-vlan 7668 21983232 962635660 415424512 ffc7dc60/ffc7c0ec ufdm 7669 8130560 1078978700 345919488 ffb1a360/ffb196b0 m2rib 7672 11886592 1173634406 705236992 ffd32390/ffd322b0 mcastfwd 7697 0 0 0 0/0 wdpunch_thread 7742 0 0 0 0/0 bkncmd 7743 0 0 0 0/0 bknevt 7783 3575808 0 287432704 ffa940f0/ffa9341c bloggerd 7863 1732608 0 286425088 ffff64a0/ffff6170 psshelper 7864 1732608 0 286425088 ff840850/ff840520 psshelper 7865 71057408 0 469352448 ffef8c60/ffef87c0 plog_lc 7866 843776 0 280580096 fff4fc70/fff4f6e8 patch_installer 7867 466944 0 280535040 ff8ec980/ff8ec038 obfl_lc 7868 1183744 0 284327936 ffbe1af0/ffbe0870 mvsh 7869 1253376 0 284467200 ffe03660/ffe02630 evmc 7870 80203776 0 482349056 ff820ca0/ff82052c dt_helper 7871 3031040 0 348794880 ffca7d90/ffca6a40 diagclient 7872 17182720 0 360239104 ffca8350/ffca7ac0 crdcfg_server 7873 499712 0 280338432 ffa34260/ffa33ff8 capability 7884 76320768 0 478597120 ffda3ed0/ffda346c device_test 7919 811008 0 345006080 ffcafbf0/ffcaf710 crdclient 7921 200384512 0 634863616 ffd6ac20/ffd6a78c t2usd 7923 21573632 0 743190528 ffa62e00/ffa62160 nsusd 8131 72155136 0 474206208 ffd3fb90/ffd3f6d0 dc3_sensor 8141 91025408 0 499462144 ff9c1380/ff9c0e50 bfdc 8142 143904768 0 556765184 ffa193b0/ffa1833c iftmc 8143 115154944 0 525111296 ffcb4bc0/ffcb3fbc pixc 8144 78970880 0 482656256 ffba7d20/ffba7120 port_client 8145 83636224 0 489664512 ff8b66d0/ff8b6230 stats_client 8146 81498112 0 490598400 ffe64520/ffe634ec vntagc 8147 113397760 0 524271616 ff80d3f0/ff80c8cc mtm 8153 175144960 0 597024768 ff8f8160/ff8f72fc ipfib 8156 118140928 0 537980928 ffea77d0/ffea658c aclqos 8157 81772544 0 505847808 ffeb2440/ffeb140c ptplc 8162 82186240 0 492183552 ff96d1a0/ff96c16c monc 8163 92790784 0 496664576 ffb8c260/ffb8b1cc xbar_client 8235 0 0 0 0/0 ppm 8935 0 0 0 0/0 kworker/0:1 8963 184320 0 4325376 7cc43360/7cc43248 klogd 9835 0 0 0 0/0 ppm 9967 0 0 0 0/0 ppm 12079 0 0 0 0/0 ppm 12143 0 0 0 0/0 ppm 12160 5550080 1210387545 767406080 fff523b0/fff4d74c hsrp_engine 12539 0 0 0 0/0 kworker/2:2 13039 188416 0 2572288 ff8fdf30/ff8fb888 getty 13364 0 0 0 0/0 ppm 13381 0 0 0 0/0 ppm 18573 153915392 758766694 496168960 ffe79940/ffe78b30 diag_port_lb 22872 1777664 0 335474688 ff9988f0/ff9941ac dcos_sshd 22877 13152256 0 495513600 ffa5b610/ffa52f58 vsh 22930 184320 0 6586368 979b4a90/979b48c8 more 22931 13287424 0 495706112 ffa5b610/ffa52838 vsh 22932 0 0 0 f735960/f735658 ps 26436 0 0 0 0/0 kworker/2:0 27712 0 0 0 0/0 kworker/1:0 29466 0 0 0 0/0 kworker/1:2 31800 0 0 0 0/0 ppm All processes: MemAlloc = 4278689792
05-27-2022 09:17 AM
As the Cisco TAC confirmed, there were some bugs causing memory leak. Rebooting the switches solved and now we are upgrading to a fixed version.
05-27-2022 09:33 AM
Yeah, memory leaks are notorious for crashing the switch, and if you don't reboot in a planned maintenance window, it will get rebooted at a very bad time.
Good Luck with the upgrade!
Find answers to your questions by entering keywords or phrases in the Search bar above. New here? Use these resources to familiarize yourself with the community: