<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic AMP+kubernetes/rancher memory problem in Endpoint Security</title>
    <link>https://community.cisco.com/t5/endpoint-security/amp-kubernetes-rancher-memory-problem/m-p/4595703#M6817</link>
    <description>&lt;P&gt;Hi!&lt;BR /&gt;&lt;BR /&gt;We have 2 on-prem cloud infrastructures running kubernetes/rancher at the moment for dev and test, but the prod is coming along fast and we ran into the following problem:&lt;BR /&gt;the infrastructures consists of the following nodes:&lt;BR /&gt;1 "control" node&lt;/P&gt;&lt;P&gt;3 "master" nodes&lt;/P&gt;&lt;P&gt;3-4 "worker" nodes&amp;nbsp;&lt;/P&gt;&lt;P&gt;3-4 "infra" nodes.&amp;nbsp;&lt;/P&gt;&lt;P&gt;All of them have AMP installed, latest version.&amp;nbsp;&lt;/P&gt;&lt;P&gt;On the worker and infra nodes, the memory usage of the ampdaemon process starts to ramp up after a while, which triggers the oom killer, which starts to kill the k8s/rancher processes and the infra and worker servers become unavailable.&lt;BR /&gt;This doesnt happen on the control or master servers ever.&lt;/P&gt;&lt;P&gt;I imagine that there might be some problem with the exclusion list, or its a memory leak somewhere between amp and k8s.&amp;nbsp;&lt;BR /&gt;This happened just half an hour again on of our infra nodes and the ampdaemon process was using 77% of memory of the server.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have disabled the amp service on the test nodes, leaving the dev nodes for troubleshooting for now.&amp;nbsp;&lt;/P&gt;&lt;P&gt;The exclusion list should be sound, based on the documentation, and we dont have this issue with our other server, which doesnt run k8s either.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I can't post logs for now, but will do later if needed.&amp;nbsp;&lt;/P&gt;&lt;P&gt;All mentioned systems are running RHEL8.5, swap off, autoupdate for the connector is set, clam-av linux-only.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 19 Apr 2022 20:01:37 GMT</pubDate>
    <dc:creator>cxir</dc:creator>
    <dc:date>2022-04-19T20:01:37Z</dc:date>
    <item>
      <title>AMP+kubernetes/rancher memory problem</title>
      <link>https://community.cisco.com/t5/endpoint-security/amp-kubernetes-rancher-memory-problem/m-p/4595703#M6817</link>
      <description>&lt;P&gt;Hi!&lt;BR /&gt;&lt;BR /&gt;We have 2 on-prem cloud infrastructures running kubernetes/rancher at the moment for dev and test, but the prod is coming along fast and we ran into the following problem:&lt;BR /&gt;the infrastructures consists of the following nodes:&lt;BR /&gt;1 "control" node&lt;/P&gt;&lt;P&gt;3 "master" nodes&lt;/P&gt;&lt;P&gt;3-4 "worker" nodes&amp;nbsp;&lt;/P&gt;&lt;P&gt;3-4 "infra" nodes.&amp;nbsp;&lt;/P&gt;&lt;P&gt;All of them have AMP installed, latest version.&amp;nbsp;&lt;/P&gt;&lt;P&gt;On the worker and infra nodes, the memory usage of the ampdaemon process starts to ramp up after a while, which triggers the oom killer, which starts to kill the k8s/rancher processes and the infra and worker servers become unavailable.&lt;BR /&gt;This doesnt happen on the control or master servers ever.&lt;/P&gt;&lt;P&gt;I imagine that there might be some problem with the exclusion list, or its a memory leak somewhere between amp and k8s.&amp;nbsp;&lt;BR /&gt;This happened just half an hour again on of our infra nodes and the ampdaemon process was using 77% of memory of the server.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I have disabled the amp service on the test nodes, leaving the dev nodes for troubleshooting for now.&amp;nbsp;&lt;/P&gt;&lt;P&gt;The exclusion list should be sound, based on the documentation, and we dont have this issue with our other server, which doesnt run k8s either.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I can't post logs for now, but will do later if needed.&amp;nbsp;&lt;/P&gt;&lt;P&gt;All mentioned systems are running RHEL8.5, swap off, autoupdate for the connector is set, clam-av linux-only.&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 19 Apr 2022 20:01:37 GMT</pubDate>
      <guid>https://community.cisco.com/t5/endpoint-security/amp-kubernetes-rancher-memory-problem/m-p/4595703#M6817</guid>
      <dc:creator>cxir</dc:creator>
      <dc:date>2022-04-19T20:01:37Z</dc:date>
    </item>
  </channel>
</rss>

