03-27-2023 02:57 PM
I'm trying to understand the cause of the high latency that I'm seeing in the latency graph on vManage.
I have a few questions about how vManage collects statistics from devices:
Please see the attachment.
03-27-2023 03:07 PM
Hi,
1) what vManages shows is based on BFD. By default, each poll-interval is 10min and 6x poll-interval are used (60min). Mean values are considered as a result. Your picture shows tunnel statistics so BFD between tunnel endpoints (respective TLOC of routers) are taken.
2) how do you tests? are you sure which exit interface router uses in your tests?
3) vManage does not do separate test
See "Bidirectional Forwarding Detection (BFD) " section from SD-WAN CVD
https://www.cisco.com/c/en/us/td/docs/solutions/CVD/SDWAN/cisco-sdwan-design-guide.html
03-27-2023 04:51 PM
Thank you for your reply.
I opened a ticket with the service provider and they informed me that they did not find any issues in the transport. However, I am still seeing that the latency values remain high.
Because of this, I am investigating the reason why the latency in the overlay is so high even though the transport is not showing the same behavior.
Sincerely.
03-27-2023 05:12 PM
03-27-2023 06:13 PM
Thank you for your reply.
What I am trying to understand is the values that vManage is showing me in the UI whether they are collected based on all bfd sessions or by transport.
When I go to Monitor -> Network -> Select Device -> WAN -> TLOC, in the table below the vManage shows me the statistics by color. These values are being collected from BFD in each transport?
Please see the attachment and you will see the custom2 is showing high latency value (2,814.59ms), the question is why?
Sincerely.
03-28-2023 12:42 AM
Hi,
it is based on aggregated average loss or latency/jitter information for TLOC color. he time interval in the graph is determined by the value of the BFD application-aware routing poll interval. Check below subs-section.
Regarding, why you have higher latency for custom2, check all tunnels statistics to see which exact tunnel (which is initiated by custom2 TLOC) have higher latency.
03-28-2023 12:41 PM
I was checking the vEdge's events and I have found some entries related to high CPU (system + user). The vEdge 1000 has only one CPU for both control and data plane. This could be affect the tunnels?
03-28-2023 01:07 PM
03-28-2023 04:41 PM - edited 03-28-2023 04:42 PM
03-28-2023 05:38 PM
03-28-2023 05:53 PM - edited 03-28-2023 06:33 PM
Good to know.
I have observed some critical events related to CPU usage in the vEdge device, but I am not sure if they are causing the high latency in a tunnel.
There is a tunnel that is showing high latency more than 2000ms, but in the only one direction.
This vEdge model supports how many tunnels?
Could high CPU usage be causing the one-way latency?
03-28-2023 09:59 PM
Hi,
latency for TLOC is aggregated value. That's why I asked to search in tunnel list.
What do you mean by one-way, how did you check? If from A to B there is latency, should be from B to A also (normally).
03-29-2023 12:26 PM
Exactly, the latency is from A to B, but not B to A in the same tunnel.
03-29-2023 01:13 PM
Hi,
could you share screen? In reality, I don't see reason to have such scenario. BFD is calculated 2-way (I didn't find explicit doc, but I believe it works like so) and checked in our environment, the same values from A to B and from B to A (can be +/- very very little difference). Maybe you look incorrect tunnel?
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide