Hello Carlos--HSRP traffic is considered "important" traffic according to default CoPP settings. If you have your CoPP dialed into "strict" then the default CoPP rate is:
set cos 6
police cir 3000 pps bc 128 packets conform transmit violate drop
Your HSRP active router is likely receiving HSRP packets in excess of the allowed CoPP limit, so the excess is dropped, which would contribute to your high latency. You can go into this configuration, and dial it up and see if you get better latency.
This is the Security Config Guide for NX-OS Release 7.x. There are step-by-step instructions under "Configure Control Plane Policy Map." Not sure what you were running. https://www.cisco.com/c/en/us/td/docs/switches/datacenter/nexus9000/sw/7-x/security/configuration/guide/b_Cisco_Nexus_90….
Hope this helps. MM
The only time I've encountered this was when dmirror was enabled on a couple of 9Ks in sites that we acquired. At times, TAC enables dmirror to send traffic from a specific line card to the CPU as a troubleshooting step so that traffic appears in the debug and then forgets to disable it after the troubleshooting session. To verify if it is enabled, do this:
9K-1# bcm-shell module <MODULE #> "dmirror show"
xe44: Mirror all to local port cpu0
If the output shows exactly as above, then yes, it is enabled and it would cause latency since traffic is mirrored then punted to the CPU. This is how to disable it, assuming that the port where your switch sends ICMP replies out to is via Module 1:
9K-1# bcm-shell module 1
Warning: BCM shell access should be used with caution
Entering bcm shell on module 1
bcm-shell.0> dmirror xe44 mode=off
dmirror xe44 mode=off
That was it, dmirror was enabled and the latency normalized when we disabled it. Probably someone within our team enabled it over the weekend. Thanks.
N9K-A# ping 10.X.X.12
PING 10.X.X.12 (10.X.X.12): 56 data bytes
64 bytes from 10.X.X.12: icmp_seq=0 ttl=254 time=0.748 ms
64 bytes from 10.X.X.12: icmp_seq=1 ttl=254 time=0.57 ms
64 bytes from 10.X.X.12: icmp_seq=2 ttl=254 time=0.565 ms
64 bytes from 10.X.X.12: icmp_seq=3 ttl=254 time=0.587 ms
64 bytes from 10.X.X.12: icmp_seq=4 ttl=254 time=0.625 ms