Guest

Preview Tool

Cisco Bug: CSCvt57817 - High network load on DR link causing remote NFS timeout may affect the cluster status

Last Modified

Jul 18, 2020

Products (1)

  • Cisco HyperFlex HX-Series

Known Affected Releases

4.0(1a) 4.0(2a)

Description (partial)

Symptom:
The spring path filesystem service(storfs) will  restarts on controller VM, could impact the data availability on the cluster. 
There will be core file generated on /var/core.

Logs similar to following can be seen on the debug log:
2019-12-08-12:39:32.775+0000: storfs[2941:3184]: SUPPORT: PANIC: Segmentation fault @ 0xfffffffffffffff9 (source pid -7, uid -1 (info: 0x7fde5ca82cf0, ctxt: 0x7fde5ca82bc0)).
2019-12-08-12:39:32.777+0000: storfs[2941:3184]: SUPPORT: PANIC: [0x7fde5ca81090:0x558c7b919a8e] Util_Backtrace+0x4e(0x558c7b919a8e, 0x7fde5ca81090, 0x7fde5ca81460, 0x7fde5ca81090, 0x7fde5ca81150, 0x3000000010)
2019-12-08-12:39:32.777+0000: storfs[2941:3184]: SUPPORT: PANIC: [0x7fde5ca818a0:0x558c7b8b2272] LogPanicInternal+0x122(0x558c7bb436f8, 0x7fde5ca818b8, 0xfa1, 0x3000000028, 0x7fde5ca82a60, 0x7fde5ca82980)
2019-12-08-12:39:32.777+0000: storfs[2941:3184]: SUPPORT: PANIC: [0x7fde5ca82a60:0x558c7b8bec1e] Module_HeapIdToModId+0xfe(0x2f6638363400383d, 0xfffffffffffffff9, 0x61746e656d676553, 0x756166206e6f6974, 0x667830204020746c, 0x6666666666666666)
2019-12-08-12:39:32.778+0000: storfs[2941:3184]: SUPPORT: PANIC: [0x7fde5ca82bc0:0x7fe3f93ba390] __restore_rt+0(0x7, 0x0, 0x0, 0x2, 0x0, 0x0)
2019-12-08-12:39:32.778+0000: storfs[2941:3184]: SUPPORT: PANIC: [0x7fde5ca838c8:0x558c7b8bc559] Heap_Free+0x9(0x558c7b38e869, 0xfb, 0x7fde5ca83a00, 0x7fe3611a3120, 0x558c7b42d7f2, 0x7fe3611a335f)
2019-12-08-12:39:32.779+0000: storfs[2941:3184]: SUPPORT: PANIC: [0x7fde5ca838d0:0x558c7b38e869] ClusterPropertiesFree+0x19(0xfb, 0x7fde5ca83a00, 0x7fe3611a3120, 0x558c7b42d7f2, 0x7fe3611a335f, 0x3c61198bf0)
2019-12-08-12:39:32.779+0000: storfs[2941:3184]: SUPPORT: PANIC: [0x7fde5ca838f0:0x558c7b42d7f2] NRNFS_ConnectSync+0x242(0x7fe3611a335f, 0x3c61198bf0, 0x100000001, 0x7fe3611a3490, 0x558c7bb00b68e00e, 0x500fb0000)

Conditions:
The NFS clusterproperty query should fail after the NFS mount. This happen only if there is severe network load on the DR-link that is causing failures and a new connect is initiated and there was a timetout after the mount failed.
Bug details contain sensitive information and therefore require a Cisco.com account to be viewed.

Bug Details Include

  • Full Description (including symptoms, conditions and workarounds)
  • Status
  • Severity
  • Known Fixed Releases
  • Related Community Discussions
  • Number of Related Support Cases
Bug information is viewable for customers and partners who have a service contract. Registered users can view up to 200 bugs per month without a service contract.