This discussion has been locked.
You can no longer post new replies to this discussion. If you have a question you can start a new discussion

XG HA: Kernel Panic on Auxiliary Appliance 18.0.1 MR-1-Build396 Tainted Module winbindd

Hello Community,

this is my first Post here. We updated our Cluster this Weekend to 18.0.1 MR-1-Build396.

After the update both Devices restarted. One took Master Role and other Aux all fine... for about a minute.

Auxilliary Device dropped to Faulty State. Upon check of the local console following came up:

BUG: unable to handle kernel NULL pointer dereference at
[  163.527682] IP:           (null)
[  163.537399] PGD 800000039b846067 P4D 800000039b846067 PUD 0
[  163.554381] Oops: 0010 [#1] SMP PTI
[  163.564855] Modules linked in: nf_conntrack_ipslb nfnetmap_queue(O) xt_master                                                                                                                                                                                                                                                                                                           YN_DATA ip6t_ADVERTISEMENT ip6t_SOLICITATION xt_LBS ip6table_filter iptable_filt                                                                                                                                                                                                                                                                                                           onntrack_tftp nf_nat_h323 nf_conntrack_h323 nf_nat_pptp
[  163.777028]  nf_conntrack_pptp cfg80211 usbhid hid_generic hid ohci_pci ohci_                                                                                                                                                                                                                                                                                                           ort xfrm4_mode_tunnel xfrm4_tunnel xfrm_user af_key xfrm_algo aesni_intel glue_h                                                                                                                                                                                                                                                                                                           er ipt_rpfilter ebt_nflog ebt_pkttype xt_serviceset
[  163.988262]  xt_appset xt_hostset xt_pkttype xt_recent xt_state xt_status xt_                                                                                                                                                                                                                                                                                                           et ip_set_bitmap_fwrule ip_set_bitmap_ctrxss ip_set_bitmap_user sp2fp_api ip_set                                                                                                                                                                                                                                                                                                           ip6_udp_tunnel ptp pps_core mdio i2c_i801 i2c_dev i2c_core
[  164.201353]  netmap(O) ip6table_nat nf_nat_ipv6 ip6table_mangle ip6table_raw                                                                                                                                                                                                                                                                                                            es nfnetlink button evdev [last unloaded: nfnetmap_queue]
[  164.315505] CPU: 5 PID: 21751 Comm: winbindd Tainted: G           O    4.14.3
[  164.337970] Hardware name: Sophos XG/XG, BIOS 5.11 06/01/2018
[  164.355246] task: ffff8803b0257080 task.stack: ffffc90008304000
[  164.373031] RIP: 0010:          (null)
[  164.384308] RSP: 0000:ffff88046dd43e18 EFLAGS: 00010202
[  164.400016] RAX: ffffffffa083f700 RBX: ffff8803e5c0e780 RCX: ffff88044dde0400
[  164.421468] RDX: 0000000000000000 RSI: 0000000000000001 RDI: ffff8803e5c0e780
[  164.442919] RBP: ffff88044dde0410 R08: 0000000000000001 R09: 0000000000000001
[  164.464343] R10: 0000000000000000 R11: ffffc90008307bf0 R12: ffff8804546b2000
[  164.485767] R13: ffff8804546b2078 R14: ffff8804546b20a0 R15: 0000000000000008
[  164.507190] FS:  0000000000000000(0000) GS:ffff88046dd40000(0063) knlGS:00000
[  164.531487] CS:  0010 DS: 002b ES: 002b CR0: 0000000080050033
[  164.548737] CR2: 0000000000000000 CR3: 000000039b924003 CR4: 00000000001606e0
[  164.570164] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  164.591585] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  164.613049] Call Trace:
[  164.620439]  <IRQ>
[  164.626523]  ? ip_rcv+0x316/0x4c0
[  164.636509]  ? ip_local_deliver_finish+0x1d0/0x1d0
[  164.650941]  ? __netif_receive_skb_core+0x3ec/0xac0
[  164.665632]  ? enqueue_task_fair+0x320/0x440
[  164.678475]  ? process_backlog+0x86/0x120
[  164.690537]  ? process_backlog+0x86/0x120
[  164.702601]  ? net_rx_action+0xcc/0x270
[  164.714148]  ? __do_softirq+0xc5/0x1ec
[  164.725428]  ? do_softirq_own_stack+0x2a/0x40
[  164.738533]  </IRQ>
[  164.744901]  ? do_softirq.part.2+0x3c/0x40
[  164.757227]  ? netif_rx_ni+0x1d/0x30
[  164.767992]  ? dev_loopback_xmit+0xa3/0xc0
[  164.780317]  ? ip_mc_output+0x176/0x240
[  164.791860]  ? ip_finish_output2+0x3b0/0x3b0
[  164.804704]  ? ip_send_skb+0x10/0x40
[  164.815469]  ? udp_send_skb+0x94/0x240
[  164.826750]  ? udp_sendmsg+0x2f8/0x8c0
[  164.838037]  ? release_sock+0x3b/0x90
[  164.849059]  ? sock_sendmsg+0xe/0x20
[  164.859822]  ? SyS_sendto+0xad/0x150
[  164.870587]  ? ep_poll_wakeup_proc+0x20/0x20
[  164.883432]  ? compat_SyS_socketcall+0x12c/0x210
[  164.897319]  ? do_int80_syscall_32+0x58/0x110
[  164.910421]  ? entry_INT80_compat+0x48/0x50
[  164.923002] Code:  Bad RIP value.
[  164.933012] RIP:           (null) RSP: ffff88046dd43e18
[  164.948719] CR2: 0000000000000000
[  164.958727] ---[ end trace 0c3cc4f11b5d6136 ]---
[  164.958728] BUG: unable to handle kernel NULL pointer dereference at
[  164.958729] IP:           (null)
[  164.958729] PGD 80000003a7ba4067 P4D 80000003a7ba4067 PUD 0
[  164.958731] Oops: 0010 [#2] SMP PTI
[  164.958732] Modules linked in: nf_conntrack_ipslb nfnetmap_queue(O) xt_master                                                                                                                                                                                                                                                                                                           YN_DATA ip6t_ADVERTISEMENT ip6t_SOLICITATION xt_LBS ip6table_filter iptable_filt                                                                                                                                                                                                                                                                                                           onntrack_tftp nf_nat_h323 nf_conntrack_h323 nf_nat_pptp
[  164.958744]  nf_conntrack_pptp cfg80211 usbhid hid_generic hid ohci_pci ohci_                                                                                                                                                                                                                                                                                                           ort xfrm4_mode_tunnel xfrm4_tunnel xfrm_user af_key xfrm_algo aesni_intel glue_h                                                                                                                                                                                                                                                                                                           er ipt_rpfilter ebt_nflog ebt_pkttype xt_serviceset
[  164.958758]  xt_appset xt_hostset xt_pkttype xt_recent xt_state xt_status xt_                                                                                                                                                                                                                                                                                                           et ip_set_bitmap_fwrule ip_set_bitmap_ctrxss ip_set_bitmap_user sp2fp_api ip_set                                                                                                                                                                                                                                                                                                           ip6_udp_tunnel ptp pps_core mdio i2c_i801 i2c_dev i2c_core
[  164.958771]  netmap(O) ip6table_nat nf_nat_ipv6 ip6table_mangle ip6table_raw                                                                                                                                                                                                                                                                                                            es nfnetlink button evdev [last unloaded: nfnetmap_queue]
[  164.958780] CPU: 6 PID: 21752 Comm: winbindd Tainted: G      D    O    4.14.3
[  164.958780] Hardware name: Sophos XG/XG, BIOS 5.11 06/01/2018
[  164.958780] task: ffff8803b0250000 task.stack: ffffc9000830c000
[  164.958781] RIP: 0010:          (null)
[  164.958781] RSP: 0000:ffff88046dd83e18 EFLAGS: 00010202
[  164.958782] RAX: ffffffffa083f700 RBX: ffff88039b9a03c0 RCX: ffff8803ae590200
[  164.958782] RDX: 0000000000000000 RSI: 0000000000000001 RDI: ffff88039b9a03c0
[  164.958782] RBP: ffff8803ae590210 R08: 0000000000000001 R09: 0000000000000001
[  164.958783] R10: 0000000000000000 R11: ffffc9000830fbf0 R12: ffff8804546b2000
[  164.958783] R13: ffff8804546b2078 R14: ffff8804546b20a0 R15: 0000000000000008
[  164.958784] FS:  0000000000000000(0000) GS

console> system ha show details
HA status : Enabled
Current Appliance Key : C430-------------
Peer Appliance Key : C430-----------
Current HA state : Standalone
Peer HA state : Fault
HA Config Mode : Active-Passive
Load Balancing : Not Applicable
Dedicated Port : Port4
Current Dedicated IP : 5.5.5.1
Peer Dedicated IP : 5.5.5.2
Monitoring Port :
Auxiliary Admin Port : bond1
Auxiliary Admin IP : 10.0.5.28
Auxiliary Admin IPv6 :
HA Cluster ID : 10
Keepalive request interval : 250
Keepalive attempts : 16
Hypervisor assigned MAC addresses : Disabled
HA preemption : Disabled

less /log/applog.log | grep ha:

Sep 29 13:07:27 ha_port_down_notification: message_id : log_data : Interface Port4 went down. Appliance HA state MASTSep 29 13:07:28 ha: handle_stat_change: 2:3 [ NA=0 AUX=1 STAND=2 PRIM=3 FAULT=4 READY=5 GOTO_PRIM=6 ]
Sep 29 13:07:28 ha: handle_stat_change: g_ha_hsc=1 is set.
Sep 29 13:07:28 ha: g_ha_transmode=0 [ CONFIG=1 INIT=2 EVENT=0 ]
Sep 29 13:07:28 ha: start tracking the device
Sep 29 13:07:28 ha: fwm:disablearpha successfully done
Sep 29 13:07:28 ha: msync:applyha: no network changes reqd
Sep 29 13:07:28 ha: fwm:applyha successfully done
Sep 29 13:07:28 ha: fwm:enablearpha successfully done
Sep 29 13:07:29 ha: mail sent successfully
Sep 29 13:07:30 ha: syncing conntracks
Sep 29 13:07:30 ha: handle_stat_change: 2:3 done.
Sep 29 13:07:30 ha: handle_stat_change: g_ha_hsc=0 is set.
Sep 29 13:08:07 ha: handle_stat_change: 3:2 [ NA=0 AUX=1 STAND=2 PRIM=3 FAULT=4 READY=5 GOTO_PRIM=6 ]
Sep 29 13:08:07 ha: handle_stat_change: g_ha_hsc=1 is set.
Sep 29 13:08:07 ha: g_ha_transmode=0 [ CONFIG=1 INIT=2 EVENT=0 ]
Sep 29 13:08:07 ha: start tracking the device
Sep 29 13:08:07 ha: fwm:disablearpha successfully done
Sep 29 13:08:07 ha: ctsyncd commited
Sep 29 13:08:07 ha: ctsyncd external cache flushed
Sep 29 13:08:07 ha: msync:applyha: prim->stand, so no network changes reqd
Sep 29 13:08:08 ha: fwm:applyha successfully done
Sep 29 13:08:10 ha: msync:garpha: send_arp 

<lots of arps here>
Sep 29 13:08:10 ha: fwm:enablearpha successfully done
Sep 29 13:08:11 ha: mail sent successfully
Sep 29 13:08:11 ha: syncing conntracks
Sep 29 13:08:11 ha: handle_stat_change: 3:2 done.
Sep 29 13:08:11 ha: handle_stat_change: g_ha_hsc=0 is set.
Sep 29 13:09:15 ha: appcached_ha_sync function is called...!!!!
Sep 29 13:13:20 ha: redis DB dump file sync is done !!

If someone managed to fixed that on his cluster any help would be appreciated.

Kind regards,

Sascha



This thread was automatically locked due to age.
  • Helo Sascha,

    Thank you for contacting the Sophos Community!

    I think the issue as you pointed out in the title is with winbindd Tainted: G

    Are you able to SSH to the AUX device or it is still in this failed status?

    In the master can you see if there is any coredump

    # cd /var/cores 

    # ls -lh

    Regards,

  • Hello Emmanuel,

    thank you for replying.

    Unfortunatly the Aux Device is totally frozen after the Kernel Panic. Not even the HW-Buttons or LCD responds to input.

    Only Chance to get Access for like 1 Minute is to cold boot it.

    Found some Coredumps on the Master:

    XG450_WP02_SFOS 18.0.1 MR-1-Build396# cd /var/cores
    XG450_WP02_SFOS 18.0.1 MR-1-Build396# ls -lh
    -rw-------    1 root     0          35.3M Jun 27 20:28 core.awed
    -rw-------    1 root     0          61.1M Sep 26 14:17 core.garner
    -rw-------    1 root     nasm       21.7M Sep 26 11:55 core.nasm
    -rw-------    1 root     0           2.2M Jan 27  2020 core.syncfile
    XG450_WP02_SFOS 18.0.1 MR-1-Build396#

    Makes sense to me, we actually had Problems with Logging and Wireless too. Syncfile makes sense too. Don´t know what nasm does.

    Regards,

    Sascha

  • Hello Community,

    meanwhile also the Standalone Applinace reboots everyday, probably from Kernel Panic.

    We will attach a Serial Cable with a logging TTY-Session to confirm the device also hits Kernel Panic Mode.

    Hope anyone can give me some insight/help on this.

    Logs are as usually empty. Looks like no deamon is able to write something helpfull before the System Crashes.

    I am really annoyed atm.

    Kind regards,

    Sascha

  • Hi Emmanuel,

    any Idea what is going on?

    Kind regards,

    Sascha

  • Hello Community,

    found this in /log/syslog.log on my Standalone...

    Oct  8 17:28:45 (none) user.warn kernel: [49916.831466] ------------[ cut here ]------------
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831470] WARNING: CPU: 0 PID: 0 at ./include/net/dst.h:256 0xffffffffa081876d
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831470] Modules linked in: nfnetmap_queue(O) nf_conntrack_ipslb xt_masterconn xt_svp drbg ansi_cprng echainiv xfrm6_mode_tunnel xt_xfrmpolicy pppox ppp_generic slhc ah4 arpt_arpreply nf_nat_ftp nf_conntrack_ftp xt_CT xt_nat xt_addrtype ebt_vlan ebt_arp arpt_arpreply_proxy arpt_arpreq_proxy arpt_arpspoof ebtable_filter ebtable_nat ebtables ip6t_MASQUERADE xt_muser xt_conntrack xt_RCV_AUX_DATA ip6t_ADVERTISEMENT ip6t_SOLICITATION xt_LBS ip6table_filter iptable_filter xt_DNAT xt_SNAT nf_nat_masquerade_ipv6 xt_nat_lookup xt_UST xt_ust xt_firewall nat_rules sfos_rules_framework firewall ip_set_hash_mlmwsticky ip_set_hash_sslvpn iptable_mangle ip_set_hash_mac ip_set_hash_bw nf_conntrack_dns nf_nat_sip nf_conntrack_sip nf_nat_irc nf_conntrack_irc nf_nat_tftp nf_conntrack_tftp nf_nat_h323 nf_conntrack_h323
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831487]  nf_nat_pptp nf_conntrack_pptp cfg80211 usbhid hid_generic hid ohci_pci ohci_hcd xhci_pci xhci_hcd uhci_hcd ehci_pci ehci_hcd fw_handle_ngfw_notification fp2sp_api fp_notifier bonding lzo lzo_compress lzo_decompress cifs red red2 appdev nf_conntrack_netlink nf_nat_proto_gre nf_conntrack_proto_gre set_sessiontbl sessiontbl ip_gre gre ipcomp xfrm_ipcomp esp4 xfrm4_mode_transport xfrm4_mode_tunnel xfrm4_tunnel xfrm_user af_key xfrm_algo aesni_intel glue_helper aes_x86_64 crypto_simd cryptd cls_u32 act_mirred sch_ingress ifb sch_hfsc sch_leafprio sch_headprio sch_sfq sch_htb xt_MULTISET xt_MLM xt_SRCNETMAP xt_MARKROUTE xt_CONTINUE xt_LOGDROP xt_ULOG xt_TCPMSS xt_REDIRECT nf_nat_redirect ipt_MASQUERADE nf_nat_masquerade_ipv4 xt_OUT_OUTDEV ip6t_rpfilter ipt_rpfilter ebt_nflog ebt_pkttype xt_serviceset
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831507]  xt_appset xt_hostset xt_pkttype xt_recent xt_state xt_status xt_cet xt_OUTDEV xt_iprange xt_limit xt_hashlimit xt_tcpudp xt_multiport nf_conntrack_relate xt_IPMACFILTER xt_RANGENAT xt_VHDNAT ip_set_bitmap_vhost xt_FWSET xt_set ip_set_hash_maciface_fp ip_set_hash_ipiface_fp ip_set_bitmap_hotspotuser ip_set_hash_hotspotmac ip_set_bitmap_tlsrule ip_set_bitmap_appset ip_set_bitmap_fwrule ip_set_bitmap_ctrxss ip_set_bitmap_user sp2fp_api ip_set_bitmap_userpolicy ip_set_hash_ipuser ip_set_bitmap_service ip_set_bitmap_host ip_set_hash_ipmaciface ip_set_hash_l2mac ip_set_hash_ipmac ip_set_hash_ip ip_set arptable_filter arp_tables caswell_bpgen3(O) network_bypass(O) e1000e_nm(O) igb_nm(O) i2c_algo_bit ixgbe_nm(O) i40e_nm(O) vxlan udp_tunnel ip6_udp_tunnel ptp pps_core mdio i2c_i801 i2c_dev i2c_core
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831524]  netmap(O) ip6table_nat nf_nat_ipv6 ip6table_mangle ip6table_raw iptable_nat iptable_raw nf_nat_ipv4 xt_dscp nf_nat ip6_tables ip_tables tun af_packet 8021q nf_conntrack_ipv6 nf_defrag_ipv6 nf_conntrack_ipv4 ip6_tunnel tunnel6 sit ip_tunnel tunnel4 ppdev parport_pc parport nf_conntrack lineartable bitmap_api br_netfilter bridge nf_defrag_ipv4 ipv6 stp llc x_tables nfnetlink button evdev [last unloaded: nfnetmap_queue]
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831536] CPU: 0 PID: 0 Comm: swapper/0 Tainted: G           O    4.14.38 #2
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831536] Hardware name: Sophos XG/XG, BIOS 5.11 06/01/2018
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831537] task: ffffffff81c104c0 task.stack: ffffffff81c00000
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831538] RIP: 0010:0xffffffffa081876d
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831538] RSP: 0018:ffff88046dc03c88 EFLAGS: 00010246
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831539] RAX: 0000000000000000 RBX: ffff8804321c0780 RCX: 000000000000001b
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831539] RDX: ffff8803534f4f00 RSI: 000000000000007d RDI: ffff8803534f4f01
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831540] RBP: ffff88044e379cc0 R08: 0000000000000001 R09: 0000000000000001
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831540] R10: 0000000000000000 R11: ffff880432843200 R12: 0000000000000001
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831541] R13: ffff88044dc0cf01 R14: ffff88046dc03cf0 R15: 0000000000000001
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831541] FS:  0000000000000000(0000) GS:ffff88046dc00000(0000) knlGS:0000000000000000
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831542] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831542] CR2: 00007f120803a858 CR3: 0000000001c0a006 CR4: 00000000001606f0
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831543] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831543] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831543] Call Trace:
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831545]  <IRQ>
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831547]  nf_hook_slow+0x38/0xd0
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831549]  ip_forward+0x39a/0x400
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831550]  ? ip_frag_mem+0x10/0x10
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831550]  ip_rcv+0x316/0x4c0
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831551]  ? ip_local_deliver_finish+0x1d0/0x1d0
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831553]  __netif_receive_skb_core+0x3ec/0xac0
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831556]  ? check_preempt_curr+0x6a/0x80
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831557]  ? netif_receive_skb_internal+0x1f/0xa0
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831558]  netif_receive_skb_internal+0x1f/0xa0
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831559]  napi_gro_receive+0x6a/0x80
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831563]  ixgbe_poll+0x6be/0x1250 [ixgbe_nm]
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831565]  net_rx_action+0xcc/0x270
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831566]  __do_softirq+0xc5/0x1ec
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831569]  irq_exit+0x75/0x80
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831570]  do_IRQ+0x76/0xc0
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831572]  common_interrupt+0x77/0x77
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831572]  </IRQ>
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831574] RIP: 0010:mwait_idle+0x49/0x70
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831574] RSP: 0018:ffffffff81c03ee8 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff4d
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831575] RAX: 0000000000000000 RBX: ffffffff81c775b0 RCX: 0000000000000000
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831576] RDX: 0000000000000000 RSI: ffff88046dc19300 RDI: 0000000000000000
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831576] RBP: ffffffff81c104c0 R08: 0000000000000001 R09: 0000000000000005
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831576] R10: ffffc900003b3db0 R11: 0000000000000001 R12: ffffffff81f5a920
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831577] R13: ffffffff81f620a0 R14: 0000000000000000 R15: 0000000000000002
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831579]  ? arch_cpu_idle_enter+0x7/0x10
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831580]  do_idle+0x85/0xd0
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831581]  cpu_startup_entry+0x5a/0x60
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831582]  start_kernel+0x3de/0x3e9
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831583]  secondary_startup_64+0xa5/0xb0
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831584] Code: 45 89 f8 ba 01 00 00 00 48 8b 3d 3f 1c 00 00 0f b7 48 18 e8 d6 a8 bc ff 41 83 fc 02 0f 86 b7 fe ff ff e9 b4 f9 ff ff 48 8b 7d 60 <0f> 0b e9 f5 fb ff ff f6 c5 02 0f 84 f7 fa ff ff c6 42 38 00 48 
    Oct  8 17:28:45 (none) user.warn kernel: [49916.831598] ---[ end trace 8a1120486f728025 ]---
    Oct  8 17:29:04 (none) user.err kernel: [49936.216216] no peer (tx)
    Oct  8 17:29:05 (none) user.err kernel: [49937.218802] no peer (tx)
    Oct  8 17:29:06 (none) user.err kernel: [49938.221333] no peer (tx)
    Oct  8 17:29:13 (none) user.err kernel: [49945.004063] no peer (tx)
    Oct  8 17:29:14 (none) user.err kernel: [49946.006102] no peer (tx)
    Oct  8 17:29:15 (none) user.err kernel: [49947.007879] no peer (tx)
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        
    Oct  8 17:30:28 (none) syslog.info syslogd started: BusyBox v1.21.1
    Oct  8 17:30:28 (none) user.notice kernel: klogd started: BusyBox v1.21.1 (2020-06-05 20:04:01 UTC)
    Oct  8 17:30:28 (none) user.notice kernel: [    0.000000] Linux version 4.14.38 (jenkins@ci-36) (gcc version 7.3.0 (OpenWrt GCC 7.3.0 7340-gf2d738297)) #2 SMP Fri Jun 5 22:28:42 UTC 2020

    Seems to me SMP-Kernel 4.14.38 has some Problems is GA 18.0.1 MR-1.

    Can finally someone say something to this Problem?

    Kind regards,

    Sascha Schaal

  • Hello Sascha,

    Thank you for the follow-up, I thought I replied to you.

    To exactly know what is happening, you would need to open a case with support.

    Please collect the following logs:

    csc.log, applog.log syslog.log msync.log and networkd.log

    If possible take a screenshot of the Memory and CPU graph (Monitor & Analyze >> Diagnostics >> System Graphs

    Additionally, provide the output of the following command:

    less applog.log | grep "Tainted"

    less syslog.log  | grep "NMI" 

    And 

    ls -lh /var/cores

    This would be for both instances.

    And please send me the Case ID.

    Regards,

  • Hi Emmanuel,

    thank you. We were able to solve this on our own.

    The Problem seems that Active Directory SSO seems to be incomptabile with HA from 17.5 up to 18. MR-3.

    We disabled Active Directory SSO on Appliance Access for all Networks and were able to build HA.

    Hope this helps someone.

    Kind regards,

    Sascha