Guest User!

You are not Sophos Staff.

[9.194-5][BUG]Intel NIC crashes under load

This possibly a duplicate report.
I have been trying to do some performance reporting using two different sets of hardware built with the latest ISO.

One the NIC goes off line under load, kernel log shows a problem. BarryG advises there is an issue with the current driver.

I would provide the log entries, but that period of testing in now full of *, wrong UTM. Will need to bring the other UTM up to extract the log entries.

I changed the the NIC for another of the same type and model and got the same results. 

Ian


2014:02:08-14:37:56 Cats-speed kernel: [ 2178.936536] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
2014:02:08-14:37:56 Cats-speed kernel: [ 2178.936536]   TDH                  
2014:02:08-14:37:56 Cats-speed kernel: [ 2178.936536]   TDT                  
2014:02:08-14:37:56 Cats-speed kernel: [ 2178.936536]   next_to_use          
2014:02:08-14:37:56 Cats-speed kernel: [ 2178.936536]   next_to_clean        
2014:02:08-14:37:56 Cats-speed kernel: [ 2178.936536] buffer_info[next_to_clean]:
2014:02:08-14:37:56 Cats-speed kernel: [ 2178.936536]   time_stamp           
2014:02:08-14:37:56 Cats-speed kernel: [ 2178.936536]   next_to_watch        
2014:02:08-14:37:56 Cats-speed kernel: [ 2178.936536]   jiffies              
2014:02:08-14:37:56 Cats-speed kernel: [ 2178.936536]   next_to_watch.status 
2014:02:08-14:37:56 Cats-speed kernel: [ 2178.936536] MAC Status             
2014:02:08-14:37:56 Cats-speed kernel: [ 2178.936536] PHY Status             
2014:02:08-14:37:56 Cats-speed kernel: [ 2178.936536] PHY 1000BASE-T Status  
2014:02:08-14:37:56 Cats-speed kernel: [ 2178.936536] PHY Extended Status    
2014:02:08-14:37:56 Cats-speed kernel: [ 2178.936536] PCI Status             
2014:02:08-14:37:57 Cats-speed kernel: [ 2179.944779] e1000e 0000:00:19.0 eth0: Reset adapter unexpectedly
2014:02:08-14:38:00 Cats-speed kernel: [ 2183.235199] e1000e: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: Rx/Tx

  • Hi Bianca,
    I have run those commands and lro cannot be changed.

    I do not anticipate being able to run the tests for a couple of days. Those that pay want attention, those that pay intermittently also want attention, those that want money also want attention which leaves a little time for me  and those that purr a lot besides computers.

    Ian
  • Guys, small question.

    So far the adapters under question are:
    -Intel 82572EI NIC (barry)
    -Intel Corporation 82579LM Gigabit Network Connection (martin)
    -Intel Ethernet Connection I217-V (ian)

    Did u have this problem in 9.1? Thanks,
    Bianca


    Hi, this is new hardware for me, and I never installed another version on it.

    However, BruceKConvergent seems to be having a similar problem in 9.1; did you see his posts?

    Barry
  • Hi Bianca,
    that report I submitted might not reflect what was happening at the time of the errors. I changed NICs and the order was changed, caused large quantities of grief. I was lazy and did a rebuild.

    I will try load test tomorrow night when I get home if not too late.

    Ian

  • I have run those commands and lro cannot be changed.


    Maybe its already off. [:)]
  • Hi, this is new hardware for me, and I never installed another version on it.

    However, BruceKConvergent seems to be having a similar problem in 9.1; did you see his posts?

    Barry


    The kernel / NIC driver patch they put out in 9.108 fixed the issue I was having in 9.10x.

    CTO, Convergent Information Security Solutions, LLC

    https://www.convergesecurity.com

    Advice given as posted on this forum does not construe a support relationship or other relationship with Convergent Information Security Solutions, LLC or its subsidiaries.  Use the advice given at your own risk.

  • Hi, 
    Several of the offloading features were ON on my NIC; I'll update to current beta, disable the offloading, and try to re-test this weekend.

    Barry

    Intel Corporation 82572EI Gigabit Ethernet Controller (rev 06)

    fw:/root # /sbin/ethtool -k eth1
    Features for eth1:
    rx-checksumming: on
    tx-checksumming: on
            tx-checksum-ipv4: off [fixed]
            tx-checksum-ip-generic: on
            tx-checksum-ipv6: off [fixed]
            tx-checksum-fcoe-crc: off [fixed]
            tx-checksum-sctp: off [fixed]
    scatter-gather: on
            tx-scatter-gather: on
            tx-scatter-gather-fraglist: off [fixed]
    tcp-segmentation-offload: on
            tx-tcp-segmentation: on
            tx-tcp-ecn-segmentation: off [fixed]
            tx-tcp6-segmentation: on
    udp-fragmentation-offload: off [fixed]
    generic-segmentation-offload: on
    generic-receive-offload: on
    large-receive-offload: off [fixed]
    rx-vlan-offload: on
    tx-vlan-offload: on
    ntuple-filters: off [fixed]
    receive-hashing: on
    highdma: on [fixed]
    rx-vlan-filter: on [fixed]
    vlan-challenged: off [fixed]
    tx-lockless: off [fixed]
    netns-local: off [fixed]
    tx-gso-robust: off [fixed]
    tx-fcoe-segmentation: off [fixed]
    fcoe-mtu: off [fixed]
    tx-nocache-copy: on
    loopback: off [fixed]
    rx-fcs: off
    rx-all: off



    fw:/root # ethtool -K eth1 tso off gso off lro off gro off
    Cannot change large-receive-offload


    fw:/root # /sbin/ethtool -k eth1
    Features for eth1:
    rx-checksumming: on
    tx-checksumming: on
            tx-checksum-ipv4: off [fixed]
            tx-checksum-ip-generic: on
            tx-checksum-ipv6: off [fixed]
            tx-checksum-fcoe-crc: off [fixed]
            tx-checksum-sctp: off [fixed]
    scatter-gather: on
            tx-scatter-gather: on
            tx-scatter-gather-fraglist: off [fixed]
    tcp-segmentation-offload: off
            tx-tcp-segmentation: off
            tx-tcp-ecn-segmentation: off [fixed]
            tx-tcp6-segmentation: off
    udp-fragmentation-offload: off [fixed]
    generic-segmentation-offload: off
    generic-receive-offload: off
    large-receive-offload: off [fixed]
    rx-vlan-offload: on
    tx-vlan-offload: on
    ntuple-filters: off [fixed]
    receive-hashing: on
    highdma: on [fixed]
    rx-vlan-filter: on [fixed]
    vlan-challenged: off [fixed]
    tx-lockless: off [fixed]
    netns-local: off [fixed]
    tx-gso-robust: off [fixed]
    tx-fcoe-segmentation: off [fixed]
    fcoe-mtu: off [fixed]
    tx-nocache-copy: on
    loopback: off [fixed]
    rx-fcs: off
    rx-all: off
    fw:/root #
  • Hi, 
    Several of the offloading features were ON on my NIC; I'll update to current beta, disable the offloading, and try to re-test this weekend.

    Barry

    Intel Corporation 82572EI Gigabit Ethernet Controller (rev 06)

    fw:/root # /sbin/ethtool -k eth1
    
    Features for eth1:
    rx-checksumming: on
    tx-checksumming: on
            tx-checksum-ipv4: off [fixed]
            tx-checksum-ip-generic: on
            tx-checksum-ipv6: off [fixed]
            tx-checksum-fcoe-crc: off [fixed]
            tx-checksum-sctp: off [fixed]
    scatter-gather: on
            tx-scatter-gather: on
            tx-scatter-gather-fraglist: off [fixed]
    tcp-segmentation-offload: on
            tx-tcp-segmentation: on
            tx-tcp-ecn-segmentation: off [fixed]
            tx-tcp6-segmentation: on
    udp-fragmentation-offload: off [fixed]
    generic-segmentation-offload: on
    generic-receive-offload: on
    large-receive-offload: off [fixed]
    rx-vlan-offload: on
    tx-vlan-offload: on
    ntuple-filters: off [fixed]
    receive-hashing: on
    highdma: on [fixed]
    rx-vlan-filter: on [fixed]
    vlan-challenged: off [fixed]
    tx-lockless: off [fixed]
    netns-local: off [fixed]
    tx-gso-robust: off [fixed]
    tx-fcoe-segmentation: off [fixed]
    fcoe-mtu: off [fixed]
    tx-nocache-copy: on
    loopback: off [fixed]
    rx-fcs: off
    rx-all: off



    fw:/root # ethtool -K eth1 tso off gso off lro off gro off
    Cannot change large-receive-offload


    fw:/root # /sbin/ethtool -k eth1
    Features for eth1:
    rx-checksumming: on
    tx-checksumming: on
            tx-checksum-ipv4: off [fixed]
            tx-checksum-ip-generic: on
            tx-checksum-ipv6: off [fixed]
            tx-checksum-fcoe-crc: off [fixed]
            tx-checksum-sctp: off [fixed]
    scatter-gather: on
            tx-scatter-gather: on
            tx-scatter-gather-fraglist: off [fixed]
    tcp-segmentation-offload: off
            tx-tcp-segmentation: off
            tx-tcp-ecn-segmentation: off [fixed]
            tx-tcp6-segmentation: off
    udp-fragmentation-offload: off [fixed]
    generic-segmentation-offload: off
    generic-receive-offload: off
    large-receive-offload: off [fixed]
    rx-vlan-offload: on
    tx-vlan-offload: on
    ntuple-filters: off [fixed]
    receive-hashing: on
    highdma: on [fixed]
    rx-vlan-filter: on [fixed]
    vlan-challenged: off [fixed]
    tx-lockless: off [fixed]
    netns-local: off [fixed]
    tx-gso-robust: off [fixed]
    tx-fcoe-segmentation: off [fixed]
    fcoe-mtu: off [fixed]
    tx-nocache-copy: on
    loopback: off [fixed]
    rx-fcs: off
    rx-all: off
    fw:/root #
  • Guys, small question.

    So far the adapters under question are:
    -Intel 82572EI NIC (barry)
    -Intel Corporation 82579LM Gigabit Network Connection (martin)
    -Intel Ethernet Connection I217-V (ian)

    Did u have this problem in 9.1? Thanks,
    Bianca


    We are running 9.108-23 on a ASG120 (Intel 82583V), facing the same problem. Not much load on this Interface (100 Mbit max)

    e1000e 0000:04:00.0 eth1: Detected Hardware Unit Hang:
    
    2014:02:15-14:00:41 ruhraue-1 kernel: [315204.820811]   TDH                  
    2014:02:15-14:00:41 ruhraue-1 kernel: [315204.820811]   TDT                  
    2014:02:15-14:00:41 ruhraue-1 kernel: [315204.820811]   next_to_use          
    2014:02:15-14:00:41 ruhraue-1 kernel: [315204.820811]   next_to_clean        
    2014:02:15-14:00:41 ruhraue-1 kernel: [315204.820811] buffer_info[next_to_clean]:
    2014:02:15-14:00:41 ruhraue-1 kernel: [315204.820811]   time_stamp           
    2014:02:15-14:00:41 ruhraue-1 kernel: [315204.820811]   next_to_watch        
    2014:02:15-14:00:41 ruhraue-1 kernel: [315204.820811]   jiffies              
    2014:02:15-14:00:41 ruhraue-1 kernel: [315204.820811]   next_to_watch.status 
    2014:02:15-14:00:41 ruhraue-1 kernel: [315204.820811] MAC Status             
    2014:02:15-14:00:41 ruhraue-1 kernel: [315204.820811] PHY Status             
    2014:02:15-14:00:41 ruhraue-1 kernel: [315204.820811] PHY 1000BASE-T Status  
    2014:02:15-14:00:41 ruhraue-1 kernel: [315204.820811] PHY Extended Status    
    2014:02:15-14:00:41 ruhraue-1 kernel: [315204.820811] PCI Status             
  • Hi folks,
    tonight I tried to load the box up with iperf, a failure. Maximum throughput I could get was 800kb/s that was with most functions disabled. The i-217v NIC after being changed from PPPoE to static IP never showed the link as being up. 

    Not sure which box was not being friendly, that will be a series of tests for another day. Last series was using to Dell laptops, this time is was one Dell and a MAC.

    Ian
  • Hi Barry,

    Thanks for the info. Did running traffic work for u after disabling the offloading parameters? 
    Best,
    Bianca

    @Ian, sorry but your posts are confusing. Please just do the steps mentioned in the thread and post the result (the purpose here is not to have the adapter crash). Note that disabling the TSO might lead to a lower throughput.
Share Feedback
×

Submitted a Tech Support Case lately from the Support Portal?