linux_dsm_epyc7002/include/linux/mlx5
Shay Agroskin c2273219ba net/mlx5e: XDP, Inline small packets into the TX MPWQE in XDP xmit flow
Upon high packet rate with multiple CPUs TX workloads, much of the HCA's
resources are spent on prefetching TX descriptors, thus affecting
transmission rates.
This patch comes to mitigate this problem by moving some workload to the
CPU and reducing the HW data prefetch overhead for small packets (<= 256B).

When forwarding packets with XDP, a packet that is smaller
than a certain size (set to ~256 bytes) would be sent inline within
its WQE TX descrptor (mem-copied), when the hardware tx queue is congested
beyond a pre-defined water-mark.

This is added to better utilize the HW resources (which now makes
one less packet data prefetch) and allow better scalability, on the
account of CPU usage (which now 'memcpy's the packet into the WQE).

To load balance between HW and CPU and get max packet rate, we use
watermarks to detect how much the HW is congested and move the work
loads back and forth between HW and CPU.

Performance:
Tested packet rate for UDP 64Byte multi-stream
over two dual port ConnectX-5 100Gbps NICs.
CPU: Intel(R) Xeon(R) CPU E5-2680 v3 @ 2.50GHz

* Tested with hyper-threading disabled

XDP_TX:

|          | before | after   |       |
| 24 rings | 51Mpps | 116Mpps | +126% |
| 1 ring   | 12Mpps | 12Mpps  | same  |

XDP_REDIRECT:

** Below is the transmit rate, not the redirection rate
which might be larger, and is not affected by this patch.

|          | before  | after   |      |
| 32 rings | 64Mpps  | 92Mpps  | +43% |
| 1 ring   | 6.4Mpps | 6.4Mpps | same |

As we can see, feature significantly improves scaling, without
hurting single ring performance.

Signed-off-by: Shay Agroskin <shayag@mellanox.com>
Signed-off-by: Tariq Toukan <tariqt@mellanox.com>
Signed-off-by: Saeed Mahameed <saeedm@mellanox.com>
2019-04-23 12:09:20 -07:00
..
accel.h net/mlx5: IPSec, Add support for ESN 2018-03-07 15:54:36 -08:00
cmd.h
cq.h net/mlx5: Remove spinlock support from mlx5_write64 2019-04-02 12:49:37 -07:00
device.h net/mlx5: Handle event of power detection in the PCIE slot 2019-04-05 14:10:32 -07:00
doorbell.h net/mlx5: Remove spinlock support from mlx5_write64 2019-04-02 12:49:37 -07:00
driver.h Linux 5.1-rc1 2019-04-22 15:25:39 -07:00
eq.h net/mlx5: EQ, Introduce atomic notifier chain subscription API 2018-11-26 13:39:33 -08:00
eswitch.h net/mlx5: E-Switch, Centralize repersentor reg/unreg to eswitch driver 2019-02-15 17:25:58 -08:00
fs_helpers.h {net,IB}/mlx5: Add ipsec helper 2018-04-04 12:06:27 -06:00
fs.h net/mlx5: E-Switch, add a new prio to be used by the RDMA side 2019-04-10 09:29:11 +03:00
mlx5_ifc_fpga.h net/mlx5: Accel, add TLS rx offload routines 2018-07-16 00:13:11 -07:00
mlx5_ifc.h Linux 5.1-rc1 2019-04-22 15:25:39 -07:00
port.h net/mlx5e: Fix GRE key by controlling port tunnel entropy calculation 2019-02-22 13:38:23 -08:00
qp.h net/mlx5e: XDP, Inline small packets into the TX MPWQE in XDP xmit flow 2019-04-23 12:09:20 -07:00
transobj.h net/mlx5: Move SRQ functions to RDMA part 2018-12-04 09:14:30 +02:00
vport.h net/mlx5: E-Switch, Consider ECPF vport depends on eswitch ownership 2019-02-15 17:25:58 -08:00