2008-10-23 05:47:49 +07:00
|
|
|
/*
|
|
|
|
* Copyright (c) 2007 Mellanox Technologies. All rights reserved.
|
|
|
|
*
|
|
|
|
* This software is available to you under a choice of one of two
|
|
|
|
* licenses. You may choose to be licensed under the terms of the GNU
|
|
|
|
* General Public License (GPL) Version 2, available from the file
|
|
|
|
* COPYING in the main directory of this source tree, or the
|
|
|
|
* OpenIB.org BSD license below:
|
|
|
|
*
|
|
|
|
* Redistribution and use in source and binary forms, with or
|
|
|
|
* without modification, are permitted provided that the following
|
|
|
|
* conditions are met:
|
|
|
|
*
|
|
|
|
* - Redistributions of source code must retain the above
|
|
|
|
* copyright notice, this list of conditions and the following
|
|
|
|
* disclaimer.
|
|
|
|
*
|
|
|
|
* - Redistributions in binary form must reproduce the above
|
|
|
|
* copyright notice, this list of conditions and the following
|
|
|
|
* disclaimer in the documentation and/or other materials
|
|
|
|
* provided with the distribution.
|
|
|
|
*
|
|
|
|
* THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND,
|
|
|
|
* EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF
|
|
|
|
* MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND
|
|
|
|
* NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS
|
|
|
|
* BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN
|
|
|
|
* ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN
|
|
|
|
* CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
|
|
|
|
* SOFTWARE.
|
|
|
|
*
|
|
|
|
*/
|
|
|
|
|
2016-07-20 02:16:50 +07:00
|
|
|
#include <linux/bpf.h>
|
2008-10-23 05:47:49 +07:00
|
|
|
#include <linux/etherdevice.h>
|
|
|
|
#include <linux/tcp.h>
|
|
|
|
#include <linux/if_vlan.h>
|
|
|
|
#include <linux/delay.h>
|
include cleanup: Update gfp.h and slab.h includes to prepare for breaking implicit slab.h inclusion from percpu.h
percpu.h is included by sched.h and module.h and thus ends up being
included when building most .c files. percpu.h includes slab.h which
in turn includes gfp.h making everything defined by the two files
universally available and complicating inclusion dependencies.
percpu.h -> slab.h dependency is about to be removed. Prepare for
this change by updating users of gfp and slab facilities include those
headers directly instead of assuming availability. As this conversion
needs to touch large number of source files, the following script is
used as the basis of conversion.
http://userweb.kernel.org/~tj/misc/slabh-sweep.py
The script does the followings.
* Scan files for gfp and slab usages and update includes such that
only the necessary includes are there. ie. if only gfp is used,
gfp.h, if slab is used, slab.h.
* When the script inserts a new include, it looks at the include
blocks and try to put the new include such that its order conforms
to its surrounding. It's put in the include block which contains
core kernel includes, in the same order that the rest are ordered -
alphabetical, Christmas tree, rev-Xmas-tree or at the end if there
doesn't seem to be any matching order.
* If the script can't find a place to put a new include (mostly
because the file doesn't have fitting include block), it prints out
an error message indicating which .h file needs to be added to the
file.
The conversion was done in the following steps.
1. The initial automatic conversion of all .c files updated slightly
over 4000 files, deleting around 700 includes and adding ~480 gfp.h
and ~3000 slab.h inclusions. The script emitted errors for ~400
files.
2. Each error was manually checked. Some didn't need the inclusion,
some needed manual addition while adding it to implementation .h or
embedding .c file was more appropriate for others. This step added
inclusions to around 150 files.
3. The script was run again and the output was compared to the edits
from #2 to make sure no file was left behind.
4. Several build tests were done and a couple of problems were fixed.
e.g. lib/decompress_*.c used malloc/free() wrappers around slab
APIs requiring slab.h to be added manually.
5. The script was run on all .h files but without automatically
editing them as sprinkling gfp.h and slab.h inclusions around .h
files could easily lead to inclusion dependency hell. Most gfp.h
inclusion directives were ignored as stuff from gfp.h was usually
wildly available and often used in preprocessor macros. Each
slab.h inclusion directive was examined and added manually as
necessary.
6. percpu.h was updated not to include slab.h.
7. Build test were done on the following configurations and failures
were fixed. CONFIG_GCOV_KERNEL was turned off for all tests (as my
distributed build env didn't work with gcov compiles) and a few
more options had to be turned off depending on archs to make things
build (like ipr on powerpc/64 which failed due to missing writeq).
* x86 and x86_64 UP and SMP allmodconfig and a custom test config.
* powerpc and powerpc64 SMP allmodconfig
* sparc and sparc64 SMP allmodconfig
* ia64 SMP allmodconfig
* s390 SMP allmodconfig
* alpha SMP allmodconfig
* um on x86_64 SMP allmodconfig
8. percpu.h modifications were reverted so that it could be applied as
a separate patch and serve as bisection point.
Given the fact that I had only a couple of failures from tests on step
6, I'm fairly confident about the coverage of this conversion patch.
If there is a breakage, it's likely to be something in one of the arch
headers which should be easily discoverable easily on most builds of
the specific arch.
Signed-off-by: Tejun Heo <tj@kernel.org>
Guess-its-ok-by: Christoph Lameter <cl@linux-foundation.org>
Cc: Ingo Molnar <mingo@redhat.com>
Cc: Lee Schermerhorn <Lee.Schermerhorn@hp.com>
2010-03-24 15:04:11 +07:00
|
|
|
#include <linux/slab.h>
|
2012-07-19 05:33:52 +07:00
|
|
|
#include <linux/hash.h>
|
|
|
|
#include <net/ip.h>
|
2013-07-10 21:13:17 +07:00
|
|
|
#include <net/busy_poll.h>
|
2014-03-27 19:02:04 +07:00
|
|
|
#include <net/vxlan.h>
|
2016-02-26 23:32:24 +07:00
|
|
|
#include <net/devlink.h>
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
#include <linux/mlx4/driver.h>
|
|
|
|
#include <linux/mlx4/device.h>
|
|
|
|
#include <linux/mlx4/cmd.h>
|
|
|
|
#include <linux/mlx4/cq.h>
|
|
|
|
|
|
|
|
#include "mlx4_en.h"
|
|
|
|
#include "en_port.h"
|
|
|
|
|
2012-12-02 10:49:23 +07:00
|
|
|
int mlx4_en_setup_tc(struct net_device *dev, u8 up)
|
2012-04-05 04:33:27 +07:00
|
|
|
{
|
2012-05-17 07:58:10 +07:00
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
int i;
|
2012-12-02 10:49:23 +07:00
|
|
|
unsigned int offset = 0;
|
2012-05-17 07:58:10 +07:00
|
|
|
|
|
|
|
if (up && up != MLX4_EN_NUM_UP)
|
2012-04-05 04:33:27 +07:00
|
|
|
return -EINVAL;
|
|
|
|
|
2012-05-17 07:58:10 +07:00
|
|
|
netdev_set_num_tc(dev, up);
|
|
|
|
|
|
|
|
/* Partition Tx queues evenly amongst UP's */
|
|
|
|
for (i = 0; i < up; i++) {
|
2012-12-02 10:49:23 +07:00
|
|
|
netdev_set_tc_queue(dev, i, priv->num_tx_rings_p_up, offset);
|
|
|
|
offset += priv->num_tx_rings_p_up;
|
2012-05-17 07:58:10 +07:00
|
|
|
}
|
|
|
|
|
2016-06-21 16:43:59 +07:00
|
|
|
#ifdef CONFIG_MLX4_EN_DCB
|
|
|
|
if (!mlx4_is_slave(priv->mdev->dev)) {
|
|
|
|
if (up) {
|
2016-09-11 14:56:19 +07:00
|
|
|
if (priv->dcbx_cap)
|
|
|
|
priv->flags |= MLX4_EN_FLAG_DCB_ENABLED;
|
2016-06-21 16:43:59 +07:00
|
|
|
} else {
|
|
|
|
priv->flags &= ~MLX4_EN_FLAG_DCB_ENABLED;
|
2016-09-11 14:56:19 +07:00
|
|
|
priv->cee_config.pfc_state = false;
|
2016-06-21 16:43:59 +07:00
|
|
|
}
|
|
|
|
}
|
|
|
|
#endif /* CONFIG_MLX4_EN_DCB */
|
|
|
|
|
2012-04-05 04:33:27 +07:00
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2016-02-17 12:16:43 +07:00
|
|
|
static int __mlx4_en_setup_tc(struct net_device *dev, u32 handle, __be16 proto,
|
|
|
|
struct tc_to_netdev *tc)
|
2016-02-17 12:16:15 +07:00
|
|
|
{
|
2016-03-01 02:26:13 +07:00
|
|
|
if (tc->type != TC_SETUP_MQPRIO)
|
2016-02-17 12:16:15 +07:00
|
|
|
return -EINVAL;
|
|
|
|
|
2016-02-17 12:16:43 +07:00
|
|
|
return mlx4_en_setup_tc(dev, tc->tc);
|
2016-02-17 12:16:15 +07:00
|
|
|
}
|
|
|
|
|
2012-07-19 05:33:52 +07:00
|
|
|
#ifdef CONFIG_RFS_ACCEL
|
|
|
|
|
|
|
|
struct mlx4_en_filter {
|
|
|
|
struct list_head next;
|
|
|
|
struct work_struct work;
|
|
|
|
|
2013-11-07 17:19:49 +07:00
|
|
|
u8 ip_proto;
|
2012-07-19 05:33:52 +07:00
|
|
|
__be32 src_ip;
|
|
|
|
__be32 dst_ip;
|
|
|
|
__be16 src_port;
|
|
|
|
__be16 dst_port;
|
|
|
|
|
|
|
|
int rxq_index;
|
|
|
|
struct mlx4_en_priv *priv;
|
|
|
|
u32 flow_id; /* RFS infrastructure id */
|
|
|
|
int id; /* mlx4_en driver id */
|
|
|
|
u64 reg_id; /* Flow steering API id */
|
|
|
|
u8 activated; /* Used to prevent expiry before filter
|
|
|
|
* is attached
|
|
|
|
*/
|
|
|
|
struct hlist_node filter_chain;
|
|
|
|
};
|
|
|
|
|
|
|
|
static void mlx4_en_filter_rfs_expire(struct mlx4_en_priv *priv);
|
|
|
|
|
2013-11-07 17:19:49 +07:00
|
|
|
static enum mlx4_net_trans_rule_id mlx4_ip_proto_to_trans_rule_id(u8 ip_proto)
|
|
|
|
{
|
|
|
|
switch (ip_proto) {
|
|
|
|
case IPPROTO_UDP:
|
|
|
|
return MLX4_NET_TRANS_RULE_ID_UDP;
|
|
|
|
case IPPROTO_TCP:
|
|
|
|
return MLX4_NET_TRANS_RULE_ID_TCP;
|
|
|
|
default:
|
2014-05-14 16:15:16 +07:00
|
|
|
return MLX4_NET_TRANS_RULE_NUM;
|
2013-11-07 17:19:49 +07:00
|
|
|
}
|
|
|
|
};
|
|
|
|
|
2012-07-19 05:33:52 +07:00
|
|
|
static void mlx4_en_filter_work(struct work_struct *work)
|
|
|
|
{
|
|
|
|
struct mlx4_en_filter *filter = container_of(work,
|
|
|
|
struct mlx4_en_filter,
|
|
|
|
work);
|
|
|
|
struct mlx4_en_priv *priv = filter->priv;
|
2013-11-07 17:19:49 +07:00
|
|
|
struct mlx4_spec_list spec_tcp_udp = {
|
|
|
|
.id = mlx4_ip_proto_to_trans_rule_id(filter->ip_proto),
|
2012-07-19 05:33:52 +07:00
|
|
|
{
|
|
|
|
.tcp_udp = {
|
|
|
|
.dst_port = filter->dst_port,
|
|
|
|
.dst_port_msk = (__force __be16)-1,
|
|
|
|
.src_port = filter->src_port,
|
|
|
|
.src_port_msk = (__force __be16)-1,
|
|
|
|
},
|
|
|
|
},
|
|
|
|
};
|
|
|
|
struct mlx4_spec_list spec_ip = {
|
|
|
|
.id = MLX4_NET_TRANS_RULE_ID_IPV4,
|
|
|
|
{
|
|
|
|
.ipv4 = {
|
|
|
|
.dst_ip = filter->dst_ip,
|
|
|
|
.dst_ip_msk = (__force __be32)-1,
|
|
|
|
.src_ip = filter->src_ip,
|
|
|
|
.src_ip_msk = (__force __be32)-1,
|
|
|
|
},
|
|
|
|
},
|
|
|
|
};
|
|
|
|
struct mlx4_spec_list spec_eth = {
|
|
|
|
.id = MLX4_NET_TRANS_RULE_ID_ETH,
|
|
|
|
};
|
|
|
|
struct mlx4_net_trans_rule rule = {
|
|
|
|
.list = LIST_HEAD_INIT(rule.list),
|
|
|
|
.queue_mode = MLX4_NET_TRANS_Q_LIFO,
|
|
|
|
.exclusive = 1,
|
|
|
|
.allow_loopback = 1,
|
2013-04-24 20:58:45 +07:00
|
|
|
.promisc_mode = MLX4_FS_REGULAR,
|
2012-07-19 05:33:52 +07:00
|
|
|
.port = priv->port,
|
|
|
|
.priority = MLX4_DOMAIN_RFS,
|
|
|
|
};
|
|
|
|
int rc;
|
|
|
|
__be64 mac_mask = cpu_to_be64(MLX4_MAC_MASK << 16);
|
|
|
|
|
2014-05-14 16:15:16 +07:00
|
|
|
if (spec_tcp_udp.id >= MLX4_NET_TRANS_RULE_NUM) {
|
2013-11-07 17:19:49 +07:00
|
|
|
en_warn(priv, "RFS: ignoring unsupported ip protocol (%d)\n",
|
|
|
|
filter->ip_proto);
|
|
|
|
goto ignore;
|
|
|
|
}
|
2012-07-19 05:33:52 +07:00
|
|
|
list_add_tail(&spec_eth.list, &rule.list);
|
|
|
|
list_add_tail(&spec_ip.list, &rule.list);
|
2013-11-07 17:19:49 +07:00
|
|
|
list_add_tail(&spec_tcp_udp.list, &rule.list);
|
2012-07-19 05:33:52 +07:00
|
|
|
|
|
|
|
rule.qpn = priv->rss_map.qps[filter->rxq_index].qpn;
|
2013-02-07 09:25:20 +07:00
|
|
|
memcpy(spec_eth.eth.dst_mac, priv->dev->dev_addr, ETH_ALEN);
|
2012-07-19 05:33:52 +07:00
|
|
|
memcpy(spec_eth.eth.dst_mac_msk, &mac_mask, ETH_ALEN);
|
|
|
|
|
|
|
|
filter->activated = 0;
|
|
|
|
|
|
|
|
if (filter->reg_id) {
|
|
|
|
rc = mlx4_flow_detach(priv->mdev->dev, filter->reg_id);
|
|
|
|
if (rc && rc != -ENOENT)
|
|
|
|
en_err(priv, "Error detaching flow. rc = %d\n", rc);
|
|
|
|
}
|
|
|
|
|
|
|
|
rc = mlx4_flow_attach(priv->mdev->dev, &rule, &filter->reg_id);
|
|
|
|
if (rc)
|
|
|
|
en_err(priv, "Error attaching flow. err = %d\n", rc);
|
|
|
|
|
2013-11-07 17:19:49 +07:00
|
|
|
ignore:
|
2012-07-19 05:33:52 +07:00
|
|
|
mlx4_en_filter_rfs_expire(priv);
|
|
|
|
|
|
|
|
filter->activated = 1;
|
|
|
|
}
|
|
|
|
|
|
|
|
static inline struct hlist_head *
|
|
|
|
filter_hash_bucket(struct mlx4_en_priv *priv, __be32 src_ip, __be32 dst_ip,
|
|
|
|
__be16 src_port, __be16 dst_port)
|
|
|
|
{
|
|
|
|
unsigned long l;
|
|
|
|
int bucket_idx;
|
|
|
|
|
|
|
|
l = (__force unsigned long)src_port |
|
|
|
|
((__force unsigned long)dst_port << 2);
|
|
|
|
l ^= (__force unsigned long)(src_ip ^ dst_ip);
|
|
|
|
|
|
|
|
bucket_idx = hash_long(l, MLX4_EN_FILTER_HASH_SHIFT);
|
|
|
|
|
|
|
|
return &priv->filter_hash[bucket_idx];
|
|
|
|
}
|
|
|
|
|
|
|
|
static struct mlx4_en_filter *
|
|
|
|
mlx4_en_filter_alloc(struct mlx4_en_priv *priv, int rxq_index, __be32 src_ip,
|
2013-11-07 17:19:49 +07:00
|
|
|
__be32 dst_ip, u8 ip_proto, __be16 src_port,
|
|
|
|
__be16 dst_port, u32 flow_id)
|
2012-07-19 05:33:52 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_filter *filter = NULL;
|
|
|
|
|
|
|
|
filter = kzalloc(sizeof(struct mlx4_en_filter), GFP_ATOMIC);
|
|
|
|
if (!filter)
|
|
|
|
return NULL;
|
|
|
|
|
|
|
|
filter->priv = priv;
|
|
|
|
filter->rxq_index = rxq_index;
|
|
|
|
INIT_WORK(&filter->work, mlx4_en_filter_work);
|
|
|
|
|
|
|
|
filter->src_ip = src_ip;
|
|
|
|
filter->dst_ip = dst_ip;
|
2013-11-07 17:19:49 +07:00
|
|
|
filter->ip_proto = ip_proto;
|
2012-07-19 05:33:52 +07:00
|
|
|
filter->src_port = src_port;
|
|
|
|
filter->dst_port = dst_port;
|
|
|
|
|
|
|
|
filter->flow_id = flow_id;
|
|
|
|
|
2012-07-26 04:21:16 +07:00
|
|
|
filter->id = priv->last_filter_id++ % RPS_NO_FILTER;
|
2012-07-19 05:33:52 +07:00
|
|
|
|
|
|
|
list_add_tail(&filter->next, &priv->filters);
|
|
|
|
hlist_add_head(&filter->filter_chain,
|
|
|
|
filter_hash_bucket(priv, src_ip, dst_ip, src_port,
|
|
|
|
dst_port));
|
|
|
|
|
|
|
|
return filter;
|
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_filter_free(struct mlx4_en_filter *filter)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = filter->priv;
|
|
|
|
int rc;
|
|
|
|
|
|
|
|
list_del(&filter->next);
|
|
|
|
|
|
|
|
rc = mlx4_flow_detach(priv->mdev->dev, filter->reg_id);
|
|
|
|
if (rc && rc != -ENOENT)
|
|
|
|
en_err(priv, "Error detaching flow. rc = %d\n", rc);
|
|
|
|
|
|
|
|
kfree(filter);
|
|
|
|
}
|
|
|
|
|
|
|
|
static inline struct mlx4_en_filter *
|
|
|
|
mlx4_en_filter_find(struct mlx4_en_priv *priv, __be32 src_ip, __be32 dst_ip,
|
2013-11-07 17:19:49 +07:00
|
|
|
u8 ip_proto, __be16 src_port, __be16 dst_port)
|
2012-07-19 05:33:52 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_filter *filter;
|
|
|
|
struct mlx4_en_filter *ret = NULL;
|
|
|
|
|
hlist: drop the node parameter from iterators
I'm not sure why, but the hlist for each entry iterators were conceived
list_for_each_entry(pos, head, member)
The hlist ones were greedy and wanted an extra parameter:
hlist_for_each_entry(tpos, pos, head, member)
Why did they need an extra pos parameter? I'm not quite sure. Not only
they don't really need it, it also prevents the iterator from looking
exactly like the list iterator, which is unfortunate.
Besides the semantic patch, there was some manual work required:
- Fix up the actual hlist iterators in linux/list.h
- Fix up the declaration of other iterators based on the hlist ones.
- A very small amount of places were using the 'node' parameter, this
was modified to use 'obj->member' instead.
- Coccinelle didn't handle the hlist_for_each_entry_safe iterator
properly, so those had to be fixed up manually.
The semantic patch which is mostly the work of Peter Senna Tschudin is here:
@@
iterator name hlist_for_each_entry, hlist_for_each_entry_continue, hlist_for_each_entry_from, hlist_for_each_entry_rcu, hlist_for_each_entry_rcu_bh, hlist_for_each_entry_continue_rcu_bh, for_each_busy_worker, ax25_uid_for_each, ax25_for_each, inet_bind_bucket_for_each, sctp_for_each_hentry, sk_for_each, sk_for_each_rcu, sk_for_each_from, sk_for_each_safe, sk_for_each_bound, hlist_for_each_entry_safe, hlist_for_each_entry_continue_rcu, nr_neigh_for_each, nr_neigh_for_each_safe, nr_node_for_each, nr_node_for_each_safe, for_each_gfn_indirect_valid_sp, for_each_gfn_sp, for_each_host;
type T;
expression a,c,d,e;
identifier b;
statement S;
@@
-T b;
<+... when != b
(
hlist_for_each_entry(a,
- b,
c, d) S
|
hlist_for_each_entry_continue(a,
- b,
c) S
|
hlist_for_each_entry_from(a,
- b,
c) S
|
hlist_for_each_entry_rcu(a,
- b,
c, d) S
|
hlist_for_each_entry_rcu_bh(a,
- b,
c, d) S
|
hlist_for_each_entry_continue_rcu_bh(a,
- b,
c) S
|
for_each_busy_worker(a, c,
- b,
d) S
|
ax25_uid_for_each(a,
- b,
c) S
|
ax25_for_each(a,
- b,
c) S
|
inet_bind_bucket_for_each(a,
- b,
c) S
|
sctp_for_each_hentry(a,
- b,
c) S
|
sk_for_each(a,
- b,
c) S
|
sk_for_each_rcu(a,
- b,
c) S
|
sk_for_each_from
-(a, b)
+(a)
S
+ sk_for_each_from(a) S
|
sk_for_each_safe(a,
- b,
c, d) S
|
sk_for_each_bound(a,
- b,
c) S
|
hlist_for_each_entry_safe(a,
- b,
c, d, e) S
|
hlist_for_each_entry_continue_rcu(a,
- b,
c) S
|
nr_neigh_for_each(a,
- b,
c) S
|
nr_neigh_for_each_safe(a,
- b,
c, d) S
|
nr_node_for_each(a,
- b,
c) S
|
nr_node_for_each_safe(a,
- b,
c, d) S
|
- for_each_gfn_sp(a, c, d, b) S
+ for_each_gfn_sp(a, c, d) S
|
- for_each_gfn_indirect_valid_sp(a, c, d, b) S
+ for_each_gfn_indirect_valid_sp(a, c, d) S
|
for_each_host(a,
- b,
c) S
|
for_each_host_safe(a,
- b,
c, d) S
|
for_each_mesh_entry(a,
- b,
c, d) S
)
...+>
[akpm@linux-foundation.org: drop bogus change from net/ipv4/raw.c]
[akpm@linux-foundation.org: drop bogus hunk from net/ipv6/raw.c]
[akpm@linux-foundation.org: checkpatch fixes]
[akpm@linux-foundation.org: fix warnings]
[akpm@linux-foudnation.org: redo intrusive kvm changes]
Tested-by: Peter Senna Tschudin <peter.senna@gmail.com>
Acked-by: Paul E. McKenney <paulmck@linux.vnet.ibm.com>
Signed-off-by: Sasha Levin <sasha.levin@oracle.com>
Cc: Wu Fengguang <fengguang.wu@intel.com>
Cc: Marcelo Tosatti <mtosatti@redhat.com>
Cc: Gleb Natapov <gleb@redhat.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2013-02-28 08:06:00 +07:00
|
|
|
hlist_for_each_entry(filter,
|
2012-07-19 05:33:52 +07:00
|
|
|
filter_hash_bucket(priv, src_ip, dst_ip,
|
|
|
|
src_port, dst_port),
|
|
|
|
filter_chain) {
|
|
|
|
if (filter->src_ip == src_ip &&
|
|
|
|
filter->dst_ip == dst_ip &&
|
2013-11-07 17:19:49 +07:00
|
|
|
filter->ip_proto == ip_proto &&
|
2012-07-19 05:33:52 +07:00
|
|
|
filter->src_port == src_port &&
|
|
|
|
filter->dst_port == dst_port) {
|
|
|
|
ret = filter;
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
return ret;
|
|
|
|
}
|
|
|
|
|
|
|
|
static int
|
|
|
|
mlx4_en_filter_rfs(struct net_device *net_dev, const struct sk_buff *skb,
|
|
|
|
u16 rxq_index, u32 flow_id)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(net_dev);
|
|
|
|
struct mlx4_en_filter *filter;
|
|
|
|
const struct iphdr *ip;
|
|
|
|
const __be16 *ports;
|
2013-11-07 17:19:49 +07:00
|
|
|
u8 ip_proto;
|
2012-07-19 05:33:52 +07:00
|
|
|
__be32 src_ip;
|
|
|
|
__be32 dst_ip;
|
|
|
|
__be16 src_port;
|
|
|
|
__be16 dst_port;
|
|
|
|
int nhoff = skb_network_offset(skb);
|
|
|
|
int ret = 0;
|
|
|
|
|
|
|
|
if (skb->protocol != htons(ETH_P_IP))
|
|
|
|
return -EPROTONOSUPPORT;
|
|
|
|
|
|
|
|
ip = (const struct iphdr *)(skb->data + nhoff);
|
|
|
|
if (ip_is_fragment(ip))
|
|
|
|
return -EPROTONOSUPPORT;
|
|
|
|
|
2013-11-07 17:19:49 +07:00
|
|
|
if ((ip->protocol != IPPROTO_TCP) && (ip->protocol != IPPROTO_UDP))
|
|
|
|
return -EPROTONOSUPPORT;
|
2012-07-19 05:33:52 +07:00
|
|
|
ports = (const __be16 *)(skb->data + nhoff + 4 * ip->ihl);
|
|
|
|
|
2013-11-07 17:19:49 +07:00
|
|
|
ip_proto = ip->protocol;
|
2012-07-19 05:33:52 +07:00
|
|
|
src_ip = ip->saddr;
|
|
|
|
dst_ip = ip->daddr;
|
|
|
|
src_port = ports[0];
|
|
|
|
dst_port = ports[1];
|
|
|
|
|
|
|
|
spin_lock_bh(&priv->filters_lock);
|
2013-11-07 17:19:49 +07:00
|
|
|
filter = mlx4_en_filter_find(priv, src_ip, dst_ip, ip_proto,
|
|
|
|
src_port, dst_port);
|
2012-07-19 05:33:52 +07:00
|
|
|
if (filter) {
|
|
|
|
if (filter->rxq_index == rxq_index)
|
|
|
|
goto out;
|
|
|
|
|
|
|
|
filter->rxq_index = rxq_index;
|
|
|
|
} else {
|
|
|
|
filter = mlx4_en_filter_alloc(priv, rxq_index,
|
2013-11-07 17:19:49 +07:00
|
|
|
src_ip, dst_ip, ip_proto,
|
2012-07-19 05:33:52 +07:00
|
|
|
src_port, dst_port, flow_id);
|
|
|
|
if (!filter) {
|
|
|
|
ret = -ENOMEM;
|
|
|
|
goto err;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
queue_work(priv->mdev->workqueue, &filter->work);
|
|
|
|
|
|
|
|
out:
|
|
|
|
ret = filter->id;
|
|
|
|
err:
|
|
|
|
spin_unlock_bh(&priv->filters_lock);
|
|
|
|
|
|
|
|
return ret;
|
|
|
|
}
|
|
|
|
|
2013-11-07 17:19:52 +07:00
|
|
|
void mlx4_en_cleanup_filters(struct mlx4_en_priv *priv)
|
2012-07-19 05:33:52 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_filter *filter, *tmp;
|
|
|
|
LIST_HEAD(del_list);
|
|
|
|
|
|
|
|
spin_lock_bh(&priv->filters_lock);
|
|
|
|
list_for_each_entry_safe(filter, tmp, &priv->filters, next) {
|
|
|
|
list_move(&filter->next, &del_list);
|
|
|
|
hlist_del(&filter->filter_chain);
|
|
|
|
}
|
|
|
|
spin_unlock_bh(&priv->filters_lock);
|
|
|
|
|
|
|
|
list_for_each_entry_safe(filter, tmp, &del_list, next) {
|
|
|
|
cancel_work_sync(&filter->work);
|
|
|
|
mlx4_en_filter_free(filter);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_filter_rfs_expire(struct mlx4_en_priv *priv)
|
|
|
|
{
|
|
|
|
struct mlx4_en_filter *filter = NULL, *tmp, *last_filter = NULL;
|
|
|
|
LIST_HEAD(del_list);
|
|
|
|
int i = 0;
|
|
|
|
|
|
|
|
spin_lock_bh(&priv->filters_lock);
|
|
|
|
list_for_each_entry_safe(filter, tmp, &priv->filters, next) {
|
|
|
|
if (i > MLX4_EN_FILTER_EXPIRY_QUOTA)
|
|
|
|
break;
|
|
|
|
|
|
|
|
if (filter->activated &&
|
|
|
|
!work_pending(&filter->work) &&
|
|
|
|
rps_may_expire_flow(priv->dev,
|
|
|
|
filter->rxq_index, filter->flow_id,
|
|
|
|
filter->id)) {
|
|
|
|
list_move(&filter->next, &del_list);
|
|
|
|
hlist_del(&filter->filter_chain);
|
|
|
|
} else
|
|
|
|
last_filter = filter;
|
|
|
|
|
|
|
|
i++;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (last_filter && (&last_filter->next != priv->filters.next))
|
|
|
|
list_move(&priv->filters, &last_filter->next);
|
|
|
|
|
|
|
|
spin_unlock_bh(&priv->filters_lock);
|
|
|
|
|
|
|
|
list_for_each_entry_safe(filter, tmp, &del_list, next)
|
|
|
|
mlx4_en_filter_free(filter);
|
|
|
|
}
|
|
|
|
#endif
|
|
|
|
|
2013-04-19 09:04:28 +07:00
|
|
|
static int mlx4_en_vlan_rx_add_vid(struct net_device *dev,
|
|
|
|
__be16 proto, u16 vid)
|
2008-10-23 05:47:49 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
int err;
|
2010-08-26 21:19:22 +07:00
|
|
|
int idx;
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2011-07-20 11:54:22 +07:00
|
|
|
en_dbg(HW, priv, "adding VLAN:%d\n", vid);
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2011-07-20 11:54:22 +07:00
|
|
|
set_bit(vid, priv->active_vlans);
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
/* Add VID to port VLAN filter */
|
|
|
|
mutex_lock(&mdev->state_lock);
|
|
|
|
if (mdev->device_up && priv->port_up) {
|
2011-07-20 11:54:22 +07:00
|
|
|
err = mlx4_SET_VLAN_FLTR(mdev->dev, priv);
|
2016-06-21 18:20:02 +07:00
|
|
|
if (err) {
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed configuring VLAN filter\n");
|
2016-06-21 18:20:02 +07:00
|
|
|
goto out;
|
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
2016-06-21 18:20:02 +07:00
|
|
|
err = mlx4_register_vlan(mdev->dev, priv->port, vid, &idx);
|
|
|
|
if (err)
|
|
|
|
en_dbg(HW, priv, "Failed adding vlan %d\n", vid);
|
2010-08-26 21:19:22 +07:00
|
|
|
|
2016-06-21 18:20:02 +07:00
|
|
|
out:
|
|
|
|
mutex_unlock(&mdev->state_lock);
|
|
|
|
return err;
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
2013-04-19 09:04:28 +07:00
|
|
|
static int mlx4_en_vlan_rx_kill_vid(struct net_device *dev,
|
|
|
|
__be16 proto, u16 vid)
|
2008-10-23 05:47:49 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
2016-06-21 18:20:02 +07:00
|
|
|
int err = 0;
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2011-07-20 11:54:22 +07:00
|
|
|
en_dbg(HW, priv, "Killing VID:%d\n", vid);
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2011-07-20 11:54:22 +07:00
|
|
|
clear_bit(vid, priv->active_vlans);
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
/* Remove VID from port VLAN filter */
|
|
|
|
mutex_lock(&mdev->state_lock);
|
2013-11-03 15:03:19 +07:00
|
|
|
mlx4_unregister_vlan(mdev->dev, priv->port, vid);
|
2010-08-26 21:19:22 +07:00
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
if (mdev->device_up && priv->port_up) {
|
2011-07-20 11:54:22 +07:00
|
|
|
err = mlx4_SET_VLAN_FLTR(mdev->dev, priv);
|
2008-10-23 05:47:49 +07:00
|
|
|
if (err)
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed configuring VLAN filter\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
mutex_unlock(&mdev->state_lock);
|
2011-12-09 07:52:37 +07:00
|
|
|
|
2016-06-21 18:20:02 +07:00
|
|
|
return err;
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
2013-02-07 09:25:20 +07:00
|
|
|
static void mlx4_en_u64_to_mac(unsigned char dst_mac[ETH_ALEN + 2], u64 src_mac)
|
|
|
|
{
|
2013-04-02 20:49:45 +07:00
|
|
|
int i;
|
|
|
|
for (i = ETH_ALEN - 1; i >= 0; --i) {
|
2013-02-07 09:25:20 +07:00
|
|
|
dst_mac[i] = src_mac & 0xff;
|
|
|
|
src_mac >>= 8;
|
|
|
|
}
|
|
|
|
memset(&dst_mac[ETH_ALEN], 0, 2);
|
|
|
|
}
|
|
|
|
|
2013-12-23 21:09:44 +07:00
|
|
|
|
|
|
|
static int mlx4_en_tunnel_steer_add(struct mlx4_en_priv *priv, unsigned char *addr,
|
|
|
|
int qpn, u64 *reg_id)
|
|
|
|
{
|
|
|
|
int err;
|
|
|
|
|
2015-01-15 20:28:54 +07:00
|
|
|
if (priv->mdev->dev->caps.tunnel_offload_mode != MLX4_TUNNEL_OFFLOAD_MODE_VXLAN ||
|
|
|
|
priv->mdev->dev->caps.dmfs_high_steer_mode == MLX4_STEERING_DMFS_A0_STATIC)
|
2013-12-23 21:09:44 +07:00
|
|
|
return 0; /* do nothing */
|
|
|
|
|
2014-08-27 20:47:48 +07:00
|
|
|
err = mlx4_tunnel_steer_add(priv->mdev->dev, addr, priv->port, qpn,
|
|
|
|
MLX4_DOMAIN_NIC, reg_id);
|
2013-12-23 21:09:44 +07:00
|
|
|
if (err) {
|
|
|
|
en_err(priv, "failed to add vxlan steering rule, err %d\n", err);
|
|
|
|
return err;
|
|
|
|
}
|
|
|
|
en_dbg(DRV, priv, "added vxlan steering rule, mac %pM reg_id %llx\n", addr, *reg_id);
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
|
2013-02-07 09:25:22 +07:00
|
|
|
static int mlx4_en_uc_steer_add(struct mlx4_en_priv *priv,
|
|
|
|
unsigned char *mac, int *qpn, u64 *reg_id)
|
|
|
|
{
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
struct mlx4_dev *dev = mdev->dev;
|
|
|
|
int err;
|
|
|
|
|
|
|
|
switch (dev->caps.steering_mode) {
|
|
|
|
case MLX4_STEERING_MODE_B0: {
|
|
|
|
struct mlx4_qp qp;
|
|
|
|
u8 gid[16] = {0};
|
|
|
|
|
|
|
|
qp.qpn = *qpn;
|
|
|
|
memcpy(&gid[10], mac, ETH_ALEN);
|
|
|
|
gid[5] = priv->port;
|
|
|
|
|
|
|
|
err = mlx4_unicast_attach(dev, &qp, gid, 0, MLX4_PROT_ETH);
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
case MLX4_STEERING_MODE_DEVICE_MANAGED: {
|
|
|
|
struct mlx4_spec_list spec_eth = { {NULL} };
|
|
|
|
__be64 mac_mask = cpu_to_be64(MLX4_MAC_MASK << 16);
|
|
|
|
|
|
|
|
struct mlx4_net_trans_rule rule = {
|
|
|
|
.queue_mode = MLX4_NET_TRANS_Q_FIFO,
|
|
|
|
.exclusive = 0,
|
|
|
|
.allow_loopback = 1,
|
2013-04-24 20:58:45 +07:00
|
|
|
.promisc_mode = MLX4_FS_REGULAR,
|
2013-02-07 09:25:22 +07:00
|
|
|
.priority = MLX4_DOMAIN_NIC,
|
|
|
|
};
|
|
|
|
|
|
|
|
rule.port = priv->port;
|
|
|
|
rule.qpn = *qpn;
|
|
|
|
INIT_LIST_HEAD(&rule.list);
|
|
|
|
|
|
|
|
spec_eth.id = MLX4_NET_TRANS_RULE_ID_ETH;
|
|
|
|
memcpy(spec_eth.eth.dst_mac, mac, ETH_ALEN);
|
|
|
|
memcpy(spec_eth.eth.dst_mac_msk, &mac_mask, ETH_ALEN);
|
|
|
|
list_add_tail(&spec_eth.list, &rule.list);
|
|
|
|
|
|
|
|
err = mlx4_flow_attach(dev, &rule, reg_id);
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
default:
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
if (err)
|
|
|
|
en_warn(priv, "Failed Attaching Unicast\n");
|
|
|
|
|
|
|
|
return err;
|
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_uc_steer_release(struct mlx4_en_priv *priv,
|
|
|
|
unsigned char *mac, int qpn, u64 reg_id)
|
|
|
|
{
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
struct mlx4_dev *dev = mdev->dev;
|
|
|
|
|
|
|
|
switch (dev->caps.steering_mode) {
|
|
|
|
case MLX4_STEERING_MODE_B0: {
|
|
|
|
struct mlx4_qp qp;
|
|
|
|
u8 gid[16] = {0};
|
|
|
|
|
|
|
|
qp.qpn = qpn;
|
|
|
|
memcpy(&gid[10], mac, ETH_ALEN);
|
|
|
|
gid[5] = priv->port;
|
|
|
|
|
|
|
|
mlx4_unicast_detach(dev, &qp, gid, MLX4_PROT_ETH);
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
case MLX4_STEERING_MODE_DEVICE_MANAGED: {
|
|
|
|
mlx4_flow_detach(dev, reg_id);
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
default:
|
|
|
|
en_err(priv, "Invalid steering mode.\n");
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
static int mlx4_en_get_qp(struct mlx4_en_priv *priv)
|
|
|
|
{
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
struct mlx4_dev *dev = mdev->dev;
|
|
|
|
int index = 0;
|
|
|
|
int err = 0;
|
|
|
|
int *qpn = &priv->base_qpn;
|
2014-03-02 15:25:01 +07:00
|
|
|
u64 mac = mlx4_mac_to_u64(priv->dev->dev_addr);
|
2013-02-07 09:25:22 +07:00
|
|
|
|
|
|
|
en_dbg(DRV, priv, "Registering MAC: %pM for adding\n",
|
|
|
|
priv->dev->dev_addr);
|
|
|
|
index = mlx4_register_mac(dev, priv->port, mac);
|
|
|
|
if (index < 0) {
|
|
|
|
err = index;
|
|
|
|
en_err(priv, "Failed adding MAC: %pM\n",
|
|
|
|
priv->dev->dev_addr);
|
|
|
|
return err;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (dev->caps.steering_mode == MLX4_STEERING_MODE_A0) {
|
|
|
|
int base_qpn = mlx4_get_base_qpn(dev, priv->port);
|
|
|
|
*qpn = base_qpn + index;
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
net/mlx4: Add A0 hybrid steering
A0 hybrid steering is a form of high performance flow steering.
By using this mode, mlx4 cards use a fast limited table based steering,
in order to enable fast steering of unicast packets to a QP.
In order to implement A0 hybrid steering we allocate resources
from different zones:
(1) General range
(2) Special MAC-assigned QPs [RSS, Raw-Ethernet] each has its own region.
When we create a rss QP or a raw ethernet (A0 steerable and BF ready) QP,
we try hard to allocate the QP from range (2). Otherwise, we try hard not
to allocate from this range. However, when the system is pushed to its
limits and one needs every resource, the allocator uses every region it can.
Meaning, when we run out of raw-eth qps, the allocator allocates from the
general range (and the special-A0 area is no longer active). If we run out
of RSS qps, the mechanism tries to allocate from the raw-eth QP zone. If that
is also exhausted, the allocator will allocate from the general range
(and the A0 region is no longer active).
Note that if a raw-eth qp is allocated from the general range, it attempts
to allocate the range such that bits 6 and 7 (blueflame bits) in the
QP number are not set.
When the feature is used in SRIOV, the VF has to notify the PF what
kind of QP attributes it needs. In order to do that, along with the
"Eth QP blueflame" bit, we reserve a new "A0 steerable QP". According
to the combination of these bits, the PF tries to allocate a suitable QP.
In order to maintain backward compatibility (with older PFs), the PF
notifies which QP attributes it supports via QUERY_FUNC_CAP command.
Signed-off-by: Matan Barak <matanb@mellanox.com>
Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
2014-12-11 15:57:57 +07:00
|
|
|
err = mlx4_qp_reserve_range(dev, 1, 1, qpn, MLX4_RESERVE_A0_QP);
|
2013-02-07 09:25:22 +07:00
|
|
|
en_dbg(DRV, priv, "Reserved qp %d\n", *qpn);
|
|
|
|
if (err) {
|
|
|
|
en_err(priv, "Failed to reserve qp for mac registration\n");
|
2015-10-08 21:14:01 +07:00
|
|
|
mlx4_unregister_mac(dev, priv->port, mac);
|
|
|
|
return err;
|
2013-02-07 09:25:22 +07:00
|
|
|
}
|
|
|
|
|
2013-02-07 09:25:25 +07:00
|
|
|
return 0;
|
2013-02-07 09:25:22 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_put_qp(struct mlx4_en_priv *priv)
|
|
|
|
{
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
struct mlx4_dev *dev = mdev->dev;
|
|
|
|
int qpn = priv->base_qpn;
|
|
|
|
|
2013-03-07 10:46:56 +07:00
|
|
|
if (dev->caps.steering_mode == MLX4_STEERING_MODE_A0) {
|
2015-10-08 21:14:01 +07:00
|
|
|
u64 mac = mlx4_mac_to_u64(priv->dev->dev_addr);
|
2013-03-07 10:46:56 +07:00
|
|
|
en_dbg(DRV, priv, "Registering MAC: %pM for deleting\n",
|
|
|
|
priv->dev->dev_addr);
|
|
|
|
mlx4_unregister_mac(dev, priv->port, mac);
|
|
|
|
} else {
|
|
|
|
en_dbg(DRV, priv, "Releasing qp: port %d, qpn %d\n",
|
|
|
|
priv->port, qpn);
|
|
|
|
mlx4_qp_release_range(dev, qpn, 1);
|
|
|
|
priv->flags &= ~MLX4_EN_FLAG_FORCE_PROMISC;
|
2013-02-07 09:25:22 +07:00
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
static int mlx4_en_replace_mac(struct mlx4_en_priv *priv, int qpn,
|
2013-02-07 09:25:24 +07:00
|
|
|
unsigned char *new_mac, unsigned char *prev_mac)
|
2013-02-07 09:25:22 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
struct mlx4_dev *dev = mdev->dev;
|
|
|
|
int err = 0;
|
2014-03-02 15:25:01 +07:00
|
|
|
u64 new_mac_u64 = mlx4_mac_to_u64(new_mac);
|
2013-02-07 09:25:22 +07:00
|
|
|
|
|
|
|
if (dev->caps.steering_mode != MLX4_STEERING_MODE_A0) {
|
2013-02-07 09:25:25 +07:00
|
|
|
struct hlist_head *bucket;
|
|
|
|
unsigned int mac_hash;
|
|
|
|
struct mlx4_mac_entry *entry;
|
hlist: drop the node parameter from iterators
I'm not sure why, but the hlist for each entry iterators were conceived
list_for_each_entry(pos, head, member)
The hlist ones were greedy and wanted an extra parameter:
hlist_for_each_entry(tpos, pos, head, member)
Why did they need an extra pos parameter? I'm not quite sure. Not only
they don't really need it, it also prevents the iterator from looking
exactly like the list iterator, which is unfortunate.
Besides the semantic patch, there was some manual work required:
- Fix up the actual hlist iterators in linux/list.h
- Fix up the declaration of other iterators based on the hlist ones.
- A very small amount of places were using the 'node' parameter, this
was modified to use 'obj->member' instead.
- Coccinelle didn't handle the hlist_for_each_entry_safe iterator
properly, so those had to be fixed up manually.
The semantic patch which is mostly the work of Peter Senna Tschudin is here:
@@
iterator name hlist_for_each_entry, hlist_for_each_entry_continue, hlist_for_each_entry_from, hlist_for_each_entry_rcu, hlist_for_each_entry_rcu_bh, hlist_for_each_entry_continue_rcu_bh, for_each_busy_worker, ax25_uid_for_each, ax25_for_each, inet_bind_bucket_for_each, sctp_for_each_hentry, sk_for_each, sk_for_each_rcu, sk_for_each_from, sk_for_each_safe, sk_for_each_bound, hlist_for_each_entry_safe, hlist_for_each_entry_continue_rcu, nr_neigh_for_each, nr_neigh_for_each_safe, nr_node_for_each, nr_node_for_each_safe, for_each_gfn_indirect_valid_sp, for_each_gfn_sp, for_each_host;
type T;
expression a,c,d,e;
identifier b;
statement S;
@@
-T b;
<+... when != b
(
hlist_for_each_entry(a,
- b,
c, d) S
|
hlist_for_each_entry_continue(a,
- b,
c) S
|
hlist_for_each_entry_from(a,
- b,
c) S
|
hlist_for_each_entry_rcu(a,
- b,
c, d) S
|
hlist_for_each_entry_rcu_bh(a,
- b,
c, d) S
|
hlist_for_each_entry_continue_rcu_bh(a,
- b,
c) S
|
for_each_busy_worker(a, c,
- b,
d) S
|
ax25_uid_for_each(a,
- b,
c) S
|
ax25_for_each(a,
- b,
c) S
|
inet_bind_bucket_for_each(a,
- b,
c) S
|
sctp_for_each_hentry(a,
- b,
c) S
|
sk_for_each(a,
- b,
c) S
|
sk_for_each_rcu(a,
- b,
c) S
|
sk_for_each_from
-(a, b)
+(a)
S
+ sk_for_each_from(a) S
|
sk_for_each_safe(a,
- b,
c, d) S
|
sk_for_each_bound(a,
- b,
c) S
|
hlist_for_each_entry_safe(a,
- b,
c, d, e) S
|
hlist_for_each_entry_continue_rcu(a,
- b,
c) S
|
nr_neigh_for_each(a,
- b,
c) S
|
nr_neigh_for_each_safe(a,
- b,
c, d) S
|
nr_node_for_each(a,
- b,
c) S
|
nr_node_for_each_safe(a,
- b,
c, d) S
|
- for_each_gfn_sp(a, c, d, b) S
+ for_each_gfn_sp(a, c, d) S
|
- for_each_gfn_indirect_valid_sp(a, c, d, b) S
+ for_each_gfn_indirect_valid_sp(a, c, d) S
|
for_each_host(a,
- b,
c) S
|
for_each_host_safe(a,
- b,
c, d) S
|
for_each_mesh_entry(a,
- b,
c, d) S
)
...+>
[akpm@linux-foundation.org: drop bogus change from net/ipv4/raw.c]
[akpm@linux-foundation.org: drop bogus hunk from net/ipv6/raw.c]
[akpm@linux-foundation.org: checkpatch fixes]
[akpm@linux-foundation.org: fix warnings]
[akpm@linux-foudnation.org: redo intrusive kvm changes]
Tested-by: Peter Senna Tschudin <peter.senna@gmail.com>
Acked-by: Paul E. McKenney <paulmck@linux.vnet.ibm.com>
Signed-off-by: Sasha Levin <sasha.levin@oracle.com>
Cc: Wu Fengguang <fengguang.wu@intel.com>
Cc: Marcelo Tosatti <mtosatti@redhat.com>
Cc: Gleb Natapov <gleb@redhat.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2013-02-28 08:06:00 +07:00
|
|
|
struct hlist_node *tmp;
|
2014-03-02 15:25:01 +07:00
|
|
|
u64 prev_mac_u64 = mlx4_mac_to_u64(prev_mac);
|
2013-02-07 09:25:25 +07:00
|
|
|
|
|
|
|
bucket = &priv->mac_hash[prev_mac[MLX4_EN_MAC_HASH_IDX]];
|
hlist: drop the node parameter from iterators
I'm not sure why, but the hlist for each entry iterators were conceived
list_for_each_entry(pos, head, member)
The hlist ones were greedy and wanted an extra parameter:
hlist_for_each_entry(tpos, pos, head, member)
Why did they need an extra pos parameter? I'm not quite sure. Not only
they don't really need it, it also prevents the iterator from looking
exactly like the list iterator, which is unfortunate.
Besides the semantic patch, there was some manual work required:
- Fix up the actual hlist iterators in linux/list.h
- Fix up the declaration of other iterators based on the hlist ones.
- A very small amount of places were using the 'node' parameter, this
was modified to use 'obj->member' instead.
- Coccinelle didn't handle the hlist_for_each_entry_safe iterator
properly, so those had to be fixed up manually.
The semantic patch which is mostly the work of Peter Senna Tschudin is here:
@@
iterator name hlist_for_each_entry, hlist_for_each_entry_continue, hlist_for_each_entry_from, hlist_for_each_entry_rcu, hlist_for_each_entry_rcu_bh, hlist_for_each_entry_continue_rcu_bh, for_each_busy_worker, ax25_uid_for_each, ax25_for_each, inet_bind_bucket_for_each, sctp_for_each_hentry, sk_for_each, sk_for_each_rcu, sk_for_each_from, sk_for_each_safe, sk_for_each_bound, hlist_for_each_entry_safe, hlist_for_each_entry_continue_rcu, nr_neigh_for_each, nr_neigh_for_each_safe, nr_node_for_each, nr_node_for_each_safe, for_each_gfn_indirect_valid_sp, for_each_gfn_sp, for_each_host;
type T;
expression a,c,d,e;
identifier b;
statement S;
@@
-T b;
<+... when != b
(
hlist_for_each_entry(a,
- b,
c, d) S
|
hlist_for_each_entry_continue(a,
- b,
c) S
|
hlist_for_each_entry_from(a,
- b,
c) S
|
hlist_for_each_entry_rcu(a,
- b,
c, d) S
|
hlist_for_each_entry_rcu_bh(a,
- b,
c, d) S
|
hlist_for_each_entry_continue_rcu_bh(a,
- b,
c) S
|
for_each_busy_worker(a, c,
- b,
d) S
|
ax25_uid_for_each(a,
- b,
c) S
|
ax25_for_each(a,
- b,
c) S
|
inet_bind_bucket_for_each(a,
- b,
c) S
|
sctp_for_each_hentry(a,
- b,
c) S
|
sk_for_each(a,
- b,
c) S
|
sk_for_each_rcu(a,
- b,
c) S
|
sk_for_each_from
-(a, b)
+(a)
S
+ sk_for_each_from(a) S
|
sk_for_each_safe(a,
- b,
c, d) S
|
sk_for_each_bound(a,
- b,
c) S
|
hlist_for_each_entry_safe(a,
- b,
c, d, e) S
|
hlist_for_each_entry_continue_rcu(a,
- b,
c) S
|
nr_neigh_for_each(a,
- b,
c) S
|
nr_neigh_for_each_safe(a,
- b,
c, d) S
|
nr_node_for_each(a,
- b,
c) S
|
nr_node_for_each_safe(a,
- b,
c, d) S
|
- for_each_gfn_sp(a, c, d, b) S
+ for_each_gfn_sp(a, c, d) S
|
- for_each_gfn_indirect_valid_sp(a, c, d, b) S
+ for_each_gfn_indirect_valid_sp(a, c, d) S
|
for_each_host(a,
- b,
c) S
|
for_each_host_safe(a,
- b,
c, d) S
|
for_each_mesh_entry(a,
- b,
c, d) S
)
...+>
[akpm@linux-foundation.org: drop bogus change from net/ipv4/raw.c]
[akpm@linux-foundation.org: drop bogus hunk from net/ipv6/raw.c]
[akpm@linux-foundation.org: checkpatch fixes]
[akpm@linux-foundation.org: fix warnings]
[akpm@linux-foudnation.org: redo intrusive kvm changes]
Tested-by: Peter Senna Tschudin <peter.senna@gmail.com>
Acked-by: Paul E. McKenney <paulmck@linux.vnet.ibm.com>
Signed-off-by: Sasha Levin <sasha.levin@oracle.com>
Cc: Wu Fengguang <fengguang.wu@intel.com>
Cc: Marcelo Tosatti <mtosatti@redhat.com>
Cc: Gleb Natapov <gleb@redhat.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2013-02-28 08:06:00 +07:00
|
|
|
hlist_for_each_entry_safe(entry, tmp, bucket, hlist) {
|
2013-02-07 09:25:25 +07:00
|
|
|
if (ether_addr_equal_64bits(entry->mac, prev_mac)) {
|
|
|
|
mlx4_en_uc_steer_release(priv, entry->mac,
|
|
|
|
qpn, entry->reg_id);
|
|
|
|
mlx4_unregister_mac(dev, priv->port,
|
|
|
|
prev_mac_u64);
|
|
|
|
hlist_del_rcu(&entry->hlist);
|
|
|
|
synchronize_rcu();
|
|
|
|
memcpy(entry->mac, new_mac, ETH_ALEN);
|
|
|
|
entry->reg_id = 0;
|
|
|
|
mac_hash = new_mac[MLX4_EN_MAC_HASH_IDX];
|
|
|
|
hlist_add_head_rcu(&entry->hlist,
|
|
|
|
&priv->mac_hash[mac_hash]);
|
|
|
|
mlx4_register_mac(dev, priv->port, new_mac_u64);
|
|
|
|
err = mlx4_en_uc_steer_add(priv, new_mac,
|
|
|
|
&qpn,
|
|
|
|
&entry->reg_id);
|
2014-03-12 22:16:31 +07:00
|
|
|
if (err)
|
|
|
|
return err;
|
|
|
|
if (priv->tunnel_reg_id) {
|
|
|
|
mlx4_flow_detach(priv->mdev->dev, priv->tunnel_reg_id);
|
|
|
|
priv->tunnel_reg_id = 0;
|
|
|
|
}
|
|
|
|
err = mlx4_en_tunnel_steer_add(priv, new_mac, qpn,
|
|
|
|
&priv->tunnel_reg_id);
|
2013-02-07 09:25:25 +07:00
|
|
|
return err;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
return -EINVAL;
|
2013-02-07 09:25:22 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
return __mlx4_replace_mac(dev, priv->port, qpn, new_mac_u64);
|
|
|
|
}
|
|
|
|
|
2014-07-08 15:25:24 +07:00
|
|
|
static int mlx4_en_do_set_mac(struct mlx4_en_priv *priv,
|
|
|
|
unsigned char new_mac[ETH_ALEN + 2])
|
2008-10-23 05:47:49 +07:00
|
|
|
{
|
|
|
|
int err = 0;
|
|
|
|
|
|
|
|
if (priv->port_up) {
|
|
|
|
/* Remove old MAC and insert the new one */
|
2013-02-07 09:25:22 +07:00
|
|
|
err = mlx4_en_replace_mac(priv, priv->base_qpn,
|
2014-07-08 15:25:24 +07:00
|
|
|
new_mac, priv->current_mac);
|
2008-10-23 05:47:49 +07:00
|
|
|
if (err)
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed changing HW MAC address\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
} else
|
2013-02-07 09:25:21 +07:00
|
|
|
en_dbg(HW, priv, "Port is down while registering mac, exiting...\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2014-07-08 15:25:24 +07:00
|
|
|
if (!err)
|
|
|
|
memcpy(priv->current_mac, new_mac, sizeof(priv->current_mac));
|
2014-05-14 16:15:12 +07:00
|
|
|
|
2013-03-07 10:46:55 +07:00
|
|
|
return err;
|
|
|
|
}
|
|
|
|
|
|
|
|
static int mlx4_en_set_mac(struct net_device *dev, void *addr)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
struct sockaddr *saddr = addr;
|
2014-07-08 15:25:24 +07:00
|
|
|
unsigned char new_mac[ETH_ALEN + 2];
|
2013-03-07 10:46:55 +07:00
|
|
|
int err;
|
|
|
|
|
|
|
|
if (!is_valid_ether_addr(saddr->sa_data))
|
|
|
|
return -EADDRNOTAVAIL;
|
|
|
|
|
|
|
|
mutex_lock(&mdev->state_lock);
|
2014-07-08 15:25:24 +07:00
|
|
|
memcpy(new_mac, saddr->sa_data, ETH_ALEN);
|
|
|
|
err = mlx4_en_do_set_mac(priv, new_mac);
|
|
|
|
if (!err)
|
|
|
|
memcpy(dev->dev_addr, saddr->sa_data, ETH_ALEN);
|
2008-10-23 05:47:49 +07:00
|
|
|
mutex_unlock(&mdev->state_lock);
|
2013-03-07 10:46:55 +07:00
|
|
|
|
|
|
|
return err;
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_clear_list(struct net_device *dev)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
2012-07-05 11:03:43 +07:00
|
|
|
struct mlx4_en_mc_list *tmp, *mc_to_del;
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2012-07-05 11:03:43 +07:00
|
|
|
list_for_each_entry_safe(mc_to_del, tmp, &priv->mc_list, list) {
|
|
|
|
list_del(&mc_to_del->list);
|
|
|
|
kfree(mc_to_del);
|
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_cache_mclist(struct net_device *dev)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
2010-04-02 04:22:57 +07:00
|
|
|
struct netdev_hw_addr *ha;
|
2012-07-05 11:03:43 +07:00
|
|
|
struct mlx4_en_mc_list *tmp;
|
2010-03-01 12:09:14 +07:00
|
|
|
|
2011-12-19 11:02:58 +07:00
|
|
|
mlx4_en_clear_list(dev);
|
2012-07-05 11:03:43 +07:00
|
|
|
netdev_for_each_mc_addr(ha, dev) {
|
|
|
|
tmp = kzalloc(sizeof(struct mlx4_en_mc_list), GFP_ATOMIC);
|
|
|
|
if (!tmp) {
|
|
|
|
mlx4_en_clear_list(dev);
|
|
|
|
return;
|
|
|
|
}
|
|
|
|
memcpy(tmp->addr, ha->addr, ETH_ALEN);
|
|
|
|
list_add_tail(&tmp->list, &priv->mc_list);
|
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
2012-07-05 11:03:43 +07:00
|
|
|
static void update_mclist_flags(struct mlx4_en_priv *priv,
|
|
|
|
struct list_head *dst,
|
|
|
|
struct list_head *src)
|
|
|
|
{
|
|
|
|
struct mlx4_en_mc_list *dst_tmp, *src_tmp, *new_mc;
|
|
|
|
bool found;
|
|
|
|
|
|
|
|
/* Find all the entries that should be removed from dst,
|
|
|
|
* These are the entries that are not found in src
|
|
|
|
*/
|
|
|
|
list_for_each_entry(dst_tmp, dst, list) {
|
|
|
|
found = false;
|
|
|
|
list_for_each_entry(src_tmp, src, list) {
|
2013-12-30 14:40:55 +07:00
|
|
|
if (ether_addr_equal(dst_tmp->addr, src_tmp->addr)) {
|
2012-07-05 11:03:43 +07:00
|
|
|
found = true;
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
if (!found)
|
|
|
|
dst_tmp->action = MCLIST_REM;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* Add entries that exist in src but not in dst
|
|
|
|
* mark them as need to add
|
|
|
|
*/
|
|
|
|
list_for_each_entry(src_tmp, src, list) {
|
|
|
|
found = false;
|
|
|
|
list_for_each_entry(dst_tmp, dst, list) {
|
2013-12-30 14:40:55 +07:00
|
|
|
if (ether_addr_equal(dst_tmp->addr, src_tmp->addr)) {
|
2012-07-05 11:03:43 +07:00
|
|
|
dst_tmp->action = MCLIST_NONE;
|
|
|
|
found = true;
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
if (!found) {
|
2013-02-07 18:46:27 +07:00
|
|
|
new_mc = kmemdup(src_tmp,
|
|
|
|
sizeof(struct mlx4_en_mc_list),
|
2012-07-05 11:03:43 +07:00
|
|
|
GFP_KERNEL);
|
2013-02-07 18:46:27 +07:00
|
|
|
if (!new_mc)
|
2012-07-05 11:03:43 +07:00
|
|
|
return;
|
2013-02-07 18:46:27 +07:00
|
|
|
|
2012-07-05 11:03:43 +07:00
|
|
|
new_mc->action = MCLIST_ADD;
|
|
|
|
list_add_tail(&new_mc->list, dst);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2013-02-07 09:25:23 +07:00
|
|
|
static void mlx4_en_set_rx_mode(struct net_device *dev)
|
2008-10-23 05:47:49 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
|
|
|
|
if (!priv->port_up)
|
|
|
|
return;
|
|
|
|
|
2013-02-07 09:25:23 +07:00
|
|
|
queue_work(priv->mdev->workqueue, &priv->rx_mode_task);
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
2013-02-07 09:25:23 +07:00
|
|
|
static void mlx4_en_set_promisc_mode(struct mlx4_en_priv *priv,
|
|
|
|
struct mlx4_en_dev *mdev)
|
2008-10-23 05:47:49 +07:00
|
|
|
{
|
2012-07-05 11:03:44 +07:00
|
|
|
int err = 0;
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2013-02-07 09:25:23 +07:00
|
|
|
if (!(priv->flags & MLX4_EN_FLAG_PROMISC)) {
|
2008-10-23 05:47:49 +07:00
|
|
|
if (netif_msg_rx_status(priv))
|
2013-02-07 09:25:23 +07:00
|
|
|
en_warn(priv, "Entering promiscuous mode\n");
|
|
|
|
priv->flags |= MLX4_EN_FLAG_PROMISC;
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2013-02-07 09:25:23 +07:00
|
|
|
/* Enable promiscouos mode */
|
2012-07-05 11:03:44 +07:00
|
|
|
switch (mdev->dev->caps.steering_mode) {
|
2012-07-05 11:03:48 +07:00
|
|
|
case MLX4_STEERING_MODE_DEVICE_MANAGED:
|
2013-02-07 09:25:23 +07:00
|
|
|
err = mlx4_flow_steer_promisc_add(mdev->dev,
|
|
|
|
priv->port,
|
|
|
|
priv->base_qpn,
|
2013-04-24 20:58:45 +07:00
|
|
|
MLX4_FS_ALL_DEFAULT);
|
2012-07-05 11:03:48 +07:00
|
|
|
if (err)
|
2013-02-07 09:25:23 +07:00
|
|
|
en_err(priv, "Failed enabling promiscuous mode\n");
|
|
|
|
priv->flags |= MLX4_EN_FLAG_MC_PROMISC;
|
2012-07-05 11:03:48 +07:00
|
|
|
break;
|
|
|
|
|
2012-07-05 11:03:44 +07:00
|
|
|
case MLX4_STEERING_MODE_B0:
|
2013-02-07 09:25:23 +07:00
|
|
|
err = mlx4_unicast_promisc_add(mdev->dev,
|
|
|
|
priv->base_qpn,
|
|
|
|
priv->port);
|
2012-07-05 11:03:44 +07:00
|
|
|
if (err)
|
2013-02-07 09:25:23 +07:00
|
|
|
en_err(priv, "Failed enabling unicast promiscuous mode\n");
|
|
|
|
|
|
|
|
/* Add the default qp number as multicast
|
|
|
|
* promisc
|
|
|
|
*/
|
|
|
|
if (!(priv->flags & MLX4_EN_FLAG_MC_PROMISC)) {
|
|
|
|
err = mlx4_multicast_promisc_add(mdev->dev,
|
|
|
|
priv->base_qpn,
|
|
|
|
priv->port);
|
2012-07-05 11:03:44 +07:00
|
|
|
if (err)
|
2013-02-07 09:25:23 +07:00
|
|
|
en_err(priv, "Failed enabling multicast promiscuous mode\n");
|
|
|
|
priv->flags |= MLX4_EN_FLAG_MC_PROMISC;
|
2012-07-05 11:03:44 +07:00
|
|
|
}
|
|
|
|
break;
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2012-07-05 11:03:44 +07:00
|
|
|
case MLX4_STEERING_MODE_A0:
|
|
|
|
err = mlx4_SET_PORT_qpn_calc(mdev->dev,
|
|
|
|
priv->port,
|
2013-02-07 09:25:23 +07:00
|
|
|
priv->base_qpn,
|
|
|
|
1);
|
2011-03-23 05:38:31 +07:00
|
|
|
if (err)
|
2013-02-07 09:25:23 +07:00
|
|
|
en_err(priv, "Failed enabling promiscuous mode\n");
|
2012-07-05 11:03:44 +07:00
|
|
|
break;
|
2011-03-23 05:38:31 +07:00
|
|
|
}
|
|
|
|
|
2013-02-07 09:25:23 +07:00
|
|
|
/* Disable port multicast filter (unconditionally) */
|
|
|
|
err = mlx4_SET_MCAST_FLTR(mdev->dev, priv->port, 0,
|
|
|
|
0, MLX4_MCAST_DISABLE);
|
|
|
|
if (err)
|
|
|
|
en_err(priv, "Failed disabling multicast filter\n");
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_clear_promisc_mode(struct mlx4_en_priv *priv,
|
|
|
|
struct mlx4_en_dev *mdev)
|
|
|
|
{
|
|
|
|
int err = 0;
|
|
|
|
|
|
|
|
if (netif_msg_rx_status(priv))
|
|
|
|
en_warn(priv, "Leaving promiscuous mode\n");
|
|
|
|
priv->flags &= ~MLX4_EN_FLAG_PROMISC;
|
|
|
|
|
|
|
|
/* Disable promiscouos mode */
|
|
|
|
switch (mdev->dev->caps.steering_mode) {
|
|
|
|
case MLX4_STEERING_MODE_DEVICE_MANAGED:
|
|
|
|
err = mlx4_flow_steer_promisc_remove(mdev->dev,
|
|
|
|
priv->port,
|
2013-04-24 20:58:45 +07:00
|
|
|
MLX4_FS_ALL_DEFAULT);
|
2013-02-07 09:25:23 +07:00
|
|
|
if (err)
|
|
|
|
en_err(priv, "Failed disabling promiscuous mode\n");
|
|
|
|
priv->flags &= ~MLX4_EN_FLAG_MC_PROMISC;
|
|
|
|
break;
|
|
|
|
|
|
|
|
case MLX4_STEERING_MODE_B0:
|
|
|
|
err = mlx4_unicast_promisc_remove(mdev->dev,
|
|
|
|
priv->base_qpn,
|
|
|
|
priv->port);
|
|
|
|
if (err)
|
|
|
|
en_err(priv, "Failed disabling unicast promiscuous mode\n");
|
|
|
|
/* Disable Multicast promisc */
|
|
|
|
if (priv->flags & MLX4_EN_FLAG_MC_PROMISC) {
|
|
|
|
err = mlx4_multicast_promisc_remove(mdev->dev,
|
|
|
|
priv->base_qpn,
|
|
|
|
priv->port);
|
|
|
|
if (err)
|
|
|
|
en_err(priv, "Failed disabling multicast promiscuous mode\n");
|
|
|
|
priv->flags &= ~MLX4_EN_FLAG_MC_PROMISC;
|
|
|
|
}
|
|
|
|
break;
|
|
|
|
|
|
|
|
case MLX4_STEERING_MODE_A0:
|
|
|
|
err = mlx4_SET_PORT_qpn_calc(mdev->dev,
|
|
|
|
priv->port,
|
|
|
|
priv->base_qpn, 0);
|
|
|
|
if (err)
|
|
|
|
en_err(priv, "Failed disabling promiscuous mode\n");
|
|
|
|
break;
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
2013-02-07 09:25:23 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_do_multicast(struct mlx4_en_priv *priv,
|
|
|
|
struct net_device *dev,
|
|
|
|
struct mlx4_en_dev *mdev)
|
|
|
|
{
|
|
|
|
struct mlx4_en_mc_list *mclist, *tmp;
|
|
|
|
u64 mcast_addr = 0;
|
|
|
|
u8 mc_list[16] = {0};
|
|
|
|
int err = 0;
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
/* Enable/disable the multicast filter according to IFF_ALLMULTI */
|
|
|
|
if (dev->flags & IFF_ALLMULTI) {
|
|
|
|
err = mlx4_SET_MCAST_FLTR(mdev->dev, priv->port, 0,
|
|
|
|
0, MLX4_MCAST_DISABLE);
|
|
|
|
if (err)
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed disabling multicast filter\n");
|
2011-03-23 05:38:31 +07:00
|
|
|
|
|
|
|
/* Add the default qp number as multicast promisc */
|
|
|
|
if (!(priv->flags & MLX4_EN_FLAG_MC_PROMISC)) {
|
2012-07-05 11:03:44 +07:00
|
|
|
switch (mdev->dev->caps.steering_mode) {
|
2012-07-05 11:03:48 +07:00
|
|
|
case MLX4_STEERING_MODE_DEVICE_MANAGED:
|
|
|
|
err = mlx4_flow_steer_promisc_add(mdev->dev,
|
|
|
|
priv->port,
|
|
|
|
priv->base_qpn,
|
2013-04-24 20:58:45 +07:00
|
|
|
MLX4_FS_MC_DEFAULT);
|
2012-07-05 11:03:48 +07:00
|
|
|
break;
|
|
|
|
|
2012-07-05 11:03:44 +07:00
|
|
|
case MLX4_STEERING_MODE_B0:
|
|
|
|
err = mlx4_multicast_promisc_add(mdev->dev,
|
|
|
|
priv->base_qpn,
|
|
|
|
priv->port);
|
|
|
|
break;
|
|
|
|
|
|
|
|
case MLX4_STEERING_MODE_A0:
|
|
|
|
break;
|
|
|
|
}
|
2011-03-23 05:38:31 +07:00
|
|
|
if (err)
|
|
|
|
en_err(priv, "Failed entering multicast promisc mode\n");
|
|
|
|
priv->flags |= MLX4_EN_FLAG_MC_PROMISC;
|
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
} else {
|
2011-03-23 05:38:31 +07:00
|
|
|
/* Disable Multicast promisc */
|
|
|
|
if (priv->flags & MLX4_EN_FLAG_MC_PROMISC) {
|
2012-07-05 11:03:44 +07:00
|
|
|
switch (mdev->dev->caps.steering_mode) {
|
2012-07-05 11:03:48 +07:00
|
|
|
case MLX4_STEERING_MODE_DEVICE_MANAGED:
|
|
|
|
err = mlx4_flow_steer_promisc_remove(mdev->dev,
|
|
|
|
priv->port,
|
2013-04-24 20:58:45 +07:00
|
|
|
MLX4_FS_MC_DEFAULT);
|
2012-07-05 11:03:48 +07:00
|
|
|
break;
|
|
|
|
|
2012-07-05 11:03:44 +07:00
|
|
|
case MLX4_STEERING_MODE_B0:
|
|
|
|
err = mlx4_multicast_promisc_remove(mdev->dev,
|
|
|
|
priv->base_qpn,
|
|
|
|
priv->port);
|
|
|
|
break;
|
|
|
|
|
|
|
|
case MLX4_STEERING_MODE_A0:
|
|
|
|
break;
|
|
|
|
}
|
2011-03-23 05:38:31 +07:00
|
|
|
if (err)
|
2011-03-31 08:57:33 +07:00
|
|
|
en_err(priv, "Failed disabling multicast promiscuous mode\n");
|
2011-03-23 05:38:31 +07:00
|
|
|
priv->flags &= ~MLX4_EN_FLAG_MC_PROMISC;
|
|
|
|
}
|
2010-03-01 12:09:14 +07:00
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
err = mlx4_SET_MCAST_FLTR(mdev->dev, priv->port, 0,
|
|
|
|
0, MLX4_MCAST_DISABLE);
|
|
|
|
if (err)
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed disabling multicast filter\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
/* Flush mcast filter and init it with broadcast address */
|
|
|
|
mlx4_SET_MCAST_FLTR(mdev->dev, priv->port, ETH_BCAST,
|
|
|
|
1, MLX4_MCAST_CONFIG);
|
|
|
|
|
|
|
|
/* Update multicast list - we cache all addresses so they won't
|
|
|
|
* change while HW is updated holding the command semaphor */
|
2013-01-24 08:54:16 +07:00
|
|
|
netif_addr_lock_bh(dev);
|
2008-10-23 05:47:49 +07:00
|
|
|
mlx4_en_cache_mclist(dev);
|
2013-01-24 08:54:16 +07:00
|
|
|
netif_addr_unlock_bh(dev);
|
2012-07-05 11:03:43 +07:00
|
|
|
list_for_each_entry(mclist, &priv->mc_list, list) {
|
2014-03-02 15:25:01 +07:00
|
|
|
mcast_addr = mlx4_mac_to_u64(mclist->addr);
|
2008-10-23 05:47:49 +07:00
|
|
|
mlx4_SET_MCAST_FLTR(mdev->dev, priv->port,
|
|
|
|
mcast_addr, 0, MLX4_MCAST_CONFIG);
|
|
|
|
}
|
|
|
|
err = mlx4_SET_MCAST_FLTR(mdev->dev, priv->port, 0,
|
|
|
|
0, MLX4_MCAST_ENABLE);
|
|
|
|
if (err)
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed enabling multicast filter\n");
|
2012-07-05 11:03:43 +07:00
|
|
|
|
|
|
|
update_mclist_flags(priv, &priv->curr_list, &priv->mc_list);
|
|
|
|
list_for_each_entry_safe(mclist, tmp, &priv->curr_list, list) {
|
|
|
|
if (mclist->action == MCLIST_REM) {
|
|
|
|
/* detach this address and delete from list */
|
|
|
|
memcpy(&mc_list[10], mclist->addr, ETH_ALEN);
|
|
|
|
mc_list[5] = priv->port;
|
|
|
|
err = mlx4_multicast_detach(mdev->dev,
|
|
|
|
&priv->rss_map.indir_qp,
|
|
|
|
mc_list,
|
{NET, IB}/mlx4: Add device managed flow steering firmware API
The driver is modified to support three operation modes.
If supported by firmware use the device managed flow steering
API, that which we call device managed steering mode. Else, if
the firmware supports the B0 steering mode use it, and finally,
if none of the above, use the A0 steering mode.
When the steering mode is device managed, the code is modified
such that L2 based rules set by the mlx4_en driver for Ethernet
unicast and multicast, and the IB stack multicast attach calls
done through the mlx4_ib driver are all routed to use the device
managed API.
When attaching rule using device managed flow steering API,
the firmware returns a 64 bit registration id, which is to be
provided during detach.
Currently the firmware is always programmed during HCA initialization
to use standard L2 hashing. Future work should be done to allow
configuring the flow-steering hash function with common, non
proprietary means.
Signed-off-by: Hadar Hen Zion <hadarh@mellanox.co.il>
Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
2012-07-05 11:03:46 +07:00
|
|
|
MLX4_PROT_ETH,
|
|
|
|
mclist->reg_id);
|
2012-07-05 11:03:43 +07:00
|
|
|
if (err)
|
|
|
|
en_err(priv, "Fail to detach multicast address\n");
|
|
|
|
|
2013-12-23 21:09:44 +07:00
|
|
|
if (mclist->tunnel_reg_id) {
|
|
|
|
err = mlx4_flow_detach(priv->mdev->dev, mclist->tunnel_reg_id);
|
|
|
|
if (err)
|
|
|
|
en_err(priv, "Failed to detach multicast address\n");
|
|
|
|
}
|
|
|
|
|
2012-07-05 11:03:43 +07:00
|
|
|
/* remove from list */
|
|
|
|
list_del(&mclist->list);
|
|
|
|
kfree(mclist);
|
2012-07-11 03:34:07 +07:00
|
|
|
} else if (mclist->action == MCLIST_ADD) {
|
2012-07-05 11:03:43 +07:00
|
|
|
/* attach the address */
|
|
|
|
memcpy(&mc_list[10], mclist->addr, ETH_ALEN);
|
{NET, IB}/mlx4: Add device managed flow steering firmware API
The driver is modified to support three operation modes.
If supported by firmware use the device managed flow steering
API, that which we call device managed steering mode. Else, if
the firmware supports the B0 steering mode use it, and finally,
if none of the above, use the A0 steering mode.
When the steering mode is device managed, the code is modified
such that L2 based rules set by the mlx4_en driver for Ethernet
unicast and multicast, and the IB stack multicast attach calls
done through the mlx4_ib driver are all routed to use the device
managed API.
When attaching rule using device managed flow steering API,
the firmware returns a 64 bit registration id, which is to be
provided during detach.
Currently the firmware is always programmed during HCA initialization
to use standard L2 hashing. Future work should be done to allow
configuring the flow-steering hash function with common, non
proprietary means.
Signed-off-by: Hadar Hen Zion <hadarh@mellanox.co.il>
Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
2012-07-05 11:03:46 +07:00
|
|
|
/* needed for B0 steering support */
|
2012-07-05 11:03:43 +07:00
|
|
|
mc_list[5] = priv->port;
|
|
|
|
err = mlx4_multicast_attach(mdev->dev,
|
|
|
|
&priv->rss_map.indir_qp,
|
{NET, IB}/mlx4: Add device managed flow steering firmware API
The driver is modified to support three operation modes.
If supported by firmware use the device managed flow steering
API, that which we call device managed steering mode. Else, if
the firmware supports the B0 steering mode use it, and finally,
if none of the above, use the A0 steering mode.
When the steering mode is device managed, the code is modified
such that L2 based rules set by the mlx4_en driver for Ethernet
unicast and multicast, and the IB stack multicast attach calls
done through the mlx4_ib driver are all routed to use the device
managed API.
When attaching rule using device managed flow steering API,
the firmware returns a 64 bit registration id, which is to be
provided during detach.
Currently the firmware is always programmed during HCA initialization
to use standard L2 hashing. Future work should be done to allow
configuring the flow-steering hash function with common, non
proprietary means.
Signed-off-by: Hadar Hen Zion <hadarh@mellanox.co.il>
Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
2012-07-05 11:03:46 +07:00
|
|
|
mc_list,
|
|
|
|
priv->port, 0,
|
|
|
|
MLX4_PROT_ETH,
|
|
|
|
&mclist->reg_id);
|
2012-07-05 11:03:43 +07:00
|
|
|
if (err)
|
|
|
|
en_err(priv, "Fail to attach multicast address\n");
|
|
|
|
|
2013-12-23 21:09:44 +07:00
|
|
|
err = mlx4_en_tunnel_steer_add(priv, &mc_list[10], priv->base_qpn,
|
|
|
|
&mclist->tunnel_reg_id);
|
|
|
|
if (err)
|
|
|
|
en_err(priv, "Failed to attach multicast address\n");
|
2012-07-05 11:03:43 +07:00
|
|
|
}
|
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
2013-02-07 09:25:23 +07:00
|
|
|
}
|
|
|
|
|
2013-02-07 09:25:26 +07:00
|
|
|
static void mlx4_en_do_uc_filter(struct mlx4_en_priv *priv,
|
|
|
|
struct net_device *dev,
|
|
|
|
struct mlx4_en_dev *mdev)
|
|
|
|
{
|
|
|
|
struct netdev_hw_addr *ha;
|
|
|
|
struct mlx4_mac_entry *entry;
|
hlist: drop the node parameter from iterators
I'm not sure why, but the hlist for each entry iterators were conceived
list_for_each_entry(pos, head, member)
The hlist ones were greedy and wanted an extra parameter:
hlist_for_each_entry(tpos, pos, head, member)
Why did they need an extra pos parameter? I'm not quite sure. Not only
they don't really need it, it also prevents the iterator from looking
exactly like the list iterator, which is unfortunate.
Besides the semantic patch, there was some manual work required:
- Fix up the actual hlist iterators in linux/list.h
- Fix up the declaration of other iterators based on the hlist ones.
- A very small amount of places were using the 'node' parameter, this
was modified to use 'obj->member' instead.
- Coccinelle didn't handle the hlist_for_each_entry_safe iterator
properly, so those had to be fixed up manually.
The semantic patch which is mostly the work of Peter Senna Tschudin is here:
@@
iterator name hlist_for_each_entry, hlist_for_each_entry_continue, hlist_for_each_entry_from, hlist_for_each_entry_rcu, hlist_for_each_entry_rcu_bh, hlist_for_each_entry_continue_rcu_bh, for_each_busy_worker, ax25_uid_for_each, ax25_for_each, inet_bind_bucket_for_each, sctp_for_each_hentry, sk_for_each, sk_for_each_rcu, sk_for_each_from, sk_for_each_safe, sk_for_each_bound, hlist_for_each_entry_safe, hlist_for_each_entry_continue_rcu, nr_neigh_for_each, nr_neigh_for_each_safe, nr_node_for_each, nr_node_for_each_safe, for_each_gfn_indirect_valid_sp, for_each_gfn_sp, for_each_host;
type T;
expression a,c,d,e;
identifier b;
statement S;
@@
-T b;
<+... when != b
(
hlist_for_each_entry(a,
- b,
c, d) S
|
hlist_for_each_entry_continue(a,
- b,
c) S
|
hlist_for_each_entry_from(a,
- b,
c) S
|
hlist_for_each_entry_rcu(a,
- b,
c, d) S
|
hlist_for_each_entry_rcu_bh(a,
- b,
c, d) S
|
hlist_for_each_entry_continue_rcu_bh(a,
- b,
c) S
|
for_each_busy_worker(a, c,
- b,
d) S
|
ax25_uid_for_each(a,
- b,
c) S
|
ax25_for_each(a,
- b,
c) S
|
inet_bind_bucket_for_each(a,
- b,
c) S
|
sctp_for_each_hentry(a,
- b,
c) S
|
sk_for_each(a,
- b,
c) S
|
sk_for_each_rcu(a,
- b,
c) S
|
sk_for_each_from
-(a, b)
+(a)
S
+ sk_for_each_from(a) S
|
sk_for_each_safe(a,
- b,
c, d) S
|
sk_for_each_bound(a,
- b,
c) S
|
hlist_for_each_entry_safe(a,
- b,
c, d, e) S
|
hlist_for_each_entry_continue_rcu(a,
- b,
c) S
|
nr_neigh_for_each(a,
- b,
c) S
|
nr_neigh_for_each_safe(a,
- b,
c, d) S
|
nr_node_for_each(a,
- b,
c) S
|
nr_node_for_each_safe(a,
- b,
c, d) S
|
- for_each_gfn_sp(a, c, d, b) S
+ for_each_gfn_sp(a, c, d) S
|
- for_each_gfn_indirect_valid_sp(a, c, d, b) S
+ for_each_gfn_indirect_valid_sp(a, c, d) S
|
for_each_host(a,
- b,
c) S
|
for_each_host_safe(a,
- b,
c, d) S
|
for_each_mesh_entry(a,
- b,
c, d) S
)
...+>
[akpm@linux-foundation.org: drop bogus change from net/ipv4/raw.c]
[akpm@linux-foundation.org: drop bogus hunk from net/ipv6/raw.c]
[akpm@linux-foundation.org: checkpatch fixes]
[akpm@linux-foundation.org: fix warnings]
[akpm@linux-foudnation.org: redo intrusive kvm changes]
Tested-by: Peter Senna Tschudin <peter.senna@gmail.com>
Acked-by: Paul E. McKenney <paulmck@linux.vnet.ibm.com>
Signed-off-by: Sasha Levin <sasha.levin@oracle.com>
Cc: Wu Fengguang <fengguang.wu@intel.com>
Cc: Marcelo Tosatti <mtosatti@redhat.com>
Cc: Gleb Natapov <gleb@redhat.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2013-02-28 08:06:00 +07:00
|
|
|
struct hlist_node *tmp;
|
2013-02-07 09:25:26 +07:00
|
|
|
bool found;
|
|
|
|
u64 mac;
|
|
|
|
int err = 0;
|
|
|
|
struct hlist_head *bucket;
|
|
|
|
unsigned int i;
|
|
|
|
int removed = 0;
|
|
|
|
u32 prev_flags;
|
|
|
|
|
|
|
|
/* Note that we do not need to protect our mac_hash traversal with rcu,
|
|
|
|
* since all modification code is protected by mdev->state_lock
|
|
|
|
*/
|
|
|
|
|
|
|
|
/* find what to remove */
|
|
|
|
for (i = 0; i < MLX4_EN_MAC_HASH_SIZE; ++i) {
|
|
|
|
bucket = &priv->mac_hash[i];
|
hlist: drop the node parameter from iterators
I'm not sure why, but the hlist for each entry iterators were conceived
list_for_each_entry(pos, head, member)
The hlist ones were greedy and wanted an extra parameter:
hlist_for_each_entry(tpos, pos, head, member)
Why did they need an extra pos parameter? I'm not quite sure. Not only
they don't really need it, it also prevents the iterator from looking
exactly like the list iterator, which is unfortunate.
Besides the semantic patch, there was some manual work required:
- Fix up the actual hlist iterators in linux/list.h
- Fix up the declaration of other iterators based on the hlist ones.
- A very small amount of places were using the 'node' parameter, this
was modified to use 'obj->member' instead.
- Coccinelle didn't handle the hlist_for_each_entry_safe iterator
properly, so those had to be fixed up manually.
The semantic patch which is mostly the work of Peter Senna Tschudin is here:
@@
iterator name hlist_for_each_entry, hlist_for_each_entry_continue, hlist_for_each_entry_from, hlist_for_each_entry_rcu, hlist_for_each_entry_rcu_bh, hlist_for_each_entry_continue_rcu_bh, for_each_busy_worker, ax25_uid_for_each, ax25_for_each, inet_bind_bucket_for_each, sctp_for_each_hentry, sk_for_each, sk_for_each_rcu, sk_for_each_from, sk_for_each_safe, sk_for_each_bound, hlist_for_each_entry_safe, hlist_for_each_entry_continue_rcu, nr_neigh_for_each, nr_neigh_for_each_safe, nr_node_for_each, nr_node_for_each_safe, for_each_gfn_indirect_valid_sp, for_each_gfn_sp, for_each_host;
type T;
expression a,c,d,e;
identifier b;
statement S;
@@
-T b;
<+... when != b
(
hlist_for_each_entry(a,
- b,
c, d) S
|
hlist_for_each_entry_continue(a,
- b,
c) S
|
hlist_for_each_entry_from(a,
- b,
c) S
|
hlist_for_each_entry_rcu(a,
- b,
c, d) S
|
hlist_for_each_entry_rcu_bh(a,
- b,
c, d) S
|
hlist_for_each_entry_continue_rcu_bh(a,
- b,
c) S
|
for_each_busy_worker(a, c,
- b,
d) S
|
ax25_uid_for_each(a,
- b,
c) S
|
ax25_for_each(a,
- b,
c) S
|
inet_bind_bucket_for_each(a,
- b,
c) S
|
sctp_for_each_hentry(a,
- b,
c) S
|
sk_for_each(a,
- b,
c) S
|
sk_for_each_rcu(a,
- b,
c) S
|
sk_for_each_from
-(a, b)
+(a)
S
+ sk_for_each_from(a) S
|
sk_for_each_safe(a,
- b,
c, d) S
|
sk_for_each_bound(a,
- b,
c) S
|
hlist_for_each_entry_safe(a,
- b,
c, d, e) S
|
hlist_for_each_entry_continue_rcu(a,
- b,
c) S
|
nr_neigh_for_each(a,
- b,
c) S
|
nr_neigh_for_each_safe(a,
- b,
c, d) S
|
nr_node_for_each(a,
- b,
c) S
|
nr_node_for_each_safe(a,
- b,
c, d) S
|
- for_each_gfn_sp(a, c, d, b) S
+ for_each_gfn_sp(a, c, d) S
|
- for_each_gfn_indirect_valid_sp(a, c, d, b) S
+ for_each_gfn_indirect_valid_sp(a, c, d) S
|
for_each_host(a,
- b,
c) S
|
for_each_host_safe(a,
- b,
c, d) S
|
for_each_mesh_entry(a,
- b,
c, d) S
)
...+>
[akpm@linux-foundation.org: drop bogus change from net/ipv4/raw.c]
[akpm@linux-foundation.org: drop bogus hunk from net/ipv6/raw.c]
[akpm@linux-foundation.org: checkpatch fixes]
[akpm@linux-foundation.org: fix warnings]
[akpm@linux-foudnation.org: redo intrusive kvm changes]
Tested-by: Peter Senna Tschudin <peter.senna@gmail.com>
Acked-by: Paul E. McKenney <paulmck@linux.vnet.ibm.com>
Signed-off-by: Sasha Levin <sasha.levin@oracle.com>
Cc: Wu Fengguang <fengguang.wu@intel.com>
Cc: Marcelo Tosatti <mtosatti@redhat.com>
Cc: Gleb Natapov <gleb@redhat.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2013-02-28 08:06:00 +07:00
|
|
|
hlist_for_each_entry_safe(entry, tmp, bucket, hlist) {
|
2013-02-07 09:25:26 +07:00
|
|
|
found = false;
|
|
|
|
netdev_for_each_uc_addr(ha, dev) {
|
|
|
|
if (ether_addr_equal_64bits(entry->mac,
|
|
|
|
ha->addr)) {
|
|
|
|
found = true;
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
/* MAC address of the port is not in uc list */
|
2014-07-08 15:25:24 +07:00
|
|
|
if (ether_addr_equal_64bits(entry->mac,
|
|
|
|
priv->current_mac))
|
2013-02-07 09:25:26 +07:00
|
|
|
found = true;
|
|
|
|
|
|
|
|
if (!found) {
|
2014-03-02 15:25:01 +07:00
|
|
|
mac = mlx4_mac_to_u64(entry->mac);
|
2013-02-07 09:25:26 +07:00
|
|
|
mlx4_en_uc_steer_release(priv, entry->mac,
|
|
|
|
priv->base_qpn,
|
|
|
|
entry->reg_id);
|
|
|
|
mlx4_unregister_mac(mdev->dev, priv->port, mac);
|
|
|
|
|
|
|
|
hlist_del_rcu(&entry->hlist);
|
|
|
|
kfree_rcu(entry, rcu);
|
|
|
|
en_dbg(DRV, priv, "Removed MAC %pM on port:%d\n",
|
|
|
|
entry->mac, priv->port);
|
|
|
|
++removed;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
/* if we didn't remove anything, there is no use in trying to add
|
|
|
|
* again once we are in a forced promisc mode state
|
|
|
|
*/
|
|
|
|
if ((priv->flags & MLX4_EN_FLAG_FORCE_PROMISC) && 0 == removed)
|
|
|
|
return;
|
|
|
|
|
|
|
|
prev_flags = priv->flags;
|
|
|
|
priv->flags &= ~MLX4_EN_FLAG_FORCE_PROMISC;
|
|
|
|
|
|
|
|
/* find what to add */
|
|
|
|
netdev_for_each_uc_addr(ha, dev) {
|
|
|
|
found = false;
|
|
|
|
bucket = &priv->mac_hash[ha->addr[MLX4_EN_MAC_HASH_IDX]];
|
hlist: drop the node parameter from iterators
I'm not sure why, but the hlist for each entry iterators were conceived
list_for_each_entry(pos, head, member)
The hlist ones were greedy and wanted an extra parameter:
hlist_for_each_entry(tpos, pos, head, member)
Why did they need an extra pos parameter? I'm not quite sure. Not only
they don't really need it, it also prevents the iterator from looking
exactly like the list iterator, which is unfortunate.
Besides the semantic patch, there was some manual work required:
- Fix up the actual hlist iterators in linux/list.h
- Fix up the declaration of other iterators based on the hlist ones.
- A very small amount of places were using the 'node' parameter, this
was modified to use 'obj->member' instead.
- Coccinelle didn't handle the hlist_for_each_entry_safe iterator
properly, so those had to be fixed up manually.
The semantic patch which is mostly the work of Peter Senna Tschudin is here:
@@
iterator name hlist_for_each_entry, hlist_for_each_entry_continue, hlist_for_each_entry_from, hlist_for_each_entry_rcu, hlist_for_each_entry_rcu_bh, hlist_for_each_entry_continue_rcu_bh, for_each_busy_worker, ax25_uid_for_each, ax25_for_each, inet_bind_bucket_for_each, sctp_for_each_hentry, sk_for_each, sk_for_each_rcu, sk_for_each_from, sk_for_each_safe, sk_for_each_bound, hlist_for_each_entry_safe, hlist_for_each_entry_continue_rcu, nr_neigh_for_each, nr_neigh_for_each_safe, nr_node_for_each, nr_node_for_each_safe, for_each_gfn_indirect_valid_sp, for_each_gfn_sp, for_each_host;
type T;
expression a,c,d,e;
identifier b;
statement S;
@@
-T b;
<+... when != b
(
hlist_for_each_entry(a,
- b,
c, d) S
|
hlist_for_each_entry_continue(a,
- b,
c) S
|
hlist_for_each_entry_from(a,
- b,
c) S
|
hlist_for_each_entry_rcu(a,
- b,
c, d) S
|
hlist_for_each_entry_rcu_bh(a,
- b,
c, d) S
|
hlist_for_each_entry_continue_rcu_bh(a,
- b,
c) S
|
for_each_busy_worker(a, c,
- b,
d) S
|
ax25_uid_for_each(a,
- b,
c) S
|
ax25_for_each(a,
- b,
c) S
|
inet_bind_bucket_for_each(a,
- b,
c) S
|
sctp_for_each_hentry(a,
- b,
c) S
|
sk_for_each(a,
- b,
c) S
|
sk_for_each_rcu(a,
- b,
c) S
|
sk_for_each_from
-(a, b)
+(a)
S
+ sk_for_each_from(a) S
|
sk_for_each_safe(a,
- b,
c, d) S
|
sk_for_each_bound(a,
- b,
c) S
|
hlist_for_each_entry_safe(a,
- b,
c, d, e) S
|
hlist_for_each_entry_continue_rcu(a,
- b,
c) S
|
nr_neigh_for_each(a,
- b,
c) S
|
nr_neigh_for_each_safe(a,
- b,
c, d) S
|
nr_node_for_each(a,
- b,
c) S
|
nr_node_for_each_safe(a,
- b,
c, d) S
|
- for_each_gfn_sp(a, c, d, b) S
+ for_each_gfn_sp(a, c, d) S
|
- for_each_gfn_indirect_valid_sp(a, c, d, b) S
+ for_each_gfn_indirect_valid_sp(a, c, d) S
|
for_each_host(a,
- b,
c) S
|
for_each_host_safe(a,
- b,
c, d) S
|
for_each_mesh_entry(a,
- b,
c, d) S
)
...+>
[akpm@linux-foundation.org: drop bogus change from net/ipv4/raw.c]
[akpm@linux-foundation.org: drop bogus hunk from net/ipv6/raw.c]
[akpm@linux-foundation.org: checkpatch fixes]
[akpm@linux-foundation.org: fix warnings]
[akpm@linux-foudnation.org: redo intrusive kvm changes]
Tested-by: Peter Senna Tschudin <peter.senna@gmail.com>
Acked-by: Paul E. McKenney <paulmck@linux.vnet.ibm.com>
Signed-off-by: Sasha Levin <sasha.levin@oracle.com>
Cc: Wu Fengguang <fengguang.wu@intel.com>
Cc: Marcelo Tosatti <mtosatti@redhat.com>
Cc: Gleb Natapov <gleb@redhat.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2013-02-28 08:06:00 +07:00
|
|
|
hlist_for_each_entry(entry, bucket, hlist) {
|
2013-02-07 09:25:26 +07:00
|
|
|
if (ether_addr_equal_64bits(entry->mac, ha->addr)) {
|
|
|
|
found = true;
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
if (!found) {
|
|
|
|
entry = kmalloc(sizeof(*entry), GFP_KERNEL);
|
|
|
|
if (!entry) {
|
|
|
|
en_err(priv, "Failed adding MAC %pM on port:%d (out of memory)\n",
|
|
|
|
ha->addr, priv->port);
|
|
|
|
priv->flags |= MLX4_EN_FLAG_FORCE_PROMISC;
|
|
|
|
break;
|
|
|
|
}
|
2014-03-02 15:25:01 +07:00
|
|
|
mac = mlx4_mac_to_u64(ha->addr);
|
2013-02-07 09:25:26 +07:00
|
|
|
memcpy(entry->mac, ha->addr, ETH_ALEN);
|
|
|
|
err = mlx4_register_mac(mdev->dev, priv->port, mac);
|
|
|
|
if (err < 0) {
|
|
|
|
en_err(priv, "Failed registering MAC %pM on port %d: %d\n",
|
|
|
|
ha->addr, priv->port, err);
|
|
|
|
kfree(entry);
|
|
|
|
priv->flags |= MLX4_EN_FLAG_FORCE_PROMISC;
|
|
|
|
break;
|
|
|
|
}
|
|
|
|
err = mlx4_en_uc_steer_add(priv, ha->addr,
|
|
|
|
&priv->base_qpn,
|
|
|
|
&entry->reg_id);
|
|
|
|
if (err) {
|
|
|
|
en_err(priv, "Failed adding MAC %pM on port %d: %d\n",
|
|
|
|
ha->addr, priv->port, err);
|
|
|
|
mlx4_unregister_mac(mdev->dev, priv->port, mac);
|
|
|
|
kfree(entry);
|
|
|
|
priv->flags |= MLX4_EN_FLAG_FORCE_PROMISC;
|
|
|
|
break;
|
|
|
|
} else {
|
|
|
|
unsigned int mac_hash;
|
|
|
|
en_dbg(DRV, priv, "Added MAC %pM on port:%d\n",
|
|
|
|
ha->addr, priv->port);
|
|
|
|
mac_hash = ha->addr[MLX4_EN_MAC_HASH_IDX];
|
|
|
|
bucket = &priv->mac_hash[mac_hash];
|
|
|
|
hlist_add_head_rcu(&entry->hlist, bucket);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
if (priv->flags & MLX4_EN_FLAG_FORCE_PROMISC) {
|
|
|
|
en_warn(priv, "Forcing promiscuous mode on port:%d\n",
|
|
|
|
priv->port);
|
|
|
|
} else if (prev_flags & MLX4_EN_FLAG_FORCE_PROMISC) {
|
|
|
|
en_warn(priv, "Stop forcing promiscuous mode on port:%d\n",
|
|
|
|
priv->port);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2013-02-07 09:25:23 +07:00
|
|
|
static void mlx4_en_do_set_rx_mode(struct work_struct *work)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = container_of(work, struct mlx4_en_priv,
|
|
|
|
rx_mode_task);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
struct net_device *dev = priv->dev;
|
|
|
|
|
|
|
|
mutex_lock(&mdev->state_lock);
|
|
|
|
if (!mdev->device_up) {
|
|
|
|
en_dbg(HW, priv, "Card is not up, ignoring rx mode change.\n");
|
|
|
|
goto out;
|
|
|
|
}
|
|
|
|
if (!priv->port_up) {
|
|
|
|
en_dbg(HW, priv, "Port is down, ignoring rx mode change.\n");
|
|
|
|
goto out;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (!netif_carrier_ok(dev)) {
|
|
|
|
if (!mlx4_en_QUERY_PORT(mdev, priv->port)) {
|
|
|
|
if (priv->port_state.link_state) {
|
|
|
|
priv->last_link_state = MLX4_DEV_EVENT_PORT_UP;
|
|
|
|
netif_carrier_on(dev);
|
|
|
|
en_dbg(LINK, priv, "Link Up\n");
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2013-02-07 09:25:26 +07:00
|
|
|
if (dev->priv_flags & IFF_UNICAST_FLT)
|
|
|
|
mlx4_en_do_uc_filter(priv, dev, mdev);
|
|
|
|
|
2013-02-07 09:25:23 +07:00
|
|
|
/* Promsicuous mode: disable all filters */
|
2013-02-07 09:25:26 +07:00
|
|
|
if ((dev->flags & IFF_PROMISC) ||
|
|
|
|
(priv->flags & MLX4_EN_FLAG_FORCE_PROMISC)) {
|
2013-02-07 09:25:23 +07:00
|
|
|
mlx4_en_set_promisc_mode(priv, mdev);
|
|
|
|
goto out;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* Not in promiscuous mode */
|
|
|
|
if (priv->flags & MLX4_EN_FLAG_PROMISC)
|
|
|
|
mlx4_en_clear_promisc_mode(priv, mdev);
|
|
|
|
|
|
|
|
mlx4_en_do_multicast(priv, dev, mdev);
|
2008-10-23 05:47:49 +07:00
|
|
|
out:
|
|
|
|
mutex_unlock(&mdev->state_lock);
|
|
|
|
}
|
|
|
|
|
|
|
|
#ifdef CONFIG_NET_POLL_CONTROLLER
|
|
|
|
static void mlx4_en_netpoll(struct net_device *dev)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_cq *cq;
|
|
|
|
int i;
|
|
|
|
|
2016-06-04 01:52:49 +07:00
|
|
|
for (i = 0; i < priv->tx_ring_num; i++) {
|
|
|
|
cq = priv->tx_cq[i];
|
2014-04-16 05:09:24 +07:00
|
|
|
napi_schedule(&cq->napi);
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
}
|
|
|
|
#endif
|
|
|
|
|
2015-10-08 21:14:01 +07:00
|
|
|
static int mlx4_en_set_rss_steer_rules(struct mlx4_en_priv *priv)
|
|
|
|
{
|
|
|
|
u64 reg_id;
|
|
|
|
int err = 0;
|
|
|
|
int *qpn = &priv->base_qpn;
|
|
|
|
struct mlx4_mac_entry *entry;
|
|
|
|
|
|
|
|
err = mlx4_en_uc_steer_add(priv, priv->dev->dev_addr, qpn, ®_id);
|
|
|
|
if (err)
|
|
|
|
return err;
|
|
|
|
|
|
|
|
err = mlx4_en_tunnel_steer_add(priv, priv->dev->dev_addr, *qpn,
|
|
|
|
&priv->tunnel_reg_id);
|
|
|
|
if (err)
|
|
|
|
goto tunnel_err;
|
|
|
|
|
|
|
|
entry = kmalloc(sizeof(*entry), GFP_KERNEL);
|
|
|
|
if (!entry) {
|
|
|
|
err = -ENOMEM;
|
|
|
|
goto alloc_err;
|
|
|
|
}
|
|
|
|
|
|
|
|
memcpy(entry->mac, priv->dev->dev_addr, sizeof(entry->mac));
|
|
|
|
memcpy(priv->current_mac, entry->mac, sizeof(priv->current_mac));
|
|
|
|
entry->reg_id = reg_id;
|
|
|
|
hlist_add_head_rcu(&entry->hlist,
|
|
|
|
&priv->mac_hash[entry->mac[MLX4_EN_MAC_HASH_IDX]]);
|
|
|
|
|
|
|
|
return 0;
|
|
|
|
|
|
|
|
alloc_err:
|
|
|
|
if (priv->tunnel_reg_id)
|
|
|
|
mlx4_flow_detach(priv->mdev->dev, priv->tunnel_reg_id);
|
|
|
|
|
|
|
|
tunnel_err:
|
|
|
|
mlx4_en_uc_steer_release(priv, priv->dev->dev_addr, *qpn, reg_id);
|
|
|
|
return err;
|
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_delete_rss_steer_rules(struct mlx4_en_priv *priv)
|
|
|
|
{
|
|
|
|
u64 mac;
|
|
|
|
unsigned int i;
|
|
|
|
int qpn = priv->base_qpn;
|
|
|
|
struct hlist_head *bucket;
|
|
|
|
struct hlist_node *tmp;
|
|
|
|
struct mlx4_mac_entry *entry;
|
|
|
|
|
|
|
|
for (i = 0; i < MLX4_EN_MAC_HASH_SIZE; ++i) {
|
|
|
|
bucket = &priv->mac_hash[i];
|
|
|
|
hlist_for_each_entry_safe(entry, tmp, bucket, hlist) {
|
|
|
|
mac = mlx4_mac_to_u64(entry->mac);
|
|
|
|
en_dbg(DRV, priv, "Registering MAC:%pM for deleting\n",
|
|
|
|
entry->mac);
|
|
|
|
mlx4_en_uc_steer_release(priv, entry->mac,
|
|
|
|
qpn, entry->reg_id);
|
|
|
|
|
|
|
|
mlx4_unregister_mac(priv->mdev->dev, priv->port, mac);
|
|
|
|
hlist_del_rcu(&entry->hlist);
|
|
|
|
kfree_rcu(entry, rcu);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
if (priv->tunnel_reg_id) {
|
|
|
|
mlx4_flow_detach(priv->mdev->dev, priv->tunnel_reg_id);
|
|
|
|
priv->tunnel_reg_id = 0;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
static void mlx4_en_tx_timeout(struct net_device *dev)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
2013-06-25 16:09:34 +07:00
|
|
|
int i;
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
if (netif_msg_timer(priv))
|
2009-06-02 03:27:13 +07:00
|
|
|
en_warn(priv, "Tx timeout called on port:%d\n", priv->port);
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2013-06-25 16:09:34 +07:00
|
|
|
for (i = 0; i < priv->tx_ring_num; i++) {
|
|
|
|
if (!netif_tx_queue_stopped(netdev_get_tx_queue(dev, i)))
|
|
|
|
continue;
|
|
|
|
en_warn(priv, "TX timeout on queue: %d, QP: 0x%x, CQ: 0x%x, Cons: 0x%x, Prod: 0x%x\n",
|
2013-11-07 17:19:52 +07:00
|
|
|
i, priv->tx_ring[i]->qpn, priv->tx_ring[i]->cqn,
|
|
|
|
priv->tx_ring[i]->cons, priv->tx_ring[i]->prod);
|
2013-06-25 16:09:34 +07:00
|
|
|
}
|
|
|
|
|
2009-04-20 11:26:05 +07:00
|
|
|
priv->port_stats.tx_timeout++;
|
2009-06-02 03:27:13 +07:00
|
|
|
en_dbg(DRV, priv, "Scheduling watchdog\n");
|
2009-04-20 11:26:05 +07:00
|
|
|
queue_work(mdev->workqueue, &priv->watchdog_task);
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
|
2016-05-25 23:50:38 +07:00
|
|
|
static struct rtnl_link_stats64 *
|
|
|
|
mlx4_en_get_stats64(struct net_device *dev, struct rtnl_link_stats64 *stats)
|
2008-10-23 05:47:49 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
|
|
|
|
spin_lock_bh(&priv->stats_lock);
|
2016-05-25 23:50:39 +07:00
|
|
|
netdev_stats_to_stats64(stats, &dev->stats);
|
2008-10-23 05:47:49 +07:00
|
|
|
spin_unlock_bh(&priv->stats_lock);
|
|
|
|
|
2016-05-25 23:50:38 +07:00
|
|
|
return stats;
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_set_default_moderation(struct mlx4_en_priv *priv)
|
|
|
|
{
|
|
|
|
struct mlx4_en_cq *cq;
|
|
|
|
int i;
|
|
|
|
|
|
|
|
/* If we haven't received a specific coalescing setting
|
2009-04-22 23:21:29 +07:00
|
|
|
* (module param), we set the moderation parameters as follows:
|
2008-10-23 05:47:49 +07:00
|
|
|
* - moder_cnt is set to the number of mtu sized packets to
|
2012-11-05 23:20:42 +07:00
|
|
|
* satisfy our coalescing target.
|
2008-10-23 05:47:49 +07:00
|
|
|
* - moder_time is set to a fixed value.
|
|
|
|
*/
|
2009-06-02 06:23:13 +07:00
|
|
|
priv->rx_frames = MLX4_EN_RX_COAL_TARGET;
|
2008-12-26 09:19:47 +07:00
|
|
|
priv->rx_usecs = MLX4_EN_RX_COAL_TIME;
|
2012-04-23 09:18:33 +07:00
|
|
|
priv->tx_frames = MLX4_EN_TX_COAL_PKTS;
|
|
|
|
priv->tx_usecs = MLX4_EN_TX_COAL_TIME;
|
2013-02-07 09:25:21 +07:00
|
|
|
en_dbg(INTR, priv, "Default coalesing params for mtu:%d - rx_frames:%d rx_usecs:%d\n",
|
|
|
|
priv->dev->mtu, priv->rx_frames, priv->rx_usecs);
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
/* Setup cq moderation params */
|
|
|
|
for (i = 0; i < priv->rx_ring_num; i++) {
|
2013-11-07 17:19:52 +07:00
|
|
|
cq = priv->rx_cq[i];
|
2008-10-23 05:47:49 +07:00
|
|
|
cq->moder_cnt = priv->rx_frames;
|
|
|
|
cq->moder_time = priv->rx_usecs;
|
2011-10-09 12:38:23 +07:00
|
|
|
priv->last_moder_time[i] = MLX4_EN_AUTO_CONF;
|
|
|
|
priv->last_moder_packets[i] = 0;
|
|
|
|
priv->last_moder_bytes[i] = 0;
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
for (i = 0; i < priv->tx_ring_num; i++) {
|
2013-11-07 17:19:52 +07:00
|
|
|
cq = priv->tx_cq[i];
|
2012-04-23 09:18:33 +07:00
|
|
|
cq->moder_cnt = priv->tx_frames;
|
|
|
|
cq->moder_time = priv->tx_usecs;
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
/* Reset auto-moderation params */
|
|
|
|
priv->pkt_rate_low = MLX4_EN_RX_RATE_LOW;
|
|
|
|
priv->rx_usecs_low = MLX4_EN_RX_COAL_TIME_LOW;
|
|
|
|
priv->pkt_rate_high = MLX4_EN_RX_RATE_HIGH;
|
|
|
|
priv->rx_usecs_high = MLX4_EN_RX_COAL_TIME_HIGH;
|
|
|
|
priv->sample_interval = MLX4_EN_SAMPLE_INTERVAL;
|
2008-12-26 09:19:47 +07:00
|
|
|
priv->adaptive_rx_coal = 1;
|
2008-10-23 05:47:49 +07:00
|
|
|
priv->last_moder_jiffies = 0;
|
|
|
|
priv->last_moder_tx_packets = 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_auto_moderation(struct mlx4_en_priv *priv)
|
|
|
|
{
|
|
|
|
unsigned long period = (unsigned long) (jiffies - priv->last_moder_jiffies);
|
|
|
|
struct mlx4_en_cq *cq;
|
|
|
|
unsigned long packets;
|
|
|
|
unsigned long rate;
|
|
|
|
unsigned long avg_pkt_size;
|
|
|
|
unsigned long rx_packets;
|
|
|
|
unsigned long rx_bytes;
|
|
|
|
unsigned long rx_pkt_diff;
|
|
|
|
int moder_time;
|
2011-10-09 12:38:23 +07:00
|
|
|
int ring, err;
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
if (!priv->adaptive_rx_coal || period < priv->sample_interval * HZ)
|
|
|
|
return;
|
|
|
|
|
2011-10-09 12:38:23 +07:00
|
|
|
for (ring = 0; ring < priv->rx_ring_num; ring++) {
|
|
|
|
spin_lock_bh(&priv->stats_lock);
|
2013-11-07 17:19:52 +07:00
|
|
|
rx_packets = priv->rx_ring[ring]->packets;
|
|
|
|
rx_bytes = priv->rx_ring[ring]->bytes;
|
2011-10-09 12:38:23 +07:00
|
|
|
spin_unlock_bh(&priv->stats_lock);
|
|
|
|
|
|
|
|
rx_pkt_diff = ((unsigned long) (rx_packets -
|
|
|
|
priv->last_moder_packets[ring]));
|
|
|
|
packets = rx_pkt_diff;
|
|
|
|
rate = packets * HZ / period;
|
|
|
|
avg_pkt_size = packets ? ((unsigned long) (rx_bytes -
|
|
|
|
priv->last_moder_bytes[ring])) / packets : 0;
|
|
|
|
|
|
|
|
/* Apply auto-moderation only when packet rate
|
|
|
|
* exceeds a rate that it matters */
|
|
|
|
if (rate > (MLX4_EN_RX_RATE_THRESH / priv->rx_ring_num) &&
|
|
|
|
avg_pkt_size > MLX4_EN_AVG_PKT_SMALL) {
|
2008-10-23 05:47:49 +07:00
|
|
|
if (rate < priv->pkt_rate_low)
|
|
|
|
moder_time = priv->rx_usecs_low;
|
|
|
|
else if (rate > priv->pkt_rate_high)
|
|
|
|
moder_time = priv->rx_usecs_high;
|
|
|
|
else
|
|
|
|
moder_time = (rate - priv->pkt_rate_low) *
|
|
|
|
(priv->rx_usecs_high - priv->rx_usecs_low) /
|
|
|
|
(priv->pkt_rate_high - priv->pkt_rate_low) +
|
|
|
|
priv->rx_usecs_low;
|
2011-10-09 12:38:23 +07:00
|
|
|
} else {
|
|
|
|
moder_time = priv->rx_usecs_low;
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
2011-10-09 12:38:23 +07:00
|
|
|
if (moder_time != priv->last_moder_time[ring]) {
|
|
|
|
priv->last_moder_time[ring] = moder_time;
|
2013-11-07 17:19:52 +07:00
|
|
|
cq = priv->rx_cq[ring];
|
2008-10-23 05:47:49 +07:00
|
|
|
cq->moder_time = moder_time;
|
2013-06-04 12:13:26 +07:00
|
|
|
cq->moder_cnt = priv->rx_frames;
|
2008-10-23 05:47:49 +07:00
|
|
|
err = mlx4_en_set_cq_moder(priv, cq);
|
2011-10-09 12:38:23 +07:00
|
|
|
if (err)
|
2013-02-07 09:25:21 +07:00
|
|
|
en_err(priv, "Failed modifying moderation for cq:%d\n",
|
|
|
|
ring);
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
2011-10-09 12:38:23 +07:00
|
|
|
priv->last_moder_packets[ring] = rx_packets;
|
|
|
|
priv->last_moder_bytes[ring] = rx_bytes;
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
priv->last_moder_jiffies = jiffies;
|
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_do_get_stats(struct work_struct *work)
|
|
|
|
{
|
2009-04-03 06:56:54 +07:00
|
|
|
struct delayed_work *delay = to_delayed_work(work);
|
2008-10-23 05:47:49 +07:00
|
|
|
struct mlx4_en_priv *priv = container_of(delay, struct mlx4_en_priv,
|
|
|
|
stats_task);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
int err;
|
|
|
|
|
|
|
|
mutex_lock(&mdev->state_lock);
|
|
|
|
if (mdev->device_up) {
|
2013-06-25 16:09:30 +07:00
|
|
|
if (priv->port_up) {
|
|
|
|
err = mlx4_en_DUMP_ETH_STATS(mdev, priv->port, 0);
|
|
|
|
if (err)
|
|
|
|
en_dbg(HW, priv, "Could not update stats\n");
|
2013-01-24 08:54:14 +07:00
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
mlx4_en_auto_moderation(priv);
|
2013-06-25 16:09:30 +07:00
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
queue_delayed_work(mdev->workqueue, &priv->stats_task, STATS_DELAY);
|
|
|
|
}
|
2010-08-24 10:46:38 +07:00
|
|
|
if (mdev->mac_removed[MLX4_MAX_PORTS + 1 - priv->port]) {
|
2014-07-08 15:25:24 +07:00
|
|
|
mlx4_en_do_set_mac(priv, priv->current_mac);
|
2010-08-24 10:46:38 +07:00
|
|
|
mdev->mac_removed[MLX4_MAX_PORTS + 1 - priv->port] = 0;
|
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
mutex_unlock(&mdev->state_lock);
|
|
|
|
}
|
|
|
|
|
2013-04-23 13:06:51 +07:00
|
|
|
/* mlx4_en_service_task - Run service task for tasks that needed to be done
|
|
|
|
* periodically
|
|
|
|
*/
|
|
|
|
static void mlx4_en_service_task(struct work_struct *work)
|
|
|
|
{
|
|
|
|
struct delayed_work *delay = to_delayed_work(work);
|
|
|
|
struct mlx4_en_priv *priv = container_of(delay, struct mlx4_en_priv,
|
|
|
|
service_task);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
|
|
|
|
mutex_lock(&mdev->state_lock);
|
|
|
|
if (mdev->device_up) {
|
2013-04-25 12:22:24 +07:00
|
|
|
if (mdev->dev->caps.flags2 & MLX4_DEV_CAP_FLAG2_TS)
|
|
|
|
mlx4_en_ptp_overflow_check(mdev);
|
2013-04-23 13:06:51 +07:00
|
|
|
|
2015-04-30 21:32:46 +07:00
|
|
|
mlx4_en_recover_from_oom(priv);
|
2013-04-23 13:06:51 +07:00
|
|
|
queue_delayed_work(mdev->workqueue, &priv->service_task,
|
|
|
|
SERVICE_TASK_DELAY);
|
|
|
|
}
|
|
|
|
mutex_unlock(&mdev->state_lock);
|
|
|
|
}
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
static void mlx4_en_linkstate(struct work_struct *work)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = container_of(work, struct mlx4_en_priv,
|
|
|
|
linkstate_task);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
int linkstate = priv->link_state;
|
|
|
|
|
|
|
|
mutex_lock(&mdev->state_lock);
|
|
|
|
/* If observable port state changed set carrier state and
|
|
|
|
* report to system log */
|
|
|
|
if (priv->last_link_state != linkstate) {
|
|
|
|
if (linkstate == MLX4_DEV_EVENT_PORT_DOWN) {
|
2010-08-24 10:46:01 +07:00
|
|
|
en_info(priv, "Link Down\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
netif_carrier_off(priv->dev);
|
|
|
|
} else {
|
2010-08-24 10:46:01 +07:00
|
|
|
en_info(priv, "Link Up\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
netif_carrier_on(priv->dev);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
priv->last_link_state = linkstate;
|
|
|
|
mutex_unlock(&mdev->state_lock);
|
|
|
|
}
|
|
|
|
|
2014-06-09 14:24:39 +07:00
|
|
|
static int mlx4_en_init_affinity_hint(struct mlx4_en_priv *priv, int ring_idx)
|
|
|
|
{
|
|
|
|
struct mlx4_en_rx_ring *ring = priv->rx_ring[ring_idx];
|
|
|
|
int numa_node = priv->mdev->dev->numa_node;
|
|
|
|
|
|
|
|
if (!zalloc_cpumask_var(&ring->affinity_mask, GFP_KERNEL))
|
|
|
|
return -ENOMEM;
|
|
|
|
|
cpumask_set_cpu_local_first => cpumask_local_spread, lament
da91309e0a7e (cpumask: Utility function to set n'th cpu...) created a
genuinely weird function. I never saw it before, it went through DaveM.
(He only does this to make us other maintainers feel better about our own
mistakes.)
cpumask_set_cpu_local_first's purpose is say "I need to spread things
across N online cpus, choose the ones on this numa node first"; you call
it in a loop.
It can fail. One of the two callers ignores this, the other aborts and
fails the device open.
It can fail in two ways: allocating the off-stack cpumask, or through a
convoluted codepath which AFAICT can only occur if cpu_online_mask
changes. Which shouldn't happen, because if cpu_online_mask can change
while you call this, it could return a now-offline cpu anyway.
It contains a nonsensical test "!cpumask_of_node(numa_node)". This was
drawn to my attention by Geert, who said this causes a warning on Sparc.
It sets a single bit in a cpumask instead of returning a cpu number,
because that's what the callers want.
It could be made more efficient by passing the previous cpu rather than
an index, but that would be more invasive to the callers.
Fixes: da91309e0a7e8966d916a74cce42ed170fde06bf
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au> (then rebased)
Tested-by: Amir Vadai <amirv@mellanox.com>
Acked-by: Amir Vadai <amirv@mellanox.com>
Acked-by: David S. Miller <davem@davemloft.net>
2015-05-09 00:44:13 +07:00
|
|
|
cpumask_set_cpu(cpumask_local_spread(ring_idx, numa_node),
|
|
|
|
ring->affinity_mask);
|
|
|
|
return 0;
|
2014-06-09 14:24:39 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_free_affinity_hint(struct mlx4_en_priv *priv, int ring_idx)
|
|
|
|
{
|
|
|
|
free_cpumask_var(priv->rx_ring[ring_idx]->affinity_mask);
|
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2016-07-20 02:16:55 +07:00
|
|
|
static void mlx4_en_init_recycle_ring(struct mlx4_en_priv *priv,
|
|
|
|
int tx_ring_idx)
|
|
|
|
{
|
|
|
|
struct mlx4_en_tx_ring *tx_ring = priv->tx_ring[tx_ring_idx];
|
|
|
|
int rr_index;
|
|
|
|
|
|
|
|
rr_index = (priv->xdp_ring_num - priv->tx_ring_num) + tx_ring_idx;
|
|
|
|
if (rr_index >= 0) {
|
|
|
|
tx_ring->free_tx_desc = mlx4_en_recycle_tx_desc;
|
|
|
|
tx_ring->recycle_ring = priv->rx_ring[rr_index];
|
|
|
|
en_dbg(DRV, priv,
|
|
|
|
"Set tx_ring[%d]->recycle_ring = rx_ring[%d]\n",
|
|
|
|
tx_ring_idx, rr_index);
|
|
|
|
} else {
|
|
|
|
tx_ring->recycle_ring = NULL;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2008-12-30 09:39:20 +07:00
|
|
|
int mlx4_en_start_port(struct net_device *dev)
|
2008-10-23 05:47:49 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
struct mlx4_en_cq *cq;
|
|
|
|
struct mlx4_en_tx_ring *tx_ring;
|
|
|
|
int rx_index = 0;
|
|
|
|
int tx_index = 0;
|
|
|
|
int err = 0;
|
|
|
|
int i;
|
|
|
|
int j;
|
2011-03-23 05:38:31 +07:00
|
|
|
u8 mc_list[16] = {0};
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
if (priv->port_up) {
|
2009-06-02 03:27:13 +07:00
|
|
|
en_dbg(DRV, priv, "start port called while port already up\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2012-07-05 11:03:43 +07:00
|
|
|
INIT_LIST_HEAD(&priv->mc_list);
|
|
|
|
INIT_LIST_HEAD(&priv->curr_list);
|
2013-01-31 06:07:08 +07:00
|
|
|
INIT_LIST_HEAD(&priv->ethtool_list);
|
|
|
|
memset(&priv->ethtool_rules[0], 0,
|
|
|
|
sizeof(struct ethtool_flow_id) * MAX_NUM_OF_FS_RULES);
|
2012-07-05 11:03:43 +07:00
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
/* Calculate Rx buf size */
|
|
|
|
dev->mtu = min(dev->mtu, priv->max_mtu);
|
|
|
|
mlx4_en_calc_rx_buf(dev);
|
2009-06-02 03:27:13 +07:00
|
|
|
en_dbg(DRV, priv, "Rx buf size:%d\n", priv->rx_skb_size);
|
2009-05-24 10:17:11 +07:00
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
/* Configure rx cq's and rings */
|
2009-05-24 10:17:11 +07:00
|
|
|
err = mlx4_en_activate_rx_rings(priv);
|
|
|
|
if (err) {
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed to activate RX rings\n");
|
2009-05-24 10:17:11 +07:00
|
|
|
return err;
|
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
for (i = 0; i < priv->rx_ring_num; i++) {
|
2013-11-07 17:19:52 +07:00
|
|
|
cq = priv->rx_cq[i];
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2014-06-09 14:24:39 +07:00
|
|
|
err = mlx4_en_init_affinity_hint(priv, i);
|
|
|
|
if (err) {
|
|
|
|
en_err(priv, "Failed preparing IRQ affinity hint\n");
|
|
|
|
goto cq_err;
|
|
|
|
}
|
|
|
|
|
2011-10-09 12:26:31 +07:00
|
|
|
err = mlx4_en_activate_cq(priv, cq, i);
|
2008-10-23 05:47:49 +07:00
|
|
|
if (err) {
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed activating Rx CQ\n");
|
2014-06-09 14:24:39 +07:00
|
|
|
mlx4_en_free_affinity_hint(priv, i);
|
2009-04-27 03:41:34 +07:00
|
|
|
goto cq_err;
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
2014-12-16 18:28:54 +07:00
|
|
|
|
|
|
|
for (j = 0; j < cq->size; j++) {
|
|
|
|
struct mlx4_cqe *cqe = NULL;
|
|
|
|
|
|
|
|
cqe = mlx4_en_get_cqe(cq->buf, j, priv->cqe_size) +
|
|
|
|
priv->cqe_factor;
|
|
|
|
cqe->owner_sr_opcode = MLX4_CQE_OWNER_MASK;
|
|
|
|
}
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
err = mlx4_en_set_cq_moder(priv, cq);
|
|
|
|
if (err) {
|
2014-05-08 02:52:57 +07:00
|
|
|
en_err(priv, "Failed setting cq moderation parameters\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
mlx4_en_deactivate_cq(priv, cq);
|
2014-06-09 14:24:39 +07:00
|
|
|
mlx4_en_free_affinity_hint(priv, i);
|
2008-10-23 05:47:49 +07:00
|
|
|
goto cq_err;
|
|
|
|
}
|
|
|
|
mlx4_en_arm_cq(priv, cq);
|
2013-11-07 17:19:52 +07:00
|
|
|
priv->rx_ring[i]->cqn = cq->mcq.cqn;
|
2008-10-23 05:47:49 +07:00
|
|
|
++rx_index;
|
|
|
|
}
|
|
|
|
|
2011-12-13 11:16:21 +07:00
|
|
|
/* Set qp number */
|
|
|
|
en_dbg(DRV, priv, "Getting qp number for port %d\n", priv->port);
|
2013-02-07 09:25:22 +07:00
|
|
|
err = mlx4_en_get_qp(priv);
|
2011-03-23 05:38:31 +07:00
|
|
|
if (err) {
|
2011-12-13 11:16:21 +07:00
|
|
|
en_err(priv, "Failed getting eth qp\n");
|
2011-03-23 05:38:31 +07:00
|
|
|
goto cq_err;
|
|
|
|
}
|
|
|
|
mdev->mac_removed[priv->port] = 0;
|
|
|
|
|
2015-06-15 21:59:02 +07:00
|
|
|
priv->counter_index =
|
|
|
|
mlx4_get_default_counter_index(mdev->dev, priv->port);
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
err = mlx4_en_config_rss_steer(priv);
|
|
|
|
if (err) {
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed configuring rss steering\n");
|
2011-03-23 05:38:31 +07:00
|
|
|
goto mac_err;
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
2012-07-05 11:03:50 +07:00
|
|
|
err = mlx4_en_create_drop_qp(priv);
|
|
|
|
if (err)
|
|
|
|
goto rss_err;
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
/* Configure tx cq's and rings */
|
|
|
|
for (i = 0; i < priv->tx_ring_num; i++) {
|
|
|
|
/* Configure cq */
|
2013-11-07 17:19:52 +07:00
|
|
|
cq = priv->tx_cq[i];
|
2011-10-09 12:26:31 +07:00
|
|
|
err = mlx4_en_activate_cq(priv, cq, i);
|
2008-10-23 05:47:49 +07:00
|
|
|
if (err) {
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed allocating Tx CQ\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
goto tx_err;
|
|
|
|
}
|
|
|
|
err = mlx4_en_set_cq_moder(priv, cq);
|
|
|
|
if (err) {
|
2014-05-08 02:52:57 +07:00
|
|
|
en_err(priv, "Failed setting cq moderation parameters\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
mlx4_en_deactivate_cq(priv, cq);
|
|
|
|
goto tx_err;
|
|
|
|
}
|
2009-06-02 03:27:13 +07:00
|
|
|
en_dbg(DRV, priv, "Resetting index of collapsed CQ:%d to -1\n", i);
|
2008-10-23 05:47:49 +07:00
|
|
|
cq->buf->wqe_index = cpu_to_be16(0xffff);
|
|
|
|
|
|
|
|
/* Configure ring */
|
2013-11-07 17:19:52 +07:00
|
|
|
tx_ring = priv->tx_ring[i];
|
2012-04-05 04:33:24 +07:00
|
|
|
err = mlx4_en_activate_tx_ring(priv, tx_ring, cq->mcq.cqn,
|
2012-12-02 10:49:23 +07:00
|
|
|
i / priv->num_tx_rings_p_up);
|
2008-10-23 05:47:49 +07:00
|
|
|
if (err) {
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed allocating Tx ring\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
mlx4_en_deactivate_cq(priv, cq);
|
|
|
|
goto tx_err;
|
|
|
|
}
|
2012-04-23 09:18:50 +07:00
|
|
|
tx_ring->tx_queue = netdev_get_tx_queue(dev, i);
|
2012-04-23 09:18:39 +07:00
|
|
|
|
2016-07-20 02:16:55 +07:00
|
|
|
mlx4_en_init_recycle_ring(priv, i);
|
|
|
|
|
2012-04-23 09:18:39 +07:00
|
|
|
/* Arm CQ for TX completions */
|
|
|
|
mlx4_en_arm_cq(priv, cq);
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
/* Set initial ownership of all Tx TXBBs to SW (1) */
|
|
|
|
for (j = 0; j < tx_ring->buf_size; j += STAMP_STRIDE)
|
|
|
|
*((u32 *) (tx_ring->buf + j)) = 0xffffffff;
|
|
|
|
++tx_index;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* Configure port */
|
|
|
|
err = mlx4_SET_PORT_general(mdev->dev, priv->port,
|
|
|
|
priv->rx_skb_size + ETH_FCS_LEN,
|
2008-11-05 11:48:36 +07:00
|
|
|
priv->prof->tx_pause,
|
|
|
|
priv->prof->tx_ppp,
|
|
|
|
priv->prof->rx_pause,
|
|
|
|
priv->prof->rx_ppp);
|
2008-10-23 05:47:49 +07:00
|
|
|
if (err) {
|
2013-02-07 09:25:21 +07:00
|
|
|
en_err(priv, "Failed setting port general configurations for port %d, with error %d\n",
|
|
|
|
priv->port, err);
|
2008-10-23 05:47:49 +07:00
|
|
|
goto tx_err;
|
|
|
|
}
|
|
|
|
/* Set default qp number */
|
|
|
|
err = mlx4_SET_PORT_qpn_calc(mdev->dev, priv->port, priv->base_qpn, 0);
|
|
|
|
if (err) {
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed setting default qp numbers\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
goto tx_err;
|
|
|
|
}
|
|
|
|
|
2013-12-23 21:09:44 +07:00
|
|
|
if (mdev->dev->caps.tunnel_offload_mode == MLX4_TUNNEL_OFFLOAD_MODE_VXLAN) {
|
2014-03-27 19:02:04 +07:00
|
|
|
err = mlx4_SET_PORT_VXLAN(mdev->dev, priv->port, VXLAN_STEER_BY_OUTER_MAC, 1);
|
2013-12-23 21:09:44 +07:00
|
|
|
if (err) {
|
|
|
|
en_err(priv, "Failed setting port L2 tunnel configuration, err %d\n",
|
|
|
|
err);
|
|
|
|
goto tx_err;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
/* Init port */
|
2009-06-02 03:27:13 +07:00
|
|
|
en_dbg(HW, priv, "Initializing port\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
err = mlx4_INIT_PORT(mdev->dev, priv->port);
|
|
|
|
if (err) {
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed Initializing port\n");
|
2011-03-23 05:38:31 +07:00
|
|
|
goto tx_err;
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
2015-10-08 21:14:01 +07:00
|
|
|
/* Set Unicast and VXLAN steering rules */
|
|
|
|
if (mdev->dev->caps.steering_mode != MLX4_STEERING_MODE_A0 &&
|
|
|
|
mlx4_en_set_rss_steer_rules(priv))
|
|
|
|
mlx4_warn(mdev, "Failed setting steering rules\n");
|
|
|
|
|
2011-03-23 05:38:31 +07:00
|
|
|
/* Attach rx QP to bradcast address */
|
2015-03-03 10:54:47 +07:00
|
|
|
eth_broadcast_addr(&mc_list[10]);
|
{NET, IB}/mlx4: Add device managed flow steering firmware API
The driver is modified to support three operation modes.
If supported by firmware use the device managed flow steering
API, that which we call device managed steering mode. Else, if
the firmware supports the B0 steering mode use it, and finally,
if none of the above, use the A0 steering mode.
When the steering mode is device managed, the code is modified
such that L2 based rules set by the mlx4_en driver for Ethernet
unicast and multicast, and the IB stack multicast attach calls
done through the mlx4_ib driver are all routed to use the device
managed API.
When attaching rule using device managed flow steering API,
the firmware returns a 64 bit registration id, which is to be
provided during detach.
Currently the firmware is always programmed during HCA initialization
to use standard L2 hashing. Future work should be done to allow
configuring the flow-steering hash function with common, non
proprietary means.
Signed-off-by: Hadar Hen Zion <hadarh@mellanox.co.il>
Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
2012-07-05 11:03:46 +07:00
|
|
|
mc_list[5] = priv->port; /* needed for B0 steering support */
|
2011-03-23 05:38:31 +07:00
|
|
|
if (mlx4_multicast_attach(mdev->dev, &priv->rss_map.indir_qp, mc_list,
|
{NET, IB}/mlx4: Add device managed flow steering firmware API
The driver is modified to support three operation modes.
If supported by firmware use the device managed flow steering
API, that which we call device managed steering mode. Else, if
the firmware supports the B0 steering mode use it, and finally,
if none of the above, use the A0 steering mode.
When the steering mode is device managed, the code is modified
such that L2 based rules set by the mlx4_en driver for Ethernet
unicast and multicast, and the IB stack multicast attach calls
done through the mlx4_ib driver are all routed to use the device
managed API.
When attaching rule using device managed flow steering API,
the firmware returns a 64 bit registration id, which is to be
provided during detach.
Currently the firmware is always programmed during HCA initialization
to use standard L2 hashing. Future work should be done to allow
configuring the flow-steering hash function with common, non
proprietary means.
Signed-off-by: Hadar Hen Zion <hadarh@mellanox.co.il>
Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
2012-07-05 11:03:46 +07:00
|
|
|
priv->port, 0, MLX4_PROT_ETH,
|
|
|
|
&priv->broadcast_id))
|
2011-03-23 05:38:31 +07:00
|
|
|
mlx4_warn(mdev, "Failed Attaching Broadcast\n");
|
|
|
|
|
2011-03-27 08:01:26 +07:00
|
|
|
/* Must redo promiscuous mode setup. */
|
|
|
|
priv->flags &= ~(MLX4_EN_FLAG_PROMISC | MLX4_EN_FLAG_MC_PROMISC);
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
/* Schedule multicast task to populate multicast list */
|
2013-02-07 09:25:23 +07:00
|
|
|
queue_work(mdev->workqueue, &priv->rx_mode_task);
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2014-11-18 22:51:27 +07:00
|
|
|
if (priv->mdev->dev->caps.tunnel_offload_mode == MLX4_TUNNEL_OFFLOAD_MODE_VXLAN)
|
2016-06-17 02:22:30 +07:00
|
|
|
udp_tunnel_get_rx_info(dev);
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
priv->port_up = true;
|
2009-06-21 05:15:46 +07:00
|
|
|
netif_tx_start_all_queues(dev);
|
2013-01-31 06:07:11 +07:00
|
|
|
netif_device_attach(dev);
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
return 0;
|
|
|
|
|
|
|
|
tx_err:
|
|
|
|
while (tx_index--) {
|
2013-11-07 17:19:52 +07:00
|
|
|
mlx4_en_deactivate_tx_ring(priv, priv->tx_ring[tx_index]);
|
|
|
|
mlx4_en_deactivate_cq(priv, priv->tx_cq[tx_index]);
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
2012-07-05 11:03:50 +07:00
|
|
|
mlx4_en_destroy_drop_qp(priv);
|
|
|
|
rss_err:
|
2008-10-23 05:47:49 +07:00
|
|
|
mlx4_en_release_rss_steer(priv);
|
2011-03-23 05:38:31 +07:00
|
|
|
mac_err:
|
2013-02-07 09:25:22 +07:00
|
|
|
mlx4_en_put_qp(priv);
|
2008-10-23 05:47:49 +07:00
|
|
|
cq_err:
|
2014-06-09 14:24:39 +07:00
|
|
|
while (rx_index--) {
|
2013-11-07 17:19:52 +07:00
|
|
|
mlx4_en_deactivate_cq(priv, priv->rx_cq[rx_index]);
|
2015-04-30 05:59:35 +07:00
|
|
|
mlx4_en_free_affinity_hint(priv, rx_index);
|
2014-06-09 14:24:39 +07:00
|
|
|
}
|
2009-05-24 10:17:11 +07:00
|
|
|
for (i = 0; i < priv->rx_ring_num; i++)
|
2013-11-07 17:19:52 +07:00
|
|
|
mlx4_en_deactivate_rx_ring(priv, priv->rx_ring[i]);
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
return err; /* need to close devices */
|
|
|
|
}
|
|
|
|
|
|
|
|
|
2013-01-31 06:07:11 +07:00
|
|
|
void mlx4_en_stop_port(struct net_device *dev, int detach)
|
2008-10-23 05:47:49 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
2012-07-05 11:03:43 +07:00
|
|
|
struct mlx4_en_mc_list *mclist, *tmp;
|
2013-01-31 06:07:08 +07:00
|
|
|
struct ethtool_flow_id *flow, *tmp_flow;
|
2008-10-23 05:47:49 +07:00
|
|
|
int i;
|
2011-03-23 05:38:31 +07:00
|
|
|
u8 mc_list[16] = {0};
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
if (!priv->port_up) {
|
2009-06-02 03:27:13 +07:00
|
|
|
en_dbg(DRV, priv, "stop port called while port already down\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
return;
|
|
|
|
}
|
|
|
|
|
2013-06-25 16:09:33 +07:00
|
|
|
/* close port*/
|
|
|
|
mlx4_CLOSE_PORT(mdev->dev, priv->port);
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
/* Synchronize with tx routine */
|
|
|
|
netif_tx_lock_bh(dev);
|
2013-01-31 06:07:11 +07:00
|
|
|
if (detach)
|
|
|
|
netif_device_detach(dev);
|
2009-06-21 05:15:52 +07:00
|
|
|
netif_tx_stop_all_queues(dev);
|
2008-10-23 05:47:49 +07:00
|
|
|
netif_tx_unlock_bh(dev);
|
|
|
|
|
2013-01-31 06:07:11 +07:00
|
|
|
netif_tx_disable(dev);
|
|
|
|
|
2010-08-24 10:45:45 +07:00
|
|
|
/* Set port as not active */
|
2009-06-21 05:15:52 +07:00
|
|
|
priv->port_up = false;
|
2015-06-15 21:59:02 +07:00
|
|
|
priv->counter_index = MLX4_SINK_COUNTER_INDEX(mdev->dev);
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2013-01-24 08:54:15 +07:00
|
|
|
/* Promsicuous mode */
|
|
|
|
if (mdev->dev->caps.steering_mode ==
|
|
|
|
MLX4_STEERING_MODE_DEVICE_MANAGED) {
|
|
|
|
priv->flags &= ~(MLX4_EN_FLAG_PROMISC |
|
|
|
|
MLX4_EN_FLAG_MC_PROMISC);
|
|
|
|
mlx4_flow_steer_promisc_remove(mdev->dev,
|
|
|
|
priv->port,
|
2013-04-24 20:58:45 +07:00
|
|
|
MLX4_FS_ALL_DEFAULT);
|
2013-01-24 08:54:15 +07:00
|
|
|
mlx4_flow_steer_promisc_remove(mdev->dev,
|
|
|
|
priv->port,
|
2013-04-24 20:58:45 +07:00
|
|
|
MLX4_FS_MC_DEFAULT);
|
2013-01-24 08:54:15 +07:00
|
|
|
} else if (priv->flags & MLX4_EN_FLAG_PROMISC) {
|
|
|
|
priv->flags &= ~MLX4_EN_FLAG_PROMISC;
|
|
|
|
|
|
|
|
/* Disable promiscouos mode */
|
|
|
|
mlx4_unicast_promisc_remove(mdev->dev, priv->base_qpn,
|
|
|
|
priv->port);
|
|
|
|
|
|
|
|
/* Disable Multicast promisc */
|
|
|
|
if (priv->flags & MLX4_EN_FLAG_MC_PROMISC) {
|
|
|
|
mlx4_multicast_promisc_remove(mdev->dev, priv->base_qpn,
|
|
|
|
priv->port);
|
|
|
|
priv->flags &= ~MLX4_EN_FLAG_MC_PROMISC;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2011-03-23 05:38:31 +07:00
|
|
|
/* Detach All multicasts */
|
2015-03-03 10:54:47 +07:00
|
|
|
eth_broadcast_addr(&mc_list[10]);
|
{NET, IB}/mlx4: Add device managed flow steering firmware API
The driver is modified to support three operation modes.
If supported by firmware use the device managed flow steering
API, that which we call device managed steering mode. Else, if
the firmware supports the B0 steering mode use it, and finally,
if none of the above, use the A0 steering mode.
When the steering mode is device managed, the code is modified
such that L2 based rules set by the mlx4_en driver for Ethernet
unicast and multicast, and the IB stack multicast attach calls
done through the mlx4_ib driver are all routed to use the device
managed API.
When attaching rule using device managed flow steering API,
the firmware returns a 64 bit registration id, which is to be
provided during detach.
Currently the firmware is always programmed during HCA initialization
to use standard L2 hashing. Future work should be done to allow
configuring the flow-steering hash function with common, non
proprietary means.
Signed-off-by: Hadar Hen Zion <hadarh@mellanox.co.il>
Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
2012-07-05 11:03:46 +07:00
|
|
|
mc_list[5] = priv->port; /* needed for B0 steering support */
|
2011-03-23 05:38:31 +07:00
|
|
|
mlx4_multicast_detach(mdev->dev, &priv->rss_map.indir_qp, mc_list,
|
{NET, IB}/mlx4: Add device managed flow steering firmware API
The driver is modified to support three operation modes.
If supported by firmware use the device managed flow steering
API, that which we call device managed steering mode. Else, if
the firmware supports the B0 steering mode use it, and finally,
if none of the above, use the A0 steering mode.
When the steering mode is device managed, the code is modified
such that L2 based rules set by the mlx4_en driver for Ethernet
unicast and multicast, and the IB stack multicast attach calls
done through the mlx4_ib driver are all routed to use the device
managed API.
When attaching rule using device managed flow steering API,
the firmware returns a 64 bit registration id, which is to be
provided during detach.
Currently the firmware is always programmed during HCA initialization
to use standard L2 hashing. Future work should be done to allow
configuring the flow-steering hash function with common, non
proprietary means.
Signed-off-by: Hadar Hen Zion <hadarh@mellanox.co.il>
Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
2012-07-05 11:03:46 +07:00
|
|
|
MLX4_PROT_ETH, priv->broadcast_id);
|
2012-07-05 11:03:43 +07:00
|
|
|
list_for_each_entry(mclist, &priv->curr_list, list) {
|
|
|
|
memcpy(&mc_list[10], mclist->addr, ETH_ALEN);
|
2011-03-23 05:38:31 +07:00
|
|
|
mc_list[5] = priv->port;
|
|
|
|
mlx4_multicast_detach(mdev->dev, &priv->rss_map.indir_qp,
|
{NET, IB}/mlx4: Add device managed flow steering firmware API
The driver is modified to support three operation modes.
If supported by firmware use the device managed flow steering
API, that which we call device managed steering mode. Else, if
the firmware supports the B0 steering mode use it, and finally,
if none of the above, use the A0 steering mode.
When the steering mode is device managed, the code is modified
such that L2 based rules set by the mlx4_en driver for Ethernet
unicast and multicast, and the IB stack multicast attach calls
done through the mlx4_ib driver are all routed to use the device
managed API.
When attaching rule using device managed flow steering API,
the firmware returns a 64 bit registration id, which is to be
provided during detach.
Currently the firmware is always programmed during HCA initialization
to use standard L2 hashing. Future work should be done to allow
configuring the flow-steering hash function with common, non
proprietary means.
Signed-off-by: Hadar Hen Zion <hadarh@mellanox.co.il>
Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
2012-07-05 11:03:46 +07:00
|
|
|
mc_list, MLX4_PROT_ETH, mclist->reg_id);
|
2014-03-13 19:52:15 +07:00
|
|
|
if (mclist->tunnel_reg_id)
|
|
|
|
mlx4_flow_detach(mdev->dev, mclist->tunnel_reg_id);
|
2011-03-23 05:38:31 +07:00
|
|
|
}
|
|
|
|
mlx4_en_clear_list(dev);
|
2012-07-05 11:03:43 +07:00
|
|
|
list_for_each_entry_safe(mclist, tmp, &priv->curr_list, list) {
|
|
|
|
list_del(&mclist->list);
|
|
|
|
kfree(mclist);
|
|
|
|
}
|
|
|
|
|
2011-03-23 05:38:31 +07:00
|
|
|
/* Flush multicast filter */
|
|
|
|
mlx4_SET_MCAST_FLTR(mdev->dev, priv->port, 0, 1, MLX4_MCAST_CONFIG);
|
|
|
|
|
2013-03-21 12:55:53 +07:00
|
|
|
/* Remove flow steering rules for the port*/
|
|
|
|
if (mdev->dev->caps.steering_mode ==
|
|
|
|
MLX4_STEERING_MODE_DEVICE_MANAGED) {
|
|
|
|
ASSERT_RTNL();
|
|
|
|
list_for_each_entry_safe(flow, tmp_flow,
|
|
|
|
&priv->ethtool_list, list) {
|
|
|
|
mlx4_flow_detach(mdev->dev, flow->id);
|
|
|
|
list_del(&flow->list);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2012-07-05 11:03:50 +07:00
|
|
|
mlx4_en_destroy_drop_qp(priv);
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
/* Free TX Rings */
|
|
|
|
for (i = 0; i < priv->tx_ring_num; i++) {
|
2013-11-07 17:19:52 +07:00
|
|
|
mlx4_en_deactivate_tx_ring(priv, priv->tx_ring[i]);
|
|
|
|
mlx4_en_deactivate_cq(priv, priv->tx_cq[i]);
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
msleep(10);
|
|
|
|
|
|
|
|
for (i = 0; i < priv->tx_ring_num; i++)
|
2013-11-07 17:19:52 +07:00
|
|
|
mlx4_en_free_tx_buf(dev, priv->tx_ring[i]);
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2015-10-08 21:14:01 +07:00
|
|
|
if (mdev->dev->caps.steering_mode != MLX4_STEERING_MODE_A0)
|
|
|
|
mlx4_en_delete_rss_steer_rules(priv);
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
/* Free RSS qps */
|
|
|
|
mlx4_en_release_rss_steer(priv);
|
|
|
|
|
2011-12-13 11:16:21 +07:00
|
|
|
/* Unregister Mac address for the port */
|
2013-02-07 09:25:22 +07:00
|
|
|
mlx4_en_put_qp(priv);
|
2013-10-15 21:55:22 +07:00
|
|
|
if (!(mdev->dev->caps.flags2 & MLX4_DEV_CAP_FLAG2_REASSIGN_MAC_EN))
|
2013-01-31 06:07:10 +07:00
|
|
|
mdev->mac_removed[priv->port] = 1;
|
2011-12-13 11:16:21 +07:00
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
/* Free RX Rings */
|
|
|
|
for (i = 0; i < priv->rx_ring_num; i++) {
|
2013-11-07 17:19:52 +07:00
|
|
|
struct mlx4_en_cq *cq = priv->rx_cq[i];
|
2013-06-18 20:18:27 +07:00
|
|
|
|
2014-10-27 16:37:45 +07:00
|
|
|
napi_synchronize(&cq->napi);
|
2013-11-07 17:19:52 +07:00
|
|
|
mlx4_en_deactivate_rx_ring(priv, priv->rx_ring[i]);
|
2013-06-18 20:18:27 +07:00
|
|
|
mlx4_en_deactivate_cq(priv, cq);
|
2014-06-09 14:24:39 +07:00
|
|
|
|
|
|
|
mlx4_en_free_affinity_hint(priv, i);
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_restart(struct work_struct *work)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = container_of(work, struct mlx4_en_priv,
|
|
|
|
watchdog_task);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
struct net_device *dev = priv->dev;
|
|
|
|
|
2009-06-02 03:27:13 +07:00
|
|
|
en_dbg(DRV, priv, "Watchdog task called for port %d\n", priv->port);
|
2009-04-20 11:26:05 +07:00
|
|
|
|
2016-04-19 02:19:44 +07:00
|
|
|
rtnl_lock();
|
2009-04-20 11:26:05 +07:00
|
|
|
mutex_lock(&mdev->state_lock);
|
|
|
|
if (priv->port_up) {
|
2013-01-31 06:07:11 +07:00
|
|
|
mlx4_en_stop_port(dev, 1);
|
2009-04-20 11:26:05 +07:00
|
|
|
if (mlx4_en_start_port(dev))
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed restarting port %d\n", priv->port);
|
2009-04-20 11:26:05 +07:00
|
|
|
}
|
|
|
|
mutex_unlock(&mdev->state_lock);
|
2016-04-19 02:19:44 +07:00
|
|
|
rtnl_unlock();
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
2012-01-19 16:42:37 +07:00
|
|
|
static void mlx4_en_clear_stats(struct net_device *dev)
|
2008-10-23 05:47:49 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
int i;
|
|
|
|
|
|
|
|
if (mlx4_en_DUMP_ETH_STATS(mdev, priv->port, 1))
|
2009-06-02 03:27:13 +07:00
|
|
|
en_dbg(HW, priv, "Failed dumping statistics\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
memset(&priv->pstats, 0, sizeof(priv->pstats));
|
2012-01-19 16:42:37 +07:00
|
|
|
memset(&priv->pkstats, 0, sizeof(priv->pkstats));
|
|
|
|
memset(&priv->port_stats, 0, sizeof(priv->port_stats));
|
2015-03-30 21:45:25 +07:00
|
|
|
memset(&priv->rx_flowstats, 0, sizeof(priv->rx_flowstats));
|
|
|
|
memset(&priv->tx_flowstats, 0, sizeof(priv->tx_flowstats));
|
|
|
|
memset(&priv->rx_priority_flowstats, 0,
|
|
|
|
sizeof(priv->rx_priority_flowstats));
|
|
|
|
memset(&priv->tx_priority_flowstats, 0,
|
|
|
|
sizeof(priv->tx_priority_flowstats));
|
2015-06-15 21:59:06 +07:00
|
|
|
memset(&priv->pf_stats, 0, sizeof(priv->pf_stats));
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
for (i = 0; i < priv->tx_ring_num; i++) {
|
2013-11-07 17:19:52 +07:00
|
|
|
priv->tx_ring[i]->bytes = 0;
|
|
|
|
priv->tx_ring[i]->packets = 0;
|
|
|
|
priv->tx_ring[i]->tx_csum = 0;
|
2016-05-25 23:50:36 +07:00
|
|
|
priv->tx_ring[i]->tx_dropped = 0;
|
2016-05-25 23:50:37 +07:00
|
|
|
priv->tx_ring[i]->queue_stopped = 0;
|
|
|
|
priv->tx_ring[i]->wake_queue = 0;
|
|
|
|
priv->tx_ring[i]->tso_packets = 0;
|
|
|
|
priv->tx_ring[i]->xmit_more = 0;
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
for (i = 0; i < priv->rx_ring_num; i++) {
|
2013-11-07 17:19:52 +07:00
|
|
|
priv->rx_ring[i]->bytes = 0;
|
|
|
|
priv->rx_ring[i]->packets = 0;
|
|
|
|
priv->rx_ring[i]->csum_ok = 0;
|
|
|
|
priv->rx_ring[i]->csum_none = 0;
|
2014-11-09 18:51:53 +07:00
|
|
|
priv->rx_ring[i]->csum_complete = 0;
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
2012-01-19 16:42:37 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
static int mlx4_en_open(struct net_device *dev)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
int err = 0;
|
|
|
|
|
|
|
|
mutex_lock(&mdev->state_lock);
|
|
|
|
|
|
|
|
if (!mdev->device_up) {
|
|
|
|
en_err(priv, "Cannot open - device down/disabled\n");
|
|
|
|
err = -EBUSY;
|
|
|
|
goto out;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* Reset HW statistics and SW counters */
|
|
|
|
mlx4_en_clear_stats(dev);
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
err = mlx4_en_start_port(dev);
|
|
|
|
if (err)
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed starting port:%d\n", priv->port);
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
out:
|
|
|
|
mutex_unlock(&mdev->state_lock);
|
|
|
|
return err;
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
|
|
static int mlx4_en_close(struct net_device *dev)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
|
2009-06-02 03:27:13 +07:00
|
|
|
en_dbg(IFDOWN, priv, "Close port called\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
mutex_lock(&mdev->state_lock);
|
|
|
|
|
2013-01-31 06:07:11 +07:00
|
|
|
mlx4_en_stop_port(dev, 0);
|
2008-10-23 05:47:49 +07:00
|
|
|
netif_carrier_off(dev);
|
|
|
|
|
|
|
|
mutex_unlock(&mdev->state_lock);
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2016-07-18 22:35:12 +07:00
|
|
|
static void mlx4_en_free_resources(struct mlx4_en_priv *priv)
|
2008-10-23 05:47:49 +07:00
|
|
|
{
|
|
|
|
int i;
|
|
|
|
|
2012-07-19 05:33:52 +07:00
|
|
|
#ifdef CONFIG_RFS_ACCEL
|
|
|
|
priv->dev->rx_cpu_rmap = NULL;
|
|
|
|
#endif
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
for (i = 0; i < priv->tx_ring_num; i++) {
|
2013-11-07 17:19:52 +07:00
|
|
|
if (priv->tx_ring && priv->tx_ring[i])
|
2008-10-23 05:47:49 +07:00
|
|
|
mlx4_en_destroy_tx_ring(priv, &priv->tx_ring[i]);
|
2013-11-07 17:19:52 +07:00
|
|
|
if (priv->tx_cq && priv->tx_cq[i])
|
2011-10-09 12:26:46 +07:00
|
|
|
mlx4_en_destroy_cq(priv, &priv->tx_cq[i]);
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
for (i = 0; i < priv->rx_ring_num; i++) {
|
2013-11-07 17:19:52 +07:00
|
|
|
if (priv->rx_ring[i])
|
2012-02-06 15:39:49 +07:00
|
|
|
mlx4_en_destroy_rx_ring(priv, &priv->rx_ring[i],
|
|
|
|
priv->prof->rx_ring_size, priv->stride);
|
2013-11-07 17:19:52 +07:00
|
|
|
if (priv->rx_cq[i])
|
2011-10-09 12:26:46 +07:00
|
|
|
mlx4_en_destroy_cq(priv, &priv->rx_cq[i]);
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
2012-06-25 07:24:13 +07:00
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
2016-07-18 22:35:12 +07:00
|
|
|
static int mlx4_en_alloc_resources(struct mlx4_en_priv *priv)
|
2008-10-23 05:47:49 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_port_profile *prof = priv->prof;
|
|
|
|
int i;
|
2013-11-07 17:19:54 +07:00
|
|
|
int node;
|
2011-03-23 05:38:52 +07:00
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
/* Create tx Rings */
|
|
|
|
for (i = 0; i < priv->tx_ring_num; i++) {
|
2013-11-07 17:19:54 +07:00
|
|
|
node = cpu_to_node(i % num_online_cpus());
|
2008-10-23 05:47:49 +07:00
|
|
|
if (mlx4_en_create_cq(priv, &priv->tx_cq[i],
|
2013-11-07 17:19:54 +07:00
|
|
|
prof->tx_ring_size, i, TX, node))
|
2008-10-23 05:47:49 +07:00
|
|
|
goto err;
|
|
|
|
|
2013-12-20 02:20:14 +07:00
|
|
|
if (mlx4_en_create_tx_ring(priv, &priv->tx_ring[i],
|
|
|
|
prof->tx_ring_size, TXBB_SIZE,
|
|
|
|
node, i))
|
2008-10-23 05:47:49 +07:00
|
|
|
goto err;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* Create rx Rings */
|
|
|
|
for (i = 0; i < priv->rx_ring_num; i++) {
|
2013-11-07 17:19:54 +07:00
|
|
|
node = cpu_to_node(i % num_online_cpus());
|
2008-10-23 05:47:49 +07:00
|
|
|
if (mlx4_en_create_cq(priv, &priv->rx_cq[i],
|
2013-11-07 17:19:54 +07:00
|
|
|
prof->rx_ring_size, i, RX, node))
|
2008-10-23 05:47:49 +07:00
|
|
|
goto err;
|
|
|
|
|
|
|
|
if (mlx4_en_create_rx_ring(priv, &priv->rx_ring[i],
|
2013-11-07 17:19:54 +07:00
|
|
|
prof->rx_ring_size, priv->stride,
|
|
|
|
node))
|
2008-10-23 05:47:49 +07:00
|
|
|
goto err;
|
|
|
|
}
|
|
|
|
|
2012-07-19 05:33:52 +07:00
|
|
|
#ifdef CONFIG_RFS_ACCEL
|
2015-05-31 13:30:16 +07:00
|
|
|
priv->dev->rx_cpu_rmap = mlx4_get_cpu_rmap(priv->mdev->dev, priv->port);
|
2012-07-19 05:33:52 +07:00
|
|
|
#endif
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
return 0;
|
|
|
|
|
|
|
|
err:
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed to allocate NIC resources\n");
|
2013-11-07 17:19:52 +07:00
|
|
|
for (i = 0; i < priv->rx_ring_num; i++) {
|
|
|
|
if (priv->rx_ring[i])
|
|
|
|
mlx4_en_destroy_rx_ring(priv, &priv->rx_ring[i],
|
|
|
|
prof->rx_ring_size,
|
|
|
|
priv->stride);
|
|
|
|
if (priv->rx_cq[i])
|
|
|
|
mlx4_en_destroy_cq(priv, &priv->rx_cq[i]);
|
|
|
|
}
|
|
|
|
for (i = 0; i < priv->tx_ring_num; i++) {
|
|
|
|
if (priv->tx_ring[i])
|
|
|
|
mlx4_en_destroy_tx_ring(priv, &priv->tx_ring[i]);
|
|
|
|
if (priv->tx_cq[i])
|
|
|
|
mlx4_en_destroy_cq(priv, &priv->tx_cq[i]);
|
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
return -ENOMEM;
|
|
|
|
}
|
|
|
|
|
2016-06-21 18:20:03 +07:00
|
|
|
static void mlx4_en_shutdown(struct net_device *dev)
|
|
|
|
{
|
|
|
|
rtnl_lock();
|
|
|
|
netif_device_detach(dev);
|
|
|
|
mlx4_en_close(dev);
|
|
|
|
rtnl_unlock();
|
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2016-07-18 22:35:12 +07:00
|
|
|
static int mlx4_en_copy_priv(struct mlx4_en_priv *dst,
|
|
|
|
struct mlx4_en_priv *src,
|
|
|
|
struct mlx4_en_port_profile *prof)
|
|
|
|
{
|
|
|
|
memcpy(&dst->hwtstamp_config, &prof->hwtstamp_config,
|
|
|
|
sizeof(dst->hwtstamp_config));
|
|
|
|
dst->num_tx_rings_p_up = src->mdev->profile.num_tx_rings_p_up;
|
|
|
|
dst->tx_ring_num = prof->tx_ring_num;
|
|
|
|
dst->rx_ring_num = prof->rx_ring_num;
|
|
|
|
dst->flags = prof->flags;
|
|
|
|
dst->mdev = src->mdev;
|
|
|
|
dst->port = src->port;
|
|
|
|
dst->dev = src->dev;
|
|
|
|
dst->prof = prof;
|
|
|
|
dst->stride = roundup_pow_of_two(sizeof(struct mlx4_en_rx_desc) +
|
|
|
|
DS_SIZE * MLX4_EN_MAX_RX_FRAGS);
|
|
|
|
|
|
|
|
dst->tx_ring = kzalloc(sizeof(struct mlx4_en_tx_ring *) * MAX_TX_RINGS,
|
|
|
|
GFP_KERNEL);
|
|
|
|
if (!dst->tx_ring)
|
|
|
|
return -ENOMEM;
|
|
|
|
|
|
|
|
dst->tx_cq = kzalloc(sizeof(struct mlx4_en_cq *) * MAX_TX_RINGS,
|
|
|
|
GFP_KERNEL);
|
|
|
|
if (!dst->tx_cq) {
|
|
|
|
kfree(dst->tx_ring);
|
|
|
|
return -ENOMEM;
|
|
|
|
}
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_update_priv(struct mlx4_en_priv *dst,
|
|
|
|
struct mlx4_en_priv *src)
|
|
|
|
{
|
|
|
|
memcpy(dst->rx_ring, src->rx_ring,
|
|
|
|
sizeof(struct mlx4_en_rx_ring *) * src->rx_ring_num);
|
|
|
|
memcpy(dst->rx_cq, src->rx_cq,
|
|
|
|
sizeof(struct mlx4_en_cq *) * src->rx_ring_num);
|
|
|
|
memcpy(&dst->hwtstamp_config, &src->hwtstamp_config,
|
|
|
|
sizeof(dst->hwtstamp_config));
|
|
|
|
dst->tx_ring_num = src->tx_ring_num;
|
|
|
|
dst->rx_ring_num = src->rx_ring_num;
|
|
|
|
dst->tx_ring = src->tx_ring;
|
|
|
|
dst->tx_cq = src->tx_cq;
|
|
|
|
memcpy(dst->prof, src->prof, sizeof(struct mlx4_en_port_profile));
|
|
|
|
}
|
|
|
|
|
|
|
|
int mlx4_en_try_alloc_resources(struct mlx4_en_priv *priv,
|
|
|
|
struct mlx4_en_priv *tmp,
|
|
|
|
struct mlx4_en_port_profile *prof)
|
|
|
|
{
|
|
|
|
mlx4_en_copy_priv(tmp, priv, prof);
|
|
|
|
|
|
|
|
if (mlx4_en_alloc_resources(tmp)) {
|
|
|
|
en_warn(priv,
|
|
|
|
"%s: Resource allocation failed, using previous configuration\n",
|
|
|
|
__func__);
|
|
|
|
kfree(tmp->tx_ring);
|
|
|
|
kfree(tmp->tx_cq);
|
|
|
|
return -ENOMEM;
|
|
|
|
}
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
void mlx4_en_safe_replace_resources(struct mlx4_en_priv *priv,
|
|
|
|
struct mlx4_en_priv *tmp)
|
|
|
|
{
|
|
|
|
mlx4_en_free_resources(priv);
|
|
|
|
mlx4_en_update_priv(priv, tmp);
|
|
|
|
}
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
void mlx4_en_destroy_netdev(struct net_device *dev)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
2016-06-21 18:20:03 +07:00
|
|
|
bool shutdown = mdev->dev->persist->interface_state &
|
|
|
|
MLX4_INTERFACE_STATE_SHUTDOWN;
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2009-06-02 03:27:13 +07:00
|
|
|
en_dbg(DRV, priv, "Destroying netdev on port:%d\n", priv->port);
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
/* Unregister device - this will close the port if it was up */
|
2016-02-26 23:32:24 +07:00
|
|
|
if (priv->registered) {
|
|
|
|
devlink_port_type_clear(mlx4_get_devlink_port(mdev->dev,
|
|
|
|
priv->port));
|
2016-06-21 18:20:03 +07:00
|
|
|
if (shutdown)
|
|
|
|
mlx4_en_shutdown(dev);
|
|
|
|
else
|
|
|
|
unregister_netdev(dev);
|
2016-02-26 23:32:24 +07:00
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
if (priv->allocated)
|
|
|
|
mlx4_free_hwq_res(mdev->dev, &priv->res, MLX4_EN_PAGE_SIZE);
|
|
|
|
|
|
|
|
cancel_delayed_work(&priv->stats_task);
|
2013-04-23 13:06:51 +07:00
|
|
|
cancel_delayed_work(&priv->service_task);
|
2008-10-23 05:47:49 +07:00
|
|
|
/* flush any pending task for this netdev */
|
|
|
|
flush_workqueue(mdev->workqueue);
|
|
|
|
|
2015-12-17 20:35:38 +07:00
|
|
|
if (mdev->dev->caps.flags2 & MLX4_DEV_CAP_FLAG2_TS)
|
|
|
|
mlx4_en_remove_timestamp(mdev);
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
/* Detach the netdev so tasks would not attempt to access it */
|
|
|
|
mutex_lock(&mdev->state_lock);
|
|
|
|
mdev->pndev[priv->port] = NULL;
|
2015-02-03 21:48:34 +07:00
|
|
|
mdev->upper[priv->port] = NULL;
|
2008-10-23 05:47:49 +07:00
|
|
|
mutex_unlock(&mdev->state_lock);
|
|
|
|
|
2016-07-18 22:35:11 +07:00
|
|
|
#ifdef CONFIG_RFS_ACCEL
|
|
|
|
mlx4_en_cleanup_filters(priv);
|
|
|
|
#endif
|
|
|
|
|
2011-10-09 12:26:46 +07:00
|
|
|
mlx4_en_free_resources(priv);
|
2012-04-05 04:33:26 +07:00
|
|
|
|
2012-05-17 07:58:10 +07:00
|
|
|
kfree(priv->tx_ring);
|
|
|
|
kfree(priv->tx_cq);
|
|
|
|
|
2016-06-21 18:20:03 +07:00
|
|
|
if (!shutdown)
|
|
|
|
free_netdev(dev);
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
static int mlx4_en_change_mtu(struct net_device *dev, int new_mtu)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
int err = 0;
|
|
|
|
|
2009-06-02 03:27:13 +07:00
|
|
|
en_dbg(DRV, priv, "Change MTU called - current:%d new:%d\n",
|
2008-10-23 05:47:49 +07:00
|
|
|
dev->mtu, new_mtu);
|
|
|
|
|
|
|
|
if ((new_mtu < MLX4_EN_MIN_MTU) || (new_mtu > priv->max_mtu)) {
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Bad MTU size:%d.\n", new_mtu);
|
2008-10-23 05:47:49 +07:00
|
|
|
return -EPERM;
|
|
|
|
}
|
2016-07-20 02:16:50 +07:00
|
|
|
if (priv->xdp_ring_num && MLX4_EN_EFF_MTU(new_mtu) > FRAG_SZ0) {
|
|
|
|
en_err(priv, "MTU size:%d requires frags but XDP running\n",
|
|
|
|
new_mtu);
|
|
|
|
return -EOPNOTSUPP;
|
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
dev->mtu = new_mtu;
|
|
|
|
|
|
|
|
if (netif_running(dev)) {
|
|
|
|
mutex_lock(&mdev->state_lock);
|
|
|
|
if (!mdev->device_up) {
|
|
|
|
/* NIC is probably restarting - let watchdog task reset
|
|
|
|
* the port */
|
2009-06-02 03:27:13 +07:00
|
|
|
en_dbg(DRV, priv, "Change MTU called with card down!?\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
} else {
|
2013-01-31 06:07:11 +07:00
|
|
|
mlx4_en_stop_port(dev, 1);
|
2008-10-23 05:47:49 +07:00
|
|
|
err = mlx4_en_start_port(dev);
|
|
|
|
if (err) {
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed restarting port:%d\n",
|
2008-10-23 05:47:49 +07:00
|
|
|
priv->port);
|
|
|
|
queue_work(mdev->workqueue, &priv->watchdog_task);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
mutex_unlock(&mdev->state_lock);
|
|
|
|
}
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2013-11-19 06:13:31 +07:00
|
|
|
static int mlx4_en_hwtstamp_set(struct net_device *dev, struct ifreq *ifr)
|
2013-04-23 13:06:49 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
|
|
|
struct hwtstamp_config config;
|
|
|
|
|
|
|
|
if (copy_from_user(&config, ifr->ifr_data, sizeof(config)))
|
|
|
|
return -EFAULT;
|
|
|
|
|
|
|
|
/* reserved for future extensions */
|
|
|
|
if (config.flags)
|
|
|
|
return -EINVAL;
|
|
|
|
|
|
|
|
/* device doesn't support time stamping */
|
|
|
|
if (!(mdev->dev->caps.flags2 & MLX4_DEV_CAP_FLAG2_TS))
|
|
|
|
return -EINVAL;
|
|
|
|
|
|
|
|
/* TX HW timestamp */
|
|
|
|
switch (config.tx_type) {
|
|
|
|
case HWTSTAMP_TX_OFF:
|
|
|
|
case HWTSTAMP_TX_ON:
|
|
|
|
break;
|
|
|
|
default:
|
|
|
|
return -ERANGE;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* RX HW timestamp */
|
|
|
|
switch (config.rx_filter) {
|
|
|
|
case HWTSTAMP_FILTER_NONE:
|
|
|
|
break;
|
|
|
|
case HWTSTAMP_FILTER_ALL:
|
|
|
|
case HWTSTAMP_FILTER_SOME:
|
|
|
|
case HWTSTAMP_FILTER_PTP_V1_L4_EVENT:
|
|
|
|
case HWTSTAMP_FILTER_PTP_V1_L4_SYNC:
|
|
|
|
case HWTSTAMP_FILTER_PTP_V1_L4_DELAY_REQ:
|
|
|
|
case HWTSTAMP_FILTER_PTP_V2_L4_EVENT:
|
|
|
|
case HWTSTAMP_FILTER_PTP_V2_L4_SYNC:
|
|
|
|
case HWTSTAMP_FILTER_PTP_V2_L4_DELAY_REQ:
|
|
|
|
case HWTSTAMP_FILTER_PTP_V2_L2_EVENT:
|
|
|
|
case HWTSTAMP_FILTER_PTP_V2_L2_SYNC:
|
|
|
|
case HWTSTAMP_FILTER_PTP_V2_L2_DELAY_REQ:
|
|
|
|
case HWTSTAMP_FILTER_PTP_V2_EVENT:
|
|
|
|
case HWTSTAMP_FILTER_PTP_V2_SYNC:
|
|
|
|
case HWTSTAMP_FILTER_PTP_V2_DELAY_REQ:
|
|
|
|
config.rx_filter = HWTSTAMP_FILTER_ALL;
|
|
|
|
break;
|
|
|
|
default:
|
|
|
|
return -ERANGE;
|
|
|
|
}
|
|
|
|
|
2014-10-27 16:37:42 +07:00
|
|
|
if (mlx4_en_reset_config(dev, config, dev->features)) {
|
2013-04-23 13:06:49 +07:00
|
|
|
config.tx_type = HWTSTAMP_TX_OFF;
|
|
|
|
config.rx_filter = HWTSTAMP_FILTER_NONE;
|
|
|
|
}
|
|
|
|
|
|
|
|
return copy_to_user(ifr->ifr_data, &config,
|
|
|
|
sizeof(config)) ? -EFAULT : 0;
|
|
|
|
}
|
|
|
|
|
2013-11-19 06:13:31 +07:00
|
|
|
static int mlx4_en_hwtstamp_get(struct net_device *dev, struct ifreq *ifr)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
|
|
|
|
return copy_to_user(ifr->ifr_data, &priv->hwtstamp_config,
|
|
|
|
sizeof(priv->hwtstamp_config)) ? -EFAULT : 0;
|
|
|
|
}
|
|
|
|
|
2013-04-23 13:06:49 +07:00
|
|
|
static int mlx4_en_ioctl(struct net_device *dev, struct ifreq *ifr, int cmd)
|
|
|
|
{
|
|
|
|
switch (cmd) {
|
|
|
|
case SIOCSHWTSTAMP:
|
2013-11-19 06:13:31 +07:00
|
|
|
return mlx4_en_hwtstamp_set(dev, ifr);
|
|
|
|
case SIOCGHWTSTAMP:
|
|
|
|
return mlx4_en_hwtstamp_get(dev, ifr);
|
2013-04-23 13:06:49 +07:00
|
|
|
default:
|
|
|
|
return -EOPNOTSUPP;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2015-07-27 18:46:34 +07:00
|
|
|
static netdev_features_t mlx4_en_fix_features(struct net_device *netdev,
|
|
|
|
netdev_features_t features)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *en_priv = netdev_priv(netdev);
|
|
|
|
struct mlx4_en_dev *mdev = en_priv->mdev;
|
|
|
|
|
|
|
|
/* Since there is no support for separate RX C-TAG/S-TAG vlan accel
|
|
|
|
* enable/disable make sure S-TAG flag is always in same state as
|
|
|
|
* C-TAG.
|
|
|
|
*/
|
|
|
|
if (features & NETIF_F_HW_VLAN_CTAG_RX &&
|
|
|
|
!(mdev->dev->caps.flags2 & MLX4_DEV_CAP_FLAG2_SKIP_OUTER_VLAN))
|
|
|
|
features |= NETIF_F_HW_VLAN_STAG_RX;
|
|
|
|
else
|
|
|
|
features &= ~NETIF_F_HW_VLAN_STAG_RX;
|
|
|
|
|
|
|
|
return features;
|
|
|
|
}
|
|
|
|
|
2011-11-27 02:55:19 +07:00
|
|
|
static int mlx4_en_set_features(struct net_device *netdev,
|
|
|
|
netdev_features_t features)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(netdev);
|
2015-04-02 20:31:21 +07:00
|
|
|
bool reset = false;
|
2014-10-27 16:37:43 +07:00
|
|
|
int ret = 0;
|
|
|
|
|
2015-04-02 20:31:21 +07:00
|
|
|
if (DEV_FEATURE_CHANGED(netdev, features, NETIF_F_RXFCS)) {
|
|
|
|
en_info(priv, "Turn %s RX-FCS\n",
|
|
|
|
(features & NETIF_F_RXFCS) ? "ON" : "OFF");
|
|
|
|
reset = true;
|
|
|
|
}
|
|
|
|
|
2015-04-02 20:31:22 +07:00
|
|
|
if (DEV_FEATURE_CHANGED(netdev, features, NETIF_F_RXALL)) {
|
|
|
|
u8 ignore_fcs_value = (features & NETIF_F_RXALL) ? 1 : 0;
|
|
|
|
|
|
|
|
en_info(priv, "Turn %s RX-ALL\n",
|
|
|
|
ignore_fcs_value ? "ON" : "OFF");
|
|
|
|
ret = mlx4_SET_PORT_fcs_check(priv->mdev->dev,
|
|
|
|
priv->port, ignore_fcs_value);
|
|
|
|
if (ret)
|
|
|
|
return ret;
|
|
|
|
}
|
|
|
|
|
2014-10-27 16:37:43 +07:00
|
|
|
if (DEV_FEATURE_CHANGED(netdev, features, NETIF_F_HW_VLAN_CTAG_RX)) {
|
|
|
|
en_info(priv, "Turn %s RX vlan strip offload\n",
|
|
|
|
(features & NETIF_F_HW_VLAN_CTAG_RX) ? "ON" : "OFF");
|
2015-04-02 20:31:21 +07:00
|
|
|
reset = true;
|
2014-10-27 16:37:43 +07:00
|
|
|
}
|
2011-11-27 02:55:19 +07:00
|
|
|
|
2015-02-03 22:57:21 +07:00
|
|
|
if (DEV_FEATURE_CHANGED(netdev, features, NETIF_F_HW_VLAN_CTAG_TX))
|
|
|
|
en_info(priv, "Turn %s TX vlan strip offload\n",
|
|
|
|
(features & NETIF_F_HW_VLAN_CTAG_TX) ? "ON" : "OFF");
|
|
|
|
|
2015-07-27 18:46:34 +07:00
|
|
|
if (DEV_FEATURE_CHANGED(netdev, features, NETIF_F_HW_VLAN_STAG_TX))
|
|
|
|
en_info(priv, "Turn %s TX S-VLAN strip offload\n",
|
|
|
|
(features & NETIF_F_HW_VLAN_STAG_TX) ? "ON" : "OFF");
|
|
|
|
|
2015-04-02 20:31:07 +07:00
|
|
|
if (DEV_FEATURE_CHANGED(netdev, features, NETIF_F_LOOPBACK)) {
|
|
|
|
en_info(priv, "Turn %s loopback\n",
|
|
|
|
(features & NETIF_F_LOOPBACK) ? "ON" : "OFF");
|
|
|
|
mlx4_en_update_loopback_state(netdev, features);
|
|
|
|
}
|
2013-02-07 09:25:19 +07:00
|
|
|
|
2015-04-02 20:31:21 +07:00
|
|
|
if (reset) {
|
|
|
|
ret = mlx4_en_reset_config(netdev, priv->hwtstamp_config,
|
|
|
|
features);
|
|
|
|
if (ret)
|
|
|
|
return ret;
|
|
|
|
}
|
2011-11-27 02:55:19 +07:00
|
|
|
|
2015-04-02 20:31:21 +07:00
|
|
|
return 0;
|
2011-11-27 02:55:19 +07:00
|
|
|
}
|
|
|
|
|
2013-04-25 12:22:27 +07:00
|
|
|
static int mlx4_en_set_vf_mac(struct net_device *dev, int queue, u8 *mac)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *en_priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = en_priv->mdev;
|
2014-03-02 15:25:01 +07:00
|
|
|
u64 mac_u64 = mlx4_mac_to_u64(mac);
|
2013-04-25 12:22:27 +07:00
|
|
|
|
2016-03-02 22:47:46 +07:00
|
|
|
if (is_multicast_ether_addr(mac))
|
2013-04-25 12:22:27 +07:00
|
|
|
return -EINVAL;
|
|
|
|
|
|
|
|
return mlx4_set_vf_mac(mdev->dev, en_priv->port, queue, mac_u64);
|
|
|
|
}
|
|
|
|
|
2013-04-25 12:22:28 +07:00
|
|
|
static int mlx4_en_set_vf_vlan(struct net_device *dev, int vf, u16 vlan, u8 qos)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *en_priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = en_priv->mdev;
|
|
|
|
|
|
|
|
return mlx4_set_vf_vlan(mdev->dev, en_priv->port, vf, vlan, qos);
|
|
|
|
}
|
|
|
|
|
2015-04-02 20:31:16 +07:00
|
|
|
static int mlx4_en_set_vf_rate(struct net_device *dev, int vf, int min_tx_rate,
|
|
|
|
int max_tx_rate)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *en_priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = en_priv->mdev;
|
|
|
|
|
|
|
|
return mlx4_set_vf_rate(mdev->dev, en_priv->port, vf, min_tx_rate,
|
|
|
|
max_tx_rate);
|
|
|
|
}
|
|
|
|
|
2013-04-25 12:22:29 +07:00
|
|
|
static int mlx4_en_set_vf_spoofchk(struct net_device *dev, int vf, bool setting)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *en_priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = en_priv->mdev;
|
|
|
|
|
|
|
|
return mlx4_set_vf_spoofchk(mdev->dev, en_priv->port, vf, setting);
|
|
|
|
}
|
|
|
|
|
2013-04-25 12:22:30 +07:00
|
|
|
static int mlx4_en_get_vf_config(struct net_device *dev, int vf, struct ifla_vf_info *ivf)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *en_priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = en_priv->mdev;
|
|
|
|
|
|
|
|
return mlx4_get_vf_config(mdev->dev, en_priv->port, vf, ivf);
|
|
|
|
}
|
2013-04-25 12:22:27 +07:00
|
|
|
|
2013-06-13 17:19:11 +07:00
|
|
|
static int mlx4_en_set_vf_link_state(struct net_device *dev, int vf, int link_state)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *en_priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = en_priv->mdev;
|
|
|
|
|
|
|
|
return mlx4_set_vf_link_state(mdev->dev, en_priv->port, vf, link_state);
|
|
|
|
}
|
2013-12-20 02:20:13 +07:00
|
|
|
|
2015-06-15 21:59:08 +07:00
|
|
|
static int mlx4_en_get_vf_stats(struct net_device *dev, int vf,
|
|
|
|
struct ifla_vf_stats *vf_stats)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *en_priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = en_priv->mdev;
|
|
|
|
|
|
|
|
return mlx4_get_vf_stats(mdev->dev, en_priv->port, vf, vf_stats);
|
|
|
|
}
|
|
|
|
|
2013-12-20 02:20:13 +07:00
|
|
|
#define PORT_ID_BYTE_LEN 8
|
|
|
|
static int mlx4_en_get_phys_port_id(struct net_device *dev,
|
2014-11-28 20:34:16 +07:00
|
|
|
struct netdev_phys_item_id *ppid)
|
2013-12-20 02:20:13 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_dev *mdev = priv->mdev->dev;
|
|
|
|
int i;
|
|
|
|
u64 phys_port_id = mdev->caps.phys_port_id[priv->port];
|
|
|
|
|
|
|
|
if (!phys_port_id)
|
|
|
|
return -EOPNOTSUPP;
|
|
|
|
|
|
|
|
ppid->id_len = sizeof(phys_port_id);
|
|
|
|
for (i = PORT_ID_BYTE_LEN - 1; i >= 0; --i) {
|
|
|
|
ppid->id[i] = phys_port_id & 0xff;
|
|
|
|
phys_port_id >>= 8;
|
|
|
|
}
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2014-03-27 19:02:04 +07:00
|
|
|
static void mlx4_en_add_vxlan_offloads(struct work_struct *work)
|
|
|
|
{
|
|
|
|
int ret;
|
|
|
|
struct mlx4_en_priv *priv = container_of(work, struct mlx4_en_priv,
|
|
|
|
vxlan_add_task);
|
|
|
|
|
|
|
|
ret = mlx4_config_vxlan_port(priv->mdev->dev, priv->vxlan_port);
|
|
|
|
if (ret)
|
|
|
|
goto out;
|
|
|
|
|
|
|
|
ret = mlx4_SET_PORT_VXLAN(priv->mdev->dev, priv->port,
|
|
|
|
VXLAN_STEER_BY_OUTER_MAC, 1);
|
|
|
|
out:
|
2014-11-09 19:25:39 +07:00
|
|
|
if (ret) {
|
2014-03-27 19:02:04 +07:00
|
|
|
en_err(priv, "failed setting L2 tunnel configuration ret %d\n", ret);
|
2014-11-09 19:25:39 +07:00
|
|
|
return;
|
|
|
|
}
|
|
|
|
|
|
|
|
/* set offloads */
|
2016-05-02 23:38:37 +07:00
|
|
|
priv->dev->hw_enc_features |= NETIF_F_IP_CSUM | NETIF_F_IPV6_CSUM |
|
|
|
|
NETIF_F_RXCSUM |
|
|
|
|
NETIF_F_TSO | NETIF_F_TSO6 |
|
|
|
|
NETIF_F_GSO_UDP_TUNNEL |
|
2016-05-02 23:38:30 +07:00
|
|
|
NETIF_F_GSO_UDP_TUNNEL_CSUM |
|
|
|
|
NETIF_F_GSO_PARTIAL;
|
2014-03-27 19:02:04 +07:00
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_del_vxlan_offloads(struct work_struct *work)
|
|
|
|
{
|
|
|
|
int ret;
|
|
|
|
struct mlx4_en_priv *priv = container_of(work, struct mlx4_en_priv,
|
|
|
|
vxlan_del_task);
|
2014-11-09 19:25:39 +07:00
|
|
|
/* unset offloads */
|
2016-05-02 23:38:37 +07:00
|
|
|
priv->dev->hw_enc_features &= ~(NETIF_F_IP_CSUM | NETIF_F_IPV6_CSUM |
|
|
|
|
NETIF_F_RXCSUM |
|
|
|
|
NETIF_F_TSO | NETIF_F_TSO6 |
|
|
|
|
NETIF_F_GSO_UDP_TUNNEL |
|
2016-05-02 23:38:30 +07:00
|
|
|
NETIF_F_GSO_UDP_TUNNEL_CSUM |
|
|
|
|
NETIF_F_GSO_PARTIAL);
|
2014-03-27 19:02:04 +07:00
|
|
|
|
|
|
|
ret = mlx4_SET_PORT_VXLAN(priv->mdev->dev, priv->port,
|
|
|
|
VXLAN_STEER_BY_OUTER_MAC, 0);
|
|
|
|
if (ret)
|
|
|
|
en_err(priv, "failed setting L2 tunnel configuration ret %d\n", ret);
|
|
|
|
|
|
|
|
priv->vxlan_port = 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_add_vxlan_port(struct net_device *dev,
|
2016-06-17 02:22:30 +07:00
|
|
|
struct udp_tunnel_info *ti)
|
2014-03-27 19:02:04 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
2016-06-17 02:22:30 +07:00
|
|
|
__be16 port = ti->port;
|
2014-03-27 19:02:04 +07:00
|
|
|
__be16 current_port;
|
|
|
|
|
2016-06-17 02:22:30 +07:00
|
|
|
if (ti->type != UDP_TUNNEL_TYPE_VXLAN)
|
2014-03-27 19:02:04 +07:00
|
|
|
return;
|
|
|
|
|
2016-06-17 02:22:30 +07:00
|
|
|
if (ti->sa_family != AF_INET)
|
|
|
|
return;
|
|
|
|
|
|
|
|
if (priv->mdev->dev->caps.tunnel_offload_mode != MLX4_TUNNEL_OFFLOAD_MODE_VXLAN)
|
2014-03-27 19:02:04 +07:00
|
|
|
return;
|
|
|
|
|
|
|
|
current_port = priv->vxlan_port;
|
|
|
|
if (current_port && current_port != port) {
|
|
|
|
en_warn(priv, "vxlan port %d configured, can't add port %d\n",
|
|
|
|
ntohs(current_port), ntohs(port));
|
|
|
|
return;
|
|
|
|
}
|
|
|
|
|
|
|
|
priv->vxlan_port = port;
|
|
|
|
queue_work(priv->mdev->workqueue, &priv->vxlan_add_task);
|
|
|
|
}
|
|
|
|
|
|
|
|
static void mlx4_en_del_vxlan_port(struct net_device *dev,
|
2016-06-17 02:22:30 +07:00
|
|
|
struct udp_tunnel_info *ti)
|
2014-03-27 19:02:04 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
2016-06-17 02:22:30 +07:00
|
|
|
__be16 port = ti->port;
|
2014-03-27 19:02:04 +07:00
|
|
|
__be16 current_port;
|
|
|
|
|
2016-06-17 02:22:30 +07:00
|
|
|
if (ti->type != UDP_TUNNEL_TYPE_VXLAN)
|
2014-03-27 19:02:04 +07:00
|
|
|
return;
|
|
|
|
|
2016-06-17 02:22:30 +07:00
|
|
|
if (ti->sa_family != AF_INET)
|
|
|
|
return;
|
|
|
|
|
|
|
|
if (priv->mdev->dev->caps.tunnel_offload_mode != MLX4_TUNNEL_OFFLOAD_MODE_VXLAN)
|
2014-03-27 19:02:04 +07:00
|
|
|
return;
|
|
|
|
|
|
|
|
current_port = priv->vxlan_port;
|
|
|
|
if (current_port != port) {
|
|
|
|
en_dbg(DRV, priv, "vxlan port %d isn't configured, ignoring\n", ntohs(port));
|
|
|
|
return;
|
|
|
|
}
|
|
|
|
|
|
|
|
queue_work(priv->mdev->workqueue, &priv->vxlan_del_task);
|
|
|
|
}
|
2014-11-14 07:38:14 +07:00
|
|
|
|
2014-12-24 13:37:26 +07:00
|
|
|
static netdev_features_t mlx4_en_features_check(struct sk_buff *skb,
|
|
|
|
struct net_device *dev,
|
|
|
|
netdev_features_t features)
|
2014-11-14 07:38:14 +07:00
|
|
|
{
|
2015-03-27 12:31:12 +07:00
|
|
|
features = vlan_features_check(skb, features);
|
2016-05-02 23:38:37 +07:00
|
|
|
features = vxlan_features_check(skb, features);
|
|
|
|
|
|
|
|
/* The ConnectX-3 doesn't support outer IPv6 checksums but it does
|
|
|
|
* support inner IPv6 checksums and segmentation so we need to
|
|
|
|
* strip that feature if this is an IPv6 encapsulated frame.
|
|
|
|
*/
|
|
|
|
if (skb->encapsulation &&
|
2016-06-16 04:42:11 +07:00
|
|
|
(skb->ip_summed == CHECKSUM_PARTIAL)) {
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
|
|
|
|
if (!priv->vxlan_port ||
|
|
|
|
(ip_hdr(skb)->version != 4) ||
|
|
|
|
(udp_hdr(skb)->dest != priv->vxlan_port))
|
|
|
|
features &= ~(NETIF_F_CSUM_MASK | NETIF_F_GSO_MASK);
|
|
|
|
}
|
2016-05-02 23:38:37 +07:00
|
|
|
|
|
|
|
return features;
|
2014-11-14 07:38:14 +07:00
|
|
|
}
|
2014-03-27 19:02:04 +07:00
|
|
|
|
2015-03-19 07:51:27 +07:00
|
|
|
static int mlx4_en_set_tx_maxrate(struct net_device *dev, int queue_index, u32 maxrate)
|
2015-03-18 19:57:35 +07:00
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_tx_ring *tx_ring = priv->tx_ring[queue_index];
|
|
|
|
struct mlx4_update_qp_params params;
|
|
|
|
int err;
|
|
|
|
|
|
|
|
if (!(priv->mdev->dev->caps.flags2 & MLX4_DEV_CAP_FLAG2_QP_RATE_LIMIT))
|
|
|
|
return -EOPNOTSUPP;
|
|
|
|
|
|
|
|
/* rate provided to us in Mbs, check if it fits into 12 bits, if not use Gbs */
|
|
|
|
if (maxrate >> 12) {
|
|
|
|
params.rate_unit = MLX4_QP_RATE_LIMIT_GBS;
|
|
|
|
params.rate_val = maxrate / 1000;
|
|
|
|
} else if (maxrate) {
|
|
|
|
params.rate_unit = MLX4_QP_RATE_LIMIT_MBS;
|
|
|
|
params.rate_val = maxrate;
|
|
|
|
} else { /* zero serves to revoke the QP rate-limitation */
|
|
|
|
params.rate_unit = 0;
|
|
|
|
params.rate_val = 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
err = mlx4_update_qp(priv->mdev->dev, tx_ring->qpn, MLX4_UPDATE_QP_RATE_LIMIT,
|
|
|
|
¶ms);
|
|
|
|
return err;
|
|
|
|
}
|
|
|
|
|
2016-07-20 02:16:50 +07:00
|
|
|
static int mlx4_xdp_set(struct net_device *dev, struct bpf_prog *prog)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
2016-07-20 02:16:52 +07:00
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
2016-07-20 02:16:50 +07:00
|
|
|
struct bpf_prog *old_prog;
|
|
|
|
int xdp_ring_num;
|
2016-07-20 02:16:52 +07:00
|
|
|
int port_up = 0;
|
|
|
|
int err;
|
2016-07-20 02:16:50 +07:00
|
|
|
int i;
|
|
|
|
|
|
|
|
xdp_ring_num = prog ? ALIGN(priv->rx_ring_num, MLX4_EN_NUM_UP) : 0;
|
|
|
|
|
2016-07-20 02:16:52 +07:00
|
|
|
/* No need to reconfigure buffers when simply swapping the
|
|
|
|
* program for a new one.
|
|
|
|
*/
|
|
|
|
if (priv->xdp_ring_num == xdp_ring_num) {
|
|
|
|
if (prog) {
|
|
|
|
prog = bpf_prog_add(prog, priv->rx_ring_num - 1);
|
|
|
|
if (IS_ERR(prog))
|
|
|
|
return PTR_ERR(prog);
|
|
|
|
}
|
net/mlx4_en: protect ring->xdp_prog with rcu_read_lock
Depending on the preempt mode, the bpf_prog stored in xdp_prog may be
freed despite the use of call_rcu inside bpf_prog_put. The situation is
possible when running in PREEMPT_RCU=y mode, for instance, since the rcu
callback for destroying the bpf prog can run even during the bh handling
in the mlx4 rx path.
Several options were considered before this patch was settled on:
Add a napi_synchronize loop in mlx4_xdp_set, which would occur after all
of the rings are updated with the new program.
This approach has the disadvantage that as the number of rings
increases, the speed of update will slow down significantly due to
napi_synchronize's msleep(1).
Add a new rcu_head in bpf_prog_aux, to be used by a new bpf_prog_put_bh.
The action of the bpf_prog_put_bh would be to then call bpf_prog_put
later. Those drivers that consume a bpf prog in a bh context (like mlx4)
would then use the bpf_prog_put_bh instead when the ring is up. This has
the problem of complexity, in maintaining proper refcnts and rcu lists,
and would likely be harder to review. In addition, this approach to
freeing must be exclusive with other frees of the bpf prog, for instance
a _bh prog must not be referenced from a prog array that is consumed by
a non-_bh prog.
The placement of rcu_read_lock in this patch is functionally the same as
putting an rcu_read_lock in napi_poll. Actually doing so could be a
potentially controversial change, but would bring the implementation in
line with sk_busy_loop (though of course the nature of those two paths
is substantially different), and would also avoid future copy/paste
problems with future supporters of XDP. Still, this patch does not take
that opinionated option.
Testing was done with kernels in either PREEMPT_RCU=y or
CONFIG_PREEMPT_VOLUNTARY=y+PREEMPT_RCU=n modes, with neither exhibiting
any drawback. With PREEMPT_RCU=n, the extra call to rcu_read_lock did
not show up in the perf report whatsoever, and with PREEMPT_RCU=y the
overhead of rcu_read_lock (according to perf) was the same before/after.
In the rx path, rcu_read_lock is eventually called for every packet
from netif_receive_skb_internal, so the napi poll call's rcu_read_lock
is easily amortized.
v2:
Remove extra rcu_read_lock in mlx4_en_process_rx_cq body
Annotate xdp_prog with __rcu, and convert all usages to rcu_assign or
rcu_dereference[_protected] as appropriate.
Add explicit mutex lock around rcu_assign instead of xchg loop.
Fixes: d576acf0a22 ("net/mlx4_en: add page recycle to prepare rx ring for tx support")
Acked-by: Daniel Borkmann <daniel@iogearbox.net>
Acked-by: Alexei Starovoitov <alexei.starovoitov@gmail.com>
Signed-off-by: Brenden Blanco <bblanco@plumgrid.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
2016-09-04 11:29:58 +07:00
|
|
|
mutex_lock(&mdev->state_lock);
|
2016-07-20 02:16:52 +07:00
|
|
|
for (i = 0; i < priv->rx_ring_num; i++) {
|
net/mlx4_en: protect ring->xdp_prog with rcu_read_lock
Depending on the preempt mode, the bpf_prog stored in xdp_prog may be
freed despite the use of call_rcu inside bpf_prog_put. The situation is
possible when running in PREEMPT_RCU=y mode, for instance, since the rcu
callback for destroying the bpf prog can run even during the bh handling
in the mlx4 rx path.
Several options were considered before this patch was settled on:
Add a napi_synchronize loop in mlx4_xdp_set, which would occur after all
of the rings are updated with the new program.
This approach has the disadvantage that as the number of rings
increases, the speed of update will slow down significantly due to
napi_synchronize's msleep(1).
Add a new rcu_head in bpf_prog_aux, to be used by a new bpf_prog_put_bh.
The action of the bpf_prog_put_bh would be to then call bpf_prog_put
later. Those drivers that consume a bpf prog in a bh context (like mlx4)
would then use the bpf_prog_put_bh instead when the ring is up. This has
the problem of complexity, in maintaining proper refcnts and rcu lists,
and would likely be harder to review. In addition, this approach to
freeing must be exclusive with other frees of the bpf prog, for instance
a _bh prog must not be referenced from a prog array that is consumed by
a non-_bh prog.
The placement of rcu_read_lock in this patch is functionally the same as
putting an rcu_read_lock in napi_poll. Actually doing so could be a
potentially controversial change, but would bring the implementation in
line with sk_busy_loop (though of course the nature of those two paths
is substantially different), and would also avoid future copy/paste
problems with future supporters of XDP. Still, this patch does not take
that opinionated option.
Testing was done with kernels in either PREEMPT_RCU=y or
CONFIG_PREEMPT_VOLUNTARY=y+PREEMPT_RCU=n modes, with neither exhibiting
any drawback. With PREEMPT_RCU=n, the extra call to rcu_read_lock did
not show up in the perf report whatsoever, and with PREEMPT_RCU=y the
overhead of rcu_read_lock (according to perf) was the same before/after.
In the rx path, rcu_read_lock is eventually called for every packet
from netif_receive_skb_internal, so the napi poll call's rcu_read_lock
is easily amortized.
v2:
Remove extra rcu_read_lock in mlx4_en_process_rx_cq body
Annotate xdp_prog with __rcu, and convert all usages to rcu_assign or
rcu_dereference[_protected] as appropriate.
Add explicit mutex lock around rcu_assign instead of xchg loop.
Fixes: d576acf0a22 ("net/mlx4_en: add page recycle to prepare rx ring for tx support")
Acked-by: Daniel Borkmann <daniel@iogearbox.net>
Acked-by: Alexei Starovoitov <alexei.starovoitov@gmail.com>
Signed-off-by: Brenden Blanco <bblanco@plumgrid.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
2016-09-04 11:29:58 +07:00
|
|
|
old_prog = rcu_dereference_protected(
|
|
|
|
priv->rx_ring[i]->xdp_prog,
|
|
|
|
lockdep_is_held(&mdev->state_lock));
|
|
|
|
rcu_assign_pointer(priv->rx_ring[i]->xdp_prog, prog);
|
2016-07-20 02:16:52 +07:00
|
|
|
if (old_prog)
|
|
|
|
bpf_prog_put(old_prog);
|
|
|
|
}
|
net/mlx4_en: protect ring->xdp_prog with rcu_read_lock
Depending on the preempt mode, the bpf_prog stored in xdp_prog may be
freed despite the use of call_rcu inside bpf_prog_put. The situation is
possible when running in PREEMPT_RCU=y mode, for instance, since the rcu
callback for destroying the bpf prog can run even during the bh handling
in the mlx4 rx path.
Several options were considered before this patch was settled on:
Add a napi_synchronize loop in mlx4_xdp_set, which would occur after all
of the rings are updated with the new program.
This approach has the disadvantage that as the number of rings
increases, the speed of update will slow down significantly due to
napi_synchronize's msleep(1).
Add a new rcu_head in bpf_prog_aux, to be used by a new bpf_prog_put_bh.
The action of the bpf_prog_put_bh would be to then call bpf_prog_put
later. Those drivers that consume a bpf prog in a bh context (like mlx4)
would then use the bpf_prog_put_bh instead when the ring is up. This has
the problem of complexity, in maintaining proper refcnts and rcu lists,
and would likely be harder to review. In addition, this approach to
freeing must be exclusive with other frees of the bpf prog, for instance
a _bh prog must not be referenced from a prog array that is consumed by
a non-_bh prog.
The placement of rcu_read_lock in this patch is functionally the same as
putting an rcu_read_lock in napi_poll. Actually doing so could be a
potentially controversial change, but would bring the implementation in
line with sk_busy_loop (though of course the nature of those two paths
is substantially different), and would also avoid future copy/paste
problems with future supporters of XDP. Still, this patch does not take
that opinionated option.
Testing was done with kernels in either PREEMPT_RCU=y or
CONFIG_PREEMPT_VOLUNTARY=y+PREEMPT_RCU=n modes, with neither exhibiting
any drawback. With PREEMPT_RCU=n, the extra call to rcu_read_lock did
not show up in the perf report whatsoever, and with PREEMPT_RCU=y the
overhead of rcu_read_lock (according to perf) was the same before/after.
In the rx path, rcu_read_lock is eventually called for every packet
from netif_receive_skb_internal, so the napi poll call's rcu_read_lock
is easily amortized.
v2:
Remove extra rcu_read_lock in mlx4_en_process_rx_cq body
Annotate xdp_prog with __rcu, and convert all usages to rcu_assign or
rcu_dereference[_protected] as appropriate.
Add explicit mutex lock around rcu_assign instead of xchg loop.
Fixes: d576acf0a22 ("net/mlx4_en: add page recycle to prepare rx ring for tx support")
Acked-by: Daniel Borkmann <daniel@iogearbox.net>
Acked-by: Alexei Starovoitov <alexei.starovoitov@gmail.com>
Signed-off-by: Brenden Blanco <bblanco@plumgrid.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
2016-09-04 11:29:58 +07:00
|
|
|
mutex_unlock(&mdev->state_lock);
|
2016-07-20 02:16:52 +07:00
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
2016-07-20 02:16:50 +07:00
|
|
|
if (priv->num_frags > 1) {
|
|
|
|
en_err(priv, "Cannot set XDP if MTU requires multiple frags\n");
|
|
|
|
return -EOPNOTSUPP;
|
|
|
|
}
|
|
|
|
|
2016-07-20 02:16:55 +07:00
|
|
|
if (priv->tx_ring_num < xdp_ring_num + MLX4_EN_NUM_UP) {
|
|
|
|
en_err(priv,
|
|
|
|
"Minimum %d tx channels required to run XDP\n",
|
|
|
|
(xdp_ring_num + MLX4_EN_NUM_UP) / MLX4_EN_NUM_UP);
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
2016-07-20 02:16:50 +07:00
|
|
|
if (prog) {
|
|
|
|
prog = bpf_prog_add(prog, priv->rx_ring_num - 1);
|
|
|
|
if (IS_ERR(prog))
|
|
|
|
return PTR_ERR(prog);
|
|
|
|
}
|
|
|
|
|
2016-07-20 02:16:52 +07:00
|
|
|
mutex_lock(&mdev->state_lock);
|
|
|
|
if (priv->port_up) {
|
|
|
|
port_up = 1;
|
|
|
|
mlx4_en_stop_port(dev, 1);
|
|
|
|
}
|
|
|
|
|
2016-07-20 02:16:50 +07:00
|
|
|
priv->xdp_ring_num = xdp_ring_num;
|
2016-07-20 02:16:55 +07:00
|
|
|
netif_set_real_num_tx_queues(dev, priv->tx_ring_num -
|
|
|
|
priv->xdp_ring_num);
|
2016-07-20 02:16:50 +07:00
|
|
|
|
|
|
|
for (i = 0; i < priv->rx_ring_num; i++) {
|
net/mlx4_en: protect ring->xdp_prog with rcu_read_lock
Depending on the preempt mode, the bpf_prog stored in xdp_prog may be
freed despite the use of call_rcu inside bpf_prog_put. The situation is
possible when running in PREEMPT_RCU=y mode, for instance, since the rcu
callback for destroying the bpf prog can run even during the bh handling
in the mlx4 rx path.
Several options were considered before this patch was settled on:
Add a napi_synchronize loop in mlx4_xdp_set, which would occur after all
of the rings are updated with the new program.
This approach has the disadvantage that as the number of rings
increases, the speed of update will slow down significantly due to
napi_synchronize's msleep(1).
Add a new rcu_head in bpf_prog_aux, to be used by a new bpf_prog_put_bh.
The action of the bpf_prog_put_bh would be to then call bpf_prog_put
later. Those drivers that consume a bpf prog in a bh context (like mlx4)
would then use the bpf_prog_put_bh instead when the ring is up. This has
the problem of complexity, in maintaining proper refcnts and rcu lists,
and would likely be harder to review. In addition, this approach to
freeing must be exclusive with other frees of the bpf prog, for instance
a _bh prog must not be referenced from a prog array that is consumed by
a non-_bh prog.
The placement of rcu_read_lock in this patch is functionally the same as
putting an rcu_read_lock in napi_poll. Actually doing so could be a
potentially controversial change, but would bring the implementation in
line with sk_busy_loop (though of course the nature of those two paths
is substantially different), and would also avoid future copy/paste
problems with future supporters of XDP. Still, this patch does not take
that opinionated option.
Testing was done with kernels in either PREEMPT_RCU=y or
CONFIG_PREEMPT_VOLUNTARY=y+PREEMPT_RCU=n modes, with neither exhibiting
any drawback. With PREEMPT_RCU=n, the extra call to rcu_read_lock did
not show up in the perf report whatsoever, and with PREEMPT_RCU=y the
overhead of rcu_read_lock (according to perf) was the same before/after.
In the rx path, rcu_read_lock is eventually called for every packet
from netif_receive_skb_internal, so the napi poll call's rcu_read_lock
is easily amortized.
v2:
Remove extra rcu_read_lock in mlx4_en_process_rx_cq body
Annotate xdp_prog with __rcu, and convert all usages to rcu_assign or
rcu_dereference[_protected] as appropriate.
Add explicit mutex lock around rcu_assign instead of xchg loop.
Fixes: d576acf0a22 ("net/mlx4_en: add page recycle to prepare rx ring for tx support")
Acked-by: Daniel Borkmann <daniel@iogearbox.net>
Acked-by: Alexei Starovoitov <alexei.starovoitov@gmail.com>
Signed-off-by: Brenden Blanco <bblanco@plumgrid.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
2016-09-04 11:29:58 +07:00
|
|
|
old_prog = rcu_dereference_protected(
|
|
|
|
priv->rx_ring[i]->xdp_prog,
|
|
|
|
lockdep_is_held(&mdev->state_lock));
|
|
|
|
rcu_assign_pointer(priv->rx_ring[i]->xdp_prog, prog);
|
2016-07-20 02:16:50 +07:00
|
|
|
if (old_prog)
|
|
|
|
bpf_prog_put(old_prog);
|
|
|
|
}
|
|
|
|
|
2016-07-20 02:16:52 +07:00
|
|
|
if (port_up) {
|
|
|
|
err = mlx4_en_start_port(dev);
|
|
|
|
if (err) {
|
|
|
|
en_err(priv, "Failed starting port %d for XDP change\n",
|
|
|
|
priv->port);
|
|
|
|
queue_work(mdev->workqueue, &priv->watchdog_task);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
mutex_unlock(&mdev->state_lock);
|
2016-07-20 02:16:50 +07:00
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
static bool mlx4_xdp_attached(struct net_device *dev)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
|
|
|
|
return !!priv->xdp_ring_num;
|
|
|
|
}
|
|
|
|
|
|
|
|
static int mlx4_xdp(struct net_device *dev, struct netdev_xdp *xdp)
|
|
|
|
{
|
|
|
|
switch (xdp->command) {
|
|
|
|
case XDP_SETUP_PROG:
|
|
|
|
return mlx4_xdp_set(dev, xdp->prog);
|
|
|
|
case XDP_QUERY_PROG:
|
|
|
|
xdp->prog_attached = mlx4_xdp_attached(dev);
|
|
|
|
return 0;
|
|
|
|
default:
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2008-11-22 08:30:58 +07:00
|
|
|
static const struct net_device_ops mlx4_netdev_ops = {
|
|
|
|
.ndo_open = mlx4_en_open,
|
|
|
|
.ndo_stop = mlx4_en_close,
|
|
|
|
.ndo_start_xmit = mlx4_en_xmit,
|
2009-06-02 06:24:07 +07:00
|
|
|
.ndo_select_queue = mlx4_en_select_queue,
|
2016-05-25 23:50:38 +07:00
|
|
|
.ndo_get_stats64 = mlx4_en_get_stats64,
|
2013-02-07 09:25:23 +07:00
|
|
|
.ndo_set_rx_mode = mlx4_en_set_rx_mode,
|
2008-11-22 08:30:58 +07:00
|
|
|
.ndo_set_mac_address = mlx4_en_set_mac,
|
2009-01-09 17:45:37 +07:00
|
|
|
.ndo_validate_addr = eth_validate_addr,
|
2008-11-22 08:30:58 +07:00
|
|
|
.ndo_change_mtu = mlx4_en_change_mtu,
|
2013-04-23 13:06:49 +07:00
|
|
|
.ndo_do_ioctl = mlx4_en_ioctl,
|
2008-11-22 08:30:58 +07:00
|
|
|
.ndo_tx_timeout = mlx4_en_tx_timeout,
|
|
|
|
.ndo_vlan_rx_add_vid = mlx4_en_vlan_rx_add_vid,
|
|
|
|
.ndo_vlan_rx_kill_vid = mlx4_en_vlan_rx_kill_vid,
|
|
|
|
#ifdef CONFIG_NET_POLL_CONTROLLER
|
|
|
|
.ndo_poll_controller = mlx4_en_netpoll,
|
|
|
|
#endif
|
2011-11-27 02:55:19 +07:00
|
|
|
.ndo_set_features = mlx4_en_set_features,
|
2015-07-27 18:46:34 +07:00
|
|
|
.ndo_fix_features = mlx4_en_fix_features,
|
2016-02-17 12:16:15 +07:00
|
|
|
.ndo_setup_tc = __mlx4_en_setup_tc,
|
2012-07-19 05:33:52 +07:00
|
|
|
#ifdef CONFIG_RFS_ACCEL
|
|
|
|
.ndo_rx_flow_steer = mlx4_en_filter_rfs,
|
2013-06-18 20:18:27 +07:00
|
|
|
#endif
|
2013-12-20 02:20:13 +07:00
|
|
|
.ndo_get_phys_port_id = mlx4_en_get_phys_port_id,
|
2016-06-17 02:22:30 +07:00
|
|
|
.ndo_udp_tunnel_add = mlx4_en_add_vxlan_port,
|
|
|
|
.ndo_udp_tunnel_del = mlx4_en_del_vxlan_port,
|
2014-12-24 13:37:26 +07:00
|
|
|
.ndo_features_check = mlx4_en_features_check,
|
2015-03-18 19:57:35 +07:00
|
|
|
.ndo_set_tx_maxrate = mlx4_en_set_tx_maxrate,
|
2016-07-20 02:16:50 +07:00
|
|
|
.ndo_xdp = mlx4_xdp,
|
2008-11-22 08:30:58 +07:00
|
|
|
};
|
|
|
|
|
2013-04-25 12:22:27 +07:00
|
|
|
static const struct net_device_ops mlx4_netdev_ops_master = {
|
|
|
|
.ndo_open = mlx4_en_open,
|
|
|
|
.ndo_stop = mlx4_en_close,
|
|
|
|
.ndo_start_xmit = mlx4_en_xmit,
|
|
|
|
.ndo_select_queue = mlx4_en_select_queue,
|
2016-05-25 23:50:38 +07:00
|
|
|
.ndo_get_stats64 = mlx4_en_get_stats64,
|
2013-04-25 12:22:27 +07:00
|
|
|
.ndo_set_rx_mode = mlx4_en_set_rx_mode,
|
|
|
|
.ndo_set_mac_address = mlx4_en_set_mac,
|
|
|
|
.ndo_validate_addr = eth_validate_addr,
|
|
|
|
.ndo_change_mtu = mlx4_en_change_mtu,
|
|
|
|
.ndo_tx_timeout = mlx4_en_tx_timeout,
|
|
|
|
.ndo_vlan_rx_add_vid = mlx4_en_vlan_rx_add_vid,
|
|
|
|
.ndo_vlan_rx_kill_vid = mlx4_en_vlan_rx_kill_vid,
|
|
|
|
.ndo_set_vf_mac = mlx4_en_set_vf_mac,
|
2013-04-25 12:22:28 +07:00
|
|
|
.ndo_set_vf_vlan = mlx4_en_set_vf_vlan,
|
2015-04-02 20:31:16 +07:00
|
|
|
.ndo_set_vf_rate = mlx4_en_set_vf_rate,
|
2013-04-25 12:22:29 +07:00
|
|
|
.ndo_set_vf_spoofchk = mlx4_en_set_vf_spoofchk,
|
2013-06-13 17:19:11 +07:00
|
|
|
.ndo_set_vf_link_state = mlx4_en_set_vf_link_state,
|
2015-06-15 21:59:08 +07:00
|
|
|
.ndo_get_vf_stats = mlx4_en_get_vf_stats,
|
2013-04-25 12:22:30 +07:00
|
|
|
.ndo_get_vf_config = mlx4_en_get_vf_config,
|
2013-04-25 12:22:27 +07:00
|
|
|
#ifdef CONFIG_NET_POLL_CONTROLLER
|
|
|
|
.ndo_poll_controller = mlx4_en_netpoll,
|
|
|
|
#endif
|
|
|
|
.ndo_set_features = mlx4_en_set_features,
|
2015-07-27 18:46:34 +07:00
|
|
|
.ndo_fix_features = mlx4_en_fix_features,
|
2016-02-17 12:16:15 +07:00
|
|
|
.ndo_setup_tc = __mlx4_en_setup_tc,
|
2013-04-25 12:22:27 +07:00
|
|
|
#ifdef CONFIG_RFS_ACCEL
|
|
|
|
.ndo_rx_flow_steer = mlx4_en_filter_rfs,
|
|
|
|
#endif
|
2013-12-20 02:20:13 +07:00
|
|
|
.ndo_get_phys_port_id = mlx4_en_get_phys_port_id,
|
2016-06-17 02:22:30 +07:00
|
|
|
.ndo_udp_tunnel_add = mlx4_en_add_vxlan_port,
|
|
|
|
.ndo_udp_tunnel_del = mlx4_en_del_vxlan_port,
|
2014-12-24 13:37:26 +07:00
|
|
|
.ndo_features_check = mlx4_en_features_check,
|
2015-03-18 19:57:35 +07:00
|
|
|
.ndo_set_tx_maxrate = mlx4_en_set_tx_maxrate,
|
2016-07-20 02:16:50 +07:00
|
|
|
.ndo_xdp = mlx4_xdp,
|
2013-04-25 12:22:27 +07:00
|
|
|
};
|
|
|
|
|
2015-02-03 21:48:34 +07:00
|
|
|
struct mlx4_en_bond {
|
|
|
|
struct work_struct work;
|
|
|
|
struct mlx4_en_priv *priv;
|
|
|
|
int is_bonded;
|
|
|
|
struct mlx4_port_map port_map;
|
|
|
|
};
|
|
|
|
|
|
|
|
static void mlx4_en_bond_work(struct work_struct *work)
|
|
|
|
{
|
|
|
|
struct mlx4_en_bond *bond = container_of(work,
|
|
|
|
struct mlx4_en_bond,
|
|
|
|
work);
|
|
|
|
int err = 0;
|
|
|
|
struct mlx4_dev *dev = bond->priv->mdev->dev;
|
|
|
|
|
|
|
|
if (bond->is_bonded) {
|
|
|
|
if (!mlx4_is_bonded(dev)) {
|
|
|
|
err = mlx4_bond(dev);
|
|
|
|
if (err)
|
|
|
|
en_err(bond->priv, "Fail to bond device\n");
|
|
|
|
}
|
|
|
|
if (!err) {
|
|
|
|
err = mlx4_port_map_set(dev, &bond->port_map);
|
|
|
|
if (err)
|
|
|
|
en_err(bond->priv, "Fail to set port map [%d][%d]: %d\n",
|
|
|
|
bond->port_map.port1,
|
|
|
|
bond->port_map.port2,
|
|
|
|
err);
|
|
|
|
}
|
|
|
|
} else if (mlx4_is_bonded(dev)) {
|
|
|
|
err = mlx4_unbond(dev);
|
|
|
|
if (err)
|
|
|
|
en_err(bond->priv, "Fail to unbond device\n");
|
|
|
|
}
|
|
|
|
dev_put(bond->priv->dev);
|
|
|
|
kfree(bond);
|
|
|
|
}
|
|
|
|
|
|
|
|
static int mlx4_en_queue_bond_work(struct mlx4_en_priv *priv, int is_bonded,
|
|
|
|
u8 v2p_p1, u8 v2p_p2)
|
|
|
|
{
|
|
|
|
struct mlx4_en_bond *bond = NULL;
|
|
|
|
|
|
|
|
bond = kzalloc(sizeof(*bond), GFP_ATOMIC);
|
|
|
|
if (!bond)
|
|
|
|
return -ENOMEM;
|
|
|
|
|
|
|
|
INIT_WORK(&bond->work, mlx4_en_bond_work);
|
|
|
|
bond->priv = priv;
|
|
|
|
bond->is_bonded = is_bonded;
|
|
|
|
bond->port_map.port1 = v2p_p1;
|
|
|
|
bond->port_map.port2 = v2p_p2;
|
|
|
|
dev_hold(priv->dev);
|
|
|
|
queue_work(priv->mdev->workqueue, &bond->work);
|
|
|
|
return 0;
|
|
|
|
}
|
|
|
|
|
|
|
|
int mlx4_en_netdev_event(struct notifier_block *this,
|
|
|
|
unsigned long event, void *ptr)
|
|
|
|
{
|
|
|
|
struct net_device *ndev = netdev_notifier_info_to_dev(ptr);
|
|
|
|
u8 port = 0;
|
|
|
|
struct mlx4_en_dev *mdev;
|
|
|
|
struct mlx4_dev *dev;
|
|
|
|
int i, num_eth_ports = 0;
|
|
|
|
bool do_bond = true;
|
|
|
|
struct mlx4_en_priv *priv;
|
|
|
|
u8 v2p_port1 = 0;
|
|
|
|
u8 v2p_port2 = 0;
|
|
|
|
|
|
|
|
if (!net_eq(dev_net(ndev), &init_net))
|
|
|
|
return NOTIFY_DONE;
|
|
|
|
|
|
|
|
mdev = container_of(this, struct mlx4_en_dev, nb);
|
|
|
|
dev = mdev->dev;
|
|
|
|
|
|
|
|
/* Go into this mode only when two network devices set on two ports
|
|
|
|
* of the same mlx4 device are slaves of the same bonding master
|
|
|
|
*/
|
|
|
|
mlx4_foreach_port(i, dev, MLX4_PORT_TYPE_ETH) {
|
|
|
|
++num_eth_ports;
|
|
|
|
if (!port && (mdev->pndev[i] == ndev))
|
|
|
|
port = i;
|
|
|
|
mdev->upper[i] = mdev->pndev[i] ?
|
|
|
|
netdev_master_upper_dev_get(mdev->pndev[i]) : NULL;
|
|
|
|
/* condition not met: network device is a slave */
|
|
|
|
if (!mdev->upper[i])
|
|
|
|
do_bond = false;
|
|
|
|
if (num_eth_ports < 2)
|
|
|
|
continue;
|
|
|
|
/* condition not met: same master */
|
|
|
|
if (mdev->upper[i] != mdev->upper[i-1])
|
|
|
|
do_bond = false;
|
|
|
|
}
|
|
|
|
/* condition not met: 2 salves */
|
|
|
|
do_bond = (num_eth_ports == 2) ? do_bond : false;
|
|
|
|
|
|
|
|
/* handle only events that come with enough info */
|
|
|
|
if ((do_bond && (event != NETDEV_BONDING_INFO)) || !port)
|
|
|
|
return NOTIFY_DONE;
|
|
|
|
|
|
|
|
priv = netdev_priv(ndev);
|
|
|
|
if (do_bond) {
|
|
|
|
struct netdev_notifier_bonding_info *notifier_info = ptr;
|
|
|
|
struct netdev_bonding_info *bonding_info =
|
|
|
|
¬ifier_info->bonding_info;
|
|
|
|
|
|
|
|
/* required mode 1, 2 or 4 */
|
|
|
|
if ((bonding_info->master.bond_mode != BOND_MODE_ACTIVEBACKUP) &&
|
|
|
|
(bonding_info->master.bond_mode != BOND_MODE_XOR) &&
|
|
|
|
(bonding_info->master.bond_mode != BOND_MODE_8023AD))
|
|
|
|
do_bond = false;
|
|
|
|
|
|
|
|
/* require exactly 2 slaves */
|
|
|
|
if (bonding_info->master.num_slaves != 2)
|
|
|
|
do_bond = false;
|
|
|
|
|
|
|
|
/* calc v2p */
|
|
|
|
if (do_bond) {
|
|
|
|
if (bonding_info->master.bond_mode ==
|
|
|
|
BOND_MODE_ACTIVEBACKUP) {
|
|
|
|
/* in active-backup mode virtual ports are
|
|
|
|
* mapped to the physical port of the active
|
|
|
|
* slave */
|
|
|
|
if (bonding_info->slave.state ==
|
|
|
|
BOND_STATE_BACKUP) {
|
|
|
|
if (port == 1) {
|
|
|
|
v2p_port1 = 2;
|
|
|
|
v2p_port2 = 2;
|
|
|
|
} else {
|
|
|
|
v2p_port1 = 1;
|
|
|
|
v2p_port2 = 1;
|
|
|
|
}
|
|
|
|
} else { /* BOND_STATE_ACTIVE */
|
|
|
|
if (port == 1) {
|
|
|
|
v2p_port1 = 1;
|
|
|
|
v2p_port2 = 1;
|
|
|
|
} else {
|
|
|
|
v2p_port1 = 2;
|
|
|
|
v2p_port2 = 2;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
} else { /* Active-Active */
|
|
|
|
/* in active-active mode a virtual port is
|
|
|
|
* mapped to the native physical port if and only
|
|
|
|
* if the physical port is up */
|
|
|
|
__s8 link = bonding_info->slave.link;
|
|
|
|
|
|
|
|
if (port == 1)
|
|
|
|
v2p_port2 = 2;
|
|
|
|
else
|
|
|
|
v2p_port1 = 1;
|
|
|
|
if ((link == BOND_LINK_UP) ||
|
|
|
|
(link == BOND_LINK_FAIL)) {
|
|
|
|
if (port == 1)
|
|
|
|
v2p_port1 = 1;
|
|
|
|
else
|
|
|
|
v2p_port2 = 2;
|
|
|
|
} else { /* BOND_LINK_DOWN || BOND_LINK_BACK */
|
|
|
|
if (port == 1)
|
|
|
|
v2p_port1 = 2;
|
|
|
|
else
|
|
|
|
v2p_port2 = 1;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
mlx4_en_queue_bond_work(priv, do_bond,
|
|
|
|
v2p_port1, v2p_port2);
|
|
|
|
|
|
|
|
return NOTIFY_DONE;
|
|
|
|
}
|
|
|
|
|
2015-03-30 21:45:25 +07:00
|
|
|
void mlx4_en_update_pfc_stats_bitmap(struct mlx4_dev *dev,
|
|
|
|
struct mlx4_en_stats_bitmap *stats_bitmap,
|
|
|
|
u8 rx_ppp, u8 rx_pause,
|
|
|
|
u8 tx_ppp, u8 tx_pause)
|
|
|
|
{
|
2015-06-15 21:59:06 +07:00
|
|
|
int last_i = NUM_MAIN_STATS + NUM_PORT_STATS + NUM_PF_STATS;
|
2015-03-30 21:45:25 +07:00
|
|
|
|
|
|
|
if (!mlx4_is_slave(dev) &&
|
|
|
|
(dev->caps.flags2 & MLX4_DEV_CAP_FLAG2_FLOWSTATS_EN)) {
|
|
|
|
mutex_lock(&stats_bitmap->mutex);
|
|
|
|
bitmap_clear(stats_bitmap->bitmap, last_i, NUM_FLOW_STATS);
|
|
|
|
|
|
|
|
if (rx_ppp)
|
|
|
|
bitmap_set(stats_bitmap->bitmap, last_i,
|
|
|
|
NUM_FLOW_PRIORITY_STATS_RX);
|
|
|
|
last_i += NUM_FLOW_PRIORITY_STATS_RX;
|
|
|
|
|
|
|
|
if (rx_pause && !(rx_ppp))
|
|
|
|
bitmap_set(stats_bitmap->bitmap, last_i,
|
|
|
|
NUM_FLOW_STATS_RX);
|
|
|
|
last_i += NUM_FLOW_STATS_RX;
|
|
|
|
|
|
|
|
if (tx_ppp)
|
|
|
|
bitmap_set(stats_bitmap->bitmap, last_i,
|
|
|
|
NUM_FLOW_PRIORITY_STATS_TX);
|
|
|
|
last_i += NUM_FLOW_PRIORITY_STATS_TX;
|
|
|
|
|
|
|
|
if (tx_pause && !(tx_ppp))
|
|
|
|
bitmap_set(stats_bitmap->bitmap, last_i,
|
|
|
|
NUM_FLOW_STATS_TX);
|
|
|
|
last_i += NUM_FLOW_STATS_TX;
|
|
|
|
|
|
|
|
mutex_unlock(&stats_bitmap->mutex);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2015-03-30 21:45:23 +07:00
|
|
|
void mlx4_en_set_stats_bitmap(struct mlx4_dev *dev,
|
2015-03-30 21:45:25 +07:00
|
|
|
struct mlx4_en_stats_bitmap *stats_bitmap,
|
|
|
|
u8 rx_ppp, u8 rx_pause,
|
|
|
|
u8 tx_ppp, u8 tx_pause)
|
2015-03-30 21:45:22 +07:00
|
|
|
{
|
2015-03-30 21:45:23 +07:00
|
|
|
int last_i = 0;
|
|
|
|
|
2015-03-30 21:45:24 +07:00
|
|
|
mutex_init(&stats_bitmap->mutex);
|
|
|
|
bitmap_zero(stats_bitmap->bitmap, NUM_ALL_STATS);
|
2015-03-30 21:45:23 +07:00
|
|
|
|
|
|
|
if (mlx4_is_slave(dev)) {
|
2015-03-30 21:45:24 +07:00
|
|
|
bitmap_set(stats_bitmap->bitmap, last_i +
|
2015-03-30 21:45:23 +07:00
|
|
|
MLX4_FIND_NETDEV_STAT(rx_packets), 1);
|
2015-03-30 21:45:24 +07:00
|
|
|
bitmap_set(stats_bitmap->bitmap, last_i +
|
2015-03-30 21:45:23 +07:00
|
|
|
MLX4_FIND_NETDEV_STAT(tx_packets), 1);
|
2015-03-30 21:45:24 +07:00
|
|
|
bitmap_set(stats_bitmap->bitmap, last_i +
|
2015-03-30 21:45:23 +07:00
|
|
|
MLX4_FIND_NETDEV_STAT(rx_bytes), 1);
|
2015-03-30 21:45:24 +07:00
|
|
|
bitmap_set(stats_bitmap->bitmap, last_i +
|
2015-03-30 21:45:23 +07:00
|
|
|
MLX4_FIND_NETDEV_STAT(tx_bytes), 1);
|
2015-03-30 21:45:24 +07:00
|
|
|
bitmap_set(stats_bitmap->bitmap, last_i +
|
2015-03-30 21:45:23 +07:00
|
|
|
MLX4_FIND_NETDEV_STAT(rx_dropped), 1);
|
2015-03-30 21:45:24 +07:00
|
|
|
bitmap_set(stats_bitmap->bitmap, last_i +
|
2015-03-30 21:45:23 +07:00
|
|
|
MLX4_FIND_NETDEV_STAT(tx_dropped), 1);
|
|
|
|
} else {
|
2015-03-30 21:45:24 +07:00
|
|
|
bitmap_set(stats_bitmap->bitmap, last_i, NUM_MAIN_STATS);
|
2015-03-30 21:45:22 +07:00
|
|
|
}
|
2015-03-30 21:45:23 +07:00
|
|
|
last_i += NUM_MAIN_STATS;
|
2015-03-30 21:45:22 +07:00
|
|
|
|
2015-03-30 21:45:24 +07:00
|
|
|
bitmap_set(stats_bitmap->bitmap, last_i, NUM_PORT_STATS);
|
2015-03-30 21:45:23 +07:00
|
|
|
last_i += NUM_PORT_STATS;
|
2015-03-30 21:45:22 +07:00
|
|
|
|
2015-06-15 21:59:06 +07:00
|
|
|
if (mlx4_is_master(dev))
|
|
|
|
bitmap_set(stats_bitmap->bitmap, last_i,
|
|
|
|
NUM_PF_STATS);
|
|
|
|
last_i += NUM_PF_STATS;
|
|
|
|
|
2015-03-30 21:45:25 +07:00
|
|
|
mlx4_en_update_pfc_stats_bitmap(dev, stats_bitmap,
|
|
|
|
rx_ppp, rx_pause,
|
|
|
|
tx_ppp, tx_pause);
|
|
|
|
last_i += NUM_FLOW_STATS;
|
|
|
|
|
2015-03-30 21:45:23 +07:00
|
|
|
if (!mlx4_is_slave(dev))
|
2015-03-30 21:45:24 +07:00
|
|
|
bitmap_set(stats_bitmap->bitmap, last_i, NUM_PKT_STATS);
|
2015-03-30 21:45:22 +07:00
|
|
|
}
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
int mlx4_en_init_netdev(struct mlx4_en_dev *mdev, int port,
|
|
|
|
struct mlx4_en_port_profile *prof)
|
|
|
|
{
|
|
|
|
struct net_device *dev;
|
|
|
|
struct mlx4_en_priv *priv;
|
2013-02-07 09:25:25 +07:00
|
|
|
int i;
|
2008-10-23 05:47:49 +07:00
|
|
|
int err;
|
|
|
|
|
2011-01-10 02:36:36 +07:00
|
|
|
dev = alloc_etherdev_mqs(sizeof(struct mlx4_en_priv),
|
2012-12-02 10:49:23 +07:00
|
|
|
MAX_TX_RINGS, MAX_RX_RINGS);
|
2012-01-29 20:47:52 +07:00
|
|
|
if (dev == NULL)
|
2008-10-23 05:47:49 +07:00
|
|
|
return -ENOMEM;
|
|
|
|
|
2012-12-02 10:49:23 +07:00
|
|
|
netif_set_real_num_tx_queues(dev, prof->tx_ring_num);
|
|
|
|
netif_set_real_num_rx_queues(dev, prof->rx_ring_num);
|
|
|
|
|
2015-01-25 21:59:35 +07:00
|
|
|
SET_NETDEV_DEV(dev, &mdev->dev->persist->pdev->dev);
|
2014-02-25 23:17:51 +07:00
|
|
|
dev->dev_port = port - 1;
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
/*
|
|
|
|
* Initialize driver private data
|
|
|
|
*/
|
|
|
|
|
|
|
|
priv = netdev_priv(dev);
|
|
|
|
memset(priv, 0, sizeof(struct mlx4_en_priv));
|
2015-06-15 21:59:02 +07:00
|
|
|
priv->counter_index = MLX4_SINK_COUNTER_INDEX(mdev->dev);
|
2014-10-27 16:37:46 +07:00
|
|
|
spin_lock_init(&priv->stats_lock);
|
|
|
|
INIT_WORK(&priv->rx_mode_task, mlx4_en_do_set_rx_mode);
|
|
|
|
INIT_WORK(&priv->watchdog_task, mlx4_en_restart);
|
|
|
|
INIT_WORK(&priv->linkstate_task, mlx4_en_linkstate);
|
|
|
|
INIT_DELAYED_WORK(&priv->stats_task, mlx4_en_do_get_stats);
|
|
|
|
INIT_DELAYED_WORK(&priv->service_task, mlx4_en_service_task);
|
|
|
|
INIT_WORK(&priv->vxlan_add_task, mlx4_en_add_vxlan_offloads);
|
|
|
|
INIT_WORK(&priv->vxlan_del_task, mlx4_en_del_vxlan_offloads);
|
|
|
|
#ifdef CONFIG_RFS_ACCEL
|
|
|
|
INIT_LIST_HEAD(&priv->filters);
|
|
|
|
spin_lock_init(&priv->filters_lock);
|
|
|
|
#endif
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
priv->dev = dev;
|
|
|
|
priv->mdev = mdev;
|
2012-03-06 11:03:34 +07:00
|
|
|
priv->ddev = &mdev->pdev->dev;
|
2008-10-23 05:47:49 +07:00
|
|
|
priv->prof = prof;
|
|
|
|
priv->port = port;
|
|
|
|
priv->port_up = false;
|
|
|
|
priv->flags = prof->flags;
|
2014-07-22 19:44:10 +07:00
|
|
|
priv->pflags = MLX4_EN_PRIV_FLAGS_BLUEFLAME;
|
2011-11-27 02:55:19 +07:00
|
|
|
priv->ctrl_flags = cpu_to_be32(MLX4_WQE_CTRL_CQ_UPDATE |
|
|
|
|
MLX4_WQE_CTRL_SOLICITED);
|
2012-12-02 10:49:23 +07:00
|
|
|
priv->num_tx_rings_p_up = mdev->profile.num_tx_rings_p_up;
|
2008-10-23 05:47:49 +07:00
|
|
|
priv->tx_ring_num = prof->tx_ring_num;
|
2014-07-08 15:28:12 +07:00
|
|
|
priv->tx_work_limit = MLX4_EN_DEFAULT_TX_WORK;
|
2014-11-23 08:24:19 +07:00
|
|
|
netdev_rss_key_fill(priv->rss_key, sizeof(priv->rss_key));
|
2012-12-02 10:49:23 +07:00
|
|
|
|
2013-11-07 17:19:52 +07:00
|
|
|
priv->tx_ring = kzalloc(sizeof(struct mlx4_en_tx_ring *) * MAX_TX_RINGS,
|
2012-12-02 10:49:23 +07:00
|
|
|
GFP_KERNEL);
|
2012-05-17 07:58:10 +07:00
|
|
|
if (!priv->tx_ring) {
|
|
|
|
err = -ENOMEM;
|
|
|
|
goto out;
|
|
|
|
}
|
2013-11-07 17:19:52 +07:00
|
|
|
priv->tx_cq = kzalloc(sizeof(struct mlx4_en_cq *) * MAX_TX_RINGS,
|
2012-12-02 10:49:23 +07:00
|
|
|
GFP_KERNEL);
|
2012-05-17 07:58:10 +07:00
|
|
|
if (!priv->tx_cq) {
|
|
|
|
err = -ENOMEM;
|
|
|
|
goto out;
|
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
priv->rx_ring_num = prof->rx_ring_num;
|
2012-10-21 21:59:24 +07:00
|
|
|
priv->cqe_factor = (mdev->dev->caps.cqe_size == 64) ? 1 : 0;
|
2014-09-18 15:51:01 +07:00
|
|
|
priv->cqe_size = mdev->dev->caps.cqe_size;
|
2008-10-23 05:47:49 +07:00
|
|
|
priv->mac_index = -1;
|
|
|
|
priv->msg_enable = MLX4_EN_MSG_LEVEL;
|
2012-04-05 04:33:26 +07:00
|
|
|
#ifdef CONFIG_MLX4_EN_DCB
|
2013-04-07 10:44:07 +07:00
|
|
|
if (!mlx4_is_slave(priv->mdev->dev)) {
|
2016-09-11 14:56:19 +07:00
|
|
|
priv->dcbx_cap = DCB_CAP_DCBX_VER_CEE | DCB_CAP_DCBX_HOST |
|
|
|
|
DCB_CAP_DCBX_VER_IEEE;
|
2016-06-21 16:43:59 +07:00
|
|
|
priv->flags |= MLX4_EN_DCB_ENABLED;
|
2016-09-11 14:56:19 +07:00
|
|
|
priv->cee_config.pfc_state = false;
|
2016-06-21 16:43:59 +07:00
|
|
|
|
2016-09-11 14:56:19 +07:00
|
|
|
for (i = 0; i < MLX4_EN_NUM_UP; i++)
|
|
|
|
priv->cee_config.dcb_pfc[i] = pfc_disabled;
|
2016-06-21 16:43:59 +07:00
|
|
|
|
2015-04-02 20:31:17 +07:00
|
|
|
if (mdev->dev->caps.flags2 & MLX4_DEV_CAP_FLAG2_ETS_CFG) {
|
2013-04-07 10:44:07 +07:00
|
|
|
dev->dcbnl_ops = &mlx4_en_dcbnl_ops;
|
|
|
|
} else {
|
|
|
|
en_info(priv, "enabling only PFC DCB ops\n");
|
|
|
|
dev->dcbnl_ops = &mlx4_en_dcbnl_pfc_ops;
|
|
|
|
}
|
|
|
|
}
|
2012-04-05 04:33:26 +07:00
|
|
|
#endif
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2013-02-07 09:25:25 +07:00
|
|
|
for (i = 0; i < MLX4_EN_MAC_HASH_SIZE; ++i)
|
|
|
|
INIT_HLIST_HEAD(&priv->mac_hash[i]);
|
2013-02-07 09:25:22 +07:00
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
/* Query for default mac and max mtu */
|
|
|
|
priv->max_mtu = mdev->dev->caps.eth_mtu_cap[priv->port];
|
2013-02-07 09:25:20 +07:00
|
|
|
|
2014-11-09 18:51:53 +07:00
|
|
|
if (mdev->dev->caps.rx_checksum_flags_port[priv->port] &
|
|
|
|
MLX4_RX_CSUM_MODE_VAL_NON_TCP_UDP)
|
|
|
|
priv->flags |= MLX4_EN_FLAG_RX_CSUM_NON_TCP_UDP;
|
|
|
|
|
2013-02-07 09:25:20 +07:00
|
|
|
/* Set default MAC */
|
|
|
|
dev->addr_len = ETH_ALEN;
|
|
|
|
mlx4_en_u64_to_mac(dev->dev_addr, mdev->dev->caps.def_mac[priv->port]);
|
|
|
|
if (!is_valid_ether_addr(dev->dev_addr)) {
|
net/mlx4_core: Replace VF zero mac with random mac in mlx4_core
By design, when no default MAC addresses are set in the Hypervisor for VFs,
the VFs are passed zero-macs. When such a MAC is received by the VF, it
generates a random MAC address and registers that MAC address
with the Hypervisor.
This random mac generation is currently done in the mlx4_en module.
There is a problem, though, if the mlx4_ib module is loaded by a VF before
the mlx4_en module. In this case, for RoCE, mlx4_ib will see the un-replaced
zero-mac and register that zero-mac as part of QP1 initialization.
Having a zero-mac in the port's MAC table creates problems for a
Baseboard Management Console. The BMC occasionally sends packets with a
zero-mac destination MAC. If there is a zero-mac present in the port's
MAC table, the FW will send such BMC packets to the host driver rather than
to the wire, and BMC will stop working.
To address this problem, we move the replacement of zero-mac addresses
with random-mac addresses to procedure mlx4_slave_cap(), which is part of the
driver startup for VFs, and is before activation of mlx4_ib and mlx4_en.
As a result, zero-mac addresses will never be registered in the port MAC table
by the driver.
In addition, when mlx4_en does initialize the net device, it needs to set
the NET_ADDR_RANDOM flag in the netdev structure if the address was
randomly generated. This is done so that udev on the VM does not create
a new device name after each VF probe (VM boot and such). To accomplish this,
we add a per-port flag in mlx4_dev which gets set whenever mlx4_core replaces
a zero-mac with a randomly-generated mac. This flag is examined when mlx4_en
initializes the net-device.
Fix was suggested by Matan Barak <matanb@mellanox.com>
Signed-off-by: Jack Morgenstein <jackm@dev.mellanox.co.il>
Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
2015-10-14 21:43:48 +07:00
|
|
|
en_err(priv, "Port: %d, invalid mac burned: %pM, quiting\n",
|
|
|
|
priv->port, dev->dev_addr);
|
|
|
|
err = -EINVAL;
|
|
|
|
goto out;
|
|
|
|
} else if (mlx4_is_slave(priv->mdev->dev) &&
|
|
|
|
(priv->mdev->dev->port_random_macs & 1 << priv->port)) {
|
|
|
|
/* Random MAC was assigned in mlx4_slave_cap
|
|
|
|
* in mlx4_core module
|
|
|
|
*/
|
|
|
|
dev->addr_assign_type |= NET_ADDR_RANDOM;
|
|
|
|
en_warn(priv, "Assigned random MAC address %pM\n", dev->dev_addr);
|
2008-10-23 05:47:49 +07:00
|
|
|
}
|
|
|
|
|
2014-07-08 15:25:24 +07:00
|
|
|
memcpy(priv->current_mac, dev->dev_addr, sizeof(priv->current_mac));
|
2013-02-07 09:25:20 +07:00
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
priv->stride = roundup_pow_of_two(sizeof(struct mlx4_en_rx_desc) +
|
|
|
|
DS_SIZE * MLX4_EN_MAX_RX_FRAGS);
|
|
|
|
err = mlx4_en_alloc_resources(priv);
|
|
|
|
if (err)
|
|
|
|
goto out;
|
|
|
|
|
2013-04-23 13:06:49 +07:00
|
|
|
/* Initialize time stamping config */
|
|
|
|
priv->hwtstamp_config.flags = 0;
|
|
|
|
priv->hwtstamp_config.tx_type = HWTSTAMP_TX_OFF;
|
|
|
|
priv->hwtstamp_config.rx_filter = HWTSTAMP_FILTER_NONE;
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
/* Allocate page for receive rings */
|
|
|
|
err = mlx4_alloc_hwq_res(mdev->dev, &priv->res,
|
2016-05-04 18:50:15 +07:00
|
|
|
MLX4_EN_PAGE_SIZE);
|
2008-10-23 05:47:49 +07:00
|
|
|
if (err) {
|
2009-06-02 03:27:13 +07:00
|
|
|
en_err(priv, "Failed to allocate page for rx qps\n");
|
2008-10-23 05:47:49 +07:00
|
|
|
goto out;
|
|
|
|
}
|
|
|
|
priv->allocated = 1;
|
|
|
|
|
|
|
|
/*
|
|
|
|
* Initialize netdev entry points
|
|
|
|
*/
|
2013-04-25 12:22:27 +07:00
|
|
|
if (mlx4_is_master(priv->mdev->dev))
|
|
|
|
dev->netdev_ops = &mlx4_netdev_ops_master;
|
|
|
|
else
|
|
|
|
dev->netdev_ops = &mlx4_netdev_ops;
|
2008-10-23 05:47:49 +07:00
|
|
|
dev->watchdog_timeo = MLX4_EN_WATCHDOG_TIMEOUT;
|
2010-09-27 15:29:34 +07:00
|
|
|
netif_set_real_num_tx_queues(dev, priv->tx_ring_num);
|
|
|
|
netif_set_real_num_rx_queues(dev, priv->rx_ring_num);
|
2008-11-22 08:30:58 +07:00
|
|
|
|
2014-05-11 07:12:32 +07:00
|
|
|
dev->ethtool_ops = &mlx4_en_ethtool_ops;
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
/*
|
|
|
|
* Set driver features
|
|
|
|
*/
|
2011-04-15 11:50:49 +07:00
|
|
|
dev->hw_features = NETIF_F_SG | NETIF_F_IP_CSUM | NETIF_F_IPV6_CSUM;
|
|
|
|
if (mdev->LSO_support)
|
|
|
|
dev->hw_features |= NETIF_F_TSO | NETIF_F_TSO6;
|
|
|
|
|
|
|
|
dev->vlan_features = dev->hw_features;
|
|
|
|
|
2011-10-18 08:51:24 +07:00
|
|
|
dev->hw_features |= NETIF_F_RXCSUM | NETIF_F_RXHASH;
|
2011-04-15 11:50:49 +07:00
|
|
|
dev->features = dev->hw_features | NETIF_F_HIGHDMA |
|
2013-04-19 09:04:27 +07:00
|
|
|
NETIF_F_HW_VLAN_CTAG_TX | NETIF_F_HW_VLAN_CTAG_RX |
|
|
|
|
NETIF_F_HW_VLAN_CTAG_FILTER;
|
2014-10-27 16:37:43 +07:00
|
|
|
dev->hw_features |= NETIF_F_LOOPBACK |
|
|
|
|
NETIF_F_HW_VLAN_CTAG_TX | NETIF_F_HW_VLAN_CTAG_RX;
|
2008-10-23 05:47:49 +07:00
|
|
|
|
2015-07-27 18:46:34 +07:00
|
|
|
if (!(mdev->dev->caps.flags2 & MLX4_DEV_CAP_FLAG2_SKIP_OUTER_VLAN)) {
|
|
|
|
dev->features |= NETIF_F_HW_VLAN_STAG_RX |
|
|
|
|
NETIF_F_HW_VLAN_STAG_FILTER;
|
|
|
|
dev->hw_features |= NETIF_F_HW_VLAN_STAG_RX;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (mlx4_is_slave(mdev->dev)) {
|
|
|
|
int phv;
|
|
|
|
|
|
|
|
err = get_phv_bit(mdev->dev, port, &phv);
|
|
|
|
if (!err && phv) {
|
|
|
|
dev->hw_features |= NETIF_F_HW_VLAN_STAG_TX;
|
|
|
|
priv->pflags |= MLX4_EN_PRIV_FLAGS_PHV;
|
|
|
|
}
|
|
|
|
} else {
|
|
|
|
if (mdev->dev->caps.flags2 & MLX4_DEV_CAP_FLAG2_PHV_EN &&
|
|
|
|
!(mdev->dev->caps.flags2 &
|
|
|
|
MLX4_DEV_CAP_FLAG2_SKIP_OUTER_VLAN))
|
|
|
|
dev->hw_features |= NETIF_F_HW_VLAN_STAG_TX;
|
|
|
|
}
|
|
|
|
|
2015-04-02 20:31:21 +07:00
|
|
|
if (mdev->dev->caps.flags & MLX4_DEV_CAP_FLAG_FCS_KEEP)
|
|
|
|
dev->hw_features |= NETIF_F_RXFCS;
|
|
|
|
|
2015-04-02 20:31:22 +07:00
|
|
|
if (mdev->dev->caps.flags2 & MLX4_DEV_CAP_FLAG2_IGNORE_FCS)
|
|
|
|
dev->hw_features |= NETIF_F_RXALL;
|
|
|
|
|
2012-07-19 05:33:52 +07:00
|
|
|
if (mdev->dev->caps.steering_mode ==
|
net/mlx4: Add support for A0 steering
Add the required firmware commands for A0 steering and a way to enable
that. The firmware support focuses on INIT_HCA, QUERY_HCA, QUERY_PORT,
QUERY_DEV_CAP and QUERY_FUNC_CAP commands. Those commands are used
to configure and query the device.
The different A0 DMFS (steering) modes are:
Static - optimized performance, but flow steering rules are
limited. This mode should be choosed explicitly by the user
in order to be used.
Dynamic - this mode should be explicitly choosed by the user.
In this mode, the FW works in optimized steering mode as long as
it can and afterwards automatically drops to classic (full) DMFS.
Disable - this mode should be explicitly choosed by the user.
The user instructs the system not to use optimized steering, even if
the FW supports Dynamic A0 DMFS (and thus will be able to use optimized
steering in Default A0 DMFS mode).
Default - this mode is implicitly choosed. In this mode, if the FW
supports Dynamic A0 DMFS, it'll work in this mode. Otherwise, it'll
work at Disable A0 DMFS mode.
Under SRIOV configuration, when the A0 steering mode is enabled,
older guest VF drivers who aren't using the RX QP allocation flag
(MLX4_RESERVE_A0_QP) will get a QP from the general range and
fail when attempting to register a steering rule. To avoid that,
the PF context behaviour is changed once on A0 static mode, to
require support for the allocation flag in VF drivers too.
In order to enable A0 steering, we use log_num_mgm_entry_size param.
If the value of the parameter is not positive, we treat the absolute
value of log_num_mgm_entry_size as a bit field. Setting bit 2 of this
bit field enables static A0 steering.
Signed-off-by: Matan Barak <matanb@mellanox.com>
Signed-off-by: Or Gerlitz <ogerlitz@mellanox.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
2014-12-11 15:58:00 +07:00
|
|
|
MLX4_STEERING_MODE_DEVICE_MANAGED &&
|
|
|
|
mdev->dev->caps.dmfs_high_steer_mode != MLX4_STEERING_DMFS_A0_STATIC)
|
2012-07-19 05:33:52 +07:00
|
|
|
dev->hw_features |= NETIF_F_NTUPLE;
|
|
|
|
|
2013-02-07 09:25:26 +07:00
|
|
|
if (mdev->dev->caps.steering_mode != MLX4_STEERING_MODE_A0)
|
|
|
|
dev->priv_flags |= IFF_UNICAST_FLT;
|
|
|
|
|
2014-12-02 23:12:11 +07:00
|
|
|
/* Setting a default hash function value */
|
|
|
|
if (mdev->dev->caps.flags2 & MLX4_DEV_CAP_FLAG2_RSS_TOP) {
|
|
|
|
priv->rss_hash_fn = ETH_RSS_HASH_TOP;
|
|
|
|
} else if (mdev->dev->caps.flags2 & MLX4_DEV_CAP_FLAG2_RSS_XOR) {
|
|
|
|
priv->rss_hash_fn = ETH_RSS_HASH_XOR;
|
|
|
|
} else {
|
|
|
|
en_warn(priv,
|
|
|
|
"No RSS hash capabilities exposed, using Toeplitz\n");
|
|
|
|
priv->rss_hash_fn = ETH_RSS_HASH_TOP;
|
|
|
|
}
|
|
|
|
|
2016-02-17 22:24:27 +07:00
|
|
|
if (mdev->dev->caps.tunnel_offload_mode == MLX4_TUNNEL_OFFLOAD_MODE_VXLAN) {
|
2016-05-02 23:38:30 +07:00
|
|
|
dev->hw_features |= NETIF_F_GSO_UDP_TUNNEL |
|
|
|
|
NETIF_F_GSO_UDP_TUNNEL_CSUM |
|
|
|
|
NETIF_F_GSO_PARTIAL;
|
|
|
|
dev->features |= NETIF_F_GSO_UDP_TUNNEL |
|
|
|
|
NETIF_F_GSO_UDP_TUNNEL_CSUM |
|
|
|
|
NETIF_F_GSO_PARTIAL;
|
|
|
|
dev->gso_partial_features = NETIF_F_GSO_UDP_TUNNEL_CSUM;
|
2016-02-17 22:24:27 +07:00
|
|
|
}
|
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
mdev->pndev[port] = dev;
|
2015-02-03 21:48:34 +07:00
|
|
|
mdev->upper[port] = NULL;
|
2008-10-23 05:47:49 +07:00
|
|
|
|
|
|
|
netif_carrier_off(dev);
|
2013-06-25 16:09:31 +07:00
|
|
|
mlx4_en_set_default_moderation(priv);
|
|
|
|
|
2009-06-02 03:27:13 +07:00
|
|
|
en_warn(priv, "Using %d TX rings\n", prof->tx_ring_num);
|
|
|
|
en_warn(priv, "Using %d RX rings\n", prof->rx_ring_num);
|
|
|
|
|
2013-02-07 09:25:19 +07:00
|
|
|
mlx4_en_update_loopback_state(priv->dev, priv->dev->features);
|
|
|
|
|
2011-03-23 05:37:41 +07:00
|
|
|
/* Configure port */
|
2012-06-25 07:24:11 +07:00
|
|
|
mlx4_en_calc_rx_buf(dev);
|
2011-03-23 05:37:41 +07:00
|
|
|
err = mlx4_SET_PORT_general(mdev->dev, priv->port,
|
2012-06-25 07:24:11 +07:00
|
|
|
priv->rx_skb_size + ETH_FCS_LEN,
|
|
|
|
prof->tx_pause, prof->tx_ppp,
|
|
|
|
prof->rx_pause, prof->rx_ppp);
|
2011-03-23 05:37:41 +07:00
|
|
|
if (err) {
|
2014-05-08 02:52:57 +07:00
|
|
|
en_err(priv, "Failed setting port general configurations for port %d, with error %d\n",
|
|
|
|
priv->port, err);
|
2011-03-23 05:37:41 +07:00
|
|
|
goto out;
|
|
|
|
}
|
|
|
|
|
2013-12-23 21:09:44 +07:00
|
|
|
if (mdev->dev->caps.tunnel_offload_mode == MLX4_TUNNEL_OFFLOAD_MODE_VXLAN) {
|
2014-03-27 19:02:04 +07:00
|
|
|
err = mlx4_SET_PORT_VXLAN(mdev->dev, priv->port, VXLAN_STEER_BY_OUTER_MAC, 1);
|
2013-12-23 21:09:44 +07:00
|
|
|
if (err) {
|
|
|
|
en_err(priv, "Failed setting port L2 tunnel configuration, err %d\n",
|
|
|
|
err);
|
|
|
|
goto out;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2011-03-23 05:37:41 +07:00
|
|
|
/* Init port */
|
|
|
|
en_warn(priv, "Initializing port\n");
|
|
|
|
err = mlx4_INIT_PORT(mdev->dev, priv->port);
|
|
|
|
if (err) {
|
|
|
|
en_err(priv, "Failed Initializing port\n");
|
|
|
|
goto out;
|
|
|
|
}
|
2008-10-23 05:47:49 +07:00
|
|
|
queue_delayed_work(mdev->workqueue, &priv->stats_task, STATS_DELAY);
|
2013-04-25 12:22:24 +07:00
|
|
|
|
2015-12-17 20:35:38 +07:00
|
|
|
/* Initialize time stamp mechanism */
|
2013-04-25 12:22:24 +07:00
|
|
|
if (mdev->dev->caps.flags2 & MLX4_DEV_CAP_FLAG2_TS)
|
2015-12-17 20:35:38 +07:00
|
|
|
mlx4_en_init_timestamp(mdev);
|
|
|
|
|
2015-12-17 20:35:37 +07:00
|
|
|
queue_delayed_work(mdev->workqueue, &priv->service_task,
|
|
|
|
SERVICE_TASK_DELAY);
|
2013-04-25 12:22:24 +07:00
|
|
|
|
2015-03-30 21:45:25 +07:00
|
|
|
mlx4_en_set_stats_bitmap(mdev->dev, &priv->stats_bitmap,
|
|
|
|
mdev->profile.prof[priv->port].rx_ppp,
|
|
|
|
mdev->profile.prof[priv->port].rx_pause,
|
|
|
|
mdev->profile.prof[priv->port].tx_ppp,
|
|
|
|
mdev->profile.prof[priv->port].tx_pause);
|
2015-03-18 21:51:38 +07:00
|
|
|
|
2015-03-24 20:18:38 +07:00
|
|
|
err = register_netdev(dev);
|
|
|
|
if (err) {
|
|
|
|
en_err(priv, "Netdev registration failed for port %d\n", port);
|
|
|
|
goto out;
|
|
|
|
}
|
|
|
|
|
|
|
|
priv->registered = 1;
|
2016-02-26 23:32:24 +07:00
|
|
|
devlink_port_type_eth_set(mlx4_get_devlink_port(mdev->dev, priv->port),
|
|
|
|
dev);
|
2015-03-24 20:18:38 +07:00
|
|
|
|
2008-10-23 05:47:49 +07:00
|
|
|
return 0;
|
|
|
|
|
|
|
|
out:
|
|
|
|
mlx4_en_destroy_netdev(dev);
|
|
|
|
return err;
|
|
|
|
}
|
|
|
|
|
2014-10-27 16:37:43 +07:00
|
|
|
int mlx4_en_reset_config(struct net_device *dev,
|
|
|
|
struct hwtstamp_config ts_config,
|
|
|
|
netdev_features_t features)
|
|
|
|
{
|
|
|
|
struct mlx4_en_priv *priv = netdev_priv(dev);
|
|
|
|
struct mlx4_en_dev *mdev = priv->mdev;
|
2016-07-18 22:35:12 +07:00
|
|
|
struct mlx4_en_port_profile new_prof;
|
|
|
|
struct mlx4_en_priv *tmp;
|
2014-10-27 16:37:43 +07:00
|
|
|
int port_up = 0;
|
|
|
|
int err = 0;
|
|
|
|
|
|
|
|
if (priv->hwtstamp_config.tx_type == ts_config.tx_type &&
|
|
|
|
priv->hwtstamp_config.rx_filter == ts_config.rx_filter &&
|
2015-04-02 20:31:21 +07:00
|
|
|
!DEV_FEATURE_CHANGED(dev, features, NETIF_F_HW_VLAN_CTAG_RX) &&
|
|
|
|
!DEV_FEATURE_CHANGED(dev, features, NETIF_F_RXFCS))
|
2014-10-27 16:37:43 +07:00
|
|
|
return 0; /* Nothing to change */
|
|
|
|
|
|
|
|
if (DEV_FEATURE_CHANGED(dev, features, NETIF_F_HW_VLAN_CTAG_RX) &&
|
|
|
|
(features & NETIF_F_HW_VLAN_CTAG_RX) &&
|
|
|
|
(priv->hwtstamp_config.rx_filter != HWTSTAMP_FILTER_NONE)) {
|
|
|
|
en_warn(priv, "Can't turn ON rx vlan offload while time-stamping rx filter is ON\n");
|
|
|
|
return -EINVAL;
|
|
|
|
}
|
|
|
|
|
2016-07-18 22:35:12 +07:00
|
|
|
tmp = kzalloc(sizeof(*tmp), GFP_KERNEL);
|
|
|
|
if (!tmp)
|
|
|
|
return -ENOMEM;
|
|
|
|
|
2014-10-27 16:37:43 +07:00
|
|
|
mutex_lock(&mdev->state_lock);
|
2016-07-18 22:35:12 +07:00
|
|
|
|
|
|
|
memcpy(&new_prof, priv->prof, sizeof(struct mlx4_en_port_profile));
|
|
|
|
memcpy(&new_prof.hwtstamp_config, &ts_config, sizeof(ts_config));
|
|
|
|
|
|
|
|
err = mlx4_en_try_alloc_resources(priv, tmp, &new_prof);
|
|
|
|
if (err)
|
|
|
|
goto out;
|
|
|
|
|
2014-10-27 16:37:43 +07:00
|
|
|
if (priv->port_up) {
|
|
|
|
port_up = 1;
|
|
|
|
mlx4_en_stop_port(dev, 1);
|
|
|
|
}
|
|
|
|
|
|
|
|
en_warn(priv, "Changing device configuration rx filter(%x) rx vlan(%x)\n",
|
2016-07-18 22:35:12 +07:00
|
|
|
ts_config.rx_filter,
|
|
|
|
!!(features & NETIF_F_HW_VLAN_CTAG_RX));
|
2014-10-27 16:37:43 +07:00
|
|
|
|
2016-07-18 22:35:12 +07:00
|
|
|
mlx4_en_safe_replace_resources(priv, tmp);
|
2014-10-27 16:37:43 +07:00
|
|
|
|
|
|
|
if (DEV_FEATURE_CHANGED(dev, features, NETIF_F_HW_VLAN_CTAG_RX)) {
|
|
|
|
if (features & NETIF_F_HW_VLAN_CTAG_RX)
|
|
|
|
dev->features |= NETIF_F_HW_VLAN_CTAG_RX;
|
|
|
|
else
|
|
|
|
dev->features &= ~NETIF_F_HW_VLAN_CTAG_RX;
|
|
|
|
} else if (ts_config.rx_filter == HWTSTAMP_FILTER_NONE) {
|
|
|
|
/* RX time-stamping is OFF, update the RX vlan offload
|
|
|
|
* to the latest wanted state
|
|
|
|
*/
|
|
|
|
if (dev->wanted_features & NETIF_F_HW_VLAN_CTAG_RX)
|
|
|
|
dev->features |= NETIF_F_HW_VLAN_CTAG_RX;
|
|
|
|
else
|
|
|
|
dev->features &= ~NETIF_F_HW_VLAN_CTAG_RX;
|
|
|
|
}
|
|
|
|
|
2015-04-02 20:31:21 +07:00
|
|
|
if (DEV_FEATURE_CHANGED(dev, features, NETIF_F_RXFCS)) {
|
|
|
|
if (features & NETIF_F_RXFCS)
|
|
|
|
dev->features |= NETIF_F_RXFCS;
|
|
|
|
else
|
|
|
|
dev->features &= ~NETIF_F_RXFCS;
|
|
|
|
}
|
|
|
|
|
2014-10-27 16:37:43 +07:00
|
|
|
/* RX vlan offload and RX time-stamping can't co-exist !
|
|
|
|
* Regardless of the caller's choice,
|
|
|
|
* Turn Off RX vlan offload in case of time-stamping is ON
|
|
|
|
*/
|
|
|
|
if (ts_config.rx_filter != HWTSTAMP_FILTER_NONE) {
|
|
|
|
if (dev->features & NETIF_F_HW_VLAN_CTAG_RX)
|
|
|
|
en_warn(priv, "Turning off RX vlan offload since RX time-stamping is ON\n");
|
|
|
|
dev->features &= ~NETIF_F_HW_VLAN_CTAG_RX;
|
|
|
|
}
|
|
|
|
|
|
|
|
if (port_up) {
|
|
|
|
err = mlx4_en_start_port(dev);
|
|
|
|
if (err)
|
|
|
|
en_err(priv, "Failed starting port\n");
|
|
|
|
}
|
|
|
|
|
|
|
|
out:
|
|
|
|
mutex_unlock(&mdev->state_lock);
|
2016-07-18 22:35:12 +07:00
|
|
|
kfree(tmp);
|
|
|
|
if (!err)
|
|
|
|
netdev_features_change(dev);
|
2014-10-27 16:37:43 +07:00
|
|
|
return err;
|
|
|
|
}
|