2007-02-08 14:33:51 +07:00
|
|
|
/*
|
|
|
|
* Copyright 2006 IBM Corporation.
|
|
|
|
*
|
|
|
|
* This program is free software; you can redistribute it and/or
|
|
|
|
* modify it under the terms of the GNU General Public License
|
|
|
|
* as published by the Free Software Foundation; either version
|
|
|
|
* 2 of the License, or (at your option) any later version.
|
|
|
|
*/
|
|
|
|
|
|
|
|
#ifndef _PSERIES_PSERIES_H
|
|
|
|
#define _PSERIES_PSERIES_H
|
|
|
|
|
2010-05-19 05:51:00 +07:00
|
|
|
#include <linux/interrupt.h>
|
powerpc/pseries: Create new device hotplug entry point
The current hotplug (or dlpar) of devices (the process is generally the
same for memory, cpu, and pci) on PowerVM systems is initiated
from the HMC, which communicates the request to the partitions through
the RSCT framework. The RSCT framework then invokes the drmgr command.
The drmgr command performs the hotplug operation by doing some pieces,
such as most of the rtas calls and device tree parsing, in userspace
and make requests to the kernel to online/offline the device, update the
device tree and add/remove the device.
For PowerKVM the approach for device hotplug is to follow what is currently
being done for pci hotplug. A hotplug request is initiated from the host.
QEMU then generates an EPOW interrupt to the guest which causes the guest
to make the rtas,check-exception call. In QEMU, the rtas,check-exception call
returns a rtas hotplug event to the guest.
Please note that the current pci hotplug path for PowerKVM involves the
kernel receiving the rtas hotplug event, passing it to rtas_errd in
userspace, and having rtas_errd invoke drmgr. The drmgr command then
handles the request as described above for PowerVM systems.
There is no need for this circuitous route, we should just handle the entire
hotplug of devices in the kernel. What I am planning is to enable this
by moving the code to handle hotplug from drmgr into the kernel to
provide a single path for handling device hotplug for both PowerVM and
PowerKVM systems. This patch provides the common iframework and entry point.
For PowerKVM a future update to the kernel rtas code will recognize rtas
hotplug events returned from rtas,check-exception calls and use the common
entry point to handle hotplug of the device.
For PowerVM systems, This patch creates /sys/kernel/dlpar that can be
used by the drmgr command to initiate hotplug requests. In order to do
this a string of the format "<resource> <action> <id_type> <id>" is
written to this file. The string consists of a resource (cpu, memory, pci,
phb), an action (add or remove), an id_type (count, drc index, drc name),
and the corresponding id. The kernel will parse the string and create a
rtas hotplug section that can be passed to the common entry point for
handling hotplug requests.
It should be noted that there is no chance of updating how we receive
hotplug (dlpar) requests from the HMC on PowerVM systems.
Signed-off-by: Nathan Fontenot <nfont@linux.vnet.ibm.com>
Signed-off-by: Benjamin Herrenschmidt <benh@kernel.crashing.org>
2015-02-11 02:47:02 +07:00
|
|
|
#include <asm/rtas.h>
|
2010-05-19 05:51:00 +07:00
|
|
|
|
|
|
|
struct device_node;
|
|
|
|
|
|
|
|
extern void request_event_sources_irqs(struct device_node *np,
|
|
|
|
irq_handler_t handler, const char *name);
|
|
|
|
|
2010-09-10 16:40:32 +07:00
|
|
|
#include <linux/of.h>
|
|
|
|
|
2007-02-08 14:33:51 +07:00
|
|
|
struct pt_regs;
|
|
|
|
|
|
|
|
extern int pSeries_system_reset_exception(struct pt_regs *regs);
|
|
|
|
extern int pSeries_machine_check_exception(struct pt_regs *regs);
|
|
|
|
|
2007-02-08 14:33:55 +07:00
|
|
|
#ifdef CONFIG_SMP
|
2016-05-30 13:18:12 +07:00
|
|
|
extern void smp_init_pseries(void);
|
2018-03-08 09:54:39 +07:00
|
|
|
|
|
|
|
/* Get state of physical CPU from query_cpu_stopped */
|
|
|
|
int smp_query_cpu_stopped(unsigned int pcpu);
|
|
|
|
#define QCSS_STOPPED 0
|
|
|
|
#define QCSS_STOPPING 1
|
|
|
|
#define QCSS_NOT_STOPPED 2
|
|
|
|
#define QCSS_HARDWARE_ERROR -1
|
|
|
|
#define QCSS_HARDWARE_BUSY -2
|
2007-02-08 14:33:55 +07:00
|
|
|
#else
|
2016-05-30 13:18:12 +07:00
|
|
|
static inline void smp_init_pseries(void) { };
|
2007-02-08 14:33:55 +07:00
|
|
|
#endif
|
|
|
|
|
2016-05-30 13:18:13 +07:00
|
|
|
extern void pseries_kexec_cpu_down(int crash_shutdown, int secondary);
|
2007-02-08 14:33:55 +07:00
|
|
|
|
[POWERPC] Rewrite IO allocation & mapping on powerpc64
This rewrites pretty much from scratch the handling of MMIO and PIO
space allocations on powerpc64. The main goals are:
- Get rid of imalloc and use more common code where possible
- Simplify the current mess so that PIO space is allocated and
mapped in a single place for PCI bridges
- Handle allocation constraints of PIO for all bridges including
hot plugged ones within the 2GB space reserved for IO ports,
so that devices on hotplugged busses will now work with drivers
that assume IO ports fit in an int.
- Cleanup and separate tracking of the ISA space in the reserved
low 64K of IO space. No ISA -> Nothing mapped there.
I booted a cell blade with IDE on PIO and MMIO and a dual G5 so
far, that's it :-)
With this patch, all allocations are done using the code in
mm/vmalloc.c, though we use the low level __get_vm_area with
explicit start/stop constraints in order to manage separate
areas for vmalloc/vmap, ioremap, and PCI IOs.
This greatly simplifies a lot of things, as you can see in the
diffstat of that patch :-)
A new pair of functions pcibios_map/unmap_io_space() now replace
all of the previous code that used to manipulate PCI IOs space.
The allocation is done at mapping time, which is now called from
scan_phb's, just before the devices are probed (instead of after,
which is by itself a bug fix). The only other caller is the PCI
hotplug code for hot adding PCI-PCI bridges (slots).
imalloc is gone, as is the "sub-allocation" thing, but I do beleive
that hotplug should still work in the sense that the space allocation
is always done by the PHB, but if you unmap a child bus of this PHB
(which seems to be possible), then the code should properly tear
down all the HPTE mappings for that area of the PHB allocated IO space.
I now always reserve the first 64K of IO space for the bridge with
the ISA bus on it. I have moved the code for tracking ISA in a separate
file which should also make it smarter if we ever are capable of
hot unplugging or re-plugging an ISA bridge.
This should have a side effect on platforms like powermac where VGA IOs
will no longer work. This is done on purpose though as they would have
worked semi-randomly before. The idea at this point is to isolate drivers
that might need to access those and fix them by providing a proper
function to obtain an offset to the legacy IOs of a given bus.
Signed-off-by: Benjamin Herrenschmidt <benh@kernel.crashing.org>
Signed-off-by: Paul Mackerras <paulus@samba.org>
2007-06-04 12:15:36 +07:00
|
|
|
extern void pSeries_final_fixup(void);
|
|
|
|
|
2007-02-09 05:01:17 +07:00
|
|
|
/* Poweron flag used for enabling auto ups restart */
|
|
|
|
extern unsigned long rtas_poweron_auto;
|
|
|
|
|
powerpc/pseries: Re-implement HVSI as part of hvc_vio
On pseries machines, consoles are provided by the hypervisor using
a low level get_chars/put_chars type interface. However, this is
really just a transport to the service processor which implements
them either as "raw" console (networked consoles, HMC, ...) or as
"hvsi" serial ports.
The later is a simple packet protocol on top of the raw character
interface that is supposed to convey additional "serial port" style
semantics. In practice however, all it does is provide a way to
read the CD line and set/clear our DTR line, that's it.
We currently implement the "raw" protocol as an hvc console backend
(/dev/hvcN) and the "hvsi" protocol using a separate tty driver
(/dev/hvsi0).
However this is quite impractical. The arbitrary difference between
the two type of devices has been a major source of user (and distro)
confusion. Additionally, there's an additional mini -hvsi implementation
in the pseries platform code for our low level debug console and early
boot kernel messages, which means code duplication, though that low
level variant is impractical as it's incapable of doing the initial
protocol negociation to establish the link to the FSP.
This essentially replaces the dedicated hvsi driver and the platform
udbg code completely by extending the existing hvc_vio backend used
in "raw" mode so that:
- It now supports HVSI as well
- We add support for hvc backend providing tiocm{get,set}
- It also provides a udbg interface for early debug and boot console
This is overall less code, though this will only be obvious once we
remove the old "hvsi" driver, which is still available for now. When
the old driver is enabled, the new code still kicks in for the low
level udbg console, replacing the old mini implementation in the platform
code, it just doesn't provide the higher level "hvc" interface.
In addition to producing generally simler code, this has several benefits
over our current situation:
- The user/distro only has to deal with /dev/hvcN for the hypervisor
console, avoiding all sort of confusion that has plagued us in the past
- The tty, kernel and low level debug console all use the same code
base which supports the full protocol establishment process, thus the
console is now available much earlier than it used to be with the
old HVSI driver. The kernel console works much earlier and udbg is
available much earlier too. Hackers can enable a hard coded very-early
debug console as well that works with HVSI (previously that was only
supported for the "raw" mode).
I've tried to keep the same semantics as hvsi relative to how I react
to things like CD changes, with some subtle differences though:
- I clear DTR on close if HUPCL is set
- Current hvsi triggers a hangup if it detects a up->down transition
on CD (you can still open a console with CD down). My new implementation
triggers a hangup if the link to the FSP is severed, and severs it upon
detecting a up->down transition on CD.
Signed-off-by: Benjamin Herrenschmidt <benh@kernel.crashing.org>
2011-05-12 10:46:38 +07:00
|
|
|
/* Provided by HVC VIO */
|
|
|
|
extern void hvc_vio_init_early(void);
|
2008-04-16 10:51:48 +07:00
|
|
|
|
2010-09-10 16:40:32 +07:00
|
|
|
/* Dynamic logical Partitioning/Mobility */
|
|
|
|
extern void dlpar_free_cc_nodes(struct device_node *);
|
|
|
|
extern void dlpar_free_cc_property(struct property *);
|
2014-09-17 03:15:45 +07:00
|
|
|
extern struct device_node *dlpar_configure_connector(__be32,
|
|
|
|
struct device_node *);
|
2017-08-21 22:16:49 +07:00
|
|
|
extern int dlpar_attach_node(struct device_node *, struct device_node *);
|
2010-09-10 16:40:32 +07:00
|
|
|
extern int dlpar_detach_node(struct device_node *);
|
2015-02-11 02:45:01 +07:00
|
|
|
extern int dlpar_acquire_drc(u32 drc_index);
|
|
|
|
extern int dlpar_release_drc(u32 drc_index);
|
2010-09-10 16:40:32 +07:00
|
|
|
|
2016-07-07 22:03:44 +07:00
|
|
|
void queue_hotplug_event(struct pseries_hp_errorlog *hp_errlog,
|
|
|
|
struct completion *hotplug_done, int *rc);
|
powerpc/pseries: Create new device hotplug entry point
The current hotplug (or dlpar) of devices (the process is generally the
same for memory, cpu, and pci) on PowerVM systems is initiated
from the HMC, which communicates the request to the partitions through
the RSCT framework. The RSCT framework then invokes the drmgr command.
The drmgr command performs the hotplug operation by doing some pieces,
such as most of the rtas calls and device tree parsing, in userspace
and make requests to the kernel to online/offline the device, update the
device tree and add/remove the device.
For PowerKVM the approach for device hotplug is to follow what is currently
being done for pci hotplug. A hotplug request is initiated from the host.
QEMU then generates an EPOW interrupt to the guest which causes the guest
to make the rtas,check-exception call. In QEMU, the rtas,check-exception call
returns a rtas hotplug event to the guest.
Please note that the current pci hotplug path for PowerKVM involves the
kernel receiving the rtas hotplug event, passing it to rtas_errd in
userspace, and having rtas_errd invoke drmgr. The drmgr command then
handles the request as described above for PowerVM systems.
There is no need for this circuitous route, we should just handle the entire
hotplug of devices in the kernel. What I am planning is to enable this
by moving the code to handle hotplug from drmgr into the kernel to
provide a single path for handling device hotplug for both PowerVM and
PowerKVM systems. This patch provides the common iframework and entry point.
For PowerKVM a future update to the kernel rtas code will recognize rtas
hotplug events returned from rtas,check-exception calls and use the common
entry point to handle hotplug of the device.
For PowerVM systems, This patch creates /sys/kernel/dlpar that can be
used by the drmgr command to initiate hotplug requests. In order to do
this a string of the format "<resource> <action> <id_type> <id>" is
written to this file. The string consists of a resource (cpu, memory, pci,
phb), an action (add or remove), an id_type (count, drc index, drc name),
and the corresponding id. The kernel will parse the string and create a
rtas hotplug section that can be passed to the common entry point for
handling hotplug requests.
It should be noted that there is no chance of updating how we receive
hotplug (dlpar) requests from the HMC on PowerVM systems.
Signed-off-by: Nathan Fontenot <nfont@linux.vnet.ibm.com>
Signed-off-by: Benjamin Herrenschmidt <benh@kernel.crashing.org>
2015-02-11 02:47:02 +07:00
|
|
|
#ifdef CONFIG_MEMORY_HOTPLUG
|
|
|
|
int dlpar_memory(struct pseries_hp_errorlog *hp_elog);
|
|
|
|
#else
|
|
|
|
static inline int dlpar_memory(struct pseries_hp_errorlog *hp_elog)
|
|
|
|
{
|
|
|
|
return -EOPNOTSUPP;
|
|
|
|
}
|
|
|
|
#endif
|
2010-09-10 16:40:32 +07:00
|
|
|
|
2015-12-17 03:54:05 +07:00
|
|
|
#ifdef CONFIG_HOTPLUG_CPU
|
|
|
|
int dlpar_cpu(struct pseries_hp_errorlog *hp_elog);
|
|
|
|
#else
|
|
|
|
static inline int dlpar_cpu(struct pseries_hp_errorlog *hp_elog)
|
|
|
|
{
|
|
|
|
return -EOPNOTSUPP;
|
|
|
|
}
|
|
|
|
#endif
|
|
|
|
|
2013-05-03 19:43:12 +07:00
|
|
|
/* PCI root bridge prepare function override for pseries */
|
|
|
|
struct pci_host_bridge;
|
|
|
|
int pseries_root_bridge_prepare(struct pci_host_bridge *bridge);
|
|
|
|
|
2015-03-31 12:00:50 +07:00
|
|
|
extern struct pci_controller_ops pseries_pci_controller_ops;
|
|
|
|
|
2014-06-04 14:50:47 +07:00
|
|
|
unsigned long pseries_memory_block_size(void);
|
|
|
|
|
2016-11-14 12:28:10 +07:00
|
|
|
extern int CMO_PrPSP;
|
|
|
|
extern int CMO_SecPSP;
|
|
|
|
extern unsigned long CMO_PageSize;
|
|
|
|
|
|
|
|
static inline int cmo_get_primary_psp(void)
|
|
|
|
{
|
|
|
|
return CMO_PrPSP;
|
|
|
|
}
|
|
|
|
|
|
|
|
static inline int cmo_get_secondary_psp(void)
|
|
|
|
{
|
|
|
|
return CMO_SecPSP;
|
|
|
|
}
|
|
|
|
|
|
|
|
static inline unsigned long cmo_get_page_size(void)
|
|
|
|
{
|
|
|
|
return CMO_PageSize;
|
|
|
|
}
|
|
|
|
|
powerpc/pseries: Make RAS IRQ explicitly dependent on DLPAR WQ
The hotplug code uses its own workqueue to handle IRQ requests
(pseries_hp_wq), however that workqueue is initialized after
init_ras_IRQ(). That can lead to a kernel panic if any hotplug
interrupts fire after init_ras_IRQ() but before pseries_hp_wq is
initialised. eg:
UDP-Lite hash table entries: 2048 (order: 0, 65536 bytes)
NET: Registered protocol family 1
Unpacking initramfs...
(qemu) object_add memory-backend-ram,id=mem1,size=10G
(qemu) device_add pc-dimm,id=dimm1,memdev=mem1
Unable to handle kernel paging request for data at address 0xf94d03007c421378
Faulting instruction address: 0xc00000000012d744
Oops: Kernel access of bad area, sig: 11 [#1]
LE SMP NR_CPUS=2048 NUMA pSeries
Modules linked in:
CPU: 0 PID: 1 Comm: swapper/0 Not tainted 4.15.0-rc2-ziviani+ #26
task: (ptrval) task.stack: (ptrval)
NIP: c00000000012d744 LR: c00000000012d744 CTR: 0000000000000000
REGS: (ptrval) TRAP: 0380 Not tainted (4.15.0-rc2-ziviani+)
MSR: 8000000000009033 <SF,EE,ME,IR,DR,RI,LE> CR: 28088042 XER: 20040000
CFAR: c00000000012d3c4 SOFTE: 0
...
NIP [c00000000012d744] __queue_work+0xd4/0x5c0
LR [c00000000012d744] __queue_work+0xd4/0x5c0
Call Trace:
[c0000000fffefb90] [c00000000012d744] __queue_work+0xd4/0x5c0 (unreliable)
[c0000000fffefc70] [c00000000012dce4] queue_work_on+0xb4/0xf0
This commit makes the RAS IRQ registration explicitly dependent on the
creation of the pseries_hp_wq.
Reported-by: Min Deng <mdeng@redhat.com>
Reported-by: Daniel Henrique Barboza <danielhb@linux.vnet.ibm.com>
Tested-by: Jose Ricardo Ziviani <joserz@linux.vnet.ibm.com>
Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
Reviewed-by: David Gibson <david@gibson.dropbear.id.au>
2018-01-08 10:54:32 +07:00
|
|
|
int dlpar_workqueue_init(void);
|
|
|
|
|
2018-03-15 05:40:42 +07:00
|
|
|
void pseries_setup_rfi_flush(void);
|
|
|
|
|
2007-02-08 14:33:51 +07:00
|
|
|
#endif /* _PSERIES_PSERIES_H */
|