kata-containers

mirror of https://github.com/aljazceru/kata-containers.git synced 2026-02-10 00:54:21 +01:00

Author	SHA1	Message	Date
James O. D. Hunt	2ce8d4263c	clh: Suppress hypervisor output to make guest output visible Reduce the cloud-hypervisor log level from `Debug` to `Info` when hypervisor debug is enabled. This is required since `Debug` level: - Is overkill for debugging hypervisor failures. - Effectively hides the output from the guest kernel and userland: CLH generates so much output that the output from the guest gets "lost in the noise" (experiments show that for each full CLH debug message, at most 1 _byte_ of guest output is displayed). Fixes: #2726. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-09-30 14:22:09 +01:00
Jakob Naucke	8739a73dd3	Merge pull request #2736 from Amulyam24/kata-check-test cmd: Fix mismatched types in testModuleData	2021-09-30 10:20:19 +02:00
Chelsea Mafrica	96c033ba6c	Merge pull request #2763 from liubin/fix/2762-update-gitignore runtime: update .gitignore to ignore monitor_address file	2021-09-29 09:45:57 -07:00
Carlos Venegas	7183de47df	Merge pull request #2766 from YchauWang/wyc-runtime-cmd runtime: fix the make check-go-static command error	2021-09-29 10:53:02 -05:00
Bin Liu	4ac7199282	Merge pull request #2494 from rapiz1/clean-up-code virtcontainers: clean up useless code	2021-09-29 22:56:13 +08:00
wangyongchao.bj	bb99bfb45d	runtime: fix the make check-go-static command error modify the make script of the check-go-static, changing the `./cli` path to `./cmd/kata-runtime` Fixes: #2765 Signed-off-by: wangyongchao.bj <wangyongchao.bj@inspur.com>	2021-09-29 15:37:25 +08:00
David Gibson	b57613f53e	Merge pull request #1682 from dgibson/rescan Remove forced PCI rescans from agent	2021-09-29 13:03:55 +10:00
bin	870771d76d	runtime: update .gitignore to ignore monitor_address file Run tests sometimes generate pkg/containerd-shim-v2/monitor_address, and `git status` will treat it as a new file. Package containerd-shim-v2 has moved to pkg/containerd-shim-v2, the monitor_address in .gitignore should be updated too. Fixes: #2762 Signed-off-by: bin <bin@hyper.sh>	2021-09-29 09:24:14 +08:00
Fupan Li	823818cfbc	Merge pull request #2744 from fengwang666/nil-bug runtime: fix nil reference in cleanup rootless user	2021-09-28 22:43:24 +08:00
Feng Wang	e5fe53f0a9	runtime: fix nil reference in cleanup rootless user It seems the client (crio) can send multiple requests to stop the Kata VM, resulting a nil reference if the uid has already been cleaned up by a different thread. Fixes #2743 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2021-09-27 21:28:47 -07:00
Francesco Giudici	2304a59601	runtime: set the sandbox storage path static Since we now have "unix://" kind of socket returned by the SocketAddress() function, there is no more need to build the sandbox storage path dynamically to keep OS compatibility. Fixes: #2738 Suggested-by: Christophe de Dinechin <dinechin@redhat.com> Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-27 15:57:34 +02:00
Francesco Giudici	315295e0ef	runtime: rename GetSanboxesStoragePath() --> GetSandboxesStoragePath() Add the missing 'd'. Fixes: #2738 Suggested-by: Jakob Naucke <jakob.naucke@ibm.com> Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-27 15:56:14 +02:00
Bin Liu	3217b03b17	Merge pull request #2522 from Bevisy/main-2515 virtcontainers: Fix incorrect scripts path	2021-09-27 21:14:40 +08:00
Bin Liu	39df808f6a	Merge pull request #2695 from YchauWang/wyc-vc-cgroup runtime: clear virtcontainers cgroup duplicated function	2021-09-27 21:12:39 +08:00
Amulya Meka	13e65f2ee8	cmd: Fix mismatched types in testModuleData Rectify the values of testModuleData with the correct types in TestCCCheckCLiFunction in kata-check_(!x86)_test.go Fixes: #2735 Signed-off-by: Amulya Meka <amulmek1@in.ibm.com>	2021-09-27 07:17:07 +00:00
Peng Tao	05995632c3	Merge pull request #2566 from fgiudici/kata-monitor_improvements Kata monitor: cache improvements	2021-09-27 12:29:13 +08:00
David Gibson	907459c1c1	agent/device: Don't force PCI rescans The agent initiates a PCI rescan from two places. One is triggered for each virtio-blk PCI device, and one is triggered unconditionally when we start a new container. The PCI bus rescan code was added long time ago in Clear Containers due to lack of ACPI support in QEMU 2.9 + q35. Since Kata routinely plugs devices under a PCIe-to-PCI bridge, that left SHPC as the only available hotplug mechanism. However, while Kata was using SHPC on the qemu side, it wasn't actually using it on the guest side. Due to a quirk of our guest kernel configuration, the SHPC driver never bound to the bridge, and no hotplug was working at all. To work around that, Kata was forcing the rescan manually, which would discover the new device. That was very fragile (we were arguably relying on a kernel bug). Even if we were using SHPC propertly, it includes a mandatory 5s delay during plug operations (designed for physical cards and human operators), which makes it unsuitable quick start up. Worse, the forced PCI rescans could race with either SHPC or PCIe native hotplug sequences, causing several problems. In some cases this could put the device into an entirely broken state where it wouldn't respond to config space accesses at all. Since pull request #2323 was merged, we have instead used ACPI hotplug which is both fast, and more solid in terms of semantics and races. So, the forced PCI rescans are no longer necessary. Remove them all. fixes #683 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	75f426dd1e	agent: Simplify do_add_swap() do_add_swap() has some mildly complex code to translate the PCI path of a virtio-blk device (where the swap will reside) into a /dev path. However, the device module already has get_virtio_blk_pci_device_name() which does exactly that. The existing code has some further advantages: it uses more precise matching of the sysfs paths, and if necessary it will wait for the device to be added to the guest. While we're there, remove an unnecessary 'as u8' from the PCI path construction: pci::Path::new() already accepts anything which implements TryInfo<u8>, which u32 certainly does. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	aad1a8734f	runtime/device: Give the agent information about VFIO devices We send information about several kinds of devices to the agent so that it can apply specific handling. We don't currently do this with VFIO devices. However we need to do that so that the agent can properly wait for VFIO devices to be ready (previously it did that using a PCI rescan which may not be reliable and has some very bad side effects). This patch collates and sends the relevant information. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	ebd7b61884	runtime: Don't repeat GetDeviceByID between appendDevices() and append() Both appendBlockDevice and appendVhostUserBlkDevice start by using GetDeviceByID to lookup the api.Device object corresponding to their ContainerDevice object. However their common caller, appendDevices() has already done this. This changes it so the looked up api.Device is passed to the individual appendDevice() functions. This slightly reduces duplicated work, but more importantly it makes it clearer that append*Device() don't need to check for a nil result from GetDeviceByID, since the caller has already done that. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	ad45c52fbe	runtime/device: Record guest PCI path for VFIO devices For several device types which correspond to a PCI device in the guest we record the device's PCI path in the guest. We don't currently do that for VFIO devices, but we're going to need to for better handling of SR-IOV devices. To accomplish this, we have to determine the guest PCI path from the information the VMM gives us: For qemu, we query the slot of the device and its bridge from QMP. For cloud-hypervisor, the device add interface gives us a guest PCI address. In fact this represents a design error in the clh API - there's no way it can really know the guest PCI address in general. It works in this case, because clh doesn't use PCI bridges, so the device will always be on the root bus. Based on that, the PCI path is simply the device's slot number. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	5c2af3e308	runtime/device: Refactor hotplugVFIODevice() to have common exit path hotplugVFIODevice() has several different paths depending if we're plugging into a root port or a PCIE<->PCI bridge and if we're using a regular or mediated VFIO device. We're going to want some common code on the successful exit path here, so refactor the function to allow that without duplication. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	8bc71105f4	agent/device: Add device type for VFIO devices Currently, VFIO devices attached to a Kata container aren't described to the agent at all. We essentially just hope they're ready by the time we've entered the container proper, which is usually the case because of the PCI rescan - but that causes other problems. This adds a new device type to the agent representing VFIO devices. The agent will use its existing uevent watching mechanisms to wait for the associated guest PCI device to appear before proceeding. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	f7a2707505	agent: Move driver type constants into device.rs Currently the constants giving the names for each device/driver type in the protocol are in mount.rs, and used in device.rs. Since these constants are inherently related to, well, devices, it makes more sense to put them in device.rs and use them from mount.rs. This will become even more so with planned extensions which will add some device types that will not be used in mount.rs at all. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	5b1eb08bde	agent/uevent: Improve logging of wait_for_uevent() These messages will help when debugging matchers not matching properly. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
David Gibson	cf36fd87ad	runtime: Fix some leftover go fmt errors A few "go fmt" errors appear to have crept it. Clean them up with "go fmt ./..." in the src/runtime directory. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-27 12:46:33 +10:00
zhanghj	57e3712dbd	virtiofs: fix error report in TestVirtiofsdStart when go test running Initialize ctx with context.Background() instead of nil value. Fixes: #2718 Signed-off-by: zhanghj <zhanghj.lc@inspur.com>	2021-09-24 16:06:06 +08:00
Fabiano Fidêncio	279f8e9d03	Merge pull request #2590 from c3d/issue/2589-virtiofsd-perms virtiofs: Create shared directory with 0700 mode, not 0750	2021-09-24 09:16:40 +02:00
Eric Ernst	fa44e5c1e5	Merge pull request #2703 from egernst/watcher-fixup watcher: ensure we create target mount point for storage	2021-09-23 21:59:08 -07:00
Julio Montes	1766c93b08	Merge pull request #2662 from cmaf/tracing-stop-rootctx runtime: tracing: Use root context to stop tracing	2021-09-23 11:50:35 -05:00
Eric Ernst	272771dcf9	watcher: ensure we create target mount point for storage We would only create the target when updating files. We need to make sure that we create the target if the source is a directory. Without this, we'll fail to start a container that utilizes an empty configmap, for example. Add unit tests for this. Fixes: #2638 Signed-off-by: Eric Ernst <eric_ernst@apple.com>	2021-09-23 08:29:28 -07:00
Julio Montes	5d2a82fbf9	Merge pull request #2323 from dgibson/acpi-pcihp Replace SHPC with ACPI PCI hotplug for Kata guests	2021-09-23 09:55:31 -05:00
Francesco Giudici	8b0bc1f45e	kata-monitor: bump version to 0.2.0 We now support any container engine CRI compliant. Let's bump the kata-monitor version to 0.2.0. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-23 14:32:09 +02:00
Francesco Giudici	bfb556d56a	kata-monitor: refresh kata sandbox list on fs events This commit stops the container engine polling in favor of the kata sandbox storage path monitoring. The pod cache list is now refreshed based on fs events and synced with the container engine only when needed. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-23 14:32:09 +02:00
Francesco Giudici	0e854f3b80	kata-monitor: improve detection of kata workloads When the container engine is different than containerd or CRI-O we lack proper detection of kata workloads and consider all the pods as kata ones. Instead of querying the container engine for the lower level runtime used in each pod, check if a directory matching the pod exists in the virtualcontainers sandboxes storage path. This provides a container engine independent way to check for kata pods. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-23 14:32:09 +02:00
Fabiano Fidêncio	0ececc630f	Merge pull request #2666 from cmaf/tracing-newContainer-logger runtime: tracing: Fix logger passed in newContainer	2021-09-23 13:07:19 +02:00
Fabiano Fidêncio	e33c26ba18	Merge pull request #2622 from YchauWang/wyc-vc-api virtcontainers: update VC SandboxConfig API add SandboxBindMounts field	2021-09-23 13:05:33 +02:00
Fabiano Fidêncio	47170e302a	Merge pull request #2616 from Bevisy/main-2615 sandbox: Allow the device to be accessed,such as /dev/null and /dev/u…	2021-09-23 13:04:18 +02:00
David Gibson	8bbcb06af5	qemu: Disable SHPC hotplug Under certain circumstances[0] Kata will attempt to use SHPC hotplug for PCI devices on the guest. In fact we explicitly enable SHPC on our PCI to PCI bridges, regardless of the qemu default. SHPC was designed a long, long time ago for physical hotplugging and works very poorly for a virtual environment. In particular it has a mandatory 5s delay to allow a (real, human) operator to back out the operation if they press a button by mistake. This alone makes it unusable for a fast start up application like Kata. Worse, the agent forces a PCI rescan during startup. That will race with the SHPC hotplug operation causing the device to go into a bad state where config space can't be accessed from the guest at all. The only reason we've sort of gotten away with this is that our default guest kernel configuration triggers what's arguably a kernel bug effectively disabling SHPC. That makes the agent rescan the only reason we see the new device. Now that we require a qemu >=6.1, which includes ACPI PCI hotplug on the q35 machine, we can explicitly disable SHPC in all cases. It's nothing but trouble. fixes #2174 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-23 10:27:26 +10:00
David Gibson	cc4983eeac	runtime: Remove unused qemuArchBase.appendBridges definition qemuArchBase.appendBridges is never actually used, because the bare qemuArchBase type is itself never used (outside of unit tests). Instead all the subclasses of qemuArchBase override appendBridges() to call the very similar, but not identical genericAppendBridges. So, we can remove the qemuArchBase.appendBridges implementation. Furthermore, all those subclasses override appendBridges() in exactly the same way, and so we can remove those definitions and replace the base class qemuArchBase appendBridges() with that version, calling genericAppendBridges(). Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-23 10:15:08 +10:00
David Gibson	e248de4616	vendor: Update govmm Update to commit `1b60b536f3`, in particular to get extensions to allow IO and memory window reservations to be set on PCI bridges. https://github.com/kata-containers/govmm/pull/201 Git log: `de039da` govmm/qemu: Let IO/memory reservations be specified for bridge devices Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-09-23 10:14:29 +10:00
wangyongchao.bj	3b0c4bf9a0	runtime: clear virtcontainers cgroup duplicated function There are `DeviceToDeviceCgroup` and `deviceToDeviceCgroup` two functions, creating a `specs.LinuxDeviceCgroup` object. We clear the new function `deviceToDeviceCgroup`. Fixes: #2694 Signed-off-by: wangyongchao.bj <wangyongchao.bj@inspur.com>	2021-09-22 15:13:34 +08:00
Fabiano Fidêncio	32c3fb71f2	Merge pull request #2546 from fengwang666/rootless-qemu-doc docs: documentation for running non-root VMM	2021-09-21 22:45:33 +02:00
Fabiano Fidêncio	2bee8bc6bd	Merge pull request #2432 from fengwang666/qemu-rootless runtime: run the QEMU VMM process with a non-root user	2021-09-21 21:37:02 +02:00
Feng Wang	305afc8b70	docs: documentation for running non-root VMM Documentation for running non-root QEMU VMM in Kata runtime Fixes: #2545 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2021-09-21 11:20:37 -07:00
Samuel Ortiz	3a4aca4d67	Merge pull request #2671 from YchauWang/wyc-runtime-config runtime: update .gitignore file cleare the vc shim config	2021-09-21 15:15:09 +02:00
Feng Wang	9a6d56f1ab	runtime: fix empty cgroup path validation error An empty cgroup path shouldn't fail cgroup creation Fixes #2674 Signed-off-by: Feng Wang <feng.wang@databricks.com>	2021-09-20 13:48:09 -07:00
Christophe de Dinechin	48fb1d9203	virtiofs: Create shared directory with 0700 mode, not 0750 A discussion on the Linux kernel mailing list [1] exposed that virtiofsd makes a core assumption that the file systems being shared are not accessible by any non-privileged user. We currently create the `shared` directory in the sandbox with the default `0750` permissions, which gives read and directory traversal access to the group. There is no real good reason for a non-root user to access the shared directory, and this is potentially dangerous. Fixes: #2589 [1]: https://lore.kernel.org/linux-fsdevel/YTI+k29AoeGdX13Q@redhat.com/ Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2021-09-20 10:47:18 +02:00
Francesco Giudici	afad910d0e	kata-monitor: add getSandboxFS() Retrieve the absolute sandbox storage path. We will soon need this to monitor the creation/deletion of new kata sandboxes. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-20 10:37:55 +02:00
Francesco Giudici	e38686f74d	runtime: add GetSandboxesStoragePath() The storage path we use to collect the sandbox files is defined in the virtcontainers/persist/fs package. We create the runtime socket in that storage path, by hardcoding the full path in the SocketAddress() function in the runtime package. This commit splits the hardcoded path by the socket address path so that the runtime package will be able to provide the storage path to all the components that may need it. Signed-off-by: Francesco Giudici <fgiudici@redhat.com>	2021-09-20 10:37:55 +02:00

1 2 3 4 5 ...

1429 Commits