kata-containers

mirror of https://github.com/aljazceru/kata-containers.git synced 2026-01-19 06:14:33 +01:00

Author	SHA1	Message	Date
Fabiano Fidêncio	1f9ed94d74	runtime: Fix QEMU cmdline for TDX This commit should've been part of the series that reverted a bunch of TDX changes that are not compatible with the TDX stack we're using in the Jenkins CI machine. The change made here is in order to match what's been undone here: `c29e5036a6` Fixes: #6884 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-22 11:29:46 +02:00
Fabiano Fidêncio	6763c41d7e	Merge pull request #6886 from fidencio/topic/cc-stick-to-2022ww44-for-tdx CC: tdx: Stick to the 2022ww44 TDX stack for the CCv0 branch	2023-05-19 11:55:13 +02:00
Fabiano Fidêncio	fe6e918ddc	Revert "virtcontainers: Drop check for the `tdx` CPU flag" This reverts commit `25b3cdd38c`. As the Jenkins TDX CI is running on a system with a TDX stack called "2022ww44", we should keep the QEMU / kernel / OVMF versions matching what's provided in that stack. The reason we were able to update this on `main` is because the GHA TDX CI is running on a TDX stack called "2023ww01", but we have decided to NOT take the bullet, NOT updating the Jenkins CI in order to avoid unexepected breakages. This regression was introduced as part of the last CCv0 merge to main, and would've been caught by the CI, and should've been caught by the reviewer (myself :-)), but CI was having a hard time to even build the compoenents and I wrote in the PR and I'm quoting it here: "I rather deal with possible breakages on this later on, than block this PR to get in." ... and here we are. :-) Fixes: #6884 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-18 12:55:20 +02:00
Fabiano Fidêncio	2962d8db45	Revert "runtime/qemu: Drop "kvm-type=tdx"" This reverts commit `ed145365ec`. As the Jenkins TDX CI is running on a system with a TDX stack called "2022ww44", we should keep the QEMU / kernel / OVMF versions matching what's provided in that stack. The reason we were able to update this on `main` is because the GHA TDX CI is running on a TDX stack called "2023ww01", but we have decided to NOT take the bullet, NOT updating the Jenkins CI in order to avoid unexepected breakages. This regression was introduced as part of the last CCv0 merge to main, and would've been caught by the CI, and should've been caught by the reviewer (myself :-)), but CI was having a hard time to even build the compoenents and I wrote in the PR and I'm quoting it here: "I rather deal with possible breakages on this later on, than block this PR to get in." ... and here we are. :-) Fixes: #6884 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-18 12:55:03 +02:00
Fabiano Fidêncio	eee621f431	Revert "govmm: Set "sept-ve-disable=on"" This reverts commit `3c5ffb0c85`. As the Jenkins TDX CI is running on a system with a TDX stack called "2022ww44", we should keep the QEMU / kernel / OVMF versions matching what's provided in that stack. The reason we were able to update this on `main` is because the GHA TDX CI is running on a TDX stack called "2023ww01", but we have decided to NOT take the bullet, NOT updating the Jenkins CI in order to avoid unexepected breakages. This regression was introduced as part of the last CCv0 merge to main, and would've been caught by the CI, and should've been caught by the reviewer (myself :-)), but CI was having a hard time to even build the compoenents and I wrote in the PR and I'm quoting it here: "I rather deal with possible breakages on this later on, than block this PR to get in." ... and here we are. :-) Fixes: #6884 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-18 12:54:46 +02:00
Fabiano Fidêncio	e005dfa718	Revert "govmm: Directly pass the firmware using -bios with TDX" This reverts commit `3e15800199`. As the Jenkins TDX CI is running on a system with a TDX stack called "2022ww44", we should keep the QEMU / kernel / OVMF versions matching what's provided in that stack. The reason we were able to update this on `main` is because the GHA TDX CI is running on a TDX stack called "2023ww01", but we have decided to NOT take the bullet, NOT updating the Jenkins CI in order to avoid unexepected breakages. This regression was introduced as part of the last CCv0 merge to main, and would've been caught by the CI, and should've been caught by the reviewer (myself :-)), but CI was having a hard time to even build the compoenents and I wrote in the PR and I'm quoting it here: "I rather deal with possible breakages on this later on, than block this PR to get in." ... and here we are. :-) Fixes: #6884 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-18 12:50:11 +02:00
Fabiano Fidêncio	1222800f7c	runtime: Use static_sandbox_resource_mgmt=true for TEEs When this option is enabled the runtime will attempt to determine the appropriate sandbox size (memory, CPU) before booting the virtual machine. As TEEs do not support memory and CPU hotplug, this approach must be used. Fixes: #6818 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-17 20:50:45 +02:00
Fabiano Fidêncio	f07b27d445	Merge pull request #6559 from stevenhorsman/CCv0-merge-30th-mar CCv0: Merge main into CCv0 branch	2023-05-17 17:00:39 +02:00
Pradipta Banerjee	1f0d709be6	CC: Add configurable context timeout for StopVM in remote hyp Add configurable context timeout for StopVM in remote hypervisor similar to StartVM Fixes: #6730 Signed-off-by: Pradipta Banerjee <pradipta.banerjee@gmail.com>	2023-05-04 10:42:30 +00:00
stevenhorsman	dbe1fd9436	CCv0: Merge main into CCv0 branch Merge remote-tracking branch 'upstream/main' into CCv0 Fixes: #6558 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-04-27 09:42:44 +01:00
Fupan Li	a1568cd2f5	Merge pull request #6676 from zvonkok/gpu-runtime gpu: Add GPU enabled confguration and runtime	2023-04-19 13:01:49 +08:00
Zvonko Kaiser	a81fff706f	gpu: Adding a GPU enabled configuration We need to set hotplug on pci root port and enable at least one root port. Also set the guest-hooks-dir to the correct path Fixes: #6675 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-17 10:40:09 +00:00
Zvonko Kaiser	f4f958d53c	gpu: Do not pass-through PCI (Host) Bridges On some systems a GPU is in a IOMMU group with a PCI Bridge and PCI Host Bridge. Per default no PCI Bridge needs to be passed-through. When scanning the IOMMU group, ignore devices with a 0x60 class ID prefix. Fixes: #6663 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-17 10:08:23 +00:00
Fabiano Fidêncio	fffe2c6082	Merge pull request #6648 from fidencio/topic/gha-tdx-improvements-and-fixes gha: tdx: Ensure kata-deploy is removed after the tests run	2023-04-15 00:21:31 +02:00
Fabiano Fidêncio	dc662333df	runtime: Increase the dial_timeout When testing on AKS, we've been hitting the dial_timeout every now and then. Let's increase it to 45 seconds (instead of 30) for all the VMMs, and to 60 seconfs in case of TEEs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-13 22:42:52 +02:00
Fabiano Fidêncio	f478b9115e	clh: tdx: Update timeouts for confidential guest Booting up TDX takes more time than booting up a normal VM. Those values are being already used as part of the CCv0 branch, and we're just bringing them to the `main` branch as well. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-13 10:18:07 +02:00
Alexandru Matei	db2cac34d8	runtime: Don't create socket file in /run/kata The socket file for shim management is created in /run/kata and it isn't deleted after the container is stopped. After running and stopping thousands of containers /run folder will run out of space. Fixes #6622 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com> Co-authored-by: Greg Kurz <groug@kaod.org>	2023-04-13 10:21:29 +03:00
Steve Horsman	dfa7ed06eb	Merge pull request #6617 from yoheiueda/remote-static-resmgmt CCv0: runtime: Use static_sandbox_resource_mgmt by default in remote hypervisor	2023-04-12 07:49:11 +01:00
Fabiano Fidêncio	3b3656d96d	Merge pull request #6522 from fidencio/topic/add-tdx-artefacts-from-2023ww01-to-main tdx: Add artefacts from the latest TDX tools release into main	2023-04-11 20:43:02 +02:00
Fabiano Fidêncio	50ce33b02d	Merge pull request #6205 from fengwang666/non-root-clh runtime: support non-root for clh	2023-04-11 19:34:00 +02:00
Fabiano Fidêncio	98682805be	config: Add configuration for QEMU TDX As the QEMU configuration for TDX differs quite a lot from the normal QEMU configuration, let's add a new configuration file for the QEMU TDX. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 16:10:35 +02:00
Fabiano Fidêncio	3e15800199	govmm: Directly pass the firmware using -bios with TDX Since TDX doesn't support readonly memslot, TDVF cannot be mapped as pflash device and it actually works as RAM. "-bios" option is chosen to load TDVF. OVMF is the opensource firmware that implements the TDVF support. Thus the command line to specify and load TDVF is ``-bios OVMF.fd`` Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	3c5ffb0c85	govmm: Set "sept-ve-disable=on" This is needed since 22ww49. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	ed145365ec	runtime/qemu: Drop "kvm-type=tdx" This is not supported since 22ww49. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	25b3cdd38c	virtcontainers: Drop check for the `tdx` CPU flag In the recent kernels provided by Intel the `tdx` CPU flag is not present anymore. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Fabiano Fidêncio	01bdacb4e4	virtcontainers: Also check /sys/firmwares/tdx for TDX Let's make sure we also check /sys/firmwares/tdx for TDX guest protection, as the location may depend on whether TDX Seam is being used or not. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-11 15:23:42 +02:00
Hyounggyu Choi	b73f26440e	CC: Revive removed runtime config for IBM SE This is just to get the removed runtime config for IBM SE back again. Fixes: #6624 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-04-06 20:18:28 +09:00
Yohei Ueda	938447803b	runtime: Use static resource in remote hypervisor This patch updates the template configuration file for the remote hypervisor to set static_sandbox_resource_mgmt to be true. The remote hypervisor uses the peer pod config to determine the sandbox size, so requires this to be set to true by default. Fixes: #6616 Signed-off-by: Yohei Ueda <yohei@jp.ibm.com>	2023-04-06 16:53:05 +09:00
James O. D. Hunt	cbe6f04194	Merge pull request #6501 from shippomx/dev_metrics runtime: add filter metrics with specific names	2023-04-05 15:15:09 +01:00
Jeremi Piotrowski	5c27f27af0	Merge pull request #6135 from Alex-Carter01/snp-config config: Add SEV SNP config	2023-03-31 20:10:08 +02:00
Alex Carter	28fd92c478	kata-deploy: SNP version of Qemu An SNP runtimeclass needs a build of Qemu from https://github.com/AMDESE/qemu/tree/snp-v3. So a new target needs to be added to add it to a kata-deploy bundle. Building requires a qemu no_patches file Fixes: #6061 Signed-Off-By: Alex Carter <alex.carter@ibm.com>	2023-03-31 14:58:38 +00:00
Alex Carter	2b5c84b9ec	config: Add SEV SNP config Adding SNP template for config, and values to the runtime makefile to generate the output toml. Based on [Joana's SNP Guide](https://github.com/kata-containers/kata-containers/blob/main/docs/how-to/how-to-run-kata-containers-with-SNP-VMs.md). Fixes: #6112 Signed-Off-By: Alex Carter <alex.carter@ibm.com>	2023-03-31 13:51:09 +00:00
Pradipta Banerjee	3081cd5f8e	runtime: propagate configmap/secrets etc changes for remote-hyp For remote hypervisor, the configmap, secrets, downward-api or project-volumes are copied from host to guest. This patch watches for changes to the host files and copies the changes to the guest. Note that configmap updates takes significantly longer than updates via downward-api. This is similar across runc and Kata runtimes. Fixes: #6341 Signed-off-by: Pradipta Banerjee <pradipta.banerjee@gmail.com> Signed-off-by: Julien Ropé <jrope@redhat.com>	2023-03-30 03:14:52 +00:00
Miao Xia	0f73515561	runtime: add filter metrics with specific names The kata monitor metrics API returns a huge size response, if containers or sandboxs are a large number, focus on what we need will be harder. Fixes: #6500 Signed-off-by: Miao Xia <xia.miao1@zte.com.cn>	2023-03-28 14:56:13 +08:00
Bin Liu	75987aae72	Merge pull request #6408 from jongwu/nydus_rm_hybrid nydus: upgrad to v2.2.0	2023-03-28 11:07:56 +08:00
Steve Horsman	4b027ba664	Merge pull request #6506 from Megan-Wright/CCv0 CCv0: Merge main into CCv0 branch	2023-03-27 14:42:03 +01:00
James O. D. Hunt	f06f72b5e9	Merge pull request #6467 from jongwu/qemu-uefi-path qemu/arm64: disable image nvdimm once no firmware offered	2023-03-22 08:43:01 +00:00
stevenhorsman	7c9a791d67	config: Add initial remote hypervisor config Remote hypervisor template config Fixes: #6349 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-03-20 17:58:27 +00:00
Megan Wright	f31c907f46	Fix bad merge	2023-03-20 13:51:10 +00:00
Megan Wright	42978f3e83	CCv0: Merge main into CCv0 branch Merge remote-tracking branch 'upstream/main' into CCv0 Fixes: #6504 Signed-off-by: Megan Wright <megan.wright@ibm.com>	2023-03-20 13:23:49 +00:00
Jianyong Wu	ece5edc641	qemu/arm64: disable image nvdimm if no firmware offered For now, image nvdimm on qemu/arm64 depends on UEFI/ACPI, so if there is no firmware offered, it should be disabled. Fixes: #6468 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-03-20 18:03:05 +08:00
Hyounggyu Choi	96baa83895	agent: Bring in VFIO-AP device handling again This PR is a continuing work for (kata-containers#3679). This generalizes the previous VFIO device handling which only focuses on PCI to include AP (IBM Z specific). Fixes: kata-containers#3678 Signed-off-by: Hyounggyu Choi <Hyounggyu.Choi@ibm.com>	2023-03-16 18:14:12 +09:00
Jakob Naucke	f666f8e2df	agent: Add VFIO-AP device handling Initial VFIO-AP support (#578) was simple, but somewhat hacky; a different code path would be chosen for performing the hotplug, and agent-side device handling was bound to knowing the assigned queue numbers (APQNs) through some other means; plus the code for awaiting them was written for the Go agent and never released. This code also artificially increased the hotplug timeout to wait for the (relatively expensive, thus limited to 5 seconds at the quickest) AP rescan, which is impractical for e.g. common k8s timeouts. Since then, the general handling logic was improved (#1190), but it assumed PCI in several places. In the runtime, introduce and parse AP devices. Annotate them as such when passing to the agent, and include information about the associated APQNs. The agent awaits the passed APQNs through uevents and triggers a rescan directly. Fixes: #3678 Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2023-03-16 10:07:48 +09:00
Jakob Naucke	b546eca26f	runtime: Generalize VFIO devices Generalize VFIO devices to allow for adding AP in the next patch. The logic for VFIOPciDeviceMediatedType() has been changed and IsAPVFIOMediatedDevice() has been removed. The rationale for the revomal is: - VFIODeviceMediatedType is divided into 2 subtypes for AP and PCI - Logic of checking a subtype of mediated device is included in GetVFIODeviceType() - VFIOPciDeviceMediatedType() can simply fulfill the device addition based on a type categorized by GetVFIODeviceType() Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2023-03-16 10:06:37 +09:00
Jakob Naucke	4c527d00c7	agent: Rename VFIO handling to VFIO PCI handling e.g., split_vfio_option is PCI-specific and should instead be named split_vfio_pci_option. This mutually affects the runtime, most notably how the labels are named for the agent. Signed-off-by: Jakob Naucke <jakob.naucke@ibm.com>	2023-03-16 07:43:39 +09:00
Fabiano Fidêncio	814d07af58	Merge pull request #6463 from sprt/sprt/mshv-compat runtime: add support for Hyper-V	2023-03-15 18:03:25 +01:00
Steve Horsman	c3059d0bd9	Merge pull request #6207 from BbolroC/s390x-prepare-se-image CC: Prepare secure execution image tarball for s390x	2023-03-15 14:33:59 +00:00
Steve Horsman	5da86f3e25	Merge pull request #6440 from GeorginaKin/CCv0 CCv0: Merge main into CCv0 branch	2023-03-15 10:40:55 +00:00
Henry Beberman	974a5c22f0	runtime: add support for Hyper-V This adds /dev/mshv to the list of sandbox devices so that VMMs can create Hyper-V VMs. In our testing, this also doesn't error out in case /dev/mshv isn't present. Fixes #6454. Signed-off-by: Aurélien Bombo <abombo@microsoft.com>	2023-03-13 17:13:51 -07:00
Sidhartha Mani	a6c67a161e	runtime: add support for ephemeral mounts to occupy entire sandbox memory On hotplug of memory as containers are started, remount all ephemeral mounts with size option set to the total sandbox memory Fixes: #6417 Signed-off-by: Sidhartha Mani <sidhartha_mani@apple.com>	2023-03-10 13:36:02 -08:00

1 2 3 4 5 ...

1710 Commits