kata-containers

mirror of https://github.com/aljazceru/kata-containers.git synced 2026-01-14 11:54:28 +01:00

Author	SHA1	Message	Date
Fabiano Fidêncio	efb0ac55c8	runtime: config: tdx: Enable service_offload This also as mistakenly overwritten by the `main` -> `CCv0` merge. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-24 07:57:49 +02:00
Fabiano Fidêncio	8b4b233358	runtime: config: Fix image path for QEMU TDX The rebase from `main` to `CCv0` ended up overwriting the image path that should be used for QEMU, in the CCv0 branch. Fixes: #6932 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-24 07:57:22 +02:00
Fabiano Fidêncio	1f9ed94d74	runtime: Fix QEMU cmdline for TDX This commit should've been part of the series that reverted a bunch of TDX changes that are not compatible with the TDX stack we're using in the Jenkins CI machine. The change made here is in order to match what's been undone here: `c29e5036a6` Fixes: #6884 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-22 11:29:46 +02:00
Fabiano Fidêncio	6763c41d7e	Merge pull request #6886 from fidencio/topic/cc-stick-to-2022ww44-for-tdx CC: tdx: Stick to the 2022ww44 TDX stack for the CCv0 branch	2023-05-19 11:55:13 +02:00
Fabiano Fidêncio	fe6e918ddc	Revert "virtcontainers: Drop check for the `tdx` CPU flag" This reverts commit `25b3cdd38c`. As the Jenkins TDX CI is running on a system with a TDX stack called "2022ww44", we should keep the QEMU / kernel / OVMF versions matching what's provided in that stack. The reason we were able to update this on `main` is because the GHA TDX CI is running on a TDX stack called "2023ww01", but we have decided to NOT take the bullet, NOT updating the Jenkins CI in order to avoid unexepected breakages. This regression was introduced as part of the last CCv0 merge to main, and would've been caught by the CI, and should've been caught by the reviewer (myself :-)), but CI was having a hard time to even build the compoenents and I wrote in the PR and I'm quoting it here: "I rather deal with possible breakages on this later on, than block this PR to get in." ... and here we are. :-) Fixes: #6884 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-18 12:55:20 +02:00
Fabiano Fidêncio	2962d8db45	Revert "runtime/qemu: Drop "kvm-type=tdx"" This reverts commit `ed145365ec`. As the Jenkins TDX CI is running on a system with a TDX stack called "2022ww44", we should keep the QEMU / kernel / OVMF versions matching what's provided in that stack. The reason we were able to update this on `main` is because the GHA TDX CI is running on a TDX stack called "2023ww01", but we have decided to NOT take the bullet, NOT updating the Jenkins CI in order to avoid unexepected breakages. This regression was introduced as part of the last CCv0 merge to main, and would've been caught by the CI, and should've been caught by the reviewer (myself :-)), but CI was having a hard time to even build the compoenents and I wrote in the PR and I'm quoting it here: "I rather deal with possible breakages on this later on, than block this PR to get in." ... and here we are. :-) Fixes: #6884 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-18 12:55:03 +02:00
Fabiano Fidêncio	eee621f431	Revert "govmm: Set "sept-ve-disable=on"" This reverts commit `3c5ffb0c85`. As the Jenkins TDX CI is running on a system with a TDX stack called "2022ww44", we should keep the QEMU / kernel / OVMF versions matching what's provided in that stack. The reason we were able to update this on `main` is because the GHA TDX CI is running on a TDX stack called "2023ww01", but we have decided to NOT take the bullet, NOT updating the Jenkins CI in order to avoid unexepected breakages. This regression was introduced as part of the last CCv0 merge to main, and would've been caught by the CI, and should've been caught by the reviewer (myself :-)), but CI was having a hard time to even build the compoenents and I wrote in the PR and I'm quoting it here: "I rather deal with possible breakages on this later on, than block this PR to get in." ... and here we are. :-) Fixes: #6884 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-18 12:54:46 +02:00
Fabiano Fidêncio	e005dfa718	Revert "govmm: Directly pass the firmware using -bios with TDX" This reverts commit `3e15800199`. As the Jenkins TDX CI is running on a system with a TDX stack called "2022ww44", we should keep the QEMU / kernel / OVMF versions matching what's provided in that stack. The reason we were able to update this on `main` is because the GHA TDX CI is running on a TDX stack called "2023ww01", but we have decided to NOT take the bullet, NOT updating the Jenkins CI in order to avoid unexepected breakages. This regression was introduced as part of the last CCv0 merge to main, and would've been caught by the CI, and should've been caught by the reviewer (myself :-)), but CI was having a hard time to even build the compoenents and I wrote in the PR and I'm quoting it here: "I rather deal with possible breakages on this later on, than block this PR to get in." ... and here we are. :-) Fixes: #6884 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-18 12:50:11 +02:00
Fabiano Fidêncio	1222800f7c	runtime: Use static_sandbox_resource_mgmt=true for TEEs When this option is enabled the runtime will attempt to determine the appropriate sandbox size (memory, CPU) before booting the virtual machine. As TEEs do not support memory and CPU hotplug, this approach must be used. Fixes: #6818 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-17 20:50:45 +02:00
Fabiano Fidêncio	f07b27d445	Merge pull request #6559 from stevenhorsman/CCv0-merge-30th-mar CCv0: Merge main into CCv0 branch	2023-05-17 17:00:39 +02:00
Pradipta Banerjee	1f0d709be6	CC: Add configurable context timeout for StopVM in remote hyp Add configurable context timeout for StopVM in remote hypervisor similar to StartVM Fixes: #6730 Signed-off-by: Pradipta Banerjee <pradipta.banerjee@gmail.com>	2023-05-04 10:42:30 +00:00
stevenhorsman	9bdd9af898	agent: Update image_rpc - Update image_rpc to reflect the changes in code gen Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-04-27 15:12:13 +01:00
stevenhorsman	8b8e71fa65	protocols: Update type - Fix merge issue Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-04-27 11:25:41 +01:00
stevenhorsman	dbe1fd9436	CCv0: Merge main into CCv0 branch Merge remote-tracking branch 'upstream/main' into CCv0 Fixes: #6558 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-04-27 09:42:44 +01:00
Bin Liu	509bc8b6c8	Merge pull request #6718 from openanolis/mengze/keep_abnormal runtime-rs: support keep_abnormal in toml config	2023-04-26 12:36:52 +08:00
mengze	cc8ea3232e	runtime-rs: support keep_abnormal in toml config This patch adds keep_abnormal in runtime config. If keep_abnormal = true, it means that 1) if the runtime exits abnormally, the cleanup process will be skipped, and 2) the runtime will not exit even if the health check fails. This option is typically used to retain abnormal information for debugging and should NOT be enabled by default. Fixes: #6717 Signed-off-by: mengze <mengze@linux.alibaba.com> Signed-off-by: quanweiZhou <quanweiZhou@linux.alibaba.com>	2023-04-25 13:47:44 +08:00
David Esparza	7fdaab49bc	Merge pull request #6295 from dborquez/add_kernel_module_checks_kvm kata-ctl: checks for kvm, kvm_intel modules loaded	2023-04-24 13:33:18 -06:00
Greg Kurz	bc101b7f56	rustjail: Use CPUWeight with systemd and CgroupsV2 The CPU shares property belongs to CgroupsV1. CgroupsV2 uses CPU weight instead. The correct value is computed in the latter case but it is passed to systemd using the legacy property. Systemd rejects the request and the agent exists with the following error : Value specified in CPUShares is out of range: unknown Replace the "shares" wording with "weight" in the CgroupsV2 code to avoid confusions. Use the "CPUWeight" property since this is what systemd expects in this case. Fixes #6636 References: https://www.freedesktop.org/software/systemd/man/systemd.resource-control.html#CPUWeight=weight https://www.freedesktop.org/software/systemd/man/systemd.resource-control.html#systemd%20252 https://github.com/containers/crun/blob/main/crun.1.md#cpu-controller Signed-off-by: Greg Kurz <groug@kaod.org> (cherry picked from commit `c1fbaae8d6`)	2023-04-24 15:43:27 +01:00
David Esparza	432d407440	kata-ctl: checks for kvm, kvm_intel modules loaded Ensure that kvm and kvm_intel modules are loaded. Renames the get_cpu_info() function to read_file_contents() Fixes #5332 Signed-off-by: David Esparza <david.esparza.borquez@intel.com>	2023-04-20 11:29:36 -06:00
Fupan Li	ceefd50bd0	Merge pull request #6680 from Tim-Zhang/fix-ut-bad-fd agent: Fix ut issue caused by fd double closed	2023-04-20 11:18:27 +08:00
Fupan Li	a7b4b69230	Merge pull request #6673 from Tim-Zhang/upgrade-ttrpc-protobuf Bump ttrpc to 0.7.2 and protobuf to 3.2.0	2023-04-20 10:13:43 +08:00
Fupan Li	a1568cd2f5	Merge pull request #6676 from zvonkok/gpu-runtime gpu: Add GPU enabled confguration and runtime	2023-04-19 13:01:49 +08:00
Tim Zhang	53c749a9de	agent: Fix ut issue caused by fd double closed Never ever try to close the same fd double times, even in a unit test. A file descriptor is a number which will be reused, so when you close the same number twice you may close another file descriptor in the second time and then there will be an error 'Bad file descriptor (os error 9)' while the wrongly closed fd is being used. Fixes: #6679 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-18 23:19:10 +08:00
Tim Zhang	2e3f19af92	agent: fix clippy warnings caused by protobuf3 Fix warnings introduced by protobuf upgrade. Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 20:15:49 +08:00
Tim Zhang	4849c56faa	agent: Fix unit test issue cuased by protobuf upgrade Fixes: #6646 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 19:49:21 +08:00
Tim Zhang	0a582f7815	trace-forwarder: remove unused crate protobuf Remove unused crate protobuf. Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 19:49:21 +08:00
Tim Zhang	73253850e6	kata-ctl: remove unused crate ttrpc Remove unused crate ttrpc. Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 19:49:21 +08:00
Tim Zhang	76d2e30547	agent-ctl: Bump ttrpc from 0.6.0 to 0.7.1 Fixes: #6646 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 19:49:21 +08:00
Tim Zhang	eb3d20dccb	protocols: Add ut for Serde Fixes: #6646 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 19:49:21 +08:00
Tim Zhang	59568c79dd	protocols: add support for Serde rust-protobuf@3 does not support Serde natively anymore. So we need to do it by ourselves. Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 19:49:21 +08:00
Tim Zhang	a6b4d92c84	runtime-rs: Bump ttrpc from 0.6.0 to 0.7.1 Fixes: #6646 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 19:49:20 +08:00
Zvonko Kaiser	a81fff706f	gpu: Adding a GPU enabled configuration We need to set hotplug on pci root port and enable at least one root port. Also set the guest-hooks-dir to the correct path Fixes: #6675 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-17 10:40:09 +00:00
Tim Zhang	8af6fc77cd	agent: Bump ttrpc from 0.6.0 to 0.7.1 Fixes: #6646 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 18:31:41 +08:00
Tim Zhang	009b42dbff	protocols: Fix unit test Fixes: #6646 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 18:31:41 +08:00
Tim Zhang	392732e213	protocols: Bump ttrpc from 0.6.0 to 0.7.1 Fixes: #6646 Signed-off-by: Tim Zhang <tim@hyper.sh>	2023-04-17 18:31:35 +08:00
Zvonko Kaiser	f4f958d53c	gpu: Do not pass-through PCI (Host) Bridges On some systems a GPU is in a IOMMU group with a PCI Bridge and PCI Host Bridge. Per default no PCI Bridge needs to be passed-through. When scanning the IOMMU group, ignore devices with a 0x60 class ID prefix. Fixes: #6663 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-04-17 10:08:23 +00:00
Fabiano Fidêncio	fffe2c6082	Merge pull request #6648 from fidencio/topic/gha-tdx-improvements-and-fixes gha: tdx: Ensure kata-deploy is removed after the tests run	2023-04-15 00:21:31 +02:00
Fabiano Fidêncio	a3b2358c77	agent: Update image-rs to v0.5.1 Let's update image-rs to the latest released version of the project. Fixes: #6650 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-14 15:20:35 +02:00
Fabiano Fidêncio	dc662333df	runtime: Increase the dial_timeout When testing on AKS, we've been hitting the dial_timeout every now and then. Let's increase it to 45 seconds (instead of 30) for all the VMMs, and to 60 seconfs in case of TEEs. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-13 22:42:52 +02:00
Fabiano Fidêncio	f478b9115e	clh: tdx: Update timeouts for confidential guest Booting up TDX takes more time than booting up a normal VM. Those values are being already used as part of the CCv0 branch, and we're just bringing them to the `main` branch as well. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-04-13 10:18:07 +02:00
Alexandru Matei	db2cac34d8	runtime: Don't create socket file in /run/kata The socket file for shim management is created in /run/kata and it isn't deleted after the container is stopped. After running and stopping thousands of containers /run folder will run out of space. Fixes #6622 Signed-off-by: Alexandru Matei <alexandru.matei@uipath.com> Co-authored-by: Greg Kurz <groug@kaod.org>	2023-04-13 10:21:29 +03:00
Wainer dos Santos Moschetta	2bd2b12a0a	agent: update image-rs dependency In preparation for CoCo 0.5 release, updated image-rs to commit b28eaae. Fixes #6650 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2023-04-12 18:22:33 -03:00
Zhongtao Hu	328793bb27	Merge pull request #6585 from Apokleos/nydus_prefetch_files nydus_rootfs/prefetch_files: add prefetch_files for RAFS	2023-04-12 19:58:36 +08:00
Steve Horsman	dfa7ed06eb	Merge pull request #6617 from yoheiueda/remote-static-resmgmt CCv0: runtime: Use static_sandbox_resource_mgmt by default in remote hypervisor	2023-04-12 07:49:11 +01:00
Zhongtao Hu	fef531f565	Merge pull request #6618 from Apokleos/virtiofs_extra_cache_mode runtime-rs/virtio-fs: add support extra handler for cache mode.	2023-04-12 14:40:05 +08:00
Bin Liu	9327bb0912	Merge pull request #6639 from openanolis/nerdctl runtime-rs: enable nerdctl to setup cni plugin	2023-04-12 12:04:37 +08:00
Zhongtao Hu	69ba2098f8	runtime-rs: remove network entities and netns remove network entities and netns Fixes:#4693 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-04-12 10:21:06 +08:00
Zhongtao Hu	b31f103d12	runtime-rs: enable nerdctl cni plugin 1. when we use nerdctl to setup network for kata, no netns is created by nerdctl, kata need to create netns by its own 2. after start VM, nerdctl will call cni plugin via oci hook, we need to rescan the netns after the interfaces have been created, and hotplug the network device into the VM Fixes:#4693 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-04-12 10:21:04 +08:00
Fabiano Fidêncio	3b3656d96d	Merge pull request #6522 from fidencio/topic/add-tdx-artefacts-from-2023ww01-to-main tdx: Add artefacts from the latest TDX tools release into main	2023-04-11 20:43:02 +02:00
Fabiano Fidêncio	50ce33b02d	Merge pull request #6205 from fengwang666/non-root-clh runtime: support non-root for clh	2023-04-11 19:34:00 +02:00

1 2 3 4 5 ...

3452 Commits