kata-containers

mirror of https://github.com/aljazceru/kata-containers.git synced 2026-01-16 04:44:21 +01:00

Author	SHA1	Message	Date
Zvonko Kaiser	3b9f8fdbcb	CCv0: Adding CDI support for cold and hot-plug of VFIO devices We need to do proper sandbox sizing when we're doing cold-plug introduce CDI, the de-facto standard for enabling devices in containers. containerd will pass-through annotations for accumulated CPU,Memory and now CDI devices. With that information sandbox sizing can be derived correctly. Fixes: #7331 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-07-19 06:55:58 +00:00
stevenhorsman	15647a000e	runtime: Ignore cyclomatic complexity Ignore cyclomatic complexity failure. I have fixed this in my PR waiting to forward port remote-hypervisor support into main Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-07-11 19:55:36 +01:00
stevenhorsman	7188a60e25	runtime: Fix bad merge - Fix the HotPlug type Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-07-11 19:47:45 +01:00
stevenhorsman	f4d7011f3b	CCv0: Merge main into CCv0 branch - Merge remote-tracking branch 'upstream/main' into CCv0 - Note excludes `532755ce31` due to incompatiblity Fixes: #7278 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-07-11 14:45:58 +01:00
stevenhorsman	335a456425	config: Update remote hypervisor config - Add annotation enablement for machine_type, default_memory and default_vcpus - Remove note that says that cpu and memory settings are ignored. Fixes: #7256 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-07-07 08:37:46 +01:00
Peng Tao	581be92b25	Merge pull request #4492 from zvonkok/pcie-topology runtime: fix PCIe topology for GPUDirect use-case	2023-07-03 09:17:12 +08:00
Fabiano Fidêncio	6a21e20c63	runtime: Add "none" as a shared_fs option Currently, even when using devmapper, if the VMM supports virtio-fs / virtio-9p, that's used to share a few files between the host and the guest. This needed, as we need to share with the guest contents like secrets, certificates, and configurations, via Kubernetes objects like configMaps or secrets, and those are rotated and must be updated into the guest whenever the rotation happens. However, there are still use-cases users can live with just copying those files into the guest at the pod creation time, and for those there's absolutely no need to have a shared filesystem process running with no extra obvious benefit, consuming memory and even increasing the attack surface used by Kata Containers. For the case mentioned above, we should allow users, making it very clear which limitations it'll bring, to run Kata Containers with devmapper without actually having to use a shared file system, which is already the approach taken when using Firecracker as the VMM. Fixes: #7207 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-06-30 20:45:00 +02:00
Zvonko Kaiser	0f454d0c04	gpu: Fixing typos for PCIe topology changes Some comments and functions had typos and wrong capitalization. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-30 08:42:55 +00:00
stevenhorsman	51eb0c5130	runtime: SEV sysconfig fix - SEV and SNP need a different sysconfig path Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-06-29 20:52:57 +01:00
Steve Horsman	70e6e40a8d	Merge pull request #7134 from stevenhorsman/CCv0-merge-19th-june CCv0: Merge main into CCv0 branch	2023-06-27 09:16:49 +01:00
Zvonko Kaiser	8330fb8ee7	gpu: Update unit tests Some tests are now failing due to the changes how PCIe is handled. Update the test accordingly. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-23 11:16:25 +00:00
Pradipta Banerjee	004f07f076	runtime: Add support for key annotations to remote hyp In order to support different pod VM instance type via remote hypervisor implementation (cloud-api-adaptor), we need to pass machine_type, default_vcpus and default_memory annotations to cloud-api-adaptor. The cloud-api-adaptor then uses these annotations to spin up the appropriate cloud instance. Reference PR for cloud-api-adaptor https://github.com/confidential-containers/cloud-api-adaptor/pull/1088 Fixes: #7140 Signed-off-by: Pradipta Banerjee <pradipta.banerjee@gmail.com>	2023-06-21 20:22:36 +05:30
stevenhorsman	5a4a89c108	runtime: Remove duplicated variables Remove duplicated variables that were in `CCv0` and merged in from main Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-06-20 15:01:54 +01:00
stevenhorsman	64a27d962b	CCv0: Merge main into CCv0 branch Merge remote-tracking branch 'upstream/main' into CCv0 Fixes: #7083 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-06-19 11:24:03 +01:00
Greg Kurz	a43ea24dfc	virtiofsd: Convert legacy `-o` sub-options to their `--` replacement The `-o` option is the legacy way to configure virtiofsd, inherited from the C implementation. The rust implementation honours it for compatibility but it logs deprecation warnings. Let's use the replacement options in the go shim code. Also drop references to `-o` from the configuration TOML file. Fixes #7111 Signed-off-by: Greg Kurz <groug@kaod.org>	2023-06-16 11:42:54 +02:00
Greg Kurz	8e00dc6944	virtiofsd: Drop `-o no_posix_lock` The C implementation of virtiofsd had some kind of limited support for remote POSIX locks that was causing some workflows to fail with kata. Commit `432f9bea6e` hard coded `-o no_posix_lock` in order to enforce guest local POSIX locks and avoid the issues. We've switched to the rust implementation of virtiofsd since then, but it emits a warning about `-o` being deprecated. According to https://gitlab.com/virtio-fs/virtiofsd/-/issues/53 : The C implementation of the daemon has limited support for remote POSIX locks, restricted exclusively to non-blocking operations. We tried to implement the same level of functionality in #2, but we finally decided against it because, in practice most applications will fail if non-blocking operations aren't supported. Implementing support for non-blocking isn't trivial and will probably require extending the kernel interface before we can even start working on the daemon side. There is thus no justification to pass `-o no_posix_lock` anymore. Signed-off-by: Greg Kurz <groug@kaod.org>	2023-06-16 11:42:39 +02:00
Greg Kurz	2a15ad9788	virtiofsd: Stop using deprecated `-f` option The rust implementation of virtiofsd always runs foreground and spits a deprecation warning when `-f` is passed. Signed-off-by: Greg Kurz <groug@kaod.org>	2023-06-16 10:30:40 +02:00
Zvonko Kaiser	72f2cb84e6	gpu: Reset cold or hot plug after overriding If we override the cold, hot plug with an annotation we need to reset the other plugging mechanism to NoPort otherwise both will be enabled. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-15 17:51:01 +00:00
Zvonko Kaiser	fbacc09646	gpu: PCIe topology, consider vhost-user-block in Virt In Virt the vhost-user-block is an PCIe device so we need to make sure to consider it as well. We're keeping track of vhost-user-block devices and deduce the correct amount of PCIe root ports. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-15 17:39:55 +00:00
Zvonko Kaiser	b11246c3aa	gpu: Various fixes for virt machine type The PCI qom path was not deduced correctly added regex for correct path walking. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:33:57 +00:00
Zvonko Kaiser	40101ea7db	vfio: Added annotation for hot(cold) plug Now it is possible to configure the PCIe topology via annotations and addded a simple test, checking for Invalid and RootPort Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	8f0d4e2612	vfio: Cleanup of Cold and Hot Plug Removed the configuration of PCIeRootPort and PCIeSwitchPort, those values can be deduced in createPCIeTopology Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	b5c4677e0e	vfio: Rearrange the bus assignemnt Refactor the bus assignment so that the call to GetAllVFIODevicesFromIOMMUGroup can be used by any module without affecting the topology. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	b1aa8c8a24	gpu: Moved the PCIe configs to drivers The hypervisor_state file was the wrong location for the PCIe Port settings, moved everything under device umbrella, where it can be consumed more easily and we do not get into circular deps. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	55a66eb7fb	gpu: Add config to TOML Update cold-plug and hot-plug setting to include bridge, root and switch-port Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	da42801c38	gpu: Add config settings tests for hot-plug Updated all references and config settings for hot-plug to match cold-plug Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	de39fb7d38	runtime: Add support for GPUDirect and GPUDirect RDMA PCIe topology Fixes: #4491 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zhongtao Hu	355a24e0e1	Merge pull request #6289 from openanolis/runtime_vcpu_resize feat(runtime): vcpu resize capability	2023-06-13 10:54:11 +08:00
Unmesh Deodhar	f4ee2a622f	runtime: Update snp qemu command name Main merge back to CCv0 caused snp qemu build to move from install_qemu to install_qemu_experimental. Thus, reflecting this change into the qemu snp command. Fixes: #7059 Signed-Off-By: Unmesh Deodhar <udeodhar@amd.com>	2023-06-12 12:34:42 -05:00
Yushuo	aaa96c749b	feat(runtime-rs): modify onlineCpuMemRequest Some vmms, such as dragonball, will actively help us perform online cpu operations when doing cpu hotplug. Under the old onlineCpuMem interface, it is difficult to adapt to this situation. So we modify the semantics of nb_cpus in onlineCpuMemRequest. In the original semantics, nb_cpus represents the number of newly added CPUs that need to be online. The modified semantics become that the number of online CPUs in the guest needs to be guaranteed. Fixes: #5030 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com> Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-06-12 17:53:16 +08:00
James O. D. Hunt	8cb4238b46	packaging: Remove snap package Nobody has volunteered to maintain the (currently broken) snap build, so remove it. Fixes: #6769. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-06-12 09:24:09 +01:00
Wang, Arron	f62b2670c0	config: Add root hash value and measure config to kernel params After we have a guest kernel with builtin initramfs which provide the rootfs measurement capability and Kata rootfs image with hash device, we need set related root hash value and measure config to the kernel params in kata configuration file. Fixes: #6674 Signed-off-by: Wang, Arron <arron.wang@intel.com>	2023-06-06 12:34:13 +02:00
Fabiano Fidêncio	bdb214aa34	runtimne: Add back the IMAGETDXPATH This was mistakenly removed as part of the rebase. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-30 10:17:43 +02:00
stevenhorsman	8b7b88f341	runtime: Update FIRMWARETDVFPATH Correct path Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-05-30 10:13:29 +02:00
stevenhorsman	66ca2f1bc4	qemu: static-check disable Disable gocyclo on large complex function in CCv0 branch Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-05-25 17:05:16 +01:00
stevenhorsman	c87c8ffce5	runtime: Fix bad merge - Re-add removed CC features from sandbox.go Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-05-25 16:30:01 +01:00
stevenhorsman	33143eb342	CCv0: Merge main into CCv0 branch Merge remote-tracking branch 'upstream/main' into CCv0 Fixes: kata-containers#5645 Depends-on: github.com/kata-containers/kata-containers#6885 Signed-off-by: stevenhorsman <steven@uk.ibm.com>	2023-05-25 16:17:59 +01:00
Fabiano Fidêncio	370811b017	runtime: Fix TDVF configuration with QEMU TDX Instead of setting: ``` firmware = "/path/to/OVMF.fd" firmware_volume = "/path/to/OVMF_VARS.fd" ``` We should either be setting: ``` firmware = "/path/to/OVMF.fd" ``` Or: ``` firmware = "/path/to/OVMF_CODE.fd" firmware_volume = "/path/to/OVMF_VARS.fd" ``` I'm taking the approach to setting up the latter, as that's what's been tested as part of our TDX CI. Fixes: #4926 This patch is the same as #4927, but it ended up reverted somewhere in the CCv0 -> main process, or in the attempts to fix TDX after that. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-24 19:01:44 +02:00
Beraldo Leal	0e47cfc4c7	runtime: sending SIGKILL to qemu There is a race condition when virtiofsd is killed without finishing all the clients. Because of that, when a pod is stopped, QEMU detects virtiofsd is gone, which is legitimate. Sending a SIGTERM first before killing could introduce some latency during the shutdown. Fixes #6757. Signed-off-by: Beraldo Leal <bleal@redhat.com>	2023-05-24 11:31:28 -04:00
Fabiano Fidêncio	efb0ac55c8	runtime: config: tdx: Enable service_offload This also as mistakenly overwritten by the `main` -> `CCv0` merge. Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-24 07:57:49 +02:00
Fabiano Fidêncio	8b4b233358	runtime: config: Fix image path for QEMU TDX The rebase from `main` to `CCv0` ended up overwriting the image path that should be used for QEMU, in the CCv0 branch. Fixes: #6932 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-24 07:57:22 +02:00
Fabiano Fidêncio	9aae333343	Merge pull request #6871 from kmjohansen/bugfix/ptmx runtime: make debug console work with sandbox_cgroup_only	2023-05-23 22:24:51 +02:00
Archana Shinde	2c9efbe04c	Merge pull request #6907 from likebreath/0519/clh_v32.0 Upgrade to Cloud Hypervisor v32.0	2023-05-22 09:53:05 -07:00
Fabiano Fidêncio	1f9ed94d74	runtime: Fix QEMU cmdline for TDX This commit should've been part of the series that reverted a bunch of TDX changes that are not compatible with the TDX stack we're using in the Jenkins CI machine. The change made here is in order to match what's been undone here: `c29e5036a6` Fixes: #6884 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-05-22 11:29:46 +02:00
GabyCT	6796af511b	Merge pull request #6890 from GabyCT/topic/fixurlvirt docs: Update container network model url	2023-05-19 15:10:26 -06:00
Bo Chen	35c3d7b4bc	runtime: clh: Re-generate the client code This patch re-generates the client code for Cloud Hypervisor v32.0. Note: The client code of cloud-hypervisor's OpenAPI is automatically generated by openapi-generator. Fixes: #6632 Signed-off-by: Bo Chen <chen.bo@intel.com>	2023-05-19 12:49:45 -07:00
Fabiano Fidêncio	6763c41d7e	Merge pull request #6886 from fidencio/topic/cc-stick-to-2022ww44-for-tdx CC: tdx: Stick to the 2022ww44 TDX stack for the CCv0 branch	2023-05-19 11:55:13 +02:00
Fabiano Fidêncio	0364620844	Merge pull request #6819 from fidencio/topic/use-static-sandbox-resource-mgmt-for-TEEs runtime: Use static_sandbox_resource_mgmt=true for TEEs	2023-05-18 22:38:31 +02:00
Krister Johansen	eff6ed2d5f	runtime: make debug console work with sandbox_cgroup_only If a hypervisor debug console is enabled and sandbox_cgroup_only is set, the hypervisor can fail to open /dev/ptmx, which prevents the sandbox from launching. This is caused by the absence of a device cgroup entry to allow access to /dev/ptmx. When sandbox_cgroup_only is not set, the hypervisor inherits the default unrestrcited device cgroup, but with it enabled it runs into allow / deny list restrictions. Fix by adding an allowlist entry for /dev/ptmx when debug is enabled, sandbox_cgroup_only is true, and no /dev/ptmx is already in the list of devices. Fixes: #6870 Signed-off-by: Krister Johansen <kjlx@templeofstupid.com>	2023-05-18 10:36:24 -07:00
Gabriela Cervantes	11a34a72e2	docs: Update container network model url This PR updates the container network model url that is part of the virtcontainers documentation. Fixes #6889 Signed-off-by: Gabriela Cervantes <gabriela.cervantes.tellez@intel.com>	2023-05-18 15:08:08 +00:00

1 2 3 4 5 ...

1806 Commits