kata-containers

mirror of https://github.com/aljazceru/kata-containers.git synced 2026-01-15 12:24:26 +01:00

Author	SHA1	Message	Date
Fabiano Fidêncio	96e9374d4b	dragonball: Don't fail if a request asks for more CPUs than allowed Let's take the same approach of the go runtime, instead, and allocate the maximum allowed number of vcpus instead. Fixes: #7270 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-08 15:50:23 +02:00
Fabiano Fidêncio	275c84e7b5	Revert "agent: fix the issue of exec hang with a backgroud process" This reverts commit `25d2fb0fde`. The reason we're reverting the commit is because it to check whether it's the cause for the regression on devmapper tests. Fixes: #7253 Depends-on: github.com/kata-containers/tests#5705 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-07-08 14:27:40 +02:00
Bin Liu	f214058b07	Merge pull request #7202 from wedsonaf/macros Convert `is_allowed`, `ttrpc_error` and `sl` to functions	2023-07-04 14:23:08 +08:00
Peng Tao	581be92b25	Merge pull request #4492 from zvonkok/pcie-topology runtime: fix PCIe topology for GPUDirect use-case	2023-07-03 09:17:12 +08:00
Fabiano Fidêncio	6a21e20c63	runtime: Add "none" as a shared_fs option Currently, even when using devmapper, if the VMM supports virtio-fs / virtio-9p, that's used to share a few files between the host and the guest. This needed, as we need to share with the guest contents like secrets, certificates, and configurations, via Kubernetes objects like configMaps or secrets, and those are rotated and must be updated into the guest whenever the rotation happens. However, there are still use-cases users can live with just copying those files into the guest at the pod creation time, and for those there's absolutely no need to have a shared filesystem process running with no extra obvious benefit, consuming memory and even increasing the attack surface used by Kata Containers. For the case mentioned above, we should allow users, making it very clear which limitations it'll bring, to run Kata Containers with devmapper without actually having to use a shared file system, which is already the approach taken when using Firecracker as the VMM. Fixes: #7207 Signed-off-by: Fabiano Fidêncio <fabiano.fidencio@intel.com>	2023-06-30 20:45:00 +02:00
Zvonko Kaiser	0f454d0c04	gpu: Fixing typos for PCIe topology changes Some comments and functions had typos and wrong capitalization. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-30 08:42:55 +00:00
Fupan Li	4288b935e1	Merge pull request #7104 from openanolis/physical/endpoint runtime-rs: support physical endpoint using device manager	2023-06-29 14:43:44 +08:00
GabyCT	19890133e9	Merge pull request #7189 from Apokleos/direct-vol-bugfix runtime-rs: bugfix for direct volume path's validation.	2023-06-28 12:26:22 -06:00
Wedson Almeida Filho	0504bd7254	agent: convert the `sl` macros to functions There is nothing in them that requires them to be macros. Converting them to functions allows for better error messages. Fixes: #7201 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-06-28 14:05:32 -03:00
Wedson Almeida Filho	0860fbd410	agent: convert the `ttrpc_error` macro to a function There is nothing in it that requires it to be a macro. Converting it to a function allows for better error messages. Fixes: #7201 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-06-28 14:05:32 -03:00
Wedson Almeida Filho	0e5d6ce6d7	agent: convert the `is_allowed` macro to a function Having a function allows for better error messages from the type checker and it makes it clearer to callers what can happen. For example: is_allowed!(req); Gives no indication that it may result in an early return, and no simple way for callers to modify the behaviour. It also makes it look like ownership of `req` is being transferred. On the other hand, is_allowed(&req)?; Indicates that `req` is being borrowed (immutably) and may fail. The question mark indicates that the caller wants an early return on failure. Fixes: #7201 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-06-28 14:05:32 -03:00
Wedson Almeida Filho	f680fc52be	agent: change `AGENT_CONFIG`'s lazy type to just `AgentConfig` Since it is never modified, it doesn't really need a lock of any kind. Removing the `RwLock` wrapper allows us to remove all `.read().await` calls when accessing it. Additionally, `AGENT_CONFIG` already has a static lifetime, so there is no need to wrap it in a ref-counted heap allocation. Fixes: #5409 Signed-off-by: Wedson Almeida Filho <walmeida@microsoft.com>	2023-06-28 14:05:27 -03:00
Jianyong Wu	1f3e837e4b	runtime-rs: fix build error on AArch64 Vfio support introduce build error on AArch64. Remove arch related annotation can avoid this error. Fixes: #7187 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2023-06-28 07:10:43 +00:00
alex.lyn	6fd25968c6	runtime-rs: bugfix for direct volume path's validation. The failure mainly caused by the encoded volume path and the mount/src. As the src will be validated with stat,but it's not a full path and encoded, which causes the stat mount source failed. Fixes: #7186 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-06-28 10:07:07 +08:00
Zhongtao Hu	bff4672f7d	runtime-rs: support physical endpoint using device manager use device manager to attach physical endpoint Fixes: #7103 Signed-off-by: Zhongtao Hu <zhongtaohu.tim@linux.alibaba.com>	2023-06-27 10:25:51 +08:00
alex.lyn	0df2fc2702	runtime-rs: add support spdk/vhost-user based volume. Unlike the previous usage which requires creating /dev/xxx by mknod on the host, the new approach will fully utilize the DirectVolume-related usage method, and pass the spdk controller to vmm. And a user guide about using the spdk volume when run a kata-containers. it can be found in docs/how-to. Fixes: #6526 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-06-25 16:23:19 +08:00
GabyCT	388b55175e	Merge pull request #7056 from FuuuOverclocking/fuu/fix-console_manager dragonball: avoid obtaining lock twice in create_stdio_console	2023-06-23 16:47:00 -06:00
Zvonko Kaiser	8330fb8ee7	gpu: Update unit tests Some tests are now failing due to the changes how PCIe is handled. Update the test accordingly. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-23 11:16:25 +00:00
Fupan Li	469c678425	Merge pull request #7058 from Apokleos/vfio-dev add support vfio device manager	2023-06-22 17:51:22 -06:00
Archana Shinde	2d329125fd	Merge pull request #6800 from amshinde/check-vm-capability kata-ctl: Check for vm capability	2023-06-21 23:52:46 -07:00
Archana Shinde	610f7986e4	check: Relax the unrestricted_guest check when running in a VM When running on a VM, the kernel parameter "unrestricted_guest" for kernel module "kvm_intel" is not required. So, return success when running on a VM without checking value of this kernel parameter. Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-06-21 07:30:35 -07:00
Archana Shinde	1b406b9d0c	kata-ctl:Implement functionality to check host is capable of running VM Implement functionality to add to the env output if the host is capable of running a VM. Fixes: #6727 Signed-off-by: Archana Shinde <archana.m.shinde@intel.com>	2023-06-21 07:30:22 -07:00
soup	09720babc3	docs: fix spelling of "crate" Fixes: #7153 Signed-off-by: soup <lqh348659137@outlook.com>	2023-06-21 16:10:54 +08:00
alex.lyn	59510cfee0	runtime-rs: add support vfio device based volume A new choice of using vfio devic based volume for kata-containers. With the help of kata-ctl direct-volume, users are able to add a specified device which is BDF or IOMMU group ID. To help users to use it smoothly, A doc about howto added in docs/how-to/how-to-run-kata-containers-with-kinds-of-Block-Volumes. Fixes: #6525 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-06-18 14:07:05 +08:00
alex.lyn	1e3b372bbb	runtime-rs: add support vfio device manager Limitations: As no ready rust vmm's vfio manager is ready, it only supports part of vfio in runtime-rs. And the left part is to call vmm interfaces related to vfio add/remove. So when vmm/vfio manager ready, a new PR will be pushed to narrow the gap. Fixes: #6525 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-06-18 14:05:59 +08:00
Greg Kurz	a43ea24dfc	virtiofsd: Convert legacy `-o` sub-options to their `--` replacement The `-o` option is the legacy way to configure virtiofsd, inherited from the C implementation. The rust implementation honours it for compatibility but it logs deprecation warnings. Let's use the replacement options in the go shim code. Also drop references to `-o` from the configuration TOML file. Fixes #7111 Signed-off-by: Greg Kurz <groug@kaod.org>	2023-06-16 11:42:54 +02:00
Greg Kurz	8e00dc6944	virtiofsd: Drop `-o no_posix_lock` The C implementation of virtiofsd had some kind of limited support for remote POSIX locks that was causing some workflows to fail with kata. Commit `432f9bea6e` hard coded `-o no_posix_lock` in order to enforce guest local POSIX locks and avoid the issues. We've switched to the rust implementation of virtiofsd since then, but it emits a warning about `-o` being deprecated. According to https://gitlab.com/virtio-fs/virtiofsd/-/issues/53 : The C implementation of the daemon has limited support for remote POSIX locks, restricted exclusively to non-blocking operations. We tried to implement the same level of functionality in #2, but we finally decided against it because, in practice most applications will fail if non-blocking operations aren't supported. Implementing support for non-blocking isn't trivial and will probably require extending the kernel interface before we can even start working on the daemon side. There is thus no justification to pass `-o no_posix_lock` anymore. Signed-off-by: Greg Kurz <groug@kaod.org>	2023-06-16 11:42:39 +02:00
Greg Kurz	2a15ad9788	virtiofsd: Stop using deprecated `-f` option The rust implementation of virtiofsd always runs foreground and spits a deprecation warning when `-f` is passed. Signed-off-by: Greg Kurz <groug@kaod.org>	2023-06-16 10:30:40 +02:00
Zvonko Kaiser	72f2cb84e6	gpu: Reset cold or hot plug after overriding If we override the cold, hot plug with an annotation we need to reset the other plugging mechanism to NoPort otherwise both will be enabled. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-15 17:51:01 +00:00
Zvonko Kaiser	fbacc09646	gpu: PCIe topology, consider vhost-user-block in Virt In Virt the vhost-user-block is an PCIe device so we need to make sure to consider it as well. We're keeping track of vhost-user-block devices and deduce the correct amount of PCIe root ports. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-15 17:39:55 +00:00
Zvonko Kaiser	b11246c3aa	gpu: Various fixes for virt machine type The PCI qom path was not deduced correctly added regex for correct path walking. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:33:57 +00:00
Zvonko Kaiser	40101ea7db	vfio: Added annotation for hot(cold) plug Now it is possible to configure the PCIe topology via annotations and addded a simple test, checking for Invalid and RootPort Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	8f0d4e2612	vfio: Cleanup of Cold and Hot Plug Removed the configuration of PCIeRootPort and PCIeSwitchPort, those values can be deduced in createPCIeTopology Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	b5c4677e0e	vfio: Rearrange the bus assignemnt Refactor the bus assignment so that the call to GetAllVFIODevicesFromIOMMUGroup can be used by any module without affecting the topology. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	b1aa8c8a24	gpu: Moved the PCIe configs to drivers The hypervisor_state file was the wrong location for the PCIe Port settings, moved everything under device umbrella, where it can be consumed more easily and we do not get into circular deps. Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	55a66eb7fb	gpu: Add config to TOML Update cold-plug and hot-plug setting to include bridge, root and switch-port Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	da42801c38	gpu: Add config settings tests for hot-plug Updated all references and config settings for hot-plug to match cold-plug Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
Zvonko Kaiser	de39fb7d38	runtime: Add support for GPUDirect and GPUDirect RDMA PCIe topology Fixes: #4491 Signed-off-by: Zvonko Kaiser <zkaiser@nvidia.com>	2023-06-14 08:20:24 +00:00
alex.lyn	347385b4ee	runtime-rs: Enhance flexibility of virtio-fs config support more and flexible options for inline virtiofs. Fixes: #7091 Signed-off-by: alex.lyn <alex.lyn@antgroup.com>	2023-06-13 15:12:47 +08:00
Zhongtao Hu	355a24e0e1	Merge pull request #6289 from openanolis/runtime_vcpu_resize feat(runtime): vcpu resize capability	2023-06-13 10:54:11 +08:00
Yushuo	ae2cfa8263	doc: add vcpu handlint doc for runtime-rs Kubernetes and Containerd will help calculate the Sandbox Size and pass it to Kata Containers through annotations. In order to accommodate this favorable change and be compatible with the past, we have implemented the handling of the number of vCPUs in runtime-rs. This is This is slightly different from the original runtime-go design. This doc introduce how we handle vCPU size in runtime-rs. Fixes: #5030 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com> Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-06-12 19:23:11 +08:00
Yushuo	7b1e67819c	fix(clippy): fix clippy error Fixes: #5030 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com> Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-06-12 17:53:16 +08:00
Yushuo	67972ec48a	feat(runtime-rs): calculate initial size In this commit, we refactored the logic of static resource management. We defined the sandbox size calculated from PodSandbox's annotation and SingleContainer's spec as initial size, which will always be the sandbox size when booting the VM. The configuration static_sandbox_resource_mgmt controls whether we will modify the sandbox size in the following container operation. Signed-off-by: Yushuo <y-shuo@linux.alibaba.com> Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-06-12 17:53:16 +08:00
Yushuo	aaa96c749b	feat(runtime-rs): modify onlineCpuMemRequest Some vmms, such as dragonball, will actively help us perform online cpu operations when doing cpu hotplug. Under the old onlineCpuMem interface, it is difficult to adapt to this situation. So we modify the semantics of nb_cpus in onlineCpuMemRequest. In the original semantics, nb_cpus represents the number of newly added CPUs that need to be online. The modified semantics become that the number of online CPUs in the guest needs to be guaranteed. Fixes: #5030 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com> Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-06-12 17:53:16 +08:00
Yushuo	d66f7572dd	feat(runtime-rs): clear cpuset in runtime side The declaration of the cpu number in the cpuset is greater than the actual number of vcpus, which will cause an error when updating the cgroup in the guest. This problem is difficult to solve, so we temporarily clean up the cpuset in the container spec before passing in the agent. Fixes: #5030 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com> Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-06-12 17:53:16 +08:00
Yushuo	a0385e1383	feat(runtime-rs): update linux resource when stop_process Update the resource when delete container, which is in stop_process in runtime-rs. Fixes: #5030 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com> Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-06-12 17:53:16 +08:00
Yushuo	a39e1e6cd1	feat(runtime-rs): merge the update_cgroups in update_linux_resources Updating vCPU resources and memory resources of the sandbox and updating cgroups on the host will always happening together, and they are all updated based on the linux resources declarations of all the containers. So we merge update_cgroups into the update_linux_resources, so we can better manage the resources allocated to one pod in the host. Fixes: #5030 Signed-off-by: Yushuo <y-shuo@linux.alibaba.com> Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com>	2023-06-12 17:53:16 +08:00
Ji-Xinyou	fa6dff9f70	feat(runtime-rs): support vcpu resizing on runtime side Support vcpu resizing on runtime side: 1. Calculate vcpu numbers in resource_manager using all the containers' linux_resources in the spec. 2. Call the hypervisor(vmm) to do the vcpu resize. 3. Call the agent to online vcpus. Fixes: #5030 Signed-off-by: Ji-Xinyou <jerryji0414@outlook.com> Signed-off-by: Yushuo <y-shuo@linux.alibaba.com>	2023-06-12 17:53:16 +08:00
James O. D. Hunt	8cb4238b46	packaging: Remove snap package Nobody has volunteered to maintain the (currently broken) snap build, so remove it. Fixes: #6769. Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2023-06-12 09:24:09 +01:00
Chao Wu	2988553305	Merge pull request #6998 from HerlinCoder/herlincoder/vpa Dragonball: support resize memory	2023-06-11 17:21:12 +08:00

1 2 3 4 5 ...

3333 Commits