kata-containers

mirror of https://github.com/aljazceru/kata-containers.git synced 2025-12-22 08:44:25 +01:00

Author	SHA1	Message	Date
Christophe de Dinechin	4e89b885d2	config: Protect file_mem_backend against annotation attacks This one could theoretically be used to overwrite data on the host. It seems somewhat less risky than the earlier ones for a number of reasons, but worth protecting a little anyway. Fixes: #901 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2020-10-14 16:10:12 +02:00
Christophe de Dinechin	aae9656d8b	config: Protect vhost_user_store_path against annotation attacks This path could be used to overwrite data on the host. Fixes: #901 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2020-10-14 16:10:12 +02:00
Christophe de Dinechin	5588165399	config: Add security warning on configuration examples Add the following text explaining the risk of using regular expressions in path lists: Each member of the list can be a regular expression, but prefer names. Otherwise, please read and understand the following carefully. SECURITY WARNING: If you use regular expressions, be mindful that an attacker could craft an annotation that uses .. to escape the paths you gave. For example, if your regexp is /bin/qemu.* then if there is a directory named /bin/qemu.d/, then an attacker can pass an annotation containing /bin/qemu.d/../put-any-binary-name-here and attack your host. Fixes: #901 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2020-10-14 16:10:12 +02:00
Christophe de Dinechin	b21a829c61	config: Protect ctlpath from annotation attack This also adds annotation for ctlpath which were not present before. It's better to implement the code consistenly right now to make sure that we don't end up with a leaky implementation tacked on later. Fixes: #901 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2020-10-14 16:10:12 +02:00
Christophe de Dinechin	27b6620b23	config: Protect jailer_path annotation The jailer_path annotation can be used to execute arbitrary code on the host. Add a jailer_path_list configuration entry providing a list of regular expressions that can be used to filter annotations that represent valid file names. Fixes: #901 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2020-10-14 16:10:12 +02:00
Christophe de Dinechin	076690179d	config: Add examples for path_list configuration The path_list configuration gives a series of regular expressions that limit which values are acceptable through annotations in order to avoid kata launching arbitrary binaries on the host when receiving an annotation. Fixes: #901 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2020-10-14 16:10:12 +02:00
Christophe de Dinechin	2d431c61c6	annotations: Simplify negative logic Replace strange negative logic (!ok -> continue) with positive logic (ok -> do it) Fixes: #901 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2020-10-14 16:10:12 +02:00
Christophe de Dinechin	2ca9ca892d	config: Add hypervisor path override through annotations The annotation is provided, so it should be respected. Furthermore, it is important to implement it with the appropriate protetions similar to what was done for virtiofsd. Fixes: #901 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2020-10-14 16:10:12 +02:00
Christophe de Dinechin	2e093dfd8b	config: Fix typo in function name There was an extra 'p' in addHypervisorVirtioFsOverrides. Fixes: #901 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2020-10-14 16:10:12 +02:00
Christophe de Dinechin	bf13ff0a3a	config: Protect virtio_fs_daemon annotation Sending the virtio_fs_daemon annotation can be used to execute arbitrary code on the host. In order to prevent this, restrict the values of the annotation to a list provided by the configuration file. Fixes: #901 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2020-10-14 16:10:12 +02:00
Christophe de Dinechin	8c75de1966	config: Add 'List' alternates for hypervisor configuration paths Paths mentioned in the hypervisor configuration can be overriden using annotations, which is potentially dangerous. For each path, add a 'List' variant that specifies the list of acceptable values from annotations. Bug: https://bugs.launchpad.net/katacontainers.io/+bug/1878234 Fixes: #901 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2020-10-14 16:10:12 +02:00
Peng Tao	fc6468efdb	agent: fix panic on malformed device resource in container update Somehow containerd is sending a malformed device in update API. While it should not happen, we should not panic either. Fixes: #946 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2020-10-14 13:27:23 +08:00
Eric Ernst	d8a8fe47fb	cpuset: don't set cpuset.mems in the guest Kata doesn't map any numa topologies in the guest. Let's make sure we clear the Cpuset fields before passing container updates to the guest. Note, in the future we may want to have a vCPU to guest CPU mapping and still include the cpuset.Cpus. Until we have this support, clear this as well. Fixes: #932 Signed-off-by: Eric Ernst <eric.g.ernst@gmail.com>	2020-10-13 15:54:03 -07:00
Eric Ernst	88cd712876	sandbox: consider cpusets if quota is not enforced CPUSet cgroup allows for pinning the memory associated with a cpuset to a given numa node. Similar to cpuset.cpus, we should take cpuset.mems into account for the sandbox-cgroup that Kata creates. Signed-off-by: Eric Ernst <eric.g.ernst@gmail.com>	2020-10-13 15:54:03 -07:00
Eric Ernst	77a463e57a	cpuset: support setting mems for sandbox CPUSet cgroup allows for pinning the memory associated with a cpuset to a given numa node. Similar to cpuset.cpus, we should take cpuset.mems into account for the sandbox-cgroup that Kata creates. Signed-off-by: Eric Ernst <eric.g.ernst@gmail.com>	2020-10-13 15:54:03 -07:00
Eric Ernst	2d690536b8	cpuset: add cpuset pkg Pulled from 1.18.4 Kubernetes, adding the cpuset pkg for managing CPUSet calculations on the host. Go mod'ing the original code from k8s.io/kubernetes was very painful, and this is very static, so let's just pull in what we need. Signed-off-by: Eric Ernst <eric.g.ernst@gmail.com>	2020-10-13 15:54:03 -07:00
Fabiano Fidêncio	1a9515a998	runtime: Pass `--thread-pool-size=1` to virtiofsd Dave Gilbert brough up that passing --thread-pool-size=1 to virtiofsd may result in a performance improvement especially when using `cache=none`. While our current default is `cache=auto`, Dave mentioned that he seems no harm in having it set and he also mentiond that it may use a lot less stack space on aarch/arm. Fixes: #943 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2020-10-13 22:33:08 +02:00
Fupan Li	25cdf2d728	Merge pull request #931 from dgibson/bug703 Forward port device conflict fixes from Kata 1 / Go agent	2020-10-13 15:59:17 +08:00
David Gibson	ae6b8ec747	agent/device: Check type as well as major:minor when looking up devices To update device resource entries from host to guest, we search for the right entry by host major:minor numbers, then later update it. However block and character devices exist in separate major:minor namespaces so we could have one block and one character device with matching major:minor and thus incorrectly update both with the details for whichever device is processed second. Add a check on device type to prevent this. Port from the Kata 1 Go agent https://github.com/kata-containers/agent/commit/27ebdc9d2761 Fixes: #703 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2020-10-13 16:26:52 +11:00
David Gibson	859301b009	agent/device: Index all devices in spec before updating them The agent needs to update device entries in the OCI spec so that it has the correct major:minor numbers for the guest, which may differ from the host. Entries in the main device list are looked up by device path, but entries in the device resources list are looked up by (host) major:minor. This is done one device at a time, updating as we go in update_spec_device_list(). But since the host and guest have different namespaces, one device might have the same major:minor as a different device on the host. In that case we could update one resource entry to the correct guest values, then mistakenly update it again because it now matches a different host device. To avoid this, rather than looking up and updating one by one, we make all the lookups in advance, creating a map from (host) device path to the indices in the spec where the device and resource entries can be found. Port from the Go agent in Kata 1, https://github.com/kata-containers/agent/commit/d88d46849130 Fixes: #703 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2020-10-13 16:26:26 +11:00
David Gibson	2477c355bc	agent/device: Forward port update_spec_device_list() unit test The Kata 1 Go agent included a unit test for updateSpecDeviceList, but no such unit test exists for the Rust agent's equivalent update_spec_device_list(). Port the Kata1 test to Rust. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2020-10-13 16:25:58 +11:00
David Gibson	08d80c1aaa	agent/device: update_spec_device_list() should error if dev not found If update_spec_device_list() is given a device that can't be found in the OCI spec, it currently does nothing, and returns Ok(()). That doesn't seem like what we'd expect and is not what the Go agent in Kata 1 does. Change it to return an error in that case, like Kata 1. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2020-10-13 16:25:36 +11:00
Eric Ernst	12cc0ee168	sandbox: don't constrain cpus, mem only cpuset, devices Allow for constraining the cpuset as well as the devices-whitelist . Revert sandbox constraints for cpu/memory, as they break the K8S use case. Can re-add behind a non-default flag in the future. The sandbox CPUSet should be updated every time a container is created, updated, or removed. To facilitate this without rewriting the 'non constrained cgroup' handling, let's add to the Sandbox's cgroupsUpdate function. Signed-off-by: Eric Ernst <eric.g.ernst@gmail.com>	2020-10-12 21:31:27 -07:00
Eric Ernst	b6cf68a985	cgroups: add ability to update CPUSet Add function for applying a cpuset change to a cgroup Signed-off-by: Eric Ernst <eric.g.ernst@gmail.com>	2020-10-12 21:31:27 -07:00
Eric Ernst	b812d4f7fa	virtcontainers: add method for calculating cpuset for sandbox Calculate sandbox's CPUSet as the union of each of the container's CPUSets. Signed-off-by: Eric Ernst <eric.g.ernst@gmail.com>	2020-10-12 21:31:27 -07:00
Peng Tao	16a6427ca9	Merge pull request #923 from liubin/fix/simplify-codes agent: simplify codes	2020-10-13 09:54:46 +08:00
Eric Ernst	2e72972cd7	Merge pull request #910 from egernst/fix-parsing agent: fix errorneous parsing for guest block size	2020-10-12 12:40:02 -07:00
Eric Ernst	f63f740545	agent: fix errorneous parsing for guest block size We were assuming base 10 string before, when the block size from sysfs is actually a hex string. Let's fix that. Fixes: #908 Signed-off-by: Eric Ernst <eric.g.ernst@gmail.com>	2020-10-12 11:18:39 -07:00
Fupan Li	27634982f7	Merge pull request #915 from liubin/fix/914-use-macro-to-simplify-codes agent: use macro to simplify parse_cmdline function in config.rs	2020-10-12 22:23:30 +08:00
bin liu	11c1ab8bca	agent: use ok_or/map_err instead of match Sometimes `Option.or_or` and `Result.map_err` may be simpler than match statement. Especially in rpc.rs, there are many `ctr.get_process` and `sandbox.get_container` which are using `match`. Signed-off-by: bin liu <bin@hyper.sh>	2020-10-12 16:59:02 +08:00
bin liu	6b9f99156e	rustjail: use Iterator to manipulate vector elements Use Iterator can save codes, and make code more readable Signed-off-by: bin liu <bin@hyper.sh>	2020-10-12 14:26:33 +08:00
bin liu	dc1442c33a	rustjail: delete codes commented out There are some uses/codes/struct fields are commented out, and may not turn into un-comment these codes, so delete these comments. Signed-off-by: bin liu <bin@hyper.sh>	2020-10-12 12:29:23 +08:00
bin liu	aa04111d9f	rustjail: delete unused test code The auto generated test code is no meanings, delete it. Signed-off-by: bin liu <bin@hyper.sh>	2020-10-12 10:23:22 +08:00
bin liu	eae685dc53	agent: use chain of Result to avoid early return Use rust `Result`'s `or_else`/`and_then` can write clean codes. And can avoid early return by check wether the `Result` is `Ok` or `Err`. Signed-off-by: bin liu <bin@hyper.sh>	2020-10-10 22:22:54 +08:00
bin liu	5e3d1fb60b	agent: add blank lines between methods In rpc.rs, there are no blank lines between methods, this commit add blank lines for these methods. Signed-off-by: bin liu <bin@hyper.sh>	2020-10-10 12:37:34 +00:00
bin liu	980e48ca94	agent: delete unused field in agentService The code is for test, and not needed now. Signed-off-by: bin liu <bin@hyper.sh>	2020-10-10 12:23:44 +00:00
bin liu	52b821fa5f	agent: use no-named closure to reduce codes For simple closures, inline closures can save codes. Signed-off-by: bin liu <bin@hyper.sh>	2020-10-10 20:10:16 +08:00
bin liu	b1f95e8d27	agent: use a local fn to reduce duplicated codes The same codes used twices, aggregated into a function can reduce codes. Signed-off-by: bin liu <bin@hyper.sh>	2020-10-10 19:55:05 +08:00
Jianyong Wu	c781a80820	agent: fix aarch64 build aarch64 needs libgcc to resolve some non-builtin symbols. Fixes: #909 Signed-off-by: Jianyong Wu <jianyong.wu@arm.com> Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2020-10-10 18:29:23 +08:00
bin liu	906b38441c	agent: update not accurate comments This commit includes: - update comments that not matched the function name - file path with doubled slash Fixes: #922 Signed-off-by: bin liu <bin@hyper.sh>	2020-10-10 17:57:13 +08:00
bin liu	b7309943af	agent: use macro to simplify parse_cmdline function in config.rs In function parse_cmdline there are some similar codes, if we want to add more commandline arguments, the code will grow too long. Use macro can reduce some codes with the same logic/processing. Fixes: #914 Signed-off-by: bin liu <bin@hyper.sh>	2020-10-10 15:20:47 +08:00
Julio Montes	4f0fe8473b	Merge pull request #886 from bergwolf/CVE-2019-19921 agent: do not follow link when mounting container proc and sysfs	2020-10-09 09:47:30 -05:00
Fupan Li	3a659a6733	Merge pull request #891 from bergwolf/CVE-2016-9962 agent: set init process non-dumpable	2020-10-09 19:03:24 +08:00
Peng Tao	b7147edadb	agent: do not follow link when mounting container proc and sysfs Attackers might use it to explore other containers in the same pod. While it is still safe to allow it, we can just close the race window like runc does. Fixes: #885 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2020-10-09 18:54:26 +08:00
Bin Liu	43da14e7b3	Merge pull request #752 from YchauWang/clear-moke-code01 runtime: Clear the VCMock 1.x API Methods from 2.0	2020-10-09 17:41:21 +08:00
Peng Tao	15b7156348	agent: set init process non-dumpable On old kernels (like v4.9), kernel applies CLOECEC in wrong order w.r.t. dumpable task flags. As a result, we might leak guest file descriptor to containers. This is a former runc CVE-2016-9962 and still applies to kata agent. Although Kata container is still valid at protecting the host, we should not leak extra resources to user containers. This sets the init processes that join and setup the container's namespaces as non-dumpable before they setns to the container's pid (or any other ) namespace. This settings is automatically reset to the default after the Exec in the container so that it does not change functionality for the applications that are running inside, just our init processes. This prevents parent processes, the pid 1 of the container, to ptrace the init process before it drops caps and other sets LSMs. The order during the exec syscall is that the process is set back to dumpable before O_CLOEXEC are processed. Refs: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=613cc2b6f272c1a8ad33aefa21cad77af23139f7 https://github.com/torvalds/linux/blob/v4.9/fs/exec.c#L1290-L1318 opencontainers/runc@50a19c6 https://nvd.nist.gov/vuln/detail/CVE-2016-9962 Fixes: #890 Signed-off-by: Peng Tao <bergwolf@hyper.sh>	2020-10-09 17:12:06 +08:00
Peng Tao	3f8e619c2f	Merge pull request #876 from jcvenegas/dax-off virtiofs: Disable DAX	2020-10-09 13:39:42 +08:00
Jose Carlos Venegas Munoz	c4472481bc	virtiofs: Disable DAX virtiofs DAX support is not stable today, there are a few corner cases to make it default. Fixes: #862 Fixes: #875 Signed-off-by: Jose Carlos Venegas Munoz <jose.carlos.venegas.munoz@intel.com>	2020-10-08 10:59:10 -05:00
Christophe de Dinechin	0e898c6bc4	rust-agent: Treat warnings as error Avoid the accumulation of warnings we had, as reported in #750. Fixes: #750 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2020-10-07 17:30:21 +02:00
Christophe de Dinechin	0e4baaabcc	rust-agent: Identify unused results in tests Assign unused results to _ in order to silence warnings. This addresses the following warnings: warning: unused `std::result::Result` that must be used --> rustjail/src/mount.rs:1182:16 \| 1182 \| defer!(unistd::chdir(&olddir);); \| ^^^^^^^^^^^^^^^^^^^^^^^ \| = note: `#[warn(unused_must_use)]` on by default = note: this `Result` may be an `Err` variant, which should be handled warning: unused `std::result::Result` that must be used --> rustjail/src/mount.rs:1183:9 \| 1183 \| unistd::chdir(tempdir.path()); \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ \| = note: this `Result` may be an `Err` variant, which should be handled While in regular code, we want to log possible errors, in test code it's OK to simply ignore the returned value. Fixes: #750 Signed-off-by: Christophe de Dinechin <dinechin@redhat.com>	2020-10-07 17:30:13 +02:00

... 32 33 34 35 36 ...

2192 Commits