kata-containers

mirror of https://github.com/aljazceru/kata-containers.git synced 2025-12-19 23:34:23 +01:00

Author	SHA1	Message	Date
Fupan Li	17d33868c2	Merge pull request #1670 from liubin/1668-remove-ProcessListContainer-API remove ProcessListContainer API	2021-04-12 10:22:37 +08:00
Chelsea Mafrica	543f9da3ba	runtime: Disable trace for healthcheck With tracing enabled, grpc health check generates a large number of spans which creates too much data for tasks running longer than a few minutes. To solve this, remove span creation from kata agent check() and sendReq() where the majority of the spans come from. Leave contexts in functions for subsequent calls that create spans. Fixes #1395 Signed-off-by: Chelsea Mafrica <chelsea.e.mafrica@intel.com>	2021-04-09 15:47:00 -07:00
bin	421439c633	API: remove ProcessListContainer/ListProcesses This commit will remove ProcessListContainer API from VCSandbox and ListProcesses from agent.proto. Fixes: #1668 Signed-off-by: bin <bin@hyper.sh>	2021-04-09 17:34:25 +08:00
David Gibson	0e04d6299b	Merge pull request #1642 from dgibson/ueventplus Refine uevent matching conditions	2021-04-09 13:10:52 +10:00
Eric Ernst	2334b858a0	Merge pull request #1661 from liubin/1660-replace-newStore-by-store virtcontainers: replace newStore by store in Sandbox struct	2021-04-08 13:17:44 -07:00
bin	d75fe95685	virtcontainers: replace newStore by store in Sandbox struct The property name make newcomers confused when reading code. Since in Kata Containers 2.0 there will only be one type of store, so it's safe to replace it by `store` simply. Fixes: #1660 Signed-off-by: bin <bin@hyper.sh>	2021-04-08 23:59:16 +08:00
Eric Ernst	324b026a77	Merge pull request #1604 from wainersm/agent_mount-1 agent: log the mount point if it is already mounted	2021-04-08 08:26:12 -07:00
Tim Zhang	24b0703fda	agent: fix test for the debug console Fix test for the debug console. Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-04-08 14:57:40 +08:00
Tim Zhang	790332575b	agent: async the debug console Make the debug console in this commit. Finish the rework of debug console. Fixes: #1647 Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-04-08 14:57:36 +08:00
David Gibson	8ea2ce9a31	agent/device: Remove legacy uevent matching DevAddrMatcher existed purely as a transitional step as we refined the uevent matching logic for each of the different device types we care about. We've now done that, so it can be removed along with several related pieces. fixes #1628 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-08 12:30:18 +10:00
David Gibson	5d007743c1	agent/device: Refine uevent matching for pmem devices Use the new uevent matching infrastructure to refine the matching for pmem devices to something more pinned down to that device type. While we're there, fix a few anciliary problems with get_pmem_device_name(): - The name is poor - the input to this function is the expected device name, so the result isn't helpful, except that it needs to wait for the device to be ready in the guest. Change it to wait_for_pmem_device() and explicitly check that the returned device name matches the one expected. - Remove an incorrect comment in nvdimm_storage_handler() (the only caller) which appears to have been copied from the virtio-blk path, but then become stale. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-08 12:02:39 +10:00
James O. D. Hunt	9017e1100b	agent: start to rework the debug console It's the first commit of the rework. Fixes: #1647 Signed-off-by: James O. D. Hunt <james.o.hunt@intel.com>	2021-04-08 09:57:48 +08:00
David Gibson	a59e07c1f9	agent/define: Refine uevent matching for virtio-scsi devices Current get_scsi_device_name() uses the legacy uevent matching which isn't very precise. This refines it to use a specific matcher implementation. While we're at it: - No longer insist on the SCSI controller being under the PCI root. It generally will be, but there's no particular reason to require it. The matcher still has a problem in that it won't work sensibly if there are multiple SCSI busses in the guest. Fixing that requires changes on the runtime side as well, though, so it's beyond scope for this change. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-08 11:13:00 +10:00
David Gibson	484a364729	agent/device: Rework uevent handling for virtio-blk devices There are some problems with get_pci_device_name(): 1) It's misnamed: in fact it is only used for handling virtio-blk PCI devices. It's also only correct for virtio-blk devices, the event matching doesn't locate the "raw" PCI device, but rather the block device created by virtio-blk as a child of the PCI device itself. 2) The uevent matching is imprecise. As all things using the legacy DevAddrMatcher, it matches on a bunch of conditions used across several different device types, not all of which make sense for virtio-blk pci devices specifically. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-08 11:13:00 +10:00
Eric Ernst	15c2d7ed30	Merge pull request #1400 from ManaSugi/update-oci-seccomp oci: Update seccomp configuration	2021-04-07 15:18:19 -07:00
GabyCT	d922070c50	Merge pull request #1644 from lifupan/fix_env rustjail: fix the issue of missing default home env	2021-04-07 10:16:07 -05:00
GabyCT	81bcded9a3	Merge pull request #1492 from dgibson/uevent Make uevent watching mechanism more flexible	2021-04-07 10:15:33 -05:00
fupan.lfp	a938d90310	rustjail: fix the issue of missing default home env first get the "HOME" env from "/etc/passwd", if there's no corresponding uid entry in /etc/passwd, then set "/" as the home env. Fixes: #1643 Signed-off-by: fupan.lfp <fupan.lfp@antfin.com>	2021-04-07 15:11:28 +08:00
GabyCT	0b87fd436f	Merge pull request #1544 from snir911/timeout runtime: increase dial timeout	2021-04-06 16:10:51 -05:00
Wainer dos Santos Moschetta	49eec92038	agent: log the tag and mount point if it is already mounted On commit `17e9a2cff5` it was introduced a guard for the case the mount point is already mounted. Instead of log only the mount tag ("kataShared") with this change it will print both tag and mount point path. Fixes: #1398 Signed-off-by: Wainer dos Santos Moschetta <wainersm@redhat.com>	2021-04-06 14:14:59 -04:00
GabyCT	aac852a0bc	Merge pull request #1561 from Jakob-Naucke/s390x-statfs-constants agent: s390x statfs constants	2021-04-06 11:11:40 -05:00
David Gibson	0828f9ba70	agent/uevent: Introduce wait_for_uevent() helper get_device_name() contains logic to wait for a specific uevent, then extract the /dev node name from it. In future we're going to want similar logic to wait on uevents, but using different match criteria, or getting different information out. To simplify this, add a wait_for_uevent() helper in the uevent module, which takes an explicit UeventMatcher object and returns the whole uevent found. To make testing easier, we also extract the cut down uevent watcher from test_get_device_name() into a new spawn_test_watcher() helper. Its used for both test_get_device_name() and a new test_wait_for_uevent() amd will be useful for more tests in future. fixes #1484 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 21:14:52 +10:00
David Gibson	16ed55e440	agent/device: Use consistent matching for past and future uevents get_device_name() looks at kernel uevents to work out the device name for a given PCI (usually) address. However, when we call it we can't know if the uevent we're interested in has already happened (in which case it will have been recorded in Sandbox::uevent_map) or yet to come, in which case we need to register to watch it. However, we currently match differently against past and future events. For past events we simply look for a sysfs path including the address, but for future events we use a complex bit of logic in the is_match() closure. Change it to use the exact same matching logic in both cases. fixes #1397 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 21:14:33 +10:00
David Gibson	4b16681d87	agent/uevent: Put matcher object rather than "device address" in watch list Currently, Sandbox::uevent_watchers lists uevents to watch for by a "device address" string. This is not very clearly defined, and is matched against events with a rather complex closure created in Uevent::process_add(). That closure makes a bunch of fragile assumptions about what sort of events we could ever be interested in. In some ways it is too restrictive (requires everything to be a block device), but in others is not restrictive enough (allows things matching NVDIMM paths, even if we're looking for a PCI block device). To allow the clients more precise control over uevent matching, we define a new UeventMatcher trait with a method to match uevents. We then have the atchers list include UeventMatcher trait objects which are used directly by Uevent::process_add(), instead of constructing our match directly from dev_addr. For now we don't actually change the matching function, or even use multiple different trait implementations, but we'll refine that in future. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 21:14:18 +10:00
David Gibson	b8b322482c	agent/uevent: Consolidate event matching logic The event matching logic in Uevent::process_add() is split into two parts. The first checks if we care about the event at all, the second checks whether the event is relevant to a particular watcher. However, we're going to be adding more types of watchers in future, which will make the global filter too restrictive. Fold the two bits of logic together into a per-watcher filter function. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:59:43 +10:00
David Gibson	d2caff6c55	agent: Re-organize uevent processing Uevent::process() is a bit oddly organized. It treats the onlining of hotplugged memory as the "default" case, although that's quite specific, while treating the handling of hotplugged block devices more like a special case, although that's pretty close to being very general. Furthermore splitting Uevent::is_block_add_event() from Uevent::handle_block_add_event() doesn't make a lot of sense, since their logic is intimately related to each other. Alter the code to be a bit more sensible: first split on the "action" type since that's the most fundamental difference, then handle the memory onlining special case, then the block device add (which will become a lot more general in future changes). Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:59:20 +10:00
David Gibson	55ed2ddd07	agent: Store uevent watchers in Vec rather than HashMap Sandbox:dev_watcher is a HashMap from a "device address" to a channel used to notify get_device_name() that a suitable uevent has been found. However, "device address" isn't well defined, having somewhat different meanings for different device/event types. We never actually look up this HashMap by key, except to remove entries. Not looking up by key suggests that a map is not the appropriate data structure here. Furthermore, HashMap imposes limitations on the types which will prevent some future extensions we want. So, replace the HashMap with a Vec<Option<>>. We need the Option<> so that we can remove entries by index (removing them from the Vec completely would hange the indices of other entries, possibly breaking concurrent work. This does mean that the vector will keep growing as we watch for different events during startup. However, we don't expect the number of device events we watch for during a run to be very large, so that shouldn't be a problem. We can optimize this later if it becomes a problem. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:59:19 +10:00
David Gibson	91e0ef5c90	agent/uevent: Report whole Uevents to device watchers Currently, when Uevent::handle_block_add_event() receives an event matching a registered watcher, it reports the /dev node name from the event back to the watcher. This changes it to report the entire uevent, not just the /dev node name. This will allow various future extensions. It also makes the client side of the uevent watching - get_device_name() - more consistent between its two paths: finding a past uevent in Sandbox::uevent_map() or waiting for a new uevent via a watcher. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:58:47 +10:00
David Gibson	3642005479	agent: Store whole Uevent in map, rather than just /dev name Sandbox::pci_device_map contains a mapping from sysfs paths to /dev entries which is used by get_device_name() to look up the right /dev node. But, the map only supplies the answer if the uevent for the device has already been received, otherwise get_device_name() has to wait for it. However the matching for already-received and yet-to-come uevents isn't quite the same which makes the whole system fragile. In order to make sure the matching for both cases is identical, we need the already-received side to store the whole uevent to match against, not just the sysfs path and device name. So, rename pci_device_map to uevent_map and store the whole uevent there verbatim. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:58:47 +10:00
David Gibson	0616202580	agent/device: Move GLOBAL_DEVICE_WATCHER into Sandbox In Kata 1.x, both the sysToDevMap and the deviceWatchers are in the sandbox structure. For some reason in Kata 2.x, the device watchers have moved to a separate global variable, GLOBAL_DEVICE_WATCHER. This is a bad idea: apart from introducing an extra global variable unnecessarily, it means that Sandbox::pci_device_map and GLOBAL_DEVICE_WATCHER are protected by separate mutexes. Since the information in these two structures has to be kept in sync with each other, it makes much more sense to keep them both under the same single Sandbox mutex. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:58:45 +10:00
David Gibson	11ae32e3c0	agent/device: Fix path matching for PCI devices For the case of virtio-blk PCI devices, when matching uevents we create a pci_p temporary. However, we build it incorrectly: the dev_addr values we use for PCI devices are a relative sysfs paths from the PCI root to the device in question including an initial /. But when we construct pci_p we add an extra /, meaning the resulting path will not match properly. AFAICT the only reason we got away with this is because in practice the virtio-blk devices where discovered by the kernel before we looked for them meaning the loosed matching in get_device_name() was used, rather than the pci_p logic in handle_block_add_event(). Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:58:06 +10:00
David Gibson	4f60880414	agent/device: Update test_get_device_name() The current test_get_device_name(), ported from Kata 1.x doesn't really reflect how the function is used in practice. The example path appears to be for a virtio-blk device, but it's an s390 specific variant, not a PCI device. The s390 form isn't actually supported by any of the existing users of get_device_name(). Change it to a plausible virtio-blk-pci style path to better test how get_device_name() will actually be used in practice. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 20:49:48 +10:00
Bin Liu	117c59150d	Merge pull request #1613 from Tim-Zhang/pipestream-shutdown-do-nothing Don't do anything in Pipestream::shutdown	2021-04-06 14:03:00 +08:00
Tim Zhang	ee6a590db1	agent: add test test_pipestream_shutdown Make sure PipeStream::shutdown() do not close the inner fd. Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-04-06 11:44:56 +08:00
Tim Zhang	4a2d437043	agent: don't do anything in Pipestream::shutdown The only right way to shutdown pipe is drop it Otherwise PipeStream will conflict with its twins Because they both have the same fd, and both registered. Fixes: #1614 Signed-off-by: Tim Zhang <tim@hyper.sh>	2021-04-06 11:44:38 +08:00
Peng Tao	d5600641dd	Merge pull request #1603 from lifupan/fix_fsgroup Fix fsgroup	2021-04-06 11:35:03 +08:00
David Gibson	e3e670c56f	agent/device: Forward port test for get_device_name() from Kata 1.x Kata 1.x had a testcase for the equivalent getDeviceName function in Go, this adapts it to Rust and adds it to Kata 2.x. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 13:29:37 +10:00
David Gibson	ed08980fc1	agent: Remove many "panic message is not string literal" warnings Rust 1.51 appears to have added a new warning in anticipation of Rust 2021, which requires the format string for panic!()s (including via the various assert!() macros) to be a string literal. This triggers quite a few times in the agent code. This patch fixes them. fixes #1626 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2021-04-06 11:51:34 +10:00
Snir Sheriber	13653e7b55	runtime: increase dial timeout On some setups, starting multiple kata pods (qemu) simultaneously on the same node might cause kata VMs booting time to increase and the pods to fail with: Failed to check if grpc server is working: rpc error: code = DeadlineExceeded desc = timed out connecting to vsock 1358662990:1024: unknown Increasing default dialing timeout to 30s should cover most cases. Signed-off-by: Snir Sheriber <ssheribe@redhat.com> Fixes: #1543	2021-04-04 09:37:38 +03:00
Chelsea Mafrica	17b1452c2a	Merge pull request #1607 from fidencio/wip/only-keep-one-VERSION-file Only keep one VERSION file	2021-04-02 11:14:12 -07:00
Bo Chen	1511d966aa	Merge pull request #1616 from egernst/dechat-deruntime Dechat deruntime	2021-04-01 11:02:27 -07:00
Chelsea Mafrica	4a3282cf1a	Merge pull request #1608 from likebreath/0331/go_fmt_clh_clinet_code runtime: Format auto-generated client code for cloud-hypervisor API	2021-04-01 10:39:02 -07:00
Eric Ernst	a4c125a8b9	trace: move gRPC requests from debug to trace There are many requests to the agent that happen with relatively high frequency when a workload is running (checkRequest, as an example). Let's move from Debug to Trace to avoid bombarding journal. Signed-off-by: Eric Ernst <eric.g.ernst@gmail.com>	2021-04-01 09:03:26 -07:00
Eric Ernst	50fff97753	trace: move trace span chatter to trace rather than info No human should ever read that ouptut. Let's at least move it to trace for now. Fixes: #1615 Signed-off-by: Eric Ernst <eric.g.ernst@gmail.com>	2021-04-01 09:02:56 -07:00
Fupan Li	5524bc806b	Merge pull request #1612 from liubin/1610/use-concrete-kata-agent-config-type runtime: use concrete KataAgentConfig instead of interface type	2021-04-01 21:26:38 +08:00
bin	6fe48329b5	runtime: use concrete KataAgentConfig instead of interface type Kata Containers 2.0 only have one type of agent, so there is no need to use interface as config's type Fixes: #1610 Signed-off-by: bin <bin@hyper.sh>	2021-04-01 13:44:45 +08:00
fupan.lfp	6493942568	mount: fix the issue of missing set fsGroup For k8s emptyDir volume, a specific fsGroup would be set for it, thus guest should get this fsGroup from runtime and set it properly on the emptyDir volume in guest. Fixes: #1580 Signed-off-by: fupan.lfp <fupan.lfp@antfin.com>	2021-04-01 11:33:26 +08:00
fupan.lfp	88e58a4f4b	agent: fix the issue of missing pass fsGroup For k8s emptyDir volume, a specific fsGroup would be set for it, thus runtime should pass this fsGroup to guest and set it properly on the emptyDir volume in guest. Fixes: #1580 Signed-off-by: fupan.lfp <fupan.lfp@antfin.com>	2021-04-01 11:33:18 +08:00
Fabiano Fidêncio	572aff53e8	build: Only keep one VERSION file Instead of having different VERSION files spread accross the project, let's always use the one in the topsrcdir and remove all the others, keeping only a synlink to the topsrcdir one. Fixes: #1579 Signed-off-by: Fabiano Fidêncio <fidencio@redhat.com>	2021-03-31 23:51:20 +02:00
Bo Chen	0c38d9ecc4	runtime: Fix the format of the client code of cloud-hypervisor APIs Regenerate the client code with the added `go-fmt` step. No functional changes. Fixes: #1606 Signed-off-by: Bo Chen <chen.bo@intel.com>	2021-03-31 14:41:44 -07:00

... 57 58 59 60 61 ...

3879 Commits