Tell oslo.limit how to count nova resources

JohnGarbutt · melwitt · commit d984a6d88670 · 2022-02-24T16:21:02.000Z
A follow on patch will use this code to enforce the limits, this patch
provides integration with oslo.limit and a new internal nova API that is
able to enforce those limits.

The first part is providing a callback for oslo.limit to be able to count
the resources being used. We only count resources grouped by project_id.

For counting servers, we make use of the instance mappings list in the
api database, just as the existing quota code does. While we do check to
ensure the queued for delete migration has been completed, we simply
error out if that is not the case, rather than attempting to fallback to
any other counting system. We hope one day we can count this in
placement using consumer records, or similar.

For counting all other resource usage, they must refer to some usage
relating to a resource class being consumed in placement. This is similar
to how the count with placement variant of the existing placement code
works today. This is not restricted to RAM and VCPU, it is open to any
resource class that is known to placement.

The second part is the enforcement method, that keeps a similar
signature to the existing enforce_num_instnaces call that is use to
check quotas using the legacy quota system.

From the flavor we extract the current resource usage. This is
considered the simplest first step that helps us deliver Ironic limits
alongside all the existing RAM and VCPU limits. At a later date, we
would ideally get passed a more complete view of what resources are
being requested from placement.

NOTE: given the instance object doesn't exist when enforce is called, we
can't just pass the instance into here.

A [workarounds] option is also available for operators who need the
legacy quota usage behavior where VCPU = VCPU + PCPU.

blueprint unified-limits-nova

Change-Id: I272b59b7bc8975bfd602640789f80d2d5f7ee698
diff --git a/nova/conf/workarounds.py b/nova/conf/workarounds.py
@@ -383,6 +383,24 @@
 before compute nodes have been able to update their service record. In an FFU,
 the service records in the database will be more than one version old until
 the compute nodes start up, but control services need to be online first.
+"""),
+    cfg.BoolOpt('unified_limits_count_pcpu_as_vcpu',
+                default=False,
+                help="""
+When using unified limits, use VCPU + PCPU for VCPU quota usage.
+
+If the deployment is configured to use unified limits via
+``[quota]driver=nova.quota.UnifiedLimitsDriver``, by default VCPU resources are
+counted independently from PCPU resources, consistent with how they are
+represented in the placement service.
+
+Legacy quota behavior counts PCPU as VCPU and returns the sum of VCPU + PCPU
+usage as the usage count for VCPU. Operators relying on the aggregation of
+VCPU and PCPU resource usage counts should set this option to True.
+
+Related options:
+
+* :oslo.config:option:`quota.driver`
 """),
 ]
 
diff --git a/nova/limit/placement.py b/nova/limit/placement.py
@@ -0,0 +1,168 @@
+#    Copyright 2022 StackHPC
+#
+#    Licensed under the Apache License, Version 2.0 (the "License"); you may
+#    not use this file except in compliance with the License. You may obtain
+#    a copy of the License at
+#
+#         http://www.apache.org/licenses/LICENSE-2.0
+#
+#    Unless required by applicable law or agreed to in writing, software
+#    distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
+#    WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
+#    License for the specific language governing permissions and limitations
+#    under the License.
+
+
+import os_resource_classes as orc
+from oslo_limit import exception as limit_exceptions
+from oslo_limit import limit
+from oslo_log import log as logging
+
+import nova.conf
+from nova import exception
+from nova.limit import utils as limit_utils
+from nova import objects
+from nova import quota
+from nova.scheduler.client import report
+from nova.scheduler import utils
+
+LOG = logging.getLogger(__name__)
+CONF = nova.conf.CONF
+
+# Cache to avoid repopulating ksa state
+PLACEMENT_CLIENT = None
+
+
+def _get_placement_usages(context, project_id):
+    global PLACEMENT_CLIENT
+    if not PLACEMENT_CLIENT:
+        PLACEMENT_CLIENT = report.SchedulerReportClient()
+    return PLACEMENT_CLIENT.get_usages_counts_for_limits(context, project_id)
+
+
+def _get_usage(context, project_id, resource_names):
+    """Called by oslo_limit's enforcer"""
+    if not limit_utils.use_unified_limits():
+        raise NotImplementedError("unified limits is disabled")
+
+    count_servers = False
+    resource_classes = []
+
+    for resource in resource_names:
+        if resource == "servers":
+            count_servers = True
+            continue
+
+        if not resource.startswith("class:"):
+            raise ValueError("Unknown resource type: %s" % resource)
+
+        # Temporarily strip resource class prefix as placement does not use it.
+        # Example: limit resource 'class:VCPU' will be returned as 'VCPU' from
+        # placement.
+        r_class = resource.lstrip("class:")
+        if r_class in orc.STANDARDS or orc.is_custom(r_class):
+            resource_classes.append(r_class)
+        else:
+            raise ValueError("Unknown resource class: %s" % r_class)
+
+    if not count_servers and len(resource_classes) == 0:
+        raise ValueError("no resources to check")
+
+    resource_counts = {}
+    if count_servers:
+        # TODO(melwitt): Change this to count servers from placement once nova
+        # is using placement consumer types and is able to differentiate
+        # between "instance" allocations vs "migration" allocations.
+        if not quota.is_qfd_populated(context):
+            LOG.error('Must migrate all instance mappings before using '
+                      'unified limits')
+            raise ValueError("must first migrate instance mappings")
+        mappings = objects.InstanceMappingList.get_counts(context, project_id)
+        resource_counts['servers'] = mappings['project']['instances']
+
+    try:
+        usages = _get_placement_usages(context, project_id)
+    except exception.UsagesRetrievalFailed as e:
+        msg = ("Failed to retrieve usages from placement while enforcing "
+               "%s quota limits." % ", ".join(resource_names))
+        LOG.error(msg + " Error: " + str(e))
+        raise exception.UsagesRetrievalFailed(msg)
+
+    # Use legacy behavior VCPU = VCPU + PCPU if configured.
+    if CONF.workarounds.unified_limits_count_pcpu_as_vcpu:
+        # If PCPU is in resource_classes, that means it was specified in the
+        # flavor explicitly. In that case, we expect it to have its own limit
+        # registered and we should not fold it into VCPU.
+        if orc.PCPU in usages and orc.PCPU not in resource_classes:
+            usages[orc.VCPU] = (usages.get(orc.VCPU, 0) +
+                                usages.get(orc.PCPU, 0))
+
+    for resource_class in resource_classes:
+        # Need to add back resource class prefix that was stripped earlier
+        resource_name = 'class:' + resource_class
+        # Placement doesn't know about classes with zero usage
+        # so default to zero to tell oslo.limit usage is zero
+        resource_counts[resource_name] = usages.get(resource_class, 0)
+
+    return resource_counts
+
+
+def _get_deltas_by_flavor(flavor, is_bfv, count):
+    if flavor is None:
+        raise ValueError("flavor")
+    if count < 0:
+        raise ValueError("count")
+
+    # NOTE(johngarbutt): this skips bfv, port, and cyborg resources
+    # but it still gives us better checks than before unified limits
+    # We need an instance in the DB to use the current is_bfv logic
+    # which doesn't work well for instances that don't yet have a uuid
+    deltas_from_flavor = utils.resources_for_limits(flavor, is_bfv)
+
+    deltas = {"servers": count}
+    for resource, amount in deltas_from_flavor.items():
+        if amount != 0:
+            deltas["class:%s" % resource] = amount * count
+    return deltas
+
+
+def _get_enforcer(context, project_id):
+    # NOTE(johngarbutt) should we move context arg into oslo.limit?
+    def callback(project_id, resource_names):
+        return _get_usage(context, project_id, resource_names)
+
+    return limit.Enforcer(callback)
+
+
+def enforce_num_instances_and_flavor(context, project_id, flavor, is_bfvm,
+                                     min_count, max_count, enforcer=None):
+    """Return max instances possible, else raise TooManyInstances exception."""
+    if not limit_utils.use_unified_limits():
+        return max_count
+
+    # Ensure the recursion will always complete
+    if min_count < 0 or min_count > max_count:
+        raise ValueError("invalid min_count")
+    if max_count < 0:
+        raise ValueError("invalid max_count")
+
+    deltas = _get_deltas_by_flavor(flavor, is_bfvm, max_count)
+    enforcer = _get_enforcer(context, project_id)
+    try:
+        enforcer.enforce(project_id, deltas)
+    except limit_exceptions.ProjectOverLimit as e:
+        # NOTE(johngarbutt) we can do better, but this is very simple
+        LOG.debug("Limit check failed with count %s retrying with count %s",
+                  max_count, max_count - 1)
+        try:
+            return enforce_num_instances_and_flavor(context, project_id,
+                                                    flavor, is_bfvm, min_count,
+                                                    max_count - 1,
+                                                    enforcer=enforcer)
+        except ValueError:
+            # Copy the *original* exception message to a OverQuota to
+            # propagate to the API layer
+            raise exception.TooManyInstances(str(e))
+
+    # no problems with max_count, so we return max count
+    return max_count
diff --git a/nova/quota.py b/nova/quota.py
@@ -1223,6 +1223,17 @@ def _server_group_count_members_by_user_legacy(context, group, user_id):
     return {'user': {'server_group_members': count}}
 
 
+def is_qfd_populated(context):
+    global UID_QFD_POPULATED_CACHE_ALL
+    if not UID_QFD_POPULATED_CACHE_ALL:
+        LOG.debug('Checking whether user_id and queued_for_delete are '
+                  'populated for all projects')
+        UID_QFD_POPULATED_CACHE_ALL = _user_id_queued_for_delete_populated(
+            context)
+
+    return UID_QFD_POPULATED_CACHE_ALL
+
+
 def _server_group_count_members_by_user(context, group, user_id):
     """Get the count of server group members for a group by user.
 
@@ -1240,14 +1251,7 @@ def _server_group_count_members_by_user(context, group, user_id):
     # So, we check whether user_id/queued_for_delete is populated for all
     # records and cache the result to prevent unnecessary checking once the
     # data migration has been completed.
-    global UID_QFD_POPULATED_CACHE_ALL
-    if not UID_QFD_POPULATED_CACHE_ALL:
-        LOG.debug('Checking whether user_id and queued_for_delete are '
-                  'populated for all projects')
-        UID_QFD_POPULATED_CACHE_ALL = _user_id_queued_for_delete_populated(
-            context)
-
-    if UID_QFD_POPULATED_CACHE_ALL:
+    if is_qfd_populated(context):
         count = objects.InstanceMappingList.get_count_by_uuids_and_user(
             context, group.members, user_id)
         return {'user': {'server_group_members': count}}
diff --git a/nova/scheduler/client/report.py b/nova/scheduler/client/report.py
@@ -2486,6 +2486,30 @@ def _get_usages(self, context, project_id, user_id=None):
         return self.get(url, version=GET_USAGES_VERSION,
                         global_request_id=context.global_id)
 
+    def get_usages_counts_for_limits(self, context, project_id):
+        """Get the usages counts for the purpose of enforcing unified limits
+
+        The response from placement will not contain a resource class if
+        there is no usage. i.e. if there is no usage, you get an empty dict.
+
+        Note resources are counted as placement sees them, as such note
+        that VCPUs and PCPUs will be counted independently.
+
+        :param context: The request context
+        :param project_id: The project_id to count across
+        :return: A dict containing the project-scoped counts, for example:
+                {'VCPU': 2, 'MEMORY_MB': 1024}
+        :raises: `exception.UsagesRetrievalFailed` if a placement API call
+                 fails
+        """
+        LOG.debug('Getting usages for project_id %s from placement',
+                  project_id)
+        resp = self._get_usages(context, project_id)
+        if resp:
+            data = resp.json()
+            return data['usages']
+        self._handle_usages_error_from_placement(resp, project_id)
+
     def get_usages_counts_for_quota(self, context, project_id, user_id=None):
         """Get the usages counts for the purpose of counting quota usage.
 
diff --git a/nova/scheduler/utils.py b/nova/scheduler/utils.py
@@ -615,6 +615,10 @@ def resources_from_flavor(instance, flavor):
     """
     is_bfv = compute_utils.is_volume_backed_instance(instance._context,
                                                      instance)
+    return _get_resources(flavor, is_bfv)
+
+
+def _get_resources(flavor, is_bfv):
     # create a fake RequestSpec as a wrapper to the caller
     req_spec = objects.RequestSpec(flavor=flavor, is_bfv=is_bfv)
 
@@ -628,6 +632,11 @@ def resources_from_flavor(instance, flavor):
     return res_req.merged_resources()
 
 
+def resources_for_limits(flavor, is_bfv):
+    """Work out what unified limits may be exceeded."""
+    return _get_resources(flavor, is_bfv)
+
+
 def resources_from_request_spec(ctxt, spec_obj, host_manager,
         enable_pinning_translate=True):
     """Given a RequestSpec object, returns a ResourceRequest of the resources,
diff --git a/nova/tests/unit/limit/test_placement.py b/nova/tests/unit/limit/test_placement.py
diff --git a/nova/tests/unit/scheduler/client/test_report.py b/nova/tests/unit/scheduler/client/test_report.py