Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-55240

Upgrading from 4.16 to 4.17 causes the Cluster Resource Override Operator to stop working

XMLWordPrintable

    • None
    • 0
    • AUTOSCALE - Sprint 270
    • 1
    • False
    • Hide

      None

      Show
      None
    • Hide
      Cause - the user attempts to upgrade the operator from 4.16 to 4.17+
      Consequence - this causes the CRO webhook to stop working and prevents pods from being created in namespaces with CRO enabled
      Fix - delete a stale secret and allow openshift to regenerate it with the correct fields
      Result - operator now proceeds to work as normal and allows pods to be created in CRO-enabled namespaces
      Show
      Cause - the user attempts to upgrade the operator from 4.16 to 4.17+ Consequence - this causes the CRO webhook to stop working and prevents pods from being created in namespaces with CRO enabled Fix - delete a stale secret and allow openshift to regenerate it with the correct fields Result - operator now proceeds to work as normal and allows pods to be created in CRO-enabled namespaces
    • Bug Fix
    • Done

      This is a clone of issue OCPBUGS-54886. The following is the description of the original issue:

      Description of problem:

      When the Cluster Resource Override Operator (CROO) is upgraded from v4.16 to v4.17 as a result of an OpenShift cluster upgrade, there are multiple errors:
      
      When attempting to create a pod in a namespace with CROO enabled, the following error is seen:
      ~~~
      Error from server (InternalError): error when creating "example.yaml": Internal error occurred: failed calling webhook 
      "clusterresourceoverrides.admission.autoscaling.openshift.io": failed to call webhook: the server is currently unable to handle the request
      ~~~
      Additionally, namespaces will fail to be deleted with the following error under status.conditions:
      ~~~
      Discovery failed for some groups, 1 failing: unable to retrieve the complete list of server APIs: admission.autoscaling.openshift.io/v1: stale GroupVersion discovery: admission.autoscaling.openshift.io/v1
      ~~~

      Version-Release number of selected component (if applicable):

      clusterresourceoverride-operator.v4.17.0-202503210032
      OpenShift 4.17 that was upgraded from 4.16

      How reproducible:

      I believe this should be reproducible every time, however I've only had time to test this once due to the upgrade involved.  

      Steps to Reproduce:

      1. On a OpenShift 4.16 cluster, install the clusterresourceoverride operator from the stable channel.      
      2. Create a clusterresourceoverride resource with default values. 
      3. Upgrade the OpenShift cluster to 4.17. After the upgrade is complete, CROO will be automatically upgraded to v4.17 as well.      

      Actual results:

      There are errors and CROO doesn't work properly.     

      Expected results:

      There are no errors and CROO works properly.     

      Additional info:

          

              rh-ee-macao Max Cao
              openshift-crt-jira-prow OpenShift Prow Bot
              Paul Rozehnal Paul Rozehnal
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated:
                Resolved: