Lighthouse hub-controller enhancement proposal #82

donaldh · 2022-03-18T11:19:27Z

No description provided.

submariner-bot · 2022-03-18T11:19:29Z

🤖 Created branch: z_pr82/donaldh/hub-controller

Signed-off-by: Donald Hunter <donald.hunter@redhat.com>

vthapar

Current proposal doesn't mention any cons of having central mcs controller. Chiefly, it will require broker cluster to have API access to all other clusters. In current model, all clusters only need credentials to access broker cluster. This was one of the key reasons for why submariner was designed without any controller running on broker.

All these can also be achieved by making current lighthouse agent create the aggregated version of service import on broker.

When Export is created, agent checks if ServiceImport already exists.
If not, it creates a new Import on broker based on Service present in cluster.
If present, it checks if it conflicts with Service Exported.
If conflicts, set Export status to comflict.
If not, update the ServiceImport with aggregated information.
If Service is Exported from two different clusters, only one will be able to successfuly update the copy on broker, other will get a conflict error and will have to retry.

So, by maintaining a single copy on the broker and syncing that, we can achieve same benefits without requiring a central mcs controller. Only drawback of this approach would be too many conflicts when service is exported from multiple clusters at same time, but their frequency shouldn't be high enough. It would only be an issue if two such clusters "restart" at same time.

donaldh · 2022-03-22T09:26:45Z

The proposed approach will not require the broker cluster to have API access to the other clusters.

Sync of ServiceExport from cluster to broker would be carried out by the clusters, as it is today
Reconciliation of ServiceExport to ServiceImport would be carried out by the broker controller
Sync if ServiceImport from broker to cluster would be carried out by the clusters

I agree that a drawback of this approach is the need to run a controller on the broker (highly available).

vthapar · 2022-03-22T17:26:09Z

The proposed approach will not require the broker cluster to have API access to the other clusters.

Sync of ServiceExport from cluster to broker would be carried out by the clusters, as it is today

Reconciliation of ServiceExport to ServiceImport would be carried out by the broker controller

Sync if ServiceImport from broker to cluster would be carried out by the clusters

I agree that a drawback of this approach is the need to run a controller on the broker (highly available).

Could you clarify this? ServiceExports are not synced to broker today. Or did you mean ServiceImport here?

Design of this mcscontroller is very similar to federator that was in use during very early days of submariner - https://github.com/kubernetes-sigs/kubefed You can even see remnants of that code with some APIs/functions that still use "Federator" in their naming.

The approach that I mentioned will likely have some complications of its own, but would like if you could think on those lines too. I am not against mcscontroller or in favor of what I proposed, just want to make sure we weigh both options and choose whichever is the best.

donaldh · 2022-03-22T19:00:40Z

So currently, the service export syncer transforms ServiceExport to ServiceImport and then syncs the ServiceImports to the broker, right? The idea is to separate the transformation and the syncing so that:

ServiceExport is synced to the broker by the cluster, before any transform to a ServiceImport.
A controller on the broker is responsible for transforming all ServiceExports for a compatible service into a single ServiceImport.
The service import syncer will sync the ServiceImports from the broker to the cluster, one per service.

This would mean that the controller responsible for creating the ServiceImport resource would have visibility of all ServiceExport objects from the same watcher and can generate the ServiceImport according to MCS specification. All clusters would receive a consistent view of the aggregated ServiceImport.

vthapar · 2022-03-22T19:26:28Z

So currently, the service export syncer transforms ServiceExport to ServiceImport and then syncs the ServiceImports to the broker, right? The idea is to separate the transformation and the syncing so that:

ServiceExport is synced to the broker by the cluster, before any transform to a ServiceImport.

A controller on the broker is responsible for transforming all ServiceExports for a compatible service into a single ServiceImport.

The service import syncer will sync the ServiceImports from the broker to the cluster, one per service.

This would mean that the controller responsible for creating the ServiceImport resource would have visibility of all ServiceExport objects from the same watcher and can generate the ServiceImport according to MCS specification. All clusters would receive a consistent view of the aggregated ServiceImport.

But to do that just ServiceExport isn't enough to create ServiceImport. ServiceExport is just service name and namespace, no other relevant information. Broker will also need Service to create ServiceImport and get information like ServiceIP and ports. Without ServiceImport, how will you even detect Export conflict?

donaldh · 2022-03-22T20:06:42Z

Yep, the current proof of concept serialises the associated service and adds it as a label on the ServiceExport. There could be an internal ‘implementation’ crd instead. It would also be possible to reuse ServiceImport but that is likely to be confusing and would require strict separation between “source” objects and “merged” objects that get synced by clusters.

tpantelis · 2022-03-22T21:27:45Z

Yep, the current proof of concept serialises the associated service and adds it as a label on the ServiceExport.

That seems a bit kludgy to me. There's also globalnet info needed and we also update the ServiceExport status on the managed cluster. I understand the motivation for implementing a hub MCS controller, as that's how Google has implemented the MCS API and may have even tailored parts of it assuming a central controller, but I just don't see a compelling reason to re-architect Submariner in that manner, especially since a central controller implementation seems more suited where the controller connects to/watches/updates managed cluster resources (a la kubefed).

It seems the main motivation here is to add the missing Lighthouse behavior to fully align with the MCS specification which, as @vthapar outlined earlier, can be implemented with the current architecture.

donaldh · 2022-03-22T21:59:02Z

I agree that a label containing serialised service info is kludgy but it was expedient for a proof of concept. My preference would be to have an implementation CRD that contains all required info for the exported service rather than syncing several different objects to the broker or overloading ServiceImport as a means of describing export details.

I think we do need to find a solution to ServiceImport merging that is based on having 1 writer per resource rather than multiple contending writers. A separation of concerns between syncing to the broker and transforming exports to imports would achieve this.

vthapar · 2022-03-23T04:34:05Z

Note that ServiceImport merging will only be required when service is exported from different cluster. So unless two different clusters happen to export a service at exactly same moment [rare occurence even if done using automated scripts], the conflicts will be very less likely to happen and not frequent enough to be an issue.

Between ServiceExport and ServiceImport we already have al the info. Anything else at this point would be superflous.

Another option is to aggregate at destination. This will provide single writer for aggregated ServiceImport. This can mean that different clusters can briefly have different views of service imports as they're being distributed, but that can happen even with MCS Controller. Eventual consistency is what really matters in a distributed environment.

As Tom pointed out, Google proposed the KEP and API taliored specifically to their implementation using a central MCS Controller. In submariner we were doing things differently and even today some of the deviations from spec are because those work better for us. Another example is not using a Virtual IP in ServiceImport but each individual import carrying ClusterIP of service in that cluster. So, we'll never be 100% compliant by design and if someone wants a 100% compliant central MCS controller based implementation, that is already available off MCS sig.

I'd be okay with central controller if there were no other way to solve the problems we're trying to solve. Why not do some PoC around these other two solutions: Aggregate at destination or SingleImport on broker, and then decide what works best?

tpantelis · 2022-03-23T12:07:57Z

Another aspect is that the status of each local user-created ServiceExport needs to be updated which would seem problematic from a hub controller with no access to the managed clusters.

donaldh · 2022-03-23T12:39:05Z

The proposed approach is consistent with KCP architecture where transformation logic is executed on the kcp cluster and workload clusters run syncers that sync config in one direction and status in the other direction.

tpantelis · 2022-03-23T14:10:28Z

The proposed approach is consistent with KCP architecture where transformation logic is executed on the kcp cluster and workload clusters run syncers that sync config in one direction and status in the other direction.

But in order to do the latter, the central controller would need access and credentials to each managed cluster, unless I'm missing something.

donaldh · 2022-03-23T17:26:13Z

the central controller would need access and credentials to each managed cluster

I don't think that would be required because the syncers can run on the workload clusters and connect to the broker cluster using the same authentication model that submariner uses today.

@vthapar makes a fair point:

Note that ServiceImport merging will only be required when service is exported from different cluster

In the general case, when a single cluster exports a service and other clusters import the service, there would be a single ServiceImport and a single EndpointSlice.
When more than one cluster exports a service, then the target state that should be seen by importing clusters is a single ServiceImport and multiple EndpointSlices.

It is possible that in the common case no ServiceImport merging needs to happen, only that the first ServiceExport causes a ServiceImport to be created. Subsequent ServiceExports would only need to sync an EndpointSlice.

I think there are several edge cases that need to be considered, such identifying when the ServiceImport can be deleted.

There are other issues such as: should the ServiceImport VIP be set at all since DNS resolution should be based on the EndpointSlices.

tpantelis · 2022-03-23T18:21:10Z

the central controller would need access and credentials to each managed cluster

I don't think that would be required because the syncers can run on the workload clusters and connect to the broker cluster using the same authentication model that submariner uses today.

FWIU, the hub controller would process the synced ServiceExport on the hub and do conflict resolution and transform/aggregate to a ServiceImport. The status of this process needs to updated in the original user-created ServiceExport in the managed/workload cluster. How would the hub controller accomplish this?

donaldh · 2022-03-23T19:10:27Z

The hub controller would write status to the ServiceExport object on the hub/broker. The syncer running on the workload cluster would sync the status back to the ServiceExport object on the workload cluster.

This is the pattern that kcp syncers use; they sync config in one direction and status updates in the oppposite direction.

tpantelis · 2022-03-23T20:57:48Z

The hub controller would write status to the ServiceExport object on the hub/broker. The syncer running on the workload cluster would sync the status back to the ServiceExport object on the workload cluster.

This is the pattern that kcp syncers use; they sync config in one direction and status updates in the oppposite direction.

Which seems a lot more complicated, certainly more than it needs to be in our case.

mkolesnik · 2022-04-11T11:04:36Z

lighthouse/hub-controller.md

+The current Lighthouse implementation has a passive broker, it is a central apiserver that is
+accessed by each cluster in the clusterset. Clusters synchronise resources to and from the
+broker with all sync and reconciliation logic implemented in the clusters.
+
+There are feature gaps that would be much easier to address if we introduce an active broker
+that can implement reconciliation logic.


Passive broker is part of the broader Submariner architecture which emphasizes distributed clusters where each cluster is autonomous, and the broker is simply some mechanism to disseminate the data which is very simple (could've been a simple etcd service and not K8s API).

If we're going to change that, then it should be discussed on a higher level architecturally and not just in the context of Lighthouse.

donaldh · 2022-04-14T11:51:36Z

Closing this PR given there is no consensus for introducing controller functionality on the broker. I will focus instead on aggregating service imports for alignment with MCS.

submariner-bot · 2022-04-14T11:51:39Z

🤖 Closed branches: [z_pr82/donaldh/hub-controller]

donaldh requested review from tpantelis, Oats87 and skitt as code owners March 18, 2022 11:19

Lighthouse hub-controller enhancement proposal

050fe55

Signed-off-by: Donald Hunter <donald.hunter@redhat.com>

donaldh force-pushed the hub-controller branch from e13f4e2 to 050fe55 Compare March 18, 2022 11:24

donaldh mentioned this pull request Mar 18, 2022

Epic: Hub controller for lighthouse #83

Closed

14 tasks

vthapar requested changes Mar 21, 2022

View reviewed changes

mkolesnik requested changes Apr 11, 2022

View reviewed changes

donaldh closed this Apr 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lighthouse hub-controller enhancement proposal #82

Lighthouse hub-controller enhancement proposal #82

donaldh commented Mar 18, 2022

submariner-bot commented Mar 18, 2022

vthapar left a comment

donaldh commented Mar 22, 2022

vthapar commented Mar 22, 2022

donaldh commented Mar 22, 2022

vthapar commented Mar 22, 2022

donaldh commented Mar 22, 2022

tpantelis commented Mar 22, 2022 •

edited

Loading

donaldh commented Mar 22, 2022 •

edited

Loading

vthapar commented Mar 23, 2022

tpantelis commented Mar 23, 2022

donaldh commented Mar 23, 2022

tpantelis commented Mar 23, 2022

donaldh commented Mar 23, 2022

tpantelis commented Mar 23, 2022

donaldh commented Mar 23, 2022

tpantelis commented Mar 23, 2022

mkolesnik Apr 11, 2022

donaldh commented Apr 14, 2022

submariner-bot commented Apr 14, 2022

Lighthouse hub-controller enhancement proposal #82

Lighthouse hub-controller enhancement proposal #82

Conversation

donaldh commented Mar 18, 2022

submariner-bot commented Mar 18, 2022

vthapar left a comment

Choose a reason for hiding this comment

donaldh commented Mar 22, 2022

vthapar commented Mar 22, 2022

donaldh commented Mar 22, 2022

vthapar commented Mar 22, 2022

donaldh commented Mar 22, 2022

tpantelis commented Mar 22, 2022 • edited Loading

donaldh commented Mar 22, 2022 • edited Loading

vthapar commented Mar 23, 2022

tpantelis commented Mar 23, 2022

donaldh commented Mar 23, 2022

tpantelis commented Mar 23, 2022

donaldh commented Mar 23, 2022

tpantelis commented Mar 23, 2022

donaldh commented Mar 23, 2022

tpantelis commented Mar 23, 2022

mkolesnik Apr 11, 2022

Choose a reason for hiding this comment

donaldh commented Apr 14, 2022

submariner-bot commented Apr 14, 2022

tpantelis commented Mar 22, 2022 •

edited

Loading

donaldh commented Mar 22, 2022 •

edited

Loading