[k8s] Add elastic-agent helm-chart #5331

pkoutsovasilis · 2024-08-21T07:44:12Z

What does this PR do?

This PR introduces a Helm Chart for deploying Elastic-Agent in Kubernetes

Why is it important?

By having an Elastic-Agent Helm Chart we get a simplified deployment on k8s

Checklist

My code follows the style guidelines of this project
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
~~[ ] I have made corresponding change to the default configuration files~~
I have added tests that prove my fix is effective or that my feature works
~~[ ] I have added an entry in ./changelog/fragments using the changelog tool~~
I have added an integration test or an E2E test

Disruptive User Impact

N/A

How to test this PR locally

Go to deploy/helm/elastic-agent/examples

Related issues

elasticmachine · 2024-08-26T14:33:27Z

Pinging @elastic/elastic-agent-control-plane (Team:Elastic-Agent-Control-Plane)

jlind23 · 2024-08-28T05:31:10Z

buildkite test it

ycombinator · 2024-08-28T22:41:12Z

@pkoutsovasilis Just a general comment: we plan to use this Helm chart to create an AWS Marketplace listing for Elastic Agent. To this end, I believe AWS has some requirements that the Helm chart must fulfill. Just wanted to bring that up in this PR in case it means some things in the chart need to change to meet these requirements. Thanks!

pkoutsovasilis · 2024-08-29T07:14:40Z

@pkoutsovasilis Just a general comment: we plan to use this Helm chart to create an AWS Marketplace listing for Elastic Agent. To this end, I believe AWS has some requirements that the Helm chart must fulfill. Just wanted to bring that up in this PR in case it means some things in the chart need to change to meet these requirements. Thanks!

Hi @ycombinator 👋 I have read these requirements before opening this PR and I think that all of them are satisfied:

it support both rootful and rootless deployments (we need the latter for AWS Marketplace)
the Helm chart does not utilise anything from Helm that isn't supported by the EKS addon framework, such Helm hooks, lookup function, etc.
although this is up for configuration but for the AWS Marketplace configuration we can offer deployment that have a Daemonset and thus meet the requirement of Traceable Deployments or Daemonsets

I don't see something that it isn't covered but maybe another extra pair of eyes on this PR would be helpful 🙂

swiatekm

I like all the examples! I was wondering if we could also render the final manifests using the example configurations and keep these in version control. The Otel Helm Charts do something like this and it acts partially as a test, and partially as code review aid, where it lets you visualize how a change to the templates impacts the rendered manifests.

pkoutsovasilis · 2024-08-29T08:08:10Z

I like all the examples! I was wondering if we could also render the final manifests using the example configurations and keep these in version control. The Otel Helm Charts do something like this and it acts partially as a test, and partially as code review aid, where it lets you visualize how a change to the templates impacts the rendered manifests.

oh that's a great idea @swiatekm , ty for bringing it up! 🙂 I say definitely let's add them; Initially I am gonna push them for each example and maybe afterwards we could investigate how to utilise them in a more meaningful way in CI

UPDATE: added two mage targets, one that renders the k8s manifest for each example inside the Helm chart and one that lints the Helm chart. I have also added these two in the check-ci makefile recipe, thus, a change that causes either the Helm chart to fail linting or the rendered examples to be changed without any commit will be captured

blakerouse

Okay. I have a few questions.

I did not review the templates. They are so large and complex that it didn't seem like I should sit there and go through each and every line. I think it would be better to just merge this and start using it and make incremental fixes as issues are identified.

I am good with giving a +1 after and review some of the answers to my questions.

deploy/helm/elastic-agent/Chart.yaml

deploy/helm/elastic-agent/README.md.gotmpl

deploy/helm/elastic-agent/examples/kubernetes-only-logs/agent-kubernetes.yaml

deploy/helm/elastic-agent/values.yaml

blakerouse · 2024-08-30T15:55:18Z

deploy/helm/elastic-agent/values.yaml

+  enabled: true
+  # -- elastic-agent version
+  # @section -- 3 - Elastic-Agent Configuration
+  version: 8.15.0


I assume this needs to get bump every version. We should ensure this is added to the automation for this repo so that it gets bumped when we do new releases.

@pkoutsovasilis are we intentionally not putting this under appVersion in Chart.yaml? Normally that indicates the Chart works with a wide array of application versions.

yes I do that intentionally, because this Helm chart should be able to work with multiple elastic-agent versions. This was at least part of the old requirements, however maybe it is non-relevant any more?! @strawgate any opinion on that?

blakerouse · 2024-08-30T15:55:58Z

deploy/helm/elastic-agent/values.yaml

+  # -- image configuration
+  # @section -- 3 - Elastic-Agent Configuration
+  image:
+    repository: docker.elastic.co/beats/elastic-agent-complete


Why is -complete being used? synthetics? Seems like it would be better to not use that one unless they are wanting to do synthetics.

maybe we want to support that?! if we don't this can be default to something else or maybe switch to wolfi-based images?!

I would not default to -complete its much larger, just for synthetics. I would expect them to adjust the chart to use -complete if they want to run synthetics. Wolfi image could also be the default, no issues there from me.

So I am in a pickle here, for unprivileged agent we need version 8.16.0 and the same goes for wolfi image. Maybe I we should switch to 8.16.0-SNAPSHOT by default? PS: I did fallback to an image without -complete

I think the open question here is how is this Helm chart getting released? Does it get updated out of sync of the Elastic Agent release or does it get released at the same time of the full stack release? If its stack release based then this should really sync with the version of the repository here and when its the main repository it should really be -SNAPSHOT as that is the only released version for running code in actual main. When 8.16 is created, then it should not use -SNAPSHOT.

Just a quick thinking about that; I would like having this chart "following" the agent release to always have the appropriate agent image in it, but also be able to bump helm chart version to fix any agent-unrelated issue, e.g. issues with templating?!

Can we defer the release process question for now? It sounds like it could be its own major discussion, but whatever we decide is not going to require major changes to this Chart, which we're not publishing yet anyway. I think it might be better to merge this PR without answering this question, and do that in a separate issue. WDYT @blakerouse @pkoutsovasilis ?

yep I agree with deferring the release process question for now. That said, I added a helm:updateAgentVersion mage target that I deem it would be useful in this discussion

I agree we should defer. A follow up PR to change this is simple.

deploy/helm/elastic-agent/values.yaml

app.kubernetes.io/version in common labels

…asilis/agent_helm_chart

swiatekm

I have one final comment about names, but otherwise I think we're good to go.

I would still like to move away from named templates as much as possible, and definitely add a schema file for validation, but that can happen in a follow-up.

deploy/helm/elastic-agent/templates/agent/k8s/daemonset.yaml

…helm_chart # Conflicts: # NOTICE.txt # go.mod # go.sum

swiatekm

👍

blakerouse

Looks good, I am ready for this to be merged.

pkoutsovasilis · 2024-09-05T15:04:14Z

Looks good, I am ready for this to be merged.

haha!! this phrase sounded like "release the Kraken"! 😄 ty both @swiatekm and @blakerouse for the comments and the reviews. That said, the agent-extended-testing CI step, which is required, is failing but it is not something caused by this PR?!

mergify · 2024-09-06T01:32:34Z

This pull request is now in conflicts. Could you fix it? 🙏
To fixup this pull request, you can check out it locally. See documentation: https://help.github.com/articles/checking-out-pull-requests-locally/

git fetch upstream
git checkout -b pkoutsovasilis/agent_helm_chart upstream/pkoutsovasilis/agent_helm_chart
git merge upstream/main
git push upstream pkoutsovasilis/agent_helm_chart

blakerouse · 2024-09-06T02:08:09Z

@pkoutsovasilis It is very much that!

The failures are flaky tests that we are working on, also seems there is a conflict now. I would merge main into the PR and fix the conflict and have it re-run to get a green CI run.

…helm_chart # Conflicts: # go.sum

elastic-sonarqube · 2024-09-06T08:23:08Z

Quality Gate passed

Issues
0 New issues
0 Fixed issues
0 Accepted issues

Measures
0 Security Hotspots
No data about Coverage
0.0% Duplication on New Code

See analysis details on SonarQube

ycombinator · 2024-09-09T14:22:47Z

@pkoutsovasilis If we want to release the Helm chart as a beta initially, is there any way we can mention this as part of the chart itself, such that users would clearly see it before they use it or right as they start to use it?

pkoutsovasilis · 2024-09-09T14:43:04Z

@ycombinator just thinking out loud the available options here:

I assume that we can push the helm chart with a beta suffix at the version and update the NOTES.txt accordingly here https://helm.elastic.co/
Somebody that wants to try this helm chart can clone the repo and install the helm chart from the folder deploy/helm/elastic-agent
We could utilise github pages on this repo and commit directly the helm chart package.tgz (that points to the SNAPSHOT agent image) [my least favourite option]

that's why a discussion about the Helm chart release process is wise to have before any next step 🙂

swiatekm · 2024-09-09T15:52:04Z

Helm Charts use semver 2.0 for versioning, so it's simple enough to just designate our releases as betas by setting it to va.b.c-beta.d or something similar. This makes Helm automatically treat it as a prerelease. This will cause them not to show up in searches without the --devel flag, amongst other consequences.

ycombinator · 2024-09-09T23:48:51Z

Thanks @pkoutsovasilis and @swiatekm. I've created #5485 to continue the discussion on the beta marking and to track the implementation as well.

Likewise, I have created #5486 to resume the discussion about the release process that we deferred from this PR.

pkoutsovasilis added enhancement New feature or request backport-skip skip-changelog labels Aug 21, 2024

mergify bot assigned pkoutsovasilis Aug 21, 2024

pierrehilbert added the Team:Elastic-Agent-Control-Plane Label for the Agent Control Plane team label Aug 21, 2024

pkoutsovasilis force-pushed the pkoutsovasilis/agent_helm_chart branch 2 times, most recently from 6f3d557 to 50f534b Compare August 26, 2024 14:26

pkoutsovasilis changed the title ~~[WIP] Add elastic-agent helm-chart~~ [k8s] Add elastic-agent helm-chart Aug 26, 2024

pkoutsovasilis marked this pull request as ready for review August 26, 2024 14:33

pkoutsovasilis requested a review from a team as a code owner August 26, 2024 14:33

pkoutsovasilis requested review from blakerouse and andrzej-stencel August 26, 2024 14:33

pkoutsovasilis force-pushed the pkoutsovasilis/agent_helm_chart branch from 50f534b to 4e7b5b7 Compare August 26, 2024 21:28

pkoutsovasilis added 2 commits August 27, 2024 18:14

feat: add elastic-agent helm-chart

f794e93

feat: update examples root README.md

6f94863

pkoutsovasilis force-pushed the pkoutsovasilis/agent_helm_chart branch from 4e7b5b7 to dc88b1d Compare August 27, 2024 15:16

elastic deleted a comment from mergify bot Aug 27, 2024

feat: add integration tests for the elastic-agent helm chart

e27504e

pkoutsovasilis force-pushed the pkoutsovasilis/agent_helm_chart branch from dc88b1d to e27504e Compare August 28, 2024 06:30

ycombinator requested review from swiatekm and removed request for andrzej-stencel August 28, 2024 22:23

swiatekm reviewed Aug 29, 2024

View reviewed changes

blakerouse reviewed Aug 30, 2024

View reviewed changes

pkoutsovasilis added 6 commits September 2, 2024 16:19

fix: add

d9b849c

app.kubernetes.io/version in common labels

fix: change cluster role and cluster role binding naming pattern

f7e33d3

fix: revisit cluster role and cluster role binding naming pattern

10fd1a5

fix: change to elastic-agent image that doesn't contain synthetics

3a3fd4b

fix: typo in eck example README.md

b1ef152

Merge remote-tracking branch 'refs/remotes/origin/main' into pkoutsov…

75a47c3

…asilis/agent_helm_chart

swiatekm reviewed Sep 4, 2024

View reviewed changes

deploy/helm/elastic-agent/templates/agent/k8s/daemonset.yaml Outdated Show resolved Hide resolved

pkoutsovasilis added 4 commits September 5, 2024 05:34

fix: include helm release name in k8s objects

67a9556

feat: add helm:updateAgentVersion mage target

36cae72

Merge remote-tracking branch 'origin/main' into pkoutsovasilis/agent_…

f3f6db9

…helm_chart # Conflicts: # NOTICE.txt # go.mod # go.sum

fix: resolve merge conflicts

882b6cf

pkoutsovasilis force-pushed the pkoutsovasilis/agent_helm_chart branch from 673385b to 882b6cf Compare September 5, 2024 03:19

elastic deleted a comment from mergify bot Sep 5, 2024

swiatekm approved these changes Sep 5, 2024

View reviewed changes

blakerouse approved these changes Sep 5, 2024

View reviewed changes

pkoutsovasilis added 2 commits September 6, 2024 10:45

Merge remote-tracking branch 'origin/main' into pkoutsovasilis/agent_…

19138ea

…helm_chart # Conflicts: # go.sum

fix: resolve merge conflicts

3fc84bc

pkoutsovasilis merged commit 189ec2b into elastic:main Sep 6, 2024
13 checks passed

ycombinator mentioned this pull request Sep 6, 2024

[REQUEST]: Document how to deploy Elastic Agent using Helm elastic/ingest-docs#1304

Closed

This was referenced Sep 9, 2024

Mark the Elastic Agent Helm Chart as beta #5485

Closed

[Discuss] Elastic Agent Helm Chart release process #5486

Open

kilfoyle mentioned this pull request Nov 5, 2024

Add Fleet & Agent 8.16.0 Release Notes elastic/ingest-docs#1412

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[k8s] Add elastic-agent helm-chart #5331

[k8s] Add elastic-agent helm-chart #5331

pkoutsovasilis commented Aug 21, 2024 •

edited

Loading

elasticmachine commented Aug 26, 2024

jlind23 commented Aug 28, 2024

ycombinator commented Aug 28, 2024

pkoutsovasilis commented Aug 29, 2024

swiatekm left a comment

pkoutsovasilis commented Aug 29, 2024 •

edited

Loading

blakerouse left a comment

blakerouse Aug 30, 2024

swiatekm Sep 2, 2024

pkoutsovasilis Sep 2, 2024 •

edited

Loading

blakerouse Aug 30, 2024

pkoutsovasilis Aug 30, 2024

blakerouse Sep 3, 2024

pkoutsovasilis Sep 3, 2024

blakerouse Sep 3, 2024

pkoutsovasilis Sep 3, 2024

swiatekm Sep 4, 2024

pkoutsovasilis Sep 5, 2024

blakerouse Sep 5, 2024

swiatekm left a comment

swiatekm left a comment

blakerouse left a comment

pkoutsovasilis commented Sep 5, 2024

mergify bot commented Sep 6, 2024

blakerouse commented Sep 6, 2024

elastic-sonarqube bot commented Sep 6, 2024

ycombinator commented Sep 9, 2024

pkoutsovasilis commented Sep 9, 2024

swiatekm commented Sep 9, 2024 •

edited

Loading

ycombinator commented Sep 9, 2024

[k8s] Add elastic-agent helm-chart #5331

[k8s] Add elastic-agent helm-chart #5331

Conversation

pkoutsovasilis commented Aug 21, 2024 • edited Loading

What does this PR do?

Why is it important?

Checklist

Disruptive User Impact

How to test this PR locally

Related issues

elasticmachine commented Aug 26, 2024

jlind23 commented Aug 28, 2024

ycombinator commented Aug 28, 2024

pkoutsovasilis commented Aug 29, 2024

swiatekm left a comment

Choose a reason for hiding this comment

pkoutsovasilis commented Aug 29, 2024 • edited Loading

blakerouse left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pkoutsovasilis Sep 2, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

swiatekm left a comment

Choose a reason for hiding this comment

swiatekm left a comment

Choose a reason for hiding this comment

blakerouse left a comment

Choose a reason for hiding this comment

pkoutsovasilis commented Sep 5, 2024

mergify bot commented Sep 6, 2024

blakerouse commented Sep 6, 2024

elastic-sonarqube bot commented Sep 6, 2024

Quality Gate passed

ycombinator commented Sep 9, 2024

pkoutsovasilis commented Sep 9, 2024

swiatekm commented Sep 9, 2024 • edited Loading

ycombinator commented Sep 9, 2024

pkoutsovasilis commented Aug 21, 2024 •

edited

Loading

pkoutsovasilis commented Aug 29, 2024 •

edited

Loading

pkoutsovasilis Sep 2, 2024 •

edited

Loading

swiatekm commented Sep 9, 2024 •

edited

Loading