Skip to content

Latest commit

 

History

History
347 lines (235 loc) · 14.5 KB

CONTRIBUTING.md

File metadata and controls

347 lines (235 loc) · 14.5 KB

Contributing

This page is intended for contributors to the registry and packages.

Definitions

Package

A package contains the dashboards, visualisations, and configurations to monitor the logs and metrics of a particular technology or group of related services, such as “MySQL”, or “System”.

The package consists of:

  • Name
  • Zero or more dashboards and visualisations and Canvas workpads
  • Zero or more ML job definitions
  • Zero or more dataset templates

The package is versioned.

Integration

An integration is a specific type of a package defining datasets used to observe the same product (logs and metrics).

Dataset Template

A dataset template is part of a package and contains all the assets which are needed to create a dataset. Example for assets are: ingest pipeline, agent config template, ES index template, ...

Dataset templates are inside the package directory under dataset.

The dataset template consists of:

  • An alias templates (or the fields.yml to create it)
  • Zero or more ingest pipelines
  • An Elastic Agent config template

Migration from Beats

A defined importing procedure used to transform both Filebeat and Metricbeat modules, related to the same observed product, into a single integration. The integration contains extracted dataset configuration of beat modules, hence no modules are required to exist anymore.

Package structure

Elements

Link: https://github.com/elastic/package-registry/blob/master/ASSETS.md

Reference packages

The following packages can be considered as reference points for all integrations.

Integration: reference

Link: https://github.com/elastic/package-registry/tree/master/dev/packages/example/reference-1.0.0

The directory contains mandatory manifest files defining the integration and its datasets. All manifests have fields annotated with comments to better understand their goals.

Keep in mind that this package doesn't contain all file resources (images, screenshots, icons) referenced in manifests. Let's assume that they're also there.

Integration: mysql

Link: https://github.com/mtojek/package-registry/tree/package-mysql-0.0.2/dev/packages/alpha/mysql-0.0.2

The MySQL integration was the first integration built using the import-beats script. The script imported filesets and metricsets from both MySQL modules, and converted them to a package.

The MySQL integration contains all parts that should be present (or are required) in the integration package.

After using the import-beats script, the integration has been manually adjusted and extended with dedicated docs.

Create a new integration

This section describes steps required to build a new integration. If you plan to prepare the integration with a product unsupported by Beats, feel free to skip the section about importing existing modules.

Import from existing modules

The import procedure heavily uses on the import-beats script. If you are interested how does it work internally, feel free to review the script's README.

  1. Create an issue in the package-registry to track ongoing progress with the integration (especially manual changes).

    Focus on the one particular product (e.g. MySQL, ActiveMQ) you would like to integrate with. Use this issue to mention every manual change that has been applied. It will help in adjusting the import-beats script and reviewing the integration.

  2. Prepare the developer environment:

    1. Clone/refresh the following repositories:

      Make sure you don't have any manual changes applied as they will reflect on the integration.

    2. Clone/refresh the Elastic Package Registry (EPR) to always use the latest version of the script:

    3. Make sure you've the mage tool installed:

      $ go get -u -d github.com/magefile/mage
  3. Boot up required dependencies:

    1. Elasticseach instance:
      • Kibana's dependency
    2. Kibana instance:
      • used to migrate dashboards, if not available, you can skip the generation (SKIP_KIBANA=true)

    Hint. There is dockerized environment in beats (cd testing/environments). Boot it up with the following command: docker-compose -f snapshot.yml -f local.yml up --force-recreate elasticsearch kibana.

  4. Create a new branch for the integration in package-registry repository (diverge from master).

  5. Run the command: mage ImportBeats to start the import process.

    The outcome of running the import-beats script is directory with refreshed and updated integrations.

    It will take a while to finish, but the console output should be updated frequently to track the progress. The command must end up with the exit code 0. Kindly please to open an issue if it doesn't.

    Generated packages are stored by default in the dev/packages/beats directory. Generally, the import process updates all of the integrations, so don't be surprised if you notice updates to multiple integrations, including the one you're currently working on (e.g. dev/packages/beats/foobarbaz-0.0.1). You can either commit this changes or leave them for later.

    If you want to select a subgroup of packages, set the environment variable PACKAGES (comma-delimited list):

    $ PACKAGES=aws,cisco mage ImportBeats
  6. Copy the package output for your integration (e.g. dev/packages/beats/foobarbaz-0.0.1) to the alpha directory and raise the version manually: dev/packages/alpha/foobarbaz-0.0.2.

Fine-tune the integration

Motivation

Most of migration work has been done by the import-beats script, but there're tasks that require developer's interaction.

It may happen that your integration misses a screenshot or an icon, it's a good moment to add missing resources to Beats/Kibana repositories and re-import the integration (idempotent).

Checklist

The order of action items on the checklist is advised to prevent the contributor from repeating some actions (fixing what's been already fixed, as the script has overridden part of it).

  1. Add icon if missing.

    The integration icons are presented in different places in Kibana, hence it's better to define custom icons to make the UI easier to navigate.

    As the import-beats script looks for icons in Kibana and EUI repositories, add an icon to the first one the same way as for tutorial resources (Kibana directory: src/legacy/core_plugins/kibana/public/home/tutorial_resources/logos/).

  2. Add screenshot if missing.

    The Kibana Integration Manager shows screenshots related with the integration. Screenshots present Kibana dashboards visualizing the metric/log data.

    The import-beats script finds references to screenshots mentioned in _meta/docs.asciidoc and copies image files from the Beats directories:

    • metricbeat/docs/images
    • filebeat/docs/images
  3. Improve/correct spelling product names.

    The correct spelling of product names simply makes better impression. The import-beats scripts uses the fields.yml file as the source of the correct spelling (title property), e.g. Mysql - MySQL, Nginx - NGINX, Aws - AWS.

    Keep in mind that this step requires reimporting package contents.

  4. Write README template file for the integration.

    The README template is used to render the final README file including exported fields. The template should be placed in the dev/import-beats-resources/<integration-name>/docs/README.md.

    Review the MySQL docs template to see how to use template functions (e.g. {{fields "dataset-name"}}). If the same dataset name is used in both metrics and logs, please add -metrics and -logs in the template. For example, elb is a dataset for log and also a dataset for metrics. In README.md template, {{fields "elb-logs"}} and {{fields "elb-metrics"}} are used to separate them.

  5. Review fields file and exported fields in docs.

    The goal of this action item is to verify if produced artifacts are correct.

    The fields files (package-fields.yml, fields.yml and ecs.yml) in the package were created from original fields.yml files (that may contain ECS schema fields) and fields.epr.yml (defining some other fields used in the ingest pipeline). It may happen that original sources have a typo, bad description or misses a field definition. The sum of fields in all present files should contain only fields that are really used, e.g. not all existing ECS fields.

    It may happen that the ingest pipeline uses fields abstracted from ECS, but not mentioned in fields.yml. Integrations should contain these fields and also have them documented.

    See the PR elastic/beats#17895 to understand how to add them to Beats (e.g. event.code, event.provider) using the fields.epr.yml file.

  6. Metricbeat: add missing configuration options.

    The import-beats script extracts configuration options from Metricbeat module's _meta directory. It analyzes the configuration files and selects options based on enabled metricsets (not commented). If you notice that some configuration options are missing in your package's manifest files, simply create the config.epr.yml file with all required options.

    Sample PR: elastic/beats#17323

  7. Review titles and descriptions in manifest files.

    Titles and descriptions are fields visualized in the Kibana UI. Most users will use them to see how to configure the integration with their installation of a product or to how to use advanced configuration options.

  8. Compact configuration options (vars).

    Currently, all configuration options are set by the import-beats script on the stream level (path: dataset/<dataset-name>/manifest.yml).

    It may happen that some of them in different datasets are simply duplicates or concern the same setting, which will be always equal (e.g. MySQL username, password). Keep in mind that two datasets may have the same configuration option, but different values (e.g. period, paths), hence can't be compacted.

    To sum up, compacting takes down from the user the necessity to setup the same configuration option few times (one per dataset).

  9. Define all variable properties.

    The variable properties customize visualization of configuration options in the Kibana UI. Make sure they're defined in all manifest files.

    vars:
      - name: paths
        required: true
        show_user: true
        title: Access log paths
        description: Paths to the nginx access log file.
        type: text
        multi: true
        default:
          - /var/log/nginx/access.log*

required - option is required

show_user - don't hide the configuration option (collapsed menu)

title - human readable variable name

description - variable description (may contain some details)

type - field type (according to the reference: text, password, bool, integer)

multi - the field has mutliple values.

  1. Review stream configuration.

    Due to changed templating engine from a standard Golang one to handlebars, it may be hard to automatically convert the Filebeat input configuration (nested variables, many representations, conditions, loops). Kindly please to review the output stream configuration and review potential bugs.

  2. Update docs template with sample events.

    The events collected by the agent slightly differ from original, Metricbeat's and Filebeat's, ones. Adjust the event content manually basing on already migrated integrations (e.g. MySQL integration) or copy them once managed to run whole setup with real agent.

  3. Kibana: use stream.dataset field instead of event.dataset.

    Using stream.dataset instead of event.dataset also makes queries a lot more efficient as this is a constant_keyword. Make sure that dashboards in your package don't use the event.dataset field. If so, simply replace them with the more efficient one.

Testing and validation

Run the whole setup

  1. Build docker image with EPR:

    $ docker build --rm -t docker.elastic.co/package-registry/package-registry:master .
  2. Start testing environment:

    $ cd testing/environments
    $ docker-compose -f snapshot.yml -f local.yml up

    The command will boot up a docker cluster with Elasticsearch, Kibana and Package Registry.

  3. Verify that your integration is available (in the right version), e.g. MySQL: http://localhost:8080/search?package=mysql

    [
      {
        "description": "MySQL Integration",
        "download": "/epr/mysql/mysql-0.0.1.tar.gz",
        "icons": [
          {
            "src": "/package/mysql/0.0.1/img/logo_mysql.svg",
            "title": "logo mysql",
            "size": "32x32",
            "type": "image/svg+xml"
          }
        ],
        "name": "mysql",
        "path": "/package/mysql/0.0.1",
        "title": "MySQL",
        "type": "integration",
        "version": "0.0.1"
      }
    ]
  4. Build agent code:

    $ cd $GOPATH/src/github.com/elastic/beats/x-pack/elastic-agent
    $ PLATFORMS=darwin mage package

    Unpack the distribution you'd like to use (e.g. tar.gz):

    $ cd build/distributions/
    $ tar xzf elastic-agent-8.0.0-darwin-x86_64.tar.gz
    $ cd elastic-agent-8.0.0-darwin-x86_64/
  5. Enroll the agent and start it:

    Use the "Enroll new agent" option in the Kibana UI (Ingest Manager -> Fleet -> Create user and enable Fleet) and run a similar command:

    $ ./elastic-agent enroll http://localhost:5601/rel cFhNVlZIRUIxYjhmbFhqNTBoS2o6OUhMWkF4SFJRZmFNZTh3QmtvR1cxZw==
    $ ./elastic-agent run

    The elastic-agent will start two other processes - metricbeat and filebeat.

  6. Run the product you're integrating with (e.g. a docker image with MySQL).

  7. Install package.

    Click out the configuration in the Kibana UI, deploy it and wait for the agent to pick out the updated configuration.

  8. Navigate with Kibana UI to freshly installed dashboards, verify the metrics/logs flow.