Skip to content
Max Irwin edited this page Apr 10, 2019 · 11 revisions

The RRE Maven Plugin is the component which is in charge of executing the RRE evaluation process during a Maven build cycle. The following RRE Maven plugin are available:

  • io.sease:rre-maven-solr-plugin:<version>: for Apache Solr
  • io.sease:rre-maven-elasticsearch-plugin:<version>: for Elasticsearch

You should replace the version placeholder with the platform version you want to use. We choose to go with the same versions of the target platforms so the association between the product and the plugin is easy to remember.
We are constantly updating the supported versions for both platform, but it's possible that your specific version couldn't yet in.

The following snippet provides an example of the (Solr in this case) plugin configuration.

<plugin>
    <groupId>io.sease</groupId>
    <artifactId>rre-maven-solr-plugin</artifactId>
    <version>7.1.0</version>
    <configuration>
        <configurations-folder>${basedir}/src/solr/configuration_sets</configurations-folder>
        <corpora-folder>${basedir}/src/solr/corpora</corpora-folder>
        <ratings-folder>${basedir}/src/solr/ratings</ratings-folder>
        <templates-folder>${basedir}/src/solr/templates</templates-folder>
        <fields>id,product_name</fields>
        <include>
            <param>v1.[0|1]</param>
            <param>v1.10</param>
        </include>
        <exclude>
            <param>v2.*</param>
        </exclude> 
        <metrics>
            <param>io.sease.rre.core.domain.metrics.impl.Precision</param>
            <param>io.sease.rre.core.domain.metrics.impl.Recall</param>
            <param>io.sease.rre.core.domain.metrics.impl.ReciprocalRank</param>
            <param>io.sease.rre.core.domain.metrics.impl.AveragePrecision</param>
            <param>io.sease.rre.core.domain.metrics.impl.NDCGAtTen</param>
            <param>io.sease.rre.core.domain.metrics.impl.PrecisionAtOne</param>
            <param>io.sease.rre.core.domain.metrics.impl.PrecisionAtTwo</param>
            <param>io.sease.rre.core.domain.metrics.impl.PrecisionAtThree</param>
            <param>io.sease.rre.core.domain.metrics.impl.PrecisionAtTen</param>
        </metrics>
    </configuration>
    <executions>
        <execution>
            <phase>package</phase>
            <goals>
                <goal>evaluate</goal>
            </goals>
        </execution>
    </executions>
</plugin>

Where:

  • configurations-folder: the folder that contains the configuration sets. The content, as described here depends on the target search platform.
  • corpora-folder: the folder that contains the datasets. Each datasets must have the format required by the target search platform. See here for more details.
  • corpora-folder: the folder that contains one or more ratings/judgements set. See here for more details.
  • templates-folder: the folder that contains the query templates.
  • fields: RRE collects, for each query execution, the top 10 results. This parameter declares the stored fields that will be retained, for each result.
  • include/exclude: optional constraints which declare the versions which will be part of the evaluation process. Values can be literals or regex. In the example above, we want to include version v1.0,v1.1,v1.10 and we want to exclude all versions 2.*
  • metrics: a list of metrics that will be computed in the evaluation process. Each metric is declared using its FQDN. See here for an updated list of all available metrics.

As you can see from the last part, you can bind the evaluation to a specific phase (package in the example above) or you can run it isolately with the following command:

> mvn rre:evaluate

Each concrete plugin implementation (e.g. ApacheSolr or Elasticsearch at the moment) can add specific configuration parameters. The following sections list those additional settings.

Elasticsearch

The Elasticsearch plugin binding allows an additional "plugins" parameter which is a list of custom plugins we want to load as part of the embedded instance. This is useful when the project contains some customisations (i.e. a plugin) in the search workflow we want to deploy in the Elasticsearch cluster.

<plugins>
    <param>com.yourcompany.project.PluginXYZ</param>
    <param>com.othercompany.es.plugins.PluginHJQ</param>
    ...
</plugins>   

Parameterized Metrics

Some metrics are parameterized for configurable K values. For example, the below parameterizedMetric f1At1 will evaluate F-Measure with the parameters beta=1 and k=1, and store the output value in the field 'f1At1', which matches the tag name:

        <metrics>
            <param>io.sease.rre.core.domain.metrics.impl.Precision</param>
            <param>io.sease.rre.core.domain.metrics.impl.Recall</param>
        </metrics>
        <parameterizedMetrics>
            <f1At1>
                <class>io.sease.rre.core.domain.metrics.impl.FMeasureAtK</class>
                <beta>1</beta>
                <k>1</k>
            </f1At1>
        </parameterizedMetrics>