[0.9.2] Query with both raw and aggregates should throw error #3407

otoolep · 2015-07-21T02:22:38Z

Not sure if this expected, but a combination raw and aggregate query doesn't work as one might expect. I checked 0.9.1 (pre-DQ work) and it also suffers from the same problem.

#!/bin/bash

curl -G http://localhost:8086/query --data-urlencode "q=CREATE DATABASE db"

# Point's date is 01 Jul 2010 18:47:02 GMT
curl -d '
{
    "database": "db",
    "retentionPolicy": "default",
    "points": [
        {
            "time": 1278010022,
            "precision": "s",
            "measurement": "cpu",
            "fields":{
                "value": 64
            }
        }
    ]
}
' -H "Content-Type: application/json" http://localhost:8086/write
sleep 1

curl -G 'http://localhost:8086/query?db=db&pretty=true' --data-urlencode 'q=SELECT value,mean(value) FROM cpu'

Results:

{"results":[{}]}{
    "results": [
        {
            "series": [
                {
                    "name": "cpu",
                    "columns": [
                        "time",
                        "value",
                        "mean"
                    ],
                    "values": [
                        [
                            "1970-01-01T00:00:00Z",
                            64
                        ]
                    ]
                }
            ]
        }
    ]
}

The text was updated successfully, but these errors were encountered:

pauldix · 2015-08-18T18:47:45Z

@otoolep not sure this is still an issue. The query that you entered should actually return an error because it's invalid. Can you verify this is still a problem? I think we have it updated to return a query error now.

beckettsean · 2015-08-18T18:48:02Z

In 0.9.2 the behavior is weirder. The aggregate comes back as the only value and is assigned to the first column in the output. The other column is empty.

> insert agg value=12
> insert agg value=36
> select value, mean(value) from agg
name: agg
---------
time            value   mean
1970-01-01T00:00:00Z    24

beckettsean · 2015-08-18T18:50:53Z

Aggregation functions are not valid with direct value selections. Selector functions (mix, max, etc.) that return a single point are valid for some queries and should be allowed. E.g. SELECT max(value), value2 FROM ...

pauldix · 2015-08-18T18:55:14Z

The following combinations are invalid:

select field2, mean(field1) ...
select field2, sum(field1) ...
select field2, count(field1) ...
select field2, percentile(field1, 90) ...
select field2, spread(field1) ...
select field2, stddev(field1) ...
select field2, median(field1) ...
select field2, distinct(field1) ...
select field2, derivative(field1) ...

The other aggregates select a single data point, so it's valid to select another field or tag along with those functions. These include min, max, first, and last.

beckettsean · 2015-08-18T22:10:18Z

@pauldix what about TOP and BOTTOM? I would think those can return more than one value so should be invalid.

beckettsean · 2015-08-18T22:10:52Z

Verified partially fixed in 0.9.3 nightly:

Connected to http://localhost:8086 version 0.9.3-nightly-1548f62
InfluxDB shell 0.9.3-nightly-1548f62
> select mean(value), value from thing
ERR: error parsing query: mixing aggregate and non-aggregate queries is not supported

otoolep · 2015-08-18T22:15:04Z

@beckettsean -- which part is not fixed?

beckettsean · 2015-08-18T22:23:39Z

@pauldix's comment implies that for MIN, MAX, FIRST, and LAST the error should not be thrown, but it is:

> select max(value) from thing
name: thing
-----------
time            max
1970-01-01T00:00:00Z    60

> select max(value), value from thing
ERR: error parsing query: mixing aggregate and non-aggregate queries is not supported

beckettsean · 2015-08-18T22:23:59Z

@otoolep ^^

otoolep · 2015-08-18T22:41:26Z

This is not an trivial fix to our code. A query with max is considered an aggregate statement, and there is a hard difference between aggregate queries and "raw" queries.

Supporting the mixed statements is a significant change to our code. This restriction has been in place since 0.9.0.

beckettsean · 2015-08-18T22:45:52Z

@otoolep I'm all for making it throw a parser error for all functions, aggregations or not. @pauldix can we do that for now and only revisit the special cases if/when there's a community need?

pauldix · 2015-08-19T15:25:02Z

@beckettsean TOP and BOTTOM are both valid. There are two different cases:

select top(value, 5), host from cpu where time > now() - 1h

The reason that is valid is because it will return 5 data points per group by interval (in this case the entire time range we're looking at), and each of those maps to a single point.

This query is also valid:

select top(mean(value), 5), host from cpu where time > now() -1h

What this query should do is compute the mean for each host in the interval, then get the top 5 means and output those.

Those features aren't wired up yet, so it's probably best to return an error for all functions that have an aggregate and a single field or tag along with them for now. I'll update the TOP, BOTTOM, and select with aggregate issues to references this one.

peterbollen · 2016-02-03T12:04:12Z

What about this query?

select count(amount), type from order where time <= now() and time >= (now() - 2h) and type = 'x' group by time(1h)

I would to copy the aggregation of points of type x into another RP.

Atm I get: ERR: error parsing query: mixing aggregate and non-aggregate queries is not supported

Version: 2016/02/03 09:41:58 InfluxDB starting, version 0.9.6.1, branch 0.9.6, commit 6d3a860, built unknown

pauldix · 2016-02-03T14:11:46Z

@peterbollen that's invalid since count doesn't map to a single data point. If you want the count per different type include type in your group by clause.

jsternberg · 2016-04-06T19:50:14Z

This works in the new query engine so I'm going to close it. Please reopen if this is still an issue.

beckettsean · 2016-04-07T00:57:08Z

Verified fixed in 0.11, at least in some cases:

> select max(usage_idle), usage_irq from cpu where time > now() - 10s
name: cpu
---------
time            max         usage_irq
1459990595333165705 99.59919839641957   0

> select mean(usage_idle), usage_irq from cpu where time > now() - 10s
ERR: error parsing query: mixing aggregate and non-aggregate queries is not supported

jpuigsegur · 2016-05-03T22:02:54Z

@beckettsean it's great to be able to retrieve values and tags associated with a max or min value. Not usually easy to do with a database. However, my question is: Is it possible to do the same with the timestamp?

For example:

> insert meteo,sensor=1 temperature=10,humidity=50
> insert meteo,sensor=1 temperature=11,humidity=50
> insert meteo,sensor=1 temperature=9,humidity=49
> insert meteo,sensor=1 temperature=12,humidity=47
> select min(temperature), humidity, sensor from meteo where time > now() - 10m group by time(10m)
name: meteo
-----------
time            min humidity    sensor
1462311600000000000             
1462312200000000000 9   47      1

> select min(temperature), humidity, sensor, time from meteo where time > now() - 10m group by time(10m)
name: meteo
-----------
time            min humidity    sensor
1462311600000000000             
1462312200000000000 9   47      1

Is there any way to obtain the timestamp of the measure corresponding to the min value?

beckettsean · 2016-05-03T22:18:19Z

If there is a GROUP BY clause in the query, the returned timestamps will always be a GROUP BY interval boundary, not the actual timestamp of the point. This is the current implementation, and we hope to return the full point even with GROUP BY clauses, but it will require substantial effort and likely won't be a 1.0 feature.

See #5890 and #6510 for more context.

michapr · 2016-05-27T12:51:21Z

select filename, sum(number) from products group by filename limit 100

I get "...mixing aggregate and non-aggregate queries is not supported"
Using 0.14 - where can be the problem?

beckettsean · 2016-05-27T20:51:03Z

@michapr that is intended behavior. You should not repeat the GROUP BY tag in the SELECT clause. Just issue select sum(number) from products group by filename limit 100. The query response will include the filename, as that's the GROUP BY property.

houming818 · 2017-03-20T02:52:55Z

@beckettsean I try to select the mean on a field value, I used the group by on some tags. how can I get tags information in return query set? I only get the mean column in return query set.

otoolep added the area/queries label Jul 21, 2015

beckettsean added the area/error handling label Aug 18, 2015

beckettsean changed the title ~~Query with both raw and aggregates -- questionable output~~ [0.9.2] Query with both raw and aggregates should throw error Aug 18, 2015

beckettsean added this to the 0.9.4 milestone Aug 18, 2015

beckettsean mentioned this issue Aug 19, 2015

[0.9.1] panic caused by continuous queries #3284

Closed

This was referenced Aug 19, 2015

Wire up BOTTOM aggregate #1820

Closed

Wire up TOP aggregate #1821

Closed

[feature request] selectors (e.g. min, max, first, last) should have equivalents to return the actual point #1577

Closed

jsternberg removed this from the 0.9.4 milestone Apr 6, 2016

jsternberg closed this as completed Apr 6, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[0.9.2] Query with both raw and aggregates should throw error #3407

[0.9.2] Query with both raw and aggregates should throw error #3407

otoolep commented Jul 21, 2015

pauldix commented Aug 18, 2015

beckettsean commented Aug 18, 2015

beckettsean commented Aug 18, 2015

pauldix commented Aug 18, 2015

beckettsean commented Aug 18, 2015

beckettsean commented Aug 18, 2015

otoolep commented Aug 18, 2015

beckettsean commented Aug 18, 2015

beckettsean commented Aug 18, 2015

otoolep commented Aug 18, 2015

beckettsean commented Aug 18, 2015

pauldix commented Aug 19, 2015

peterbollen commented Feb 3, 2016

pauldix commented Feb 3, 2016

jsternberg commented Apr 6, 2016

beckettsean commented Apr 7, 2016

jpuigsegur commented May 3, 2016 •

edited

Loading

beckettsean commented May 3, 2016

michapr commented May 27, 2016

beckettsean commented May 27, 2016

houming818 commented Mar 20, 2017

[0.9.2] Query with both raw and aggregates should throw error #3407

[0.9.2] Query with both raw and aggregates should throw error #3407

Comments

otoolep commented Jul 21, 2015

pauldix commented Aug 18, 2015

beckettsean commented Aug 18, 2015

beckettsean commented Aug 18, 2015

pauldix commented Aug 18, 2015

beckettsean commented Aug 18, 2015

beckettsean commented Aug 18, 2015

otoolep commented Aug 18, 2015

beckettsean commented Aug 18, 2015

beckettsean commented Aug 18, 2015

otoolep commented Aug 18, 2015

beckettsean commented Aug 18, 2015

pauldix commented Aug 19, 2015

peterbollen commented Feb 3, 2016

pauldix commented Feb 3, 2016

jsternberg commented Apr 6, 2016

beckettsean commented Apr 7, 2016

jpuigsegur commented May 3, 2016 • edited Loading

beckettsean commented May 3, 2016

michapr commented May 27, 2016

beckettsean commented May 27, 2016

houming818 commented Mar 20, 2017

jpuigsegur commented May 3, 2016 •

edited

Loading