Consistent use of timestamps in API #522

candlerb · 2019-04-26T16:56:54Z

Is your feature request related to a problem? Please describe.

The Loki API appears inconsistent in how it handles timestamps:

start and end query offsets are numeric, nanosecond-based epoch times
"ts" values for logs being pushed in or queried are string ISO8601 format (with nanosecond resolution)

This makes the API uncomfortable to use, and may require unnecessary levels of processing. Examples:

loki may return an ISO8601 UTC timestamp, but if the browser wants to display it in local time, it will have to reparse it before reconstructing local time. (Or vice versa - the API doesn't define whether it returns timestamps in UTC or the host system local timezone)
If you are reading a batch of logs, and the call terminates due to limit being reached, then calculating the next start or end time involves having to parse the last log's timestamp.

(Iteration might be better handled with an opaque "next event" token, but that's a separate issue)
Converting ISO8601 to nanosecond start and end values accurately in Javascript is problematic. The built-in Date() function only handles integral numbers of milliseconds; and even if it used floating point, the 53-bit mantissa would limit resolution to slightly better than a microsecond.

I think you have to do the following: pad the string timestamp to ensure it has 9 digits after the period; trim off the last 6 digits; parse the remainder; convert integer milliseconds to string; and then stick the last 6 digits back on again. Yuk.

Describe the solution you'd like

I think that log timestamps should be be milliseconds from epoch everywhere. These are easy to use, native to Javascript, and perfectly accurate enough for system logs (more accurate than most machines' clock sync anyway).

As far as I can tell, cortex and loki use epoch timestamps internally. e.g. I can see the "from" and "to" timestamps in base64-encoded chunk filenames are millisecond-based epoch times:

fake/7a257c9eb6a62090:169f316b816:169f316b862:7d7324d2

Time.at(0x169f316b816 / 1000.0) = 2019-04-06 14:39:06 +0000

Also, the Loki design appears to be loosely based on AWS Cloudwatch Logs, which uses millisecond epoch timestamps for both queries and responses.

It's a breaking change, but loki is pre-beta.

Describe alternatives you've considered

Having millisecond timestamps does make it more likely that successive log lines will have identical timestamps, but that has always been a possibility.

To eliminate that problem, you could define that loki timestamps have two parts: a millisecond timestamp plus a sequence number which resets to zero on every new millisecond. That happens to be exactly how redis streams generates its message IDs. It makes iterating over logs trouble-free, and they are trivial to process: e.g.
```
> "1556295624536-17".split("-")
```
If it's considered important to retain nanosecond resolution, then "ts" values could be nanoseconds from epoch (as numbers). These can be processed on systems which have 64-bit integers; however, Javascript will only achieve slightly better than microsecond accuracy, due to the 53-bit mantissa. JSON parsers in other languages which treat numbers as floating point will be similarly affected.
Return "ts" values as numeric nanoseconds, but in JSON decimal strings rather than numbers. This makes it moderately easy to split off the last 6 digits, thus giving millisecond time plus sub-millisecond nanoseconds (0-999999), which can be reassembled later:
```
> x="1556295624123456789"
> [Number(x.slice(0,x.length-6)), Number(x.slice(x.length-6))]
```
Seems messy to me, but is similar work to option 1 (millisecond plus sequence).
Keep "ts" as ISO8601, but change start and end query parameters to be ISO8601 strings as well for consistency. This makes it easier when querying logs in batches, as you don't need to parse the time to create the start or end time of the next batch (which as mentioned above, is hard to do in Javascript as it has insufficient numeric accuracy).

However it still requires lots of parsing and conversion both in Loki and in its clients, between ISO and epoch formats, and the sub-milliseconds still have to be handled separately.

Additional context
AWS GetLogEvents API

The text was updated successfully, but these errors were encountered:

cyriltovena · 2019-06-17T15:57:49Z

we took another approach and we allow to send ISO query string for start and end does that work for you ? if not please reopen.

candlerb · 2019-06-17T17:27:26Z

For me, it's not as good as the AWS Cloudwatch approach (ms timestamp); but at least it's consistent so that's acceptable.

candlerb · 2019-07-09T08:52:45Z

Relates to: #656, #597

cyriltovena closed this as completed Jun 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consistent use of timestamps in API #522

Consistent use of timestamps in API #522

candlerb commented Apr 26, 2019

cyriltovena commented Jun 17, 2019

candlerb commented Jun 17, 2019

candlerb commented Jul 9, 2019

Consistent use of timestamps in API #522

Consistent use of timestamps in API #522

Comments

candlerb commented Apr 26, 2019

cyriltovena commented Jun 17, 2019

candlerb commented Jun 17, 2019

candlerb commented Jul 9, 2019