Ability to cache more than 1000 results for a scheduled query #25068

aysiu · 2024-12-31T19:00:00Z

Problem

There isn't a clear path forward for reporting meaningfully from FleetDM for a large fleet if the scheduled query cached results are capped at 1,000 records. Log destinations are just a dump for data but don't automatically reconcile back to FleetDM, so a lot of decommissioned or lost/stolen devices will still be in the log destination results. The 1000-record cap seems arbitrary. If it's SaaS, you can charge more for more storage / database indexing, but if it's self-hosted, the limit should be configurable or able to be eliminated altogether.

What have you tried?

I've tried log destinations. It's just a mess of data, especially with snapshots. I've also tried using the API, but the API still leverages the cache that's maxed out at 1,000 records.

Potential solutions

Make 1,000 the default but be able to be moved upwards or just removed altogether (no limit), so people using FleetDM with larger fleets can get meaningful reporting from scheduled queries. Or, if that's too hard to add as a configurable option, just remove the limit completely.

What is the expected workflow as a result of your proposal?

Use the API to report on custom queries for the entire fleet or entire segments (separated by OS) of the fleet. If you have 2,000 or 5,000 or 10,000 or 50,000 devices, getting 1000 results back isn't super helpful for reporting purposes.

iansltx · 2025-01-01T19:39:41Z

Hi @aysiu, your issue appears to be the same as #19600, which we made configurable via query_report_cap in server configuration in v4.53.0. Please confirm whether that's what you're looking for (you can't remove the limit entirely, but you can raise it to an arbitrarily large number); thanks!

iansltx · 2025-01-01T19:47:25Z

If the above is the fix, I understand why you missed this issue; the most obvious place where query reporting is documented doesn't mention that you can configure the result cap upward. I'll PR in a docs update for that today.

Noticed this hole in #25068. Fingers crossed the wording here matches what folks will search when they need to bump the cap. Also added query data discard config instructions for the UI, and moved how-to-disable instructions to the bottom of the "View a query report" section since users won't need those disclaimers until they have a few queries set up. Finally, dropped the mention of where an old UI was 25+ minor releases ago.

iansltx · 2025-01-02T23:19:45Z

@aysiu Docs are updated. Does that address your issue?

aysiu · 2025-01-03T15:11:40Z

Thanks, @iansltx . I haven't had a chance yet to verify, but that looks promising.

aysiu · 2025-01-03T17:05:27Z

Is this something that needs to be modified by API PATCH call? Or can it also be done through the web UI (Settings > Organization Settings > Agent Options)?

iansltx · 2025-01-03T19:06:39Z

It needs to be modified via API PATCH.

aysiu · 2025-01-03T19:24:03Z

Thanks for clarifying.

One odd thing—The query_report_cap appears to be 0 before modification. Does 0 essentially mean the default, which is 1000?

iansltx · 2025-01-03T19:30:45Z

That's correct. 0 maps to the hard-coded default. Probably a "Golang zero value" convention.

aysiu · 2025-01-06T18:53:23Z

This seems to work. Thanks!

fleet-release · 2025-01-06T18:53:31Z

Cache limit expands,
Fleet's knowledge grows with it,
Clarity descends.

Like a city in clouds,
Data in volumes profound,
No device is lost in crowds.

In this vast expanse,
Every query finds its chance,
Users' trust enhances.

aysiu added the :product Product Design department (shows up on 🦢 Drafting board) label Dec 31, 2024

iansltx added the prospect-snowdonia label Jan 1, 2025

iansltx mentioned this issue Jan 1, 2025

Mention configurable query result set cap in query docs #25082

Merged

iansltx added the :improve documentation Involves writing improvements or additions to documentation label Jan 1, 2025

aysiu closed this as completed Jan 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ability to cache more than 1000 results for a scheduled query #25068

Ability to cache more than 1000 results for a scheduled query #25068

aysiu commented Dec 31, 2024 •

edited

Loading

iansltx commented Jan 1, 2025

iansltx commented Jan 1, 2025

iansltx commented Jan 2, 2025

aysiu commented Jan 3, 2025

aysiu commented Jan 3, 2025

iansltx commented Jan 3, 2025

aysiu commented Jan 3, 2025 •

edited

Loading

iansltx commented Jan 3, 2025

aysiu commented Jan 6, 2025

fleet-release commented Jan 6, 2025

Ability to cache more than 1000 results for a scheduled query #25068

Ability to cache more than 1000 results for a scheduled query #25068

Comments

aysiu commented Dec 31, 2024 • edited Loading

Problem

What have you tried?

Potential solutions

What is the expected workflow as a result of your proposal?

iansltx commented Jan 1, 2025

iansltx commented Jan 1, 2025

iansltx commented Jan 2, 2025

aysiu commented Jan 3, 2025

aysiu commented Jan 3, 2025

iansltx commented Jan 3, 2025

aysiu commented Jan 3, 2025 • edited Loading

iansltx commented Jan 3, 2025

aysiu commented Jan 6, 2025

fleet-release commented Jan 6, 2025

aysiu commented Dec 31, 2024 •

edited

Loading

aysiu commented Jan 3, 2025 •

edited

Loading