Optimize Datasource-based queries #467

kyri-petrou · 2024-04-02T03:12:10Z

This PR contains a series of optimizations aimed at improving the performance of Datasource-backed queries.

Main optimizations

Implement Cache.Default using a java.util.ConcurrentHashMap[_, _] in favour of a Ref[Map[_, _]]. This allows us to check if the cache provided is the default one, and if yes, use the "unsafe" methods to mutate the cache, reducing the number of flatmaps for each ZQuery.fromRequest
Futher reduce the number of flatmaps for each ZQuery.fromRequest by using the FiberRef#getWith methods for ZQuery.currentCache and ZQuery.cachingEnabled.
Rewrite BlockedRequests.run using mutable datastructures and fulfilling promises unsafely
Add a new empty constructor to Cache which allows pre-sizing the underlying ConcurrentHashMap
Optimize BlockedRequests.flatten by using mutable datastructures

Benchmarking results:

Based on the newly added benchmarks, throughput is improved by:

Up to 250% when using the default Cache size
Up to 300% when using a Cache large enough to fit all requests

series/2.x:

[info] Benchmark                                (count)   Mode  Cnt      Score    Error  Units
[info] FromRequestBenchmark.fromRequestDefault      100  thrpt   10  12168.714 ± 97.771  ops/s
[info] FromRequestBenchmark.fromRequestDefault     1000  thrpt   10   1043.727 ± 34.488  ops/s

PR:

[info] Benchmark                                (count)   Mode  Cnt      Score     Error  Units
[info] FromRequestBenchmark.fromRequestDefault      100  thrpt   10  27293.203 ± 254.516  ops/s
[info] FromRequestBenchmark.fromRequestDefault     1000  thrpt   10   2640.471 ±  32.877  ops/s
[info] FromRequestBenchmark.fromRequestSized        100  thrpt   10  29534.166 ± 147.223  ops/s
[info] FromRequestBenchmark.fromRequestSized       1000  thrpt   10   3006.937 ±  14.040  ops/s

…tasource-optimizations-attempt-2 # Conflicts: # zio-query/shared/src/main/scala/zio/query/internal/BlockedRequests.scala

…tempt-2 # Conflicts: # zio-query/shared/src/main/scala/zio/query/CompletedRequestMap.scala # zio-query/shared/src/main/scala/zio/query/DataSource.scala

ghostdogpr

Nice! Maybe try to run the caliban test suite as well to catch anything. Does it improve the caliban benchmarks too?

kyri-petrou · 2024-04-03T00:40:18Z

Does it improve the caliban benchmarks too?

@ghostdogpr Doubtful, the optimizations are for datasource-backed queries and we only use ZQuery.succeed in the Caliban benchmarks. However, I think we should add a param in Caliban in the execution config to size the Cache. Default value of 16 seems way too low

kyri-petrou · 2024-04-03T00:41:53Z

Maybe try to run the caliban test suite as well to catch anything

I don't think we have anything in Caliban's test suites that uses datasources. I'll test it against some projects at $WORK (which heavily rely on ZQueries) when we have a snapshot version of zio-query and caliban

kyri-petrou added 15 commits March 14, 2024 06:10

Optimize collection of datasource-based queries

e1bff8f

Use an iMap in CompletedRequestMap again

9b38bd0

Update comment

cfaa5ab

Add benchmark for sized cache

3492892

Cleanups

0229275

BlockedRequests.step optimization

3693d9f

Merge remote-tracking branch 'refs/remotes/origin/series/2.x' into da…

eb0ba76

…tasource-optimizations-attempt-2 # Conflicts: # zio-query/shared/src/main/scala/zio/query/internal/BlockedRequests.scala

Reduce number of flatMaps in ZQuery.fromRequest

aa19cba

More optimizations

3ab7495

Merge branch 'refs/heads/series/2.x' into datasource-optimizations-at…

3898cc6

…tempt-2 # Conflicts: # zio-query/shared/src/main/scala/zio/query/CompletedRequestMap.scala # zio-query/shared/src/main/scala/zio/query/DataSource.scala

Fix compiling with Scala 2.12

76e123c

Refactor BlockedRequests.run

e833db1

Only cache leftovers when caching is enabled & add tests

62000d0

Fix Scala 2.12 compiling

1624410

Fix linting

f660fbc

kyri-petrou marked this pull request as ready for review April 2, 2024 23:50

kyri-petrou requested review from paulpdaniels and ghostdogpr April 2, 2024 23:51

ghostdogpr previously approved these changes Apr 3, 2024

View reviewed changes

Avoid using a HashSet in BlockedRequests#run

c6ba3f3

kyri-petrou dismissed ghostdogpr’s stale review via c6ba3f3 April 5, 2024 22:15

kyri-petrou added 2 commits April 6, 2024 06:18

Can't have nice things with Scala 2.12

ad3d5a5

One more time

1adbaa8

kyri-petrou merged commit 9a807b6 into zio:series/2.x Apr 5, 2024
26 checks passed

kyri-petrou deleted the datasource-optimizations-attempt-2 branch April 5, 2024 22:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize Datasource-based queries #467

Optimize Datasource-based queries #467

kyri-petrou commented Apr 2, 2024 •

edited

Loading

ghostdogpr left a comment

kyri-petrou commented Apr 3, 2024

kyri-petrou commented Apr 3, 2024 •

edited

Loading

Optimize Datasource-based queries #467

Optimize Datasource-based queries #467

Conversation

kyri-petrou commented Apr 2, 2024 • edited Loading

Main optimizations

Benchmarking results:

ghostdogpr left a comment

Choose a reason for hiding this comment

kyri-petrou commented Apr 3, 2024

kyri-petrou commented Apr 3, 2024 • edited Loading

kyri-petrou commented Apr 2, 2024 •

edited

Loading

kyri-petrou commented Apr 3, 2024 •

edited

Loading