DM-37249-v24: backport transaction-level-pooling compatibility to v24 #771

TallJimbo · 2023-01-12T19:25:08Z

Checklist

ran Jenkins

Apparently "#region ... #endregion" is used to trigger some special linter behavior, and I don't want to fight the battle of learning to disable it.

This was probably a bug that we've never happened to trigger.

This addresses a long-standing TODO comment in Database.query (dating back to the introduction Session and our first real attempt to reduce connection contention). In some cases this might seem to be transforming lazy iteration over database results into aggressive fetching, via calls to fetchall(), but for at least PostgreSQL it seems we were already doing _some_ aggressive fetching before without realizing it, see e.g. https://docs.sqlalchemy.org/en/14/core/connections.html#using-server-side-cursors-a-k-a-stream-results So what we're really doing in the new fetchalls() here is invoking SQLAlchemy's client-side row processing more aggressively. I don't think that's a big change, and in exchange we can now guarantee that we never return iterators to user code that are responsible for closing a connection when the user is "done" with them - a problematic definition for any garbage-collected entity, but a particular problem for iterators that might not ever be exhausted. This also include some typing adjustments for SQLAlchemy objects. Those aren't actually checked yet, since typing support in SQLAlchemy won't be out until it's 2.0 release.

Temporary tables are now always created inside transactions, and via the temporary_table context-manager-returning method, which has been moved to Database from Session. Session has been removed. This should bring us into compatibility with pgbouncer transaction-level pooling - we won't care if a connection we hold actually gets multiplexed onto a different database connection, as long as it doesn't happen during transactions.

Each of the gazillion small queries we run a butler startup was previously grabbing a new connection and then returning it. That was probably wasn't too bad, since it should have just been getting them from the pool (for PostgreSQL) or opening a local file (for SQLite), but this should still be better in both cases. It should also remove a lot of rollbacks, since SQLAlchemy emits those whenever connections are returned to the pool. SQLAlchemy's docs say those shouldn't matter in terms of performance, but they're still noisy.

Now that we're careful to consistently use the same connection, we shouldn't ever need more.

Making SQLite temp tables inside transactions leads to very long exclusive locks during QG generation, causing timeouts in (at least) ci_cpp.

We now have public final session() and transaction() methods that return nothing and protected _session() and _transaction() methods that return connections, reducing the need to assert on self._session_connection being not None and avoiding returning things to users that they don't need.

codecov · 2023-01-13T17:34:55Z

Codecov Report

Base: 84.52% // Head: 84.56% // Increases project coverage by +0.03% 🎉

Coverage data is based on head (c517410) compared to base (dde7253).
Patch coverage: 95.50% of modified lines in pull request are covered.

Additional details and impacted files

@@             Coverage Diff             @@
##           v24.0.x     #771      +/-   ##
===========================================
+ Coverage    84.52%   84.56%   +0.03%     
===========================================
  Files          243      243              
  Lines        31854    31903      +49     
  Branches      5428     5464      +36     
===========================================
+ Hits         26926    26978      +52     
+ Misses        3749     3748       -1     
+ Partials      1179     1177       -2

Impacted Files	Coverage Δ
python/lsst/daf/butler/registry/tests/_registry.py	`99.07% <ø> (ø)`
...thon/lsst/daf/butler/registry/bridge/monolithic.py	`83.49% <70.00%> (-1.05%)`	⬇️
...n/lsst/daf/butler/registry/interfaces/_database.py	`87.71% <89.65%> (+0.49%)`	⬆️
python/lsst/daf/butler/registries/sql.py	`81.23% <100.00%> (+0.15%)`	⬆️
python/lsst/daf/butler/registry/attributes.py	`100.00% <100.00%> (ø)`
...thon/lsst/daf/butler/registry/collections/_base.py	`87.74% <100.00%> (+0.32%)`	⬆️
...n/lsst/daf/butler/registry/databases/postgresql.py	`81.46% <100.00%> (+2.27%)`	⬆️
...ython/lsst/daf/butler/registry/databases/sqlite.py	`84.61% <100.00%> (+0.21%)`	⬆️
.../butler/registry/datasets/byDimensions/_manager.py	`90.06% <100.00%> (+0.20%)`	⬆️
.../butler/registry/datasets/byDimensions/_storage.py	`84.54% <100.00%> (+0.14%)`	⬆️
... and 8 more

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

TallJimbo added 5 commits January 12, 2023 14:14

Reword comment to avoid pylance false-positive.

5bca687

Apparently "#region ... #endregion" is used to trigger some special linter behavior, and I don't want to fight the battle of learning to disable it.

Set PostgreSQL timezone even in read-only transactions.

a2f81fb

This was probably a bug that we've never happened to trigger.

Comment on compatibility with pgbouncer transaction pooling.

7911e48

Remove workaround in favor of SQLAlchemy 1.4+ method call.

c41067d

Add limit(1) to queries where only one result is needed.

6886e3f

TallJimbo changed the base branch from main to v24.0.x January 12, 2023 19:25

TallJimbo added 9 commits January 13, 2023 12:19

Add changelog entry.

008d422

Set connection pool size to one for PostgreSQL.

d4608c6

Now that we're careful to consistently use the same connection, we shouldn't ever need more.

Use transaction to ensure consistency in Registry initialization.

0afb18e

Only wrap PostgreSQL temp tables in transactions.

ac07514

Making SQLite temp tables inside transactions leads to very long exclusive locks during QG generation, causing timeouts in (at least) ci_cpp.

Only start read-only transactions in read-only SQLite databases.

0ab0fb2

TallJimbo force-pushed the tickets/DM-37249-v24 branch from 2968815 to 0ab0fb2 Compare January 13, 2023 17:19

Switch to v24 in requirements.txt.

c517410

TallJimbo force-pushed the tickets/DM-37249-v24 branch from 42d67ed to c517410 Compare January 13, 2023 20:19

TallJimbo merged commit 01021c5 into v24.0.x Jan 14, 2023

TallJimbo deleted the tickets/DM-37249-v24 branch January 14, 2023 02:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-37249-v24: backport transaction-level-pooling compatibility to v24 #771

DM-37249-v24: backport transaction-level-pooling compatibility to v24 #771

TallJimbo commented Jan 12, 2023 •

edited

Loading

codecov bot commented Jan 13, 2023 •

edited

Loading

DM-37249-v24: backport transaction-level-pooling compatibility to v24 #771

DM-37249-v24: backport transaction-level-pooling compatibility to v24 #771

Conversation

TallJimbo commented Jan 12, 2023 • edited Loading

Checklist

codecov bot commented Jan 13, 2023 • edited Loading

Codecov Report

TallJimbo commented Jan 12, 2023 •

edited

Loading

codecov bot commented Jan 13, 2023 •

edited

Loading