Add file read ahead through AsyncDataCache #3389

oerling · 2022-12-01T04:26:27Z

adds a code sample and supporting functions for using CachedBufferedInput for smart prefetching of sequential files. A SeekableInputStream is registered for each file of interest. Each time the read proceeds past a given fraction of the current load quantum, the load of the next quantum is scheduled. The load prefetches into AsyncDataCache and will be found there when moving past the end of thecurrent quantum. Prefetch will fail silently if there is no memory or if over half the cache is taken by not yet accessed prefetched data.

Adds a special StreamIdentifier to denote expected sequential access. The default for no StreamIdentifier in enqueue is preloading the whole enqueued range.

Adds a mode where each visited cache entry is made evictable immediately after unpinning. This allows not polluting the cache with one time large sequential accesses.

Adds a test that simulates a multifile merge. Each thread has 100 files and consumes a chunk of each in turn.

netlify · 2022-12-01T04:26:33Z

✅ Deploy Preview for meta-velox canceled.

Name	Link
🔨 Latest commit	`d0b326b`
🔍 Latest deploy log	https://app.netlify.com/sites/meta-velox/deploys/638a4655461f0a000b54a8b6

facebook-github-bot · 2022-12-01T04:29:00Z

@oerling has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Yuhta

There are some suggestions regarding removing doneIndices from allCoalescedLoad_. Also the new test is not passing TSAN.

Yuhta · 2022-12-01T17:51:58Z

velox/dwio/common/CacheInputStream.cpp

+    auto nextQuantum = position_ - offsetInQuantum + loadQuantum_;
+    auto prefetchThreshold = loadQuantum_ * prefetchPct_ / 100;
+    if (!prefetchStarted_ && offsetInQuantum + *size > prefetchThreshold &&
+        position_ - offsetInQuantum + loadQuantum_ < region_.length) {


nextQuantum < region_.length?

Yuhta · 2022-12-01T18:12:06Z

velox/dwio/common/CachedBufferedInput.cpp

+      // Remove the loads that were done. There can be done loads if the same
+      // CachedBufferedInput has multiple cycles of enqueues and loads.
+      for (int32_t i = doneIndices.size() - 1; i >= 0; --i) {
+        allCoalescedLoads_.erase(allCoalescedLoads_.begin() + doneIndices[i]);


This should be outside the looping over allCoalescedLoads_.

Also this is more efficient if allCoalescedLoads_ is large:

std::swap(allCoalescedLoads_[doneIndices[i]], allCoalescedLoads_.back()); allCoalescedLoads_.pop_back();

facebook-github-bot · 2022-12-01T21:04:53Z

@oerling has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-12-01T21:21:48Z

@oerling has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-12-01T21:29:43Z

@oerling has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2022-12-01T21:42:56Z

@oerling has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

Summary: adds a code sample and supporting functions for using CachedBufferedInput for smart prefetching of sequential files. A SeekableInputStream is registered for each file of interest. Each time the read proceeds past a given fraction of the current load quantum, the load of the next quantum is scheduled. The load prefetches into AsyncDataCache and will be found there when moving past the end of thecurrent quantum. Prefetch will fail silently if there is no memory or if over half the cache is taken by not yet accessed prefetched data. Adds a special StreamIdentifier to denote expected sequential access. The default for no StreamIdentifier in enqueue is preloading the whole enqueued range. Adds a mode where each visited cache entry is made evictable immediately after unpinning. This allows not polluting the cache with one time large sequential accesses. Adds a test that simulates a multifile merge. Each thread has 100 files and consumes a chunk of each in turn. Pull Request resolved: facebookincubator#3389 Reviewed By: Yuhta Differential Revision: D41642368 Pulled By: oerling fbshipit-source-id: 3272aca53cc3beb7b8a5489418802d552b92441e

facebook-github-bot · 2022-12-01T23:53:36Z

This pull request was exported from Phabricator. Differential Revision: D41642368

Summary: adds a code sample and supporting functions for using CachedBufferedInput for smart prefetching of sequential files. A SeekableInputStream is registered for each file of interest. Each time the read proceeds past a given fraction of the current load quantum, the load of the next quantum is scheduled. The load prefetches into AsyncDataCache and will be found there when moving past the end of thecurrent quantum. Prefetch will fail silently if there is no memory or if over half the cache is taken by not yet accessed prefetched data. Adds a special StreamIdentifier to denote expected sequential access. The default for no StreamIdentifier in enqueue is preloading the whole enqueued range. Adds a mode where each visited cache entry is made evictable immediately after unpinning. This allows not polluting the cache with one time large sequential accesses. Adds a test that simulates a multifile merge. Each thread has 100 files and consumes a chunk of each in turn. Pull Request resolved: facebookincubator#3389 Reviewed By: Yuhta Differential Revision: D41642368 Pulled By: oerling fbshipit-source-id: 86adf8ea400aed76f2681c8f291f3d619ea4f492

facebook-github-bot · 2022-12-02T18:39:18Z

This pull request was exported from Phabricator. Differential Revision: D41642368

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 1, 2022

oerling requested a review from Yuhta December 1, 2022 04:27

Yuhta reviewed Dec 1, 2022

View reviewed changes

oerling force-pushed the cache-ra-pr branch from fee8557 to ef9939f Compare December 1, 2022 21:02

oerling force-pushed the cache-ra-pr branch from ef9939f to 94a899a Compare December 1, 2022 21:29

oerling force-pushed the cache-ra-pr branch 2 times, most recently from c1c8bb0 to fb50810 Compare December 1, 2022 21:42

Yuhta approved these changes Dec 1, 2022

View reviewed changes

oerling force-pushed the cache-ra-pr branch from fb50810 to 0c4c985 Compare December 1, 2022 23:53

oerling force-pushed the cache-ra-pr branch from 0c4c985 to d0b326b Compare December 2, 2022 18:39

facebook-github-bot closed this in 1299c91 Dec 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add file read ahead through AsyncDataCache #3389

Add file read ahead through AsyncDataCache #3389

oerling commented Dec 1, 2022

netlify bot commented Dec 1, 2022 •

edited

Loading

facebook-github-bot commented Dec 1, 2022

Yuhta left a comment

Yuhta Dec 1, 2022

Yuhta Dec 1, 2022

facebook-github-bot commented Dec 1, 2022

facebook-github-bot commented Dec 1, 2022

facebook-github-bot commented Dec 1, 2022

facebook-github-bot commented Dec 1, 2022

facebook-github-bot commented Dec 1, 2022

facebook-github-bot commented Dec 2, 2022

Add file read ahead through AsyncDataCache #3389

Add file read ahead through AsyncDataCache #3389

Conversation

oerling commented Dec 1, 2022

netlify bot commented Dec 1, 2022 • edited Loading

✅ Deploy Preview for meta-velox canceled.

facebook-github-bot commented Dec 1, 2022

Yuhta left a comment

Choose a reason for hiding this comment

Yuhta Dec 1, 2022

Choose a reason for hiding this comment

Yuhta Dec 1, 2022

Choose a reason for hiding this comment

facebook-github-bot commented Dec 1, 2022

facebook-github-bot commented Dec 1, 2022

facebook-github-bot commented Dec 1, 2022

facebook-github-bot commented Dec 1, 2022

facebook-github-bot commented Dec 1, 2022

facebook-github-bot commented Dec 2, 2022

netlify bot commented Dec 1, 2022 •

edited

Loading