RFC: Make self-inplace broadcast vectorlizable based on our effect inference #43185

N5N3 · 2021-11-22T03:27:38Z

As found by #43153, self-inplace broadcast with dimension larger than 1 won't be vectorlized by LLVM, as it fails to prove a[I] and extrude(a)[I] share the same access pattern.
#43852 makes it possible to fix it on julia side, as @simd ivdep should be safe if bc[I] is proven to be effect_free, and dest is not self-aliased.

Currently the self-aliasing check for dest is pure type-based to avoid binary bloat.

Force to derive `effects` during inference stage. Co-Authored-By: Takafumi Arakaki <29282+tkf@users.noreply.github.com>

mcabbott · 2023-01-31T15:11:13Z

Can you comment on the status here -- does this cause problems, and are there possible alternatives? Would still be nice to solve #43153

N5N3 · 2023-01-31T16:18:10Z

I just got messed when rebasing #44822 on master. Since we can't finish this PR without it, I thought it would be better to reopen this once #44822 get landed.

vtjnash requested a review from mbauman November 22, 2021 04:37

N5N3 marked this pull request as draft November 22, 2021 09:02

N5N3 changed the title ~~RFC: Add ivdep to make a .+= 1 vectorlizable~~ RFC: Try to make a .+= 1 vectorlizable Nov 22, 2021

N5N3 marked this pull request as ready for review November 22, 2021 14:26

N5N3 marked this pull request as draft November 24, 2021 03:16

N5N3 force-pushed the isivdepsafe branch from 0d5fde8 to 2a14248 Compare November 24, 2021 07:23

N5N3 marked this pull request as ready for review November 24, 2021 07:47

N5N3 changed the title ~~RFC: Try to make a .+= 1 vectorlizable~~ RFC: Try to make a .+= 1 vectorlizable when a isa Array Nov 24, 2021

N5N3 marked this pull request as draft November 30, 2021 05:15

N5N3 force-pushed the isivdepsafe branch 2 times, most recently from 2bf16ee to 0543f5d Compare January 26, 2022 09:46

N5N3 force-pushed the isivdepsafe branch from 0543f5d to 991961b Compare February 11, 2022 13:00

N5N3 changed the title ~~RFC: Try to make a .+= 1 vectorlizable when a isa Array~~ RFC: Make self-inplace broadcast vectorlizable based on our effect inference Feb 11, 2022

N5N3 marked this pull request as ready for review February 11, 2022 13:06

N5N3 force-pushed the isivdepsafe branch from 991961b to d1d9c5b Compare February 11, 2022 13:11

N5N3 added broadcast Applying a function over a collection performance Must go faster labels Feb 11, 2022

N5N3 mentioned this pull request Feb 18, 2022

Use mul! in Diagonal*Matrix #42321

Merged

Infer_effects prototype

03404c1

Force to derive `effects` during inference stage. Co-Authored-By: Takafumi Arakaki <29282+tkf@users.noreply.github.com>

N5N3 force-pushed the isivdepsafe branch from d1d9c5b to 8aabb16 Compare February 18, 2022 16:09

Make inplace broadcast simdable

846d678

N5N3 force-pushed the isivdepsafe branch from 8aabb16 to 846d678 Compare February 18, 2022 17:34

N5N3 closed this Jan 31, 2023

N5N3 deleted the isivdepsafe branch January 31, 2023 15:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: Make self-inplace broadcast vectorlizable based on our effect inference #43185

RFC: Make self-inplace broadcast vectorlizable based on our effect inference #43185

N5N3 commented Nov 22, 2021 •

edited

Loading

mcabbott commented Jan 31, 2023

N5N3 commented Jan 31, 2023

RFC: Make self-inplace broadcast vectorlizable based on our effect inference #43185

RFC: Make self-inplace broadcast vectorlizable based on our effect inference #43185

Conversation

N5N3 commented Nov 22, 2021 • edited Loading

mcabbott commented Jan 31, 2023

N5N3 commented Jan 31, 2023

N5N3 commented Nov 22, 2021 •

edited

Loading