(WIP) Multi platform abstraction tweaks #1195

Titus-von-Koeller · 2024-05-06T13:57:32Z

I'll fill in more details as I go along here, but for now I'm just publicly sharing what I'm working on. Feel free to already comment if anything catches your eye.

I've gathered extensive feedback from previous PRs (some closed) around this topic and am implementing my interpretation of what's needed, what Tim wants (API structure/ sub-interfaces) and what the community suggested, e.g. tensor-driven dispatch + deferred initialization.

The idea is that I'll ask for extensive feedback from the community once I'm through with the changes I'm imagining (aiming for this week) and we see how to continue from there and then merge the Intel PR #1178, once it's been adapted to this slightly modified architecture.

…ulti-platform-tweaks

matthewdouglas · 2024-05-06T16:59:47Z

I was thinking about this a little bit in #1173 actually. It may be good opportunity to rename some of these ops to make them more clear, even if they still are aliased for BC reasons.

Possible examples:
double_quant => double_quant_int8
mm_dequant => dequant_mm_int32_fp16

matthewdouglas · 2024-05-06T17:05:16Z

bitsandbytes/backends/_subinterfaces.py

+        raise NotImplementedError
+
+
+class FourBitMatmul(ABC):


Something to think about here is that for the nested quantization, the interface for KBitQuantization is needed (quantize_blockwise/dequantize_blockwise)

…dependency issues, see https://huggingface.slack.com/archives/C021H1P1HKR/p1714469081588139

github-actions · 2024-05-06T17:21:13Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Titus-von-Koeller · 2024-07-23T15:53:05Z

Closing this as it will be replaced by upcoming torch.library refactor

Titus-von-Koeller added 3 commits May 5, 2024 16:41

delete obsolete comment

0ec54fe

Merge remote-tracking branch 'upstream/multi-backend-refactor' into m…

4540300

…ulti-platform-tweaks

sketch out modular sub-interfaces

64eff52

Titus-von-Koeller added high priority (first issues that will be worked on) Intel Integration AMD integration High Risk Risk of bugs in transformers and other libraries cross-platform macOS labels May 6, 2024

Titus-von-Koeller self-assigned this May 6, 2024

Titus-von-Koeller marked this pull request as draft May 6, 2024 13:57

Titus-von-Koeller removed Intel Integration AMD integration High Risk Risk of bugs in transformers and other libraries macOS labels May 6, 2024

matthewdouglas reviewed May 6, 2024

View reviewed changes

Titus-von-Koeller added 2 commits May 6, 2024 19:06

doc-builder image had been changed, need to revert to old one due to …

7772fa3

…dependency issues, see https://huggingface.slack.com/archives/C021H1P1HKR/p1714469081588139

Merge remote-tracking branch 'upstream/main' into multi-platform-tweaks

6efe03c

Titus-von-Koeller mentioned this pull request May 7, 2024

Add int8 ops for CPU #1178

Merged

Titus-von-Koeller changed the title ~~WIP: Multi platform abstraction tweaks~~ (WIP) Multi platform abstraction tweaks May 28, 2024

Titus-von-Koeller closed this Jul 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(WIP) Multi platform abstraction tweaks #1195

(WIP) Multi platform abstraction tweaks #1195

Titus-von-Koeller commented May 6, 2024

matthewdouglas commented May 6, 2024

matthewdouglas May 6, 2024

github-actions bot commented May 6, 2024

Titus-von-Koeller commented Jul 23, 2024

(WIP) Multi platform abstraction tweaks #1195

(WIP) Multi platform abstraction tweaks #1195

Conversation

Titus-von-Koeller commented May 6, 2024

matthewdouglas commented May 6, 2024

matthewdouglas May 6, 2024

Choose a reason for hiding this comment

github-actions bot commented May 6, 2024

Titus-von-Koeller commented Jul 23, 2024