Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Organize normal loading metadata #381

Merged
merged 2 commits into from
Jun 5, 2024
Merged

Conversation

EricLBuehler
Copy link
Owner

Prepares for automatic detection of GPU VRAM and therefore calculating maximum number of device layers. Refs #377.

Copy link

github-actions bot commented Jun 5, 2024

Code Metrics Report
  ===============================================================================
 Language            Files        Lines         Code     Comments       Blanks
===============================================================================
 Dockerfile              1           34           25            0            9
 Happy                   1          442          369            0           73
 JSON                    9           21           21            0            0
 Python                 24          864          731           25          108
 TOML                   15          403          365            1           37
-------------------------------------------------------------------------------
 Jupyter Notebooks       1            0            0            0            0
 |- Markdown             1           60           30           22            8
 |- Python               1           96           87            1            8
 (Total)                            156          117           23           16
-------------------------------------------------------------------------------
 Markdown               16         1063            0          786          277
 |- BASH                 6          204          191            0           13
 |- Python               6          122          110            0           12
 |- Rust                 3          185          172            9            4
 (Total)                           1574          473          795          306
-------------------------------------------------------------------------------
 Rust                   94        29676        27015          437         2224
 |- Markdown            47          476            0          463           13
 (Total)                          30152        27015          900         2237
===============================================================================
 Total                 162        32503        28526         1249         2728
===============================================================================
  

@EricLBuehler EricLBuehler merged commit 19ca7ac into master Jun 5, 2024
11 checks passed
@EricLBuehler EricLBuehler deleted the normal_loading_metadata branch June 5, 2024 01:19
EricLBuehler added a commit that referenced this pull request Jun 5, 2024
* Organize normal loading metadata

* Fix
EricLBuehler added a commit that referenced this pull request Jun 7, 2024
* Intial work on phi3v

* Add the image embedding layer

* Lints

* Implement the loader

* Add infrastructure for phi3 image processor

* Merge

* Merge

* Merge

* Merge

* Partially implement padding

* Implement the hd transform step

* Work on the image processor

* Clippy

* Complete the phi3v inputs processor

* Rename

* Merge

* Merge

* Rename to phi3v and fix deser

* Fix varbuilder

* Fix varbuilder

* Default for do convert rgb

* Some defaults

* Allow no processor config

* Setup debug flag

* Add phi3v

* Implement messages flattening

* Update

* Rewrite the pad, hd transform

* Clippy

* Detect num channels

* Fix reshape

* Fix global image channel dim

* Fix assert

* Fix dtype

* Fix gt

* Fix image id neg

* Fix dim0 of pixel values

* Fix dtype

* Check if model supports gemm

* Fix some shape errors

* Fix some shape errors

* Fix rank of slice_assign

* Fix image toks

* Properly downcase

* Fix response

* Fix response

* Allow no images in prompt

* Output correct hidden state

* Fix nonzero and add test

* Fix n image toks

* Add mistralrs_vision

* Typo

* Fix and add tests

* Fix indexing

* Fix test condition

* Fix unsqueeze

* Fix dtype for norm

* Update clip

* Clippy

* Run clip in f32

* Run in bf16

* Run in bf16 again

* Fix dtype

* Set toks to have correct context lens

* Set toks to have correct context lens

* Support multiple GGUF files (#379)

* Move to gguf module

* Add content abstraction for multiple gguf files

* Fix test

* Allow specifying and loading multiple gguf files

* Update docs and examples

* Print some info

* Merge

* Organize normal loading metadata (#381)

* Organize normal loading metadata

* Fix

* Bump version 0.1.13 -> 0.1.14 (#382)

* Patch incorrect unwrap and bump version (#383)

* Patch incorrect unwrap

* Bump version to 0.1.15

* More verbose logging during loading (#385)

* More verbose logging when loading

* More logging

* Refactor enabling debug logging (#387)

* Refactor enabling debug logging

* Fix reversed order

* Merge

* Merge

* Merge

* Use precise gelu

* Use correct kernel

* Debugging commit

* Add fused bias linear

* Finish merge

* Use fused layer in clip

* Save progress

* Remove debugs

* Update example

* Resize exact

* Update interpolate

* Fix batch dim

* Update test and transform

* It works

* Add some examples

* Allow more than one image

* Add support in python api

* Add to toml selector

* Update python api

* Overhaul readme and docs

* Update

* Export vision arch

* Export vision arch

* Export vision arch

* Fix max img dim

* Fix unwrap
EricLBuehler added a commit that referenced this pull request Jun 8, 2024
* Intial work on phi3v

* Add the image embedding layer

* Lints

* Implement the loader

* Add infrastructure for phi3 image processor

* Merge

* Merge

* Merge

* Merge

* Partially implement padding

* Implement the hd transform step

* Work on the image processor

* Clippy

* Complete the phi3v inputs processor

* Rename

* Merge

* Merge

* Rename to phi3v and fix deser

* Fix varbuilder

* Fix varbuilder

* Default for do convert rgb

* Some defaults

* Allow no processor config

* Setup debug flag

* Add phi3v

* Implement messages flattening

* Update

* Rewrite the pad, hd transform

* Clippy

* Detect num channels

* Fix reshape

* Fix global image channel dim

* Fix assert

* Fix dtype

* Fix gt

* Fix image id neg

* Fix dim0 of pixel values

* Fix dtype

* Check if model supports gemm

* Fix some shape errors

* Fix some shape errors

* Fix rank of slice_assign

* Fix image toks

* Properly downcase

* Fix response

* Fix response

* Allow no images in prompt

* Output correct hidden state

* Fix nonzero and add test

* Fix n image toks

* Add mistralrs_vision

* Typo

* Fix and add tests

* Fix indexing

* Fix test condition

* Fix unsqueeze

* Fix dtype for norm

* Update clip

* Clippy

* Run clip in f32

* Run in bf16

* Run in bf16 again

* Fix dtype

* Set toks to have correct context lens

* Set toks to have correct context lens

* Support multiple GGUF files (#379)

* Move to gguf module

* Add content abstraction for multiple gguf files

* Fix test

* Allow specifying and loading multiple gguf files

* Update docs and examples

* Print some info

* Merge

* Organize normal loading metadata (#381)

* Organize normal loading metadata

* Fix

* Bump version 0.1.13 -> 0.1.14 (#382)

* Patch incorrect unwrap and bump version (#383)

* Patch incorrect unwrap

* Bump version to 0.1.15

* More verbose logging during loading (#385)

* More verbose logging when loading

* More logging

* Refactor enabling debug logging (#387)

* Refactor enabling debug logging

* Fix reversed order

* Merge

* Merge

* Merge

* Use precise gelu

* Use correct kernel

* Debugging commit

* Add fused bias linear

* Finish merge

* Use fused layer in clip

* Save progress

* Remove debugs

* Update example

* Resize exact

* Update interpolate

* Fix batch dim

* Update test and transform

* It works

* Add some examples

* Allow more than one image

* Add support in python api

* Add to toml selector

* Update python api

* Overhaul readme and docs

* Update

* Export vision arch

* Export vision arch

* Export vision arch

* Fix max img dim

* Fix unwrap
@EricLBuehler EricLBuehler restored the normal_loading_metadata branch June 8, 2024 09:43
@EricLBuehler EricLBuehler deleted the normal_loading_metadata branch June 8, 2024 09:46
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant