Skip to content

Commit

Permalink
Update measure_ppl2_MC.py
Browse files Browse the repository at this point in the history
Adding functionality to ingest scaling factors upon merge of the PR vllm-project#3290
  • Loading branch information
Alexei-V-Ivanov-AMD authored Apr 4, 2024
1 parent 1e2b203 commit bbc24c7
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions benchmarks/measure_ppl2_MC.py
Original file line number Diff line number Diff line change
Expand Up @@ -72,6 +72,8 @@ def vllm_init(args):
kv_cache_dtype=args.kv_cache_dtype,
#scales_path=args.kv_cache_scales_path
# if args.kv_cache_scales_path!='' else None,
quantization-param-path=args.kv_cache_scales_path
if args.kv_cache_scales_path!='' else None,
enforce_eager=args.enforce_eager)

sampling_params = SamplingParams(n=1,
Expand Down

0 comments on commit bbc24c7

Please sign in to comment.