Update docker ENTRYPOINT to ensure proper argument handling #962

shashankmangla · 2024-06-12T15:55:05Z

Summary

This PR updates the ENTRYPOINT instruction in the Dockerfile to ensure that additional arguments passed to the container via docker run are correctly appended to the entrypoint command.

Before the change:

Parameter model is not passed to the entrypoint command and the default model facebook/opt-125m is loaded instead.

> sudo docker run --runtime=nvidia --gpus all -p 8000:8000 my-outlines-image --model="microsoft/phi-2"

/usr/local/lib/python3.10/site-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.
  warnings.warn(
INFO 06-12 14:45:46 llm_engine.py:161] Initializing an LLM engine (v0.5.0) with config: model='facebook/opt-125m', speculative_config=None, tokenizer='facebook/opt-125m', skip_tokenizer_init=False, tokenizer_mode=auto, revision=None, rope_scaling=None, rope_theta=None, tokenizer_revision=None, trust_remote_code=False, dtype=torch.float16, max_seq_len=2048, download_dir=None, load_format=LoadFormat.AUTO, tensor_parallel_size=1, disable_custom_all_reduce=False, quantization=None, enforce_eager=False, kv_cache_dtype=auto, quantization_param_path=None, device_config=cuda, decoding_config=DecodingConfig(guided_decoding_backend='outlines'), seed=0, served_model_name=facebook/opt-125m)

After the change:

Parameter model is correctly passed to the entrypoint command

> sudo docker run --runtime=nvidia --gpus all -p 8000:8000 my-outlines-image --model="microsoft/phi-2"

/usr/local/lib/python3.10/site-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.
  warnings.warn(
INFO 06-12 14:59:17 llm_engine.py:161] Initializing an LLM engine (v0.5.0) with config: model='microsoft/phi-2', speculative_config=None, tokenizer='microsoft/phi-2', skip_tokenizer_init=False, tokenizer_mode=auto, revision=None, rope_scaling=None, rope_theta=None, tokenizer_revision=None, trust_remote_code=False, dtype=torch.float16, max_seq_len=2048, download_dir=None, load_format=LoadFormat.AUTO, tensor_parallel_size=1, disable_custom_all_reduce=False, quantization=None, enforce_eager=False, kv_cache_dtype=auto, quantization_param_path=None, device_config=cuda, decoding_config=DecodingConfig(guided_decoding_backend='outlines'), seed=0, served_model_name=microsoft/phi-2)

rlouf · 2024-06-13T06:54:38Z

Thank you for contributing!

shashankmangla · 2024-06-13T10:15:25Z

@rlouf Thanks for reviewing and merging! Could you please clarify when the next release will be made or if the Release Docker workflow will be manually trigged?

rlouf · 2024-06-13T10:35:32Z

Just ran the workflow!

…i#962) ## Summary This PR updates the `ENTRYPOINT` instruction in the Dockerfile to ensure that additional arguments passed to the container via `docker run` are correctly appended to the entrypoint command. ### Before the change: Parameter `model` is not passed to the entrypoint command and the default model `facebook/opt-125m` is loaded instead. ```bash > sudo docker run --runtime=nvidia --gpus all -p 8000:8000 my-outlines-image --model="microsoft/phi-2" /usr/local/lib/python3.10/site-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`. warnings.warn( INFO 06-12 14:45:46 llm_engine.py:161] Initializing an LLM engine (v0.5.0) with config: model='facebook/opt-125m', speculative_config=None, tokenizer='facebook/opt-125m', skip_tokenizer_init=False, tokenizer_mode=auto, revision=None, rope_scaling=None, rope_theta=None, tokenizer_revision=None, trust_remote_code=False, dtype=torch.float16, max_seq_len=2048, download_dir=None, load_format=LoadFormat.AUTO, tensor_parallel_size=1, disable_custom_all_reduce=False, quantization=None, enforce_eager=False, kv_cache_dtype=auto, quantization_param_path=None, device_config=cuda, decoding_config=DecodingConfig(guided_decoding_backend='outlines'), seed=0, served_model_name=facebook/opt-125m) ``` ### After the change: Parameter `model` is correctly passed to the entrypoint command ```bash > sudo docker run --runtime=nvidia --gpus all -p 8000:8000 my-outlines-image --model="microsoft/phi-2" /usr/local/lib/python3.10/site-packages/huggingface_hub/file_download.py:1132: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`. warnings.warn( INFO 06-12 14:59:17 llm_engine.py:161] Initializing an LLM engine (v0.5.0) with config: model='microsoft/phi-2', speculative_config=None, tokenizer='microsoft/phi-2', skip_tokenizer_init=False, tokenizer_mode=auto, revision=None, rope_scaling=None, rope_theta=None, tokenizer_revision=None, trust_remote_code=False, dtype=torch.float16, max_seq_len=2048, download_dir=None, load_format=LoadFormat.AUTO, tensor_parallel_size=1, disable_custom_all_reduce=False, quantization=None, enforce_eager=False, kv_cache_dtype=auto, quantization_param_path=None, device_config=cuda, decoding_config=DecodingConfig(guided_decoding_backend='outlines'), seed=0, served_model_name=microsoft/phi-2) ```

shashankmangla and others added 3 commits June 12, 2024 15:13

update ENTRYPOINT to exec form for correct argument handling

37dc6ee

newline at end of file

823422e

Merge branch 'main' into fix-docker-entrypoint

01b9eff

shashankmangla changed the title ~~Fix: Update ENTRYPOINT to ensure proper argument handling~~ Fix: Update docker ENTRYPOINT to ensure proper argument handling Jun 12, 2024

lapp0 approved these changes Jun 13, 2024

View reviewed changes

rlouf changed the title ~~Fix: Update docker ENTRYPOINT to ensure proper argument handling~~ Update docker ENTRYPOINT to ensure proper argument handling Jun 13, 2024

rlouf merged commit 1bdcaa5 into dottxt-ai:main Jun 13, 2024
6 checks passed

shashankmangla deleted the fix-docker-entrypoint branch June 13, 2024 10:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update docker ENTRYPOINT to ensure proper argument handling #962

Update docker ENTRYPOINT to ensure proper argument handling #962

shashankmangla commented Jun 12, 2024

rlouf commented Jun 13, 2024

shashankmangla commented Jun 13, 2024

rlouf commented Jun 13, 2024

Update docker ENTRYPOINT to ensure proper argument handling #962

Update docker ENTRYPOINT to ensure proper argument handling #962

Conversation

shashankmangla commented Jun 12, 2024

Summary

Before the change:

After the change:

rlouf commented Jun 13, 2024

shashankmangla commented Jun 13, 2024

rlouf commented Jun 13, 2024