Model-Gemini-2b-dvc

Task

Model-Gemini-2b-dvc

🔥🔥🔥 Deploy Gemma-2b model on VDP.

This repository contains the Gemma-2b Text Completion Generation Model in the Transformers format, managed using DVC.

Notes:

Disk Space Requirements: 1.7G
GPU Memory Requirements: 4G

Following is an example of query parameters:

Create Model

{
    "id": "gemma-2b-gpu",
    "description": "test containerized gemma 2b gpu model.",
    "model_definition": "model-definitions/container",
    "visibility": "VISIBILITY_PUBLIC",
    "region": "REGION_GCP_EUROPE_WEST_4",
    "hardware": "GPU",
    "configuration": {
        "task": "TEXT_GENERATION"
    }
}

Inference model

{
    "task_inputs": [
        {
            "text_generation": {
                "prompt": "The capital city of Franch is ",
                "max_new_tokens": "300",
                "temperature": "0.8",
                "top_k": "50",
                "random_seed": "42",
                "extra_params": "{\"top_p\": 0.8, \"repetition_penalty\": 2.0}"
            }
        }
    ]
}```

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
instill.yaml		instill.yaml
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Model-Gemini-2b-dvc

About

Releases

Packages

Languages

instill-ai/model-gemini-2b-dvc

Folders and files

Latest commit

History

Repository files navigation

Model-Gemini-2b-dvc

About

Topics

Resources

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages