Skip to content

cc-api/confidential-huggingface-runner

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Confidential Hugging Face Runner

This project is designed to build a runner to make the Hugging Face project easily run inside Trusted Execution Environment (TEE), especially for Large Language Model (LLM), and demonstrate whole process from evidence collection to attestation.

Protect AI Models by Confidential AI Loader

Confidential AI loader is designed to prepare AI models before loading it to memory. There are two major parts to protect the models in TEE:

  • AI model encryption
  • Decrypt model with attestation

Architecture Overview

Architecture

AI Model Preprocessing

To protect the model and its encryption key, the following preprocessing steps are taken:

*Note:The following steps must be run on a trusted system.

  • Generate a key, a AES GCM key can be used for confidentiality and integrity.

    # Generate a 256 bit (32 bytes) random key
    KEY=$(head -c 32 /dev/random | base64)
  • Encrypt AI model by the key.

    # Install confidential ai loader
    cd container/confidential-huggingface-runner/confidential-ai-loader && pip install -e .
    # Encrypt models
    python crypto.py -i ${INPUT_DIR} -o ${OUTPUT_DIR} -k ${KEY}
  • Upload the encrypted model to Model Server. For example, using Hugging Face model server:

    # Clone the repo
    
    git clone git@hf.co:<your-username>/<your-model-name>
    cd <your-model-name>
    
    # Enable large files
    git lfs install
    huggingface-cli lfs-enable-largefiles .
    
    # Upload the models
    git add .
    git commit -m "First model version"  # You can choose any descriptive message
    git push
  • Register key to Key Broker Service (KBS), and KBS will communicate with Key Management Service (KMS) to store the key in its database. For example, using Intel Trust Authority Key Broker Service:

    • Get access token

      BODY='{"username":"'$USER'", "password":"'$PASS'"}'
      TOKEN=$(curl -sL -X POST "${URL}/${API_TOKEN}" -d "$BODY" -H "Accept:application/jwt" -H "Content-Type:application/json")
      
    • Create a key transfer policy, such as cat policy.json

      {
          "attestation_type":"tdx",
          "tdx":{
          "attributes":{
              "enforce_tcb_upto_date":false,
              "mrseam":["5b38e33a6487958b72c3c12a938eaa5e3fd4510c51aeeab58c7d5ecee41d7c436489d6c8e4f92f160b7cad34207b00c1"],
              "mrsignerseam":["000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000"],
              "seamsvn":6
          },
          "policy_ids": []
          }
      }
      # Register policy
      RESP=`curl -X POST $URL$API_POLICY -d @policy.json -H "Accept:application/json" \
      -H "Content-Type:application/json" -H "Authorization:Bearer $TOKEN"`
      POLICY_ID=`echo $RESP | jq -r .id`
    • Register key

      BODY='{"key_information":{"algorithm":"AES","key_data":"'$KEY'","key_length":256},"transfer_policy_id":"'$POLICY_ID'"}'
      RESP=`curl -X POST $URL$API_KEYS -d $BODY -H "Accept:application/json" \
          -H "Content-Type:application/json" -H "Authorization:Bearer $TOKEN"`
      KEY_ID=`echo $RESP | jq -r .id`
      echo key id: $KEY_ID

Build and Run

Confidential AI Loader, encrypted models and hugging face project can be built to one container, then the project can be easily ran inside TEE.

  • Build the container
cd container
./build.sh -c confidential-huggingface-runner -a build -m dongx1x/Llama-2-7b-chat-hf-sharded-bf16-aes -p https://github.com/dongx1x/llama-2-7b-chat -r gar-registry.caas.intel.com/cpio

the example output: Build container

  • Run
docker run --rm -it --privileged -v /sys/kernel/config:/sys/kernel/config -v /models:/models -e http_proxy -e https_proxy -e no_proxy -p 7860:7860 gar-registry.caas.intel.com/cpio/confidential-huggingface-runner

the example output: Run container

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •