Skip to content

E2E (NVIDIA L40S x4) #121

E2E (NVIDIA L40S x4)

E2E (NVIDIA L40S x4) #121

Triggered via schedule January 23, 2025 16:01
Status Failure
Total duration 54s
Artifacts

e2e-nvidia-l40s-x4.yml

on: schedule
start-large-ec2-runner
9s
start-large-ec2-runner
e2e-large-test
0s
e2e-large-test
stop-large-ec2-runner
8s
stop-large-ec2-runner
loss-graphs
12s
loss-graphs
Fit to window
Zoom out
Zoom in

Annotations

8 errors and 3 warnings
start-large-ec2-runner
AWS EC2 instance starting error
start-large-ec2-runner
InsufficientInstanceCapacity: We currently do not have sufficient g6e.12xlarge capacity in the Availability Zone you requested (us-east-2b). Our system will be working on provisioning additional capacity. You can currently get g6e.12xlarge capacity by not specifying an Availability Zone in your request or choosing us-east-2a, us-east-2c.
start-large-ec2-runner
We currently do not have sufficient g6e.12xlarge capacity in the Availability Zone you requested (us-east-2b). Our system will be working on provisioning additional capacity. You can currently get g6e.12xlarge capacity by not specifying an Availability Zone in your request or choosing us-east-2a, us-east-2c.
stop-large-ec2-runner
Error: Not all the required inputs are provided for the 'stop' mode
stop-large-ec2-runner
Not all the required inputs are provided for the 'stop' mode
stop-large-ec2-runner
TypeError: Cannot read properties of undefined (reading 'mode')
stop-large-ec2-runner
Cannot read properties of undefined (reading 'mode')
loss-graphs
Unable to download artifact(s): Artifact not found for name: phase-1-training-log.jsonl Please ensure that your artifact is not expired and the artifact was uploaded using a compatible version of toolkit/upload-artifact. For more information, visit the GitHub Artifacts FAQ: https://github.com/actions/toolkit/blob/main/packages/artifact/docs/faq.md
start-large-ec2-runner
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
stop-large-ec2-runner
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
loss-graphs
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636