E2E (NVIDIA L40S x4) #121
e2e-nvidia-l40s-x4.yml
on: schedule
Annotations
8 errors and 3 warnings
start-large-ec2-runner
AWS EC2 instance starting error
|
start-large-ec2-runner
InsufficientInstanceCapacity: We currently do not have sufficient g6e.12xlarge capacity in the Availability Zone you requested (us-east-2b). Our system will be working on provisioning additional capacity. You can currently get g6e.12xlarge capacity by not specifying an Availability Zone in your request or choosing us-east-2a, us-east-2c.
|
start-large-ec2-runner
We currently do not have sufficient g6e.12xlarge capacity in the Availability Zone you requested (us-east-2b). Our system will be working on provisioning additional capacity. You can currently get g6e.12xlarge capacity by not specifying an Availability Zone in your request or choosing us-east-2a, us-east-2c.
|
stop-large-ec2-runner
Error: Not all the required inputs are provided for the 'stop' mode
|
stop-large-ec2-runner
Not all the required inputs are provided for the 'stop' mode
|
stop-large-ec2-runner
TypeError: Cannot read properties of undefined (reading 'mode')
|
stop-large-ec2-runner
Cannot read properties of undefined (reading 'mode')
|
loss-graphs
Unable to download artifact(s): Artifact not found for name: phase-1-training-log.jsonl
Please ensure that your artifact is not expired and the artifact was uploaded using a compatible version of toolkit/upload-artifact.
For more information, visit the GitHub Artifacts FAQ: https://github.com/actions/toolkit/blob/main/packages/artifact/docs/faq.md
|
start-large-ec2-runner
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|
stop-large-ec2-runner
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|
loss-graphs
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636
|