Skip to content

CI

CI #4377

Triggered via schedule May 17, 2025 09:32
Status Failure
Total duration 4h 39m 43s
Artifacts 42

ci.yaml

on: schedule
metadata
0s
metadata
bump-manifest
12s
bump-manifest
Matrix: amd64 / test-distribution
Matrix: arm64 / test-distribution
amd64  /  ...  /  build-base
3m 40s
amd64 / build-base / build-base
arm64  /  ...  /  build-base
3m 26s
arm64 / build-base / build-base
amd64  /  ...  /  build-jax
20m 49s
amd64 / build-jax / build-jax
arm64  /  ...  /  build-jax
29m 14s
arm64 / build-jax / build-jax
Matrix: amd64 / test-jax / run-unit-test
Matrix: amd64 / test-te-h100 / te-test-eks
Matrix: amd64 / test-te-unit-a100 / run-unit-test
amd64  /  ...  /  launch-slurm-runner
28m 55s
amd64 / test-jax / runner / launch-slurm-runner
amd64  /  test-nsys-jax-eks
4m 18s
amd64 / test-nsys-jax-eks
amd64  /  ...  /  launch-slurm-runner
41m 14s
amd64 / test-te-unit-a100 / runner / launch-slurm-runner
amd64  /  ...  /  build-maxtext
9m 52s
amd64 / build-maxtext / build-maxtext
amd64  /  ...  /  build-upstream-t5x
10m 39s
amd64 / build-upstream-t5x / build-upstream-t5x
amd64  /  ...  /  build-axlearn
6m 1s
amd64 / build-axlearn / build-axlearn
Matrix: amd64 / test-nsys-jax / run-unit-test
amd64  /  ...  /  build-equinox
7m 12s
amd64 / build-equinox / build-equinox
amd64  /  ...  /  launch-slurm-runner
55m 13s
amd64 / test-nsys-jax / runner / launch-slurm-runner
Matrix: arm64 / test-jax / run-unit-test
Waiting for pending jobs
Matrix: arm64 / test-te-h100 / te-test-eks
Waiting for pending jobs
Matrix: arm64 / test-te-unit-a100 / run-unit-test
Waiting for pending jobs
arm64  /  test-nsys-jax-eks
0s
arm64 / test-nsys-jax-eks
arm64  /  ...  /  launch-slurm-runner
arm64 / test-jax / runner / launch-slurm-runner
arm64  /  ...  /  launch-slurm-runner
arm64 / test-te-unit-a100 / runner / launch-slurm-runner
arm64  /  ...  /  build-maxtext
11m 6s
arm64 / build-maxtext / build-maxtext
arm64  /  ...  /  build-upstream-t5x
11m 2s
arm64 / build-upstream-t5x / build-upstream-t5x
arm64  /  ...  /  build-axlearn
7m 1s
arm64 / build-axlearn / build-axlearn
Matrix: arm64 / test-nsys-jax / run-unit-test
Waiting for pending jobs
arm64  /  ...  /  build-equinox
7m 23s
arm64 / build-equinox / build-equinox
arm64  /  ...  /  launch-slurm-runner
arm64 / test-nsys-jax / runner / launch-slurm-runner
Matrix: amd64 / test-maxtext / maxtext-multinode
Matrix: amd64 / test-maxtext / single-process-multi-device
amd64  /  ...  /  build-rosetta
15m 38s
amd64 / build-rosetta-t5x / build-rosetta
amd64  /  test-axlearn-eks
4h 6m
amd64 / test-axlearn-eks
amd64  /  test-axlearn-fuji-models-eks
6m 17s
amd64 / test-axlearn-fuji-models-eks
Matrix: amd64 / test-nsys-jax-archive
Matrix: arm64 / test-maxtext / maxtext-multinode
Waiting for pending jobs
Matrix: arm64 / test-maxtext / single-process-multi-device
Waiting for pending jobs
arm64  /  ...  /  build-rosetta
17m 1s
arm64 / build-rosetta-t5x / build-rosetta
arm64  /  test-axlearn-eks
0s
arm64 / test-axlearn-eks
arm64  /  test-axlearn-fuji-models-eks
0s
arm64 / test-axlearn-fuji-models-eks
Matrix: arm64 / test-nsys-jax-archive
amd64  /  ...  /  test-maxtext-metrics
14s
amd64 / test-maxtext / test-maxtext-metrics
amd64  /  collect-docker-tags
0s
amd64 / collect-docker-tags
Matrix: amd64 / test-rosetta-t5x / vit-multi-gpu-multi-node
arm64  /  ...  /  test-maxtext-metrics
arm64 / test-maxtext / test-maxtext-metrics
arm64  /  collect-docker-tags
0s
arm64 / collect-docker-tags
Matrix: arm64 / test-rosetta-t5x / vit-multi-gpu-multi-node
Waiting for pending jobs
amd64  /  ...  /  sitrep
6s
amd64 / test-maxtext / test-maxtext-sitrep / sitrep
amd64  /  ...  /  test-t5x-rosetta-summary
0s
amd64 / test-rosetta-t5x / test-t5x-rosetta-summary
amd64  /  ...  /  test-t5x-rosetta-metrics
13s
amd64 / test-rosetta-t5x / test-t5x-rosetta-metrics
arm64  /  ...  /  sitrep
arm64 / test-maxtext / test-maxtext-sitrep / sitrep
arm64  /  ...  /  test-t5x-rosetta-summary
arm64 / test-rosetta-t5x / test-t5x-rosetta-summary
arm64  /  ...  /  test-t5x-rosetta-metrics
arm64 / test-rosetta-t5x / test-t5x-rosetta-metrics
amd64  /  ...  /  test-maxtext-outcome
1s
amd64 / test-maxtext / test-maxtext-outcome
amd64  /  ...  /  sitrep
19s
amd64 / test-rosetta-t5x / test-t5x-rosetta-sitrep / sitrep
arm64  /  ...  /  test-maxtext-outcome
arm64 / test-maxtext / test-maxtext-outcome
arm64  /  ...  /  sitrep
arm64 / test-rosetta-t5x / test-t5x-rosetta-sitrep / sitrep
amd64  /  ...  /  test-t5x-rosetta-outcome
3s
amd64 / test-rosetta-t5x / test-t5x-rosetta-outcome
arm64  /  ...  /  test-t5x-rosetta-outcome
arm64 / test-rosetta-t5x / test-t5x-rosetta-outcome
make-publish-configs
2s
make-publish-configs
merge-new-manifest
6s
merge-new-manifest
Matrix: publish-containers
finalize  /  workflow-badge
4s
finalize / workflow-badge
finalize  /  report
19s
finalize / report
finalize  /  upload-badge
5s
finalize / upload-badge
finalize  /  publish-badge
3s
finalize / publish-badge
Fit to window
Zoom out
Zoom in

Annotations

3 errors and 3 warnings
amd64 / test-nsys-jax / nsys-jax-A100-unit-test
Process completed with exit code 1.
amd64 / test-nsys-jax / nsys-jax-A100-unit-test
Process completed with exit code 1.
amd64 / test-axlearn-eks
Process completed with exit code 1.
amd64 / test-axlearn-eks
This job failure may be caused by using an out of date version of GitHub runner on your self-hosted runner. You are currently using GitHub runner version 2.323.0. Please update to the latest version 2.324.0
merge-new-manifest
Unexpected input(s) 'owner_and_repo', valid inputs are ['route', 'mediaType']
merge-new-manifest
Unexpected input(s) 'owner_and_repo', 'head', 'base', 'body', 'title', 'draft', valid inputs are ['route', 'mediaType']

Artifacts

Produced during runtime
Name Size Digest
artifact-axlearn-build-amd64
566 Bytes
sha256:240424a0707393b04d6416ae0fef1b5a65c08248ded6f83523cce7de92016578
artifact-axlearn-build-arm64
566 Bytes
sha256:88c933fd9fcf56f97d706a931cdd56bb3d651b9fff74b71fdd4b54648a46b19d
artifact-axlearn-test
106 KB
sha256:4b9e1e7d6a1be88a0dfdc2551bac5d9a6b2aaa5a6353ff0d084678641c9be8fc
artifact-base-build-amd64
567 Bytes
sha256:6e65822e3db4919edceb0bb3cf84815c5115c51a68741cc92b5416b6cf6baa38
artifact-base-build-arm64
566 Bytes
sha256:c7bb627acbcc772991dda3571d532992efb7ecae47a150cc1092bc3233709e8a
artifact-equinox-build-amd64
568 Bytes
sha256:b77e542e1e3d69459d6128d119faa1a826457519cf4851194e6c784b26c7a7f0
artifact-equinox-build-arm64
569 Bytes
sha256:78dcc813b4a0eeb0447592c4059a16eeaa5b997dd0543e8b6adaf9508f2d9ee2
artifact-final-report
3.93 KB
sha256:a3ec42a82509a357bb25ec0b0e9e6ab39cdf90a526eba33bec98339c6cf7ca00
artifact-jax-build-amd64
553 Bytes
sha256:6308782ecf0043760f0bd0e46b14f98e913c934f16789b6e43bfd2f0b8e569d2
artifact-jax-build-arm64
554 Bytes
sha256:96c42ad61988a9beaf5cbe4460dcca482b5f718caae7d4dc891f05525cbad9f9
artifact-maxtext-build-amd64
568 Bytes
sha256:48bcfa008e6bd87eed6f910d13253df9d6e67cac6e0b8e364689c888e7fda41f
artifact-maxtext-build-arm64
567 Bytes
sha256:2581fa021568d68f864858d5b018671a47e1a5db335385a550136f235b31eaf8
artifact-maxtext-test
1.83 KB
sha256:554ef6e6d9ae813a89fadfe26ffb72eccb503634e07a4061524bf3a4857cc135
artifact-multigpu-test-transformerengine-15083908343-8gpu-unittest
584 Bytes
sha256:ccc359b428b1790b7515455d87828d9ce54b6ef7262827acfada0a344b666b9b
artifact-rosetta-build-t5x-amd64
584 Bytes
sha256:f057638c74d5d54497d7d98ed9571f0ba58c310e6c3fc35680d18850764da79d
artifact-rosetta-build-t5x-arm64
585 Bytes
sha256:457135bbb6e7610ababf2ec30ed8af222e92a18c82415d543bcb93f150b18846
artifact-rosetta-t5x-mgmn-test
1.28 KB
sha256:7d1b8055d7b8bba8ad29bc2915c8a2db2b286491340f904c969e8325ccbd8ee2
artifact-t5x-build-amd64
568 Bytes
sha256:764232b230e73c00cacdad46e417c8d2aebd8fe73bcf97649ad463ed884f678e
artifact-t5x-build-arm64
569 Bytes
sha256:bea1bde05130e6c752f4a92529c57843932dcfac92706b14ec54e9b30b2bae5c
artifact-workflow-metadata
278 Bytes
sha256:ddb6e48cfb2564a149e3140c0626a68c2b3225493cf9928704177eaa2e08aae4
bumped-manifest
46.9 KB
sha256:987f769e359c41d7d9375ea0e98f4b37797da6414bee6e27d93315bf392945b5
final-axlearn
258 Bytes
sha256:546ef158d5cd883cb28b996eb56cd7822f49a2c4d86dd836b292bcb56a725879
final-base
249 Bytes
sha256:89e56045f3a74858071b5b92775ce4842427288a28e0da58dc7c0c9b1318c78f
final-equinox
258 Bytes
sha256:74d196dfc4e2fe2dd5321a31596776552bd701a155925c6faca2d29d4e26a95c
final-jax
246 Bytes
sha256:ef4f5cd8a71c594e2aae7142dcb1e356d887e49210409631d7ff73fb5a656af3
final-maxtext
258 Bytes
sha256:a222961c0ed8e05a7c8cc29a51835f70245d86e3be389bfce7cb5d795c33f14b
final-t5x
246 Bytes
sha256:b481eb251bab6cb4c9bade6dcc4b45207967abbd16f08e0c332d56d048f57cf5
final-upstream-t5x
273 Bytes
sha256:89c09742060218e89ddb530c2dd65eceec6ec371d98fd098cd6984bac7415444
jax-unit-test-A100
20 KB
sha256:7780b7b25810ae2181653f29c9c0a025a1131f4d0209e6b21d801f9bdaa856de
mealkit-axlearn
269 Bytes
sha256:1c495ad6bcf61b91cf9a870dc5a457cb7c0d6da1feee581afc3984bbc64f42e4
mealkit-equinox
269 Bytes
sha256:70710fc2bd0ea20c448e2b6a5aefc647655c87a7cc0a135eeeea7abe3ad93036
mealkit-jax
256 Bytes
sha256:863b2ecb94d582e80b3e2fd8e0d33388f7688e60489e624792ee10ba019a8db9
mealkit-maxtext
269 Bytes
sha256:be992afbda5cb3dd375095822566c48864050761ac7b486c8fb37a4a6db4a320
mealkit-t5x
258 Bytes
sha256:645469f2b087fee35b9cdc2b78618d3dba61a7d051c969b243387acc8a8bc92d
mealkit-upstream-t5x
282 Bytes
sha256:655f6d7c54a1f477bd92de856535ef1d9e24647cdbfa0dcd7242bd383e884562
nsys-jax-unit-test-A100
33.2 MB
sha256:e4b8cce78d4b3cb1766d259b7d520d08bd15402b966c856b47bdfb134becb43c
rosetta-t5x-metrics-test-log
1.03 KB
sha256:abdae611c982986ed143f7ab5314d68a73b5760503151403740b064b41f1a3a7
rosetta-t5x-vit-15083908343-VIT8G1N
31.9 KB
sha256:0e1a80e914052a127ebe46f6ecdf23c7606c714fd824147387e8f2228566c891
te-unit-test-A100
931 KB
sha256:795f0421fd590245f9eed0a4a83fd4e74dc39c3bd8610ffec2a628fc39bdf38e
upstream-maxtext-15083908343-1DP2FSDP4TP1PP_single_process
20.1 KB
sha256:457e1d79362223824f30f3dad979d5dc2413adb7bdf1725515e9fdec1e8084ed
upstream-maxtext-15083908343-2DP2FSDP2TP1PP
28.9 KB
sha256:55731f44951dea1674671f89314583033ac6ccbb21b2aab7dcb3ab7014f6570c
upstream-maxtext-metrics-test-log
1.82 KB
sha256:a0230aaf714032a2a368215b66bba55d5af0105e2831eb04eb611f55a54730c6