Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

rebase wasm-on-ray branch to latest master #35727

Merged
merged 1,089 commits into from
May 24, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1089 commits
Select commit Hold shift + click to select a range
12c9724
[Overview][Serve] Add Recent Serve Applications Card (#34642)
chaowanggg May 9, 2023
74549fc
Clean SWR cache between each test cases (#35097)
chaowanggg May 9, 2023
faae43a
Revert "[Overview][Serve] Add Recent Serve Applications Card" (#35155)
aslonnie May 9, 2023
0377474
[air/output] Fix trial status at end (more info + cut off) (#35128)
krfricke May 9, 2023
094dc15
[Core] Fix async actor shutdown issue when exit_actor is used (#32407)
rkooo567 May 9, 2023
91014a2
[serve] Add log file path to replica details (#33640)
zcin May 9, 2023
13056e9
[Docker] [runtime env] Bump boto3 version from 1.4.8 to 1.26.82, add …
architkulkarni May 9, 2023
3b58d24
[core] Make ray.get(timeout=0) to throw timeout error (#35126)
rickyyx May 9, 2023
cc3fa33
[core] Change worker niceness in job submission environment (#34727)
vitsai May 9, 2023
9067905
[ci/release] Resolve dependencies with python 3.9 inside conda. (#35176)
aslonnie May 9, 2023
7204027
[Data] Remove "Scalable Batch Inference with Ray" from batch inferenc…
bveeramani May 9, 2023
4b2f198
[core] Start ray syncer reconnection after a delay (#35115)
fishbone May 9, 2023
e211688
[docker] Add netbase to base deps docker image (#35174)
krfricke May 9, 2023
8b20111
[Data] Update `pipelined_training_50_gb.aws` instance type (#35150)
bveeramani May 9, 2023
e276929
[data] Update the strict mode message to be less confusing (#35185)
ericl May 9, 2023
9e0a000
[RLlib] Activate RLModules and Learner together in docs (#35145)
ArturNiederfahrenhorst May 9, 2023
4a17d8c
[RLlib] Add test utils for rllib contrib (#35056)
avnishn May 9, 2023
dfeef56
[Data] Allow fusing `MapOperator` -> `Repartition` operators (#35178)
scottjlee May 9, 2023
7bce9f3
[core] Add object owner and copy metrics to node stats (#35119)
fishbone May 10, 2023
3febc5e
[data] Revert the dataset to datastream class rename (#35082)
ericl May 10, 2023
886926c
[AIR] LightningTrainer Dolly V2 FSDP Fine-tuning Example (#34990)
woshiyyya May 10, 2023
6cd79c9
[ci][core] Remove test_ray_get_timeout_zero #35196
rickyyx May 10, 2023
2400117
[Core]Fixing the flakey test cases caused by Redis startup failure du…
larrylian May 10, 2023
48d3b29
[core][state] Push down filtering to GCS for listing/getting task fro…
rickyyx May 10, 2023
cba7d35
[Core] Add bundles_to_node_id info in placement_group_table (#35122)
larrylian May 10, 2023
790a16d
Downgrade hermetic python to 3.8 (#35198)
aslonnie May 10, 2023
acffd2f
[Core] Put pg state to kv store when pg rescheduling (resubmit) (#34948)
larrylian May 10, 2023
c70331d
Add runtime env metadata to jobs detail page. (#34984)
alanwguo May 10, 2023
0e917ef
[RLlib] Unity3D adapter: Disable env pre-checking (agent IDs not know…
sven1977 May 10, 2023
acf18fa
[docs] update batch guide link, fix tensor ref (#35171)
maxpumperla May 10, 2023
7891a92
[docs] synced tabs in AIR getting started (#35170)
maxpumperla May 10, 2023
4c83da1
[docs] fixing missing libs in batch-x examples (#35169)
maxpumperla May 10, 2023
9e01c20
[docs] replace deprecated sklearn by scikit-learn installation (#35168)
maxpumperla May 10, 2023
9a5ef51
[RLlib] No longer return action distribution objects from RLModule's …
ArturNiederfahrenhorst May 10, 2023
a514ade
[docs][serve] add note that Ray doesn't pickle (#35194)
angelinalg May 10, 2023
959328b
[docs] batch inference pass (#35041)
maxpumperla May 10, 2023
68073ee
[docs] fix outdated tensor data ref (#35212)
maxpumperla May 10, 2023
ce95fb9
[data] [doc] Fix dataset images
ericl May 10, 2023
11d4cdb
[RLlib] Replace calls to socket in learner group for getting ip addre…
avnishn May 10, 2023
805e6ce
[Data][CI] Mark `dataset_shuffle_sort_1tb` tests as unstable (#35203)
bveeramani May 10, 2023
5a4e571
[data] Update tagline to datasets for ML (#35228)
ericl May 10, 2023
8ec3572
[Data] Improve notebook widget display (#34359)
peytondmurray May 10, 2023
fe5d45e
[CI/air] Fix lightning_gpu_tune_.* release test (#35193)
woshiyyya May 10, 2023
de91fc9
[Doc] Correctly Render the Enumerate Numbers in `convert_torch_code_t…
woshiyyya May 10, 2023
ab7384b
[Data] Clarify `map` slow warning (#35204)
amogkam May 10, 2023
d49a749
[Release tests] Moving Ray Data bulk ingest test ownership team to Da…
cadedaniel May 10, 2023
d432cfb
[core] Reduce self alive check from 60s to 5s. (#34992)
fishbone May 11, 2023
a8a6c59
[core] Turn on ray syncer again. (#35116)
fishbone May 11, 2023
33b9680
[Data][Docs] Fix `hf_quick_start.py` (#35240)
bveeramani May 11, 2023
d56b04f
[core] Deflakey test advanced 9 (#35247)
fishbone May 11, 2023
0980d59
[ci] Fix dask Ray client tests (#35233)
krfricke May 11, 2023
e81963d
[docs] auto-remove gen apis on make clean (#35210)
maxpumperla May 11, 2023
384ad04
[RLlib] APPO+new-stack (Atari benchmark) - Preparatory PR 04 - Learne…
sven1977 May 11, 2023
650bc36
[ci] Use python 3.9 in WORKSPACE (#35255)
aslonnie May 11, 2023
e9c69ee
[Client] Optimize chunk size (#35025)
zen-xu May 11, 2023
93c4163
Run bisect with the correct python version (#35186)
can-anyscale May 11, 2023
207dcc9
[ci/bazel][2] bazelize all other ray_release tests (#35032)
can-anyscale May 11, 2023
2ddebc0
Update README.rst (#35267)
jcoffi May 11, 2023
81a4a5e
Bring back "[Core] Port GcsPublisher to Cython" (#34393) (#35179)
pcmoritz May 11, 2023
62c9bcc
[serve] Add controller metadata (#35182)
zcin May 11, 2023
d5b4701
[serve] Stream Serve logs across different drivers (#35070)
zcin May 11, 2023
9d741ee
[Overview][Serve] Add Recent Serve Applications Card #34642 (#35227)
chaowanggg May 11, 2023
8963978
[Serve] Add status_code to http qps & latency (#35134)
sihanwang41 May 11, 2023
db4715d
[train] Fix HuggingFace -> Transformers wrapping logic (#35276)
matthewdeng May 11, 2023
0470383
[core][dashboard][state] Support task logs from state API (#35101)
rickyyx May 11, 2023
a627696
[core][dashboard] Task backend GC policy - worker update [1/3] (#34896)
rickyyx May 11, 2023
977cac2
Use state-api for job driver logs (#35235)
alanwguo May 11, 2023
04e414a
Add docs for setting up metrics for homebrew installations (#35026)
alanwguo May 11, 2023
55b9097
Add HTTPProxy details to Serve Dashboard UI (#35159)
alanwguo May 11, 2023
f91f5e8
[AIR] Remove hard-deprecated and unused code (#35163)
justinvyu May 11, 2023
8f9355a
[Doc] [no_early_kickoff] Revamp ray core api reference [1/n] (#34428)
jjyao May 11, 2023
a44b00d
[docs][observability] O11y refactor 1/N (#35158)
angelinalg May 12, 2023
108f34a
[train] Fix HuggingFace -> Transformers wrapping logic 2 (#35284)
matthewdeng May 12, 2023
639b97e
[Serve] Add route tags with custom metrics (#35246)
sihanwang41 May 12, 2023
72e5f6b
[Serve] Add more bucket size (#35242)
sihanwang41 May 12, 2023
014f149
[release test] [Cluster launcher] Add gcp minimal and full cluster la…
architkulkarni May 12, 2023
39b3247
[docs] [data] Update use case doc links and resources (#35277)
ericl May 12, 2023
a7d75fb
[Release test] Disabling empty-runtime-env tests in benchmark_worker_…
cadedaniel May 12, 2023
710c17a
[docs] nav fixes #34583 (#35296)
maxpumperla May 12, 2023
f936826
[AIR] Distributed checkpointing (#34709)
May 12, 2023
3185543
[RLlib] RLlib contrib (#35141)
avnishn May 12, 2023
f0d89af
[telemetry] Add libc version to ray telemetry. (#33444)
fishbone May 12, 2023
813c57e
[data] Capture the context when the dataset is first created (#35239)
ericl May 12, 2023
1240a81
[core] Make execute_after accept chrono (#35099)
fishbone May 12, 2023
0785e97
[no_early_kickoff] [data] Improve our handling of tensor returns in s…
ericl May 13, 2023
e2c6d9b
[Serve] Add multiplex support (#34941)
sihanwang41 May 13, 2023
9eb9c67
Add support for multi-tab log viewer (#35280)
alanwguo May 14, 2023
1c859f0
Fix "ImportError: sys.meta_path is None, Python is likely shutting do…
raulchen May 14, 2023
a38ed6e
[data] Add GPU data ingestion nightly test (#34986)
raulchen May 14, 2023
ecd9996
[core][state][dashboard][log] Fix subdirectory log getting (#35283)
rickyyx May 15, 2023
3d977b8
[core][state][no_early_kickoff] Add "humanify" feature to StateSchema…
ProjectsByJackHe May 15, 2023
6088de3
[docs] clarify FAST build option, fixes #35293 (#35297)
maxpumperla May 15, 2023
5d0b15e
[AIR] Deprecate `ray.tune.logger.Logger` interface (#35162)
justinvyu May 15, 2023
2dbe747
[Core/Logging] Worker startup hook (#34738)
rkooo567 May 15, 2023
21e9d38
[serve] Log to files in JSON format by default (#35118)
sihanwang41 May 15, 2023
a0e318b
[autoscaler v2][4/n] introducing node-provider and node-provider-conf…
scv119 May 15, 2023
e08ef63
[docs] fix map_batches ActorPoolStrategy ref (#35331)
maxpumperla May 15, 2023
fdcc58b
[ci/github] Track external code changes (blogs, tutorials) (#35261)
krfricke May 15, 2023
27f4713
[Data] Improve compute validation error (#35234)
bveeramani May 15, 2023
20ee164
[Data] Fix inference release test (#35339)
bveeramani May 15, 2023
c7337ca
[Data] Improve `Schema` representation (#35278)
bveeramani May 15, 2023
3ca8632
During GCS restarts, grpc based resource broadcaster should only add …
jjyao May 15, 2023
5a5155c
[Data] Improve docstring and warning message for `from_huggingface` (…
amogkam May 15, 2023
c8fc8f6
[Data] Add `column` API to Dataset (#35241)
amogkam May 15, 2023
84833a4
[Doc] Make doc code snippet testable [2/n] (#35274)
jjyao May 15, 2023
6b7e412
[RLlib] Remove some specs from encoders to smoothen dev experience (#…
ArturNiederfahrenhorst May 15, 2023
adddc2b
[Train] Don't repartition if xgboost-ray>=0.1.16 (#32960)
Yard1 May 15, 2023
2e08421
[Serve] Multiplex API Impl (#35326)
sihanwang41 May 15, 2023
01f932c
[UI] Unify colors of different status for Jobs, Services, Actors (#35…
chaowanggg May 16, 2023
79784c5
[core] Delete disconnected node view in ray syncer when connection is…
fishbone May 16, 2023
152e06d
[doc] [data] Update dataset intro page and fix some typos (#35361)
ericl May 16, 2023
a21713e
[data] Fix bugs in handling of nested ndarrays (and other complex obj…
ericl May 16, 2023
5197da2
[Train] LightningTrainer: Enable prog bar (#35350)
woshiyyya May 16, 2023
496024d
Add "all" option for SessionName (#35303)
alanwguo May 16, 2023
35cb004
[Dashboard] Add task detail page with logs (#35328)
alanwguo May 16, 2023
5ccbb7c
[core][state][ci] Fix stress_test_state_api_scale (#35332)
rickyyx May 16, 2023
37fa98f
[core][state][job] Supporting job listing(getting) and logs from stat…
rickyyx May 16, 2023
548e70b
[tests] fix lint and dependency issues in tests (#35373)
matthewdeng May 16, 2023
fb41024
[ci] External code tracker: Ignore if file is not found (#35376)
krfricke May 16, 2023
bdf3e56
[Dashboard] Add serve controller info to the Serve page (#35327)
alanwguo May 16, 2023
0d880e3
[core][dashboard] Make actor tasks'name default to <actor_repr>.<task…
rickyyx May 16, 2023
7fd2edf
[tune/execution] 1/n Add more unittests for TuneController (#34833)
krfricke May 16, 2023
bf33878
[air/output] Context-aware output engine: Add docs, experimental feat…
krfricke May 16, 2023
8f4a0e4
[ci][byod/1] clean up local environment setup for release tests (#35355)
can-anyscale May 16, 2023
a862547
[AIR][Telemetry] Cluster storage configuration (#34905)
justinvyu May 16, 2023
42a8d14
Remove previously added debug logs (#35360)
jjyao May 16, 2023
681ce0d
[serve] Catch all exceptions during deploy (#35307)
zcin May 16, 2023
a6cb363
[AIR][Telemetry] Experiment tracking integrations + callbacks (#34904)
justinvyu May 16, 2023
1a88c05
[Train] Change `num_boost_round` to target iterations (#33602)
Yard1 May 16, 2023
8c2e850
[serve][docs] Add user guide for application builders (#35392)
edoakes May 16, 2023
1ccd59a
[air/output] Add parameter columns to status table (#35388)
krfricke May 16, 2023
e19ab03
Revert "Add "all" option for SessionName (#35303)" (#35403)
cadedaniel May 16, 2023
53c638f
[Serve] Mutliplexed information report impl (#35372)
sihanwang41 May 16, 2023
736060e
[core][state] Move state API out of experimental (#35318)
rickyyx May 16, 2023
fa077fb
[data] Improve map batches error message for strict mode migration (#…
ericl May 16, 2023
ea6b926
[RLlib] Add missing `sampler_results` key to fetch min desired reward…
ArturNiederfahrenhorst May 16, 2023
7e61d0b
[core] Change log error to log info when node disconnect and got dete…
fishbone May 16, 2023
f31d70e
Add a disconnect button to the context widgets in notebooks (#34815)
peytondmurray May 17, 2023
8b264df
[core] Serialize requests in redis store client. (#35123)
fishbone May 17, 2023
a9f333a
[CI] Increase parallelism for Train tests to 4 (#35401)
Yard1 May 17, 2023
9cd5097
[core] Sending ReportWorkerFailure after the process died. (#35320)
fishbone May 17, 2023
119e12b
[AIR] Deprecate modules in `ray.tune.integration` (#35160)
justinvyu May 17, 2023
cde7109
[RLlib] DreamerV3: RLModule class, tf.keras model components, dreamer…
sven1977 May 17, 2023
1bd30c3
[serve] Shutdown http proxy state (#35395)
zcin May 17, 2023
ae5e1c7
[Development] Fix unbound `BUILDKITE` variable in `install-bazel.sh` …
architkulkarni May 17, 2023
45067ae
Revert "Add a disconnect button to the context widgets in notebooks (…
rkooo567 May 17, 2023
5ec3a36
Revert "[core] Sending ReportWorkerFailure after the process died. (#…
cadedaniel May 17, 2023
9ac0d44
[Serve] Http proxy & router & handle to support multiplex impl (#35399)
sihanwang41 May 17, 2023
987a1e3
[serve] Remove print statement + fix lint (#35439)
zcin May 17, 2023
4202bc2
[RLlib] Make CNN encoder test larger (#35374)
ArturNiederfahrenhorst May 17, 2023
53d3ef1
[train] Restructure `ray.train` HuggingFace modules (#35270)
matthewdeng May 17, 2023
429316c
[serve] Fix `app_builder` doc code test (#35456)
edoakes May 17, 2023
f796635
[RLlib] Add torch compile capabilities to TorchRLModule (#34640)
ArturNiederfahrenhorst May 17, 2023
f7f18b2
[Data/Train] Fix ipython representation (#35414)
amogkam May 17, 2023
dad924e
Add "all" option for SessionName (#35408)
alanwguo May 17, 2023
4bfdacf
Update version in dask on ray guide for 2.5.0 release (#35458)
ArturNiederfahrenhorst May 17, 2023
fdcdd39
[Data] Fix `read_tfrecords_benchmark` (#35152)
bveeramani May 17, 2023
e231d46
[core] Retry failed redis request (#35249)
fishbone May 18, 2023
539256a
[Dask on Ray] Attempt to fix line in dask doc (#35479)
ArturNiederfahrenhorst May 18, 2023
56ae7eb
[Doc] Make doc code snippet testable [3/n] (#35407)
jjyao May 18, 2023
fa4b59e
[install] fix installation instructions for ray[default] (#35442)
eax-anyscale May 18, 2023
b1d4249
[Data] Add `num_cpus` and `num_gpus` as top-level args to map functio…
amogkam May 18, 2023
82d5568
[AIR] Move Constants from tune/results.py to air/constants.py (#35404)
woshiyyya May 18, 2023
31caa12
[Tune] Fix hyperband scheduler raising an error for good `PENDING` tr…
justinvyu May 18, 2023
e47f4c2
[CI] Fix ml_user_ray_lightning_user_test_(master|latest).aws release …
woshiyyya May 18, 2023
4385ec8
Make sure connector related tests do not use pre-generated policy che…
May 18, 2023
dedc47d
Make test_torch_predictor a medium test. (#35466)
May 18, 2023
bfec451
[1/N] Streaming Generator. Cpp interfaces and implementation (#35291)
rkooo567 May 18, 2023
97fc841
[tune] Track PyTorch tutorials file in our CI (#35351)
krfricke May 18, 2023
2bfe43f
Revert "[Data] Add `num_cpus` and `num_gpus` as top-level args to map…
amogkam May 18, 2023
006bb51
[docker] Preserve date and git commit prefix in tags. (#35474)
aslonnie May 18, 2023
658c8e3
[RLlib][RLlib contrib] add soft deprecation notices to maml and a3c (…
avnishn May 18, 2023
db97dda
[RLlib contrib] add contributing.md to RLlib contrib (#35346)
avnishn May 18, 2023
21d66e4
[RLlib contrib] Fix rllib contrib readmes (#35347)
avnishn May 18, 2023
601a3ea
[RLlib] Fix IMPALA/APPO when using multi GPU setup and Multi-Agent En…
RocketRider May 18, 2023
fdd7c87
[Doc] Fix error in "Writing code snippets" (#35462)
bveeramani May 18, 2023
d62b7dd
[Doc] Fix batch_forecasting.ipynb (#35467)
May 18, 2023
f1b7eb6
[data] Fix ragged tensor conversion with map() (#35419)
ericl May 18, 2023
092d25b
[Core] Fix the recursion error when async actor has lots of deseriali…
rkooo567 May 18, 2023
de3df47
Revert "Revert "[Data] Add `num_cpus` and `num_gpus` as top-level arg…
amogkam May 18, 2023
72205c6
[Doc] Make doc code snippet testable [4/n] (#35506)
jjyao May 18, 2023
ec94bd4
[core] Graceful handling of returning bundles when node is removed (#…
rickyyx May 18, 2023
35305e4
[docs]change recommended install to ray[air] (#35149)
angelinalg May 19, 2023
43a20c1
[AIR][Doc] New Example: LightningTrainer with experiment tracking too…
woshiyyya May 19, 2023
ce7764b
Revert "Revert "Global logging format changes" (#34126)" (#34182)
peytondmurray May 19, 2023
f170d13
[Doc] Pin deepspeed version to fix fine_tuning example (#35477)
May 19, 2023
17f2319
[docker] Publish installed packages in docker containers (#35390)
krfricke May 19, 2023
06d25a7
[CI] Skip `test_multiprocessing_client_mode` on windows (#35534)
architkulkarni May 19, 2023
b3d97e8
[Serve] [Docs] Add section about LLM developers to Ray Serve docs (#3…
shrekris-anyscale May 19, 2023
c0f04f2
[Test] Deflake test_client_reconnect (#35567)
ckw017 May 20, 2023
3275be3
[RLlib contrib] Added Table of content to list all externally contrib…
kouroshHakha May 20, 2023
63ebee2
[2/N] Streaming Generator. Support core worker APIs + cython generat…
rkooo567 May 21, 2023
946459a
[CI] Build jar on manylinux2014 (#35538)
XiaodongLv May 21, 2023
0a1f435
[RLlib] Fit ES and ARS results dict to rest of RLlib, enable results …
ArturNiederfahrenhorst May 21, 2023
323d9d5
[RLlib] Use requests.Session object to reuse connections and use reso…
MattiasDC May 21, 2023
44ef068
[3/N] Streaming Generator. E2e integration (#35325)
rkooo567 May 22, 2023
decc28d
[Serve] Fix object detection doc test failures (#35528)
sihanwang41 May 22, 2023
0fd06ad
[RLlib] Learner API (+DreamerV3 prep): `Learner.register_metrics` API…
sven1977 May 22, 2023
d0153a3
[Doc] Remove link (#35575)
ArturNiederfahrenhorst May 22, 2023
8427de2
[RLlib] Fix ARS release test (#35608)
ArturNiederfahrenhorst May 22, 2023
ea5bcb9
[autoscaler v2] add unit tests for NodeProviderConfig (#35590)
scv119 May 22, 2023
c2c2e13
[Core][Test] Change `test_streaming_generator` to a large test (#35606)
rkooo567 May 22, 2023
fb95f03
[RLlib] Add a KL ratio test to PPO and remove APPO's own learner keys…
ArturNiederfahrenhorst May 22, 2023
d0b059d
[Data] Add batch inference object detection example (#35143)
raulchen May 22, 2023
d092f3d
[core][autoscaler] GCS Autoscaler V2: Interface [1/x] (#35549)
rickyyx May 22, 2023
668384d
[docs][observability] O11y refactor 2 (#35279)
angelinalg May 22, 2023
eea395c
[ci][byod/2] compute anyscale byod for release tests (#35369)
can-anyscale May 23, 2023
c4660b1
Update the default bazel installation location (#35482)
WanNJ May 23, 2023
18b5efd
[ci][byod/4] enrich byod definition with type and pre_run_cmds (#35400)
can-anyscale May 23, 2023
7e02ec4
[Data] Fix object detection example test (#35636)
raulchen May 23, 2023
cfe5dad
[Data] Mark chaos_pipelined_ingestion_1500_gb_15_windows as unstable …
amogkam May 23, 2023
531503b
Fix single_client_wait_1k perf regression (#35614)
jjyao May 23, 2023
62e5264
[RLlib] Fix test supported spaces with RLM (#35485)
ArturNiederfahrenhorst May 23, 2023
f79e9dd
[RLlib] Fix self play examples with RL Modules (#35633)
ArturNiederfahrenhorst May 23, 2023
6a3ea34
[RLlib] Fix _fake model to support testing with RL Modules and torch …
ArturNiederfahrenhorst May 23, 2023
fdbd252
[RLlib] Fix Checkpointing test when using Learner API (#35642)
ArturNiederfahrenhorst May 23, 2023
3fb808e
[RLlib] PPO on new API stack: Add `use_kl_loss` setting (already exis…
sven1977 May 23, 2023
ee9827d
[GCS] Give result variables a default zero value. (#34994)
aslonnie May 23, 2023
9fcc5a1
[core][dashboard] Add task name in task log magic token (#35377)
rickyyx May 23, 2023
8fa6fb5
[docs] fix bazel build command (#35298)
eax-anyscale May 23, 2023
045e617
[git] enforce signoff in commit hooks (#35502)
eax-anyscale May 23, 2023
9c54d28
[ci][byod/5] build anyscale byod images (#35418)
can-anyscale May 23, 2023
f7e6928
[Doc] Make doc code snippet testable [5/n] (#35562)
jjyao May 23, 2023
852f0f1
[Core] Print out warning every 1s when sched_cls_id is greater than 1…
jjyao May 23, 2023
86fab17
[core] Fix raylet memory leak in the wrong setup. (#35647)
fishbone May 23, 2023
26cae68
[core] Don't drop rpc status in favor of reply status (#35530)
vitsai May 23, 2023
89e16dd
[serve] Move `num_queued_queries` logic out of `ReplicaSet` and into …
edoakes May 23, 2023
6e9dfd9
[air/output] Switch on per default (#35389)
krfricke May 23, 2023
5073be7
[RLlib] Don't add a cpu to bundle for learner when using gpu (#35529)
avnishn May 23, 2023
dad239a
[RLlib] Fix bug in tf_learner.py; Learner.update method has accidenta…
sven1977 May 23, 2023
51190a1
[RLlib] Fix missing action distribution error in policy files (#35625)
ArturNiederfahrenhorst May 23, 2023
999fbf9
[RLlib] Increase the required time for PPO learner tests (#35651)
ArturNiederfahrenhorst May 23, 2023
aff48cf
Fix literalinclude path (#35660)
jjyao May 23, 2023
327d8e5
[RLlib] Attempt to deflake gym API tests and make them run quicker (#…
ArturNiederfahrenhorst May 23, 2023
62f6fd2
[doc][runtime env] Update that local py_module wheels are unsupported…
architkulkarni May 23, 2023
0b706fe
[Serve] Set `app_msg` to empty string by default (#35646)
shrekris-anyscale May 23, 2023
e15758e
[AIR, Doc] Separate computer_vision.py into its own test of enormous …
May 23, 2023
b81b1be
[RLlib] Make resource requests for multi gpu learners not request cpu…
avnishn May 24, 2023
ec68b86
[Doc] Fix doc build on M1 (#35689)
pcmoritz May 24, 2023
0d09bf6
[Data] Don't convert variable-length byte data to Numpy array (#35638)
amogkam May 24, 2023
0a5d69d
[Data] Clarify GPT2 outputs in HuggingFace example (#35688)
bveeramani May 24, 2023
eaa02f3
Revert "[core][dashboard] Add task name in task log magic token" (#35…
rickyyx May 24, 2023
10f2d7d
[Streaming Generator] Fix a reference leak when a stream is deleted w…
rkooo567 May 24, 2023
f035000
[Core] Guarantee the ordering of put ActorTaskSpecTable and ActorTabl…
jjyao May 24, 2023
c3232c5
[Core] Make some grpcio imports lazy (#35705)
pcmoritz May 24, 2023
5acf41e
[air] Store unflattened metrics in _TrackedCheckpoint (#35658)
krfricke May 24, 2023
806b633
[RLlib] Minor doc fixes (#35675)
Rohan138 May 24, 2023
09e07fd
[RLlib] Fix tfp import (#34756)
Rohan138 May 24, 2023
8e49d2a
[RLlib] Fix ope evaluation bug (#35697)
Rohan138 May 24, 2023
dc88d31
[Train] Fix `TorchCheckpoint` encoding directories (#35669)
Yard1 May 24, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
2 changes: 1 addition & 1 deletion .bazeliskrc
Original file line number Diff line number Diff line change
@@ -1 +1 @@
USE_BAZEL_VERSION=5.x
USE_BAZEL_VERSION=5.4.1
8 changes: 4 additions & 4 deletions .bazelrc
Original file line number Diff line number Diff line change
Expand Up @@ -44,8 +44,8 @@ build:clang-cl --per_file_copt="-\\.(asm|S)$@-Werror"
build:msvc-cl --per_file_copt="-\\.(asm|S)$@-WX"
# Ignore warnings for protobuf generated files and external projects.
build --per_file_copt="\\.pb\\.cc$@-w"
build --per_file_copt="-\\.(asm|S)$,external/.*@-w"
#build --per_file_copt="external/.*@-Wno-unused-result"
build:linux --per_file_copt="-\\.(asm|S)$,external/.*@-w,-Wno-error=implicit-function-declaration"
build:macos --per_file_copt="-\\.(asm|S)$,external/.*@-w,-Wno-error=implicit-function-declaration"
# Ignore minor warnings for host tools, which we generally can't control
build:clang-cl --host_copt="-Wno-inconsistent-missing-override"
build:clang-cl --host_copt="-Wno-microsoft-unqualified-friend"
Expand Down Expand Up @@ -200,5 +200,5 @@ try-import %workspace%/.llvm-local.bazelrc
# It picks up the system headers when someone has protobuf installed via Homebrew.
# Work around for https://github.com/bazelbuild/bazel/issues/8053
build:macos --sandbox_block_path=/usr/local/
#This option controls whether javac checks for missing direct dependencies.
build --strict_java_deps=off
# This option controls whether javac checks for missing direct dependencies.
build --experimental_strict_java_deps=off
2 changes: 1 addition & 1 deletion .buildkite/copy_files.py
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,7 @@ def perform_auth():

def handle_docker_login(resp):
pwd = resp.json()["docker_password"]
subprocess.call(
subprocess.check_call(
["docker", "login", "--username", "raytravisbot", "--password", pwd]
)

Expand Down
32 changes: 16 additions & 16 deletions .buildkite/pipeline.arm64.yml
Original file line number Diff line number Diff line change
Expand Up @@ -60,77 +60,77 @@
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: arm64-medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.7 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- ./ci/env/env_info.sh
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py37 --device-types cpu cu112 --build-type BUILDKITE --build-base --suffix aarch64
- python ./ci/build/build-docker-images.py --py-versions py37 -T cpu -T cu112 --build-type BUILDKITE --build-base --suffix aarch64

- label: ":mechanical_arm: :docker: Build Images: py37 [aarch64] (2/2)"
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: arm64-medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.7 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- ./ci/env/env_info.sh
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py37 --device-types cu113 cu116 cu118 --build-type BUILDKITE --build-base --suffix aarch64
- python ./ci/build/build-docker-images.py --py-versions py37 -T cu113 -T cu116 -T cu118 --build-type BUILDKITE --build-base --suffix aarch64

- label: ":mechanical_arm: :docker: Build Images: py38 [aarch64] (1/2)"
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: arm64-medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.8 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- ./ci/env/env_info.sh
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py38 --device-types cpu cu112 --build-type BUILDKITE --build-base --suffix aarch64
- python ./ci/build/build-docker-images.py --py-versions py38 -T cpu -T cu112 --build-type BUILDKITE --build-base --suffix aarch64

- label: ":mechanical_arm: :docker: Build Images: py38 [aarch64] (2/2)"
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: arm64-medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.8 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- ./ci/env/env_info.sh
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py38 --device-types cu113 cu116 cu118 --build-type BUILDKITE --build-base --suffix aarch64
- python ./ci/build/build-docker-images.py --py-versions py38 -T cu113 -T cu116 -T cu118 --build-type BUILDKITE --build-base --suffix aarch64

- label: ":mechanical_arm: :docker: Build Images: py39 [aarch64] (1/2)"
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: arm64-medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.9 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- ./ci/env/env_info.sh
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py39 --device-types cpu cu112 --build-type BUILDKITE --build-base --suffix aarch64
- python ./ci/build/build-docker-images.py --py-versions py39 -T cpu -T cu112 --build-type BUILDKITE --build-base --suffix aarch64

- label: ":mechanical_arm: :docker: Build Images: py39 [aarch64] (2/2)"
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: arm64-medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.9 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- ./ci/env/env_info.sh
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py39 --device-types cu113 cu116 cu118 --build-type BUILDKITE --build-base --suffix aarch64
- python ./ci/build/build-docker-images.py --py-versions py39 -T cu113 -T cu116 -T cu118 --build-type BUILDKITE --build-base --suffix aarch64

- label: ":mechanical_arm: :docker: Build Images: py310 [aarch64] (1/2)"
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: arm64-medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.10 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- ./ci/env/env_info.sh
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py310 --device-types cpu cu112 --build-type BUILDKITE --build-base --suffix aarch64
- python ./ci/build/build-docker-images.py --py-versions py310 -T cpu -T cu112 --build-type BUILDKITE --build-base --suffix aarch64

- label: ":mechanical_arm: :docker: Build Images: py310 [aarch64] (2/2)"
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: arm64-medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.10 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py310 --device-types cu113 cu116 cu118 --build-type BUILDKITE --build-base --suffix aarch64
- python ./ci/build/build-docker-images.py --py-versions py310 -T cu113 -T cu116 -T cu118 --build-type BUILDKITE --build-base --suffix aarch64
73 changes: 28 additions & 45 deletions .buildkite/pipeline.build.yml
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,6 @@
commands:
# Build the wheels and jars
- UPLOAD_WHEELS_AS_ARTIFACTS=1 LINUX_WHEELS=1 LINUX_JARS=1 ./ci/ci.sh build
- bash ./java/build-jar-multiplatform.sh linux
# Upload the wheels and jars
# We don't want to push on PRs, in fact, the copy_files will fail because unauthenticated.
- if [ "$BUILDKITE_PULL_REQUEST" != "false" ]; then exit 0; fi
Expand Down Expand Up @@ -88,80 +87,80 @@
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.7 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- ./ci/env/env_info.sh
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py37 --device-types cpu cu101 cu102 cu110 --build-type BUILDKITE --build-base
- python ./ci/build/build-docker-images.py --py-versions py37 -T cpu -T cu101 -T cu102 -T cu110 --build-type BUILDKITE --build-base

- label: ":docker: Build Images: py37 (2/2)"
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.7 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- ./ci/env/env_info.sh
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py37 --device-types cu111 cu112 cu113 cu116 cu118 --build-type BUILDKITE --build-base
- python ./ci/build/build-docker-images.py --py-versions py37 -T cu111 -T cu112 -T cu113 -T cu116 -T cu118 --build-type BUILDKITE --build-base

- label: ":docker: Build Images: py38 (1/2)"
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.8 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- ./ci/env/env_info.sh
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py38 --device-types cpu cu101 cu102 cu110 --build-type BUILDKITE --build-base
- python ./ci/build/build-docker-images.py --py-versions py38 -T cpu -T cu101 -T cu102 -T cu110 --build-type BUILDKITE --build-base

- label: ":docker: Build Images: py38 (2/2)"
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.8 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- ./ci/env/env_info.sh
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py38 --device-types cu111 cu112 cu113 cu116 cu118 --build-type BUILDKITE --build-base
- python ./ci/build/build-docker-images.py --py-versions py38 -T cu111 -T cu112 -T cu113 -T cu116 -T cu118 --build-type BUILDKITE --build-base

- label: ":docker: Build Images: py39 (1/2)"
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.9 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- ./ci/env/env_info.sh
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py39 --device-types cpu cu101 cu102 cu110 --build-type BUILDKITE --build-base
- python ./ci/build/build-docker-images.py --py-versions py39 -T cpu -T cu101 -T cu102 -T cu110 --build-type BUILDKITE --build-base

- label: ":docker: Build Images: py39 (2/2)"
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.9 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- ./ci/env/env_info.sh
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py39 --device-types cu111 cu112 cu113 cu116 cu118 --build-type BUILDKITE --build-base
- python ./ci/build/build-docker-images.py --py-versions py39 -T cu111 -T cu112 -T cu113 -T cu116 -T cu118 --build-type BUILDKITE --build-base

- label: ":docker: Build Images: py310 (1/2)"
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.10 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- ./ci/env/env_info.sh
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py310 --device-types cpu cu101 cu102 cu110 --build-type BUILDKITE --build-base
- python ./ci/build/build-docker-images.py --py-versions py310 -T cpu -T cu101 -T cu102 -T cu110 --build-type BUILDKITE --build-base

- label: ":docker: Build Images: py310 (2/2)"
conditions: ["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_DOCKER_AFFECTED", "RAY_CI_CORE_CPP_AFFECTED"]
instance_size: medium
commands:
- LINUX_WHEELS=1 ./ci/ci.sh build
- LINUX_WHEELS=1 BUILD_ONE_PYTHON_ONLY=3.10 ./ci/ci.sh build
- pip install -q docker aws_requests_auth boto3
- if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then python .buildkite/copy_files.py --destination docker_login; fi
- python ./ci/build/build-docker-images.py --py-versions py310 --device-types cu111 cu112 cu113 cu116 cu118 --build-type BUILDKITE --build-base
- python ./ci/build/build-docker-images.py --py-versions py310 -T cu111 -T cu112 -T cu113 -T cu116 -T cu118 --build-type BUILDKITE --build-base

- label: ":java: Java"
conditions: ["RAY_CI_JAVA_AFFECTED"]
Expand Down Expand Up @@ -298,15 +297,6 @@
python/ray/serve/test_gradio
python/ray/serve/test_gradio_visualization


- label: ":python: Minimal install 3.6"
conditions: ["RAY_CI_PYTHON_AFFECTED"]
instance_size: medium
commands:
- cleanup() { if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then ./ci/build/upload_build_info.sh; fi }; trap cleanup EXIT
- ./ci/ci.sh test_minimal 3.6
- ./ci/ci.sh test_latest_core_dependencies 3.6

- label: ":python: Minimal install 3.7"
conditions: ["RAY_CI_PYTHON_AFFECTED"]
instance_size: medium
Expand Down Expand Up @@ -377,6 +367,9 @@
- DL=1 ./ci/env/install-dependencies.sh
- bash ./ci/ci.sh prepare_docker
- ./ci/env/env_info.sh
# This is needed or else the Ray Client tests run into a gRPC forking problem
# similar to https://github.com/grpc/grpc/issues/31885
- pip install pip install grpcio==1.50.0
- bazel test --config=ci $(./ci/run/bazel_export_options)
--test_tag_filters=client_tests,small_size_python_tests
-- python/ray/tests/...
Expand All @@ -387,6 +380,8 @@
--test_env=DOCKER_CERT_PATH=/certs/client
--test_env=DOCKER_TLS_CERTDIR=/certs
-- python/ray/tests/...
- bazel test --config=ci $(./ci/run/bazel_export_options)
-- python/ray/autoscaler/v2/...

- label: ":python: (Large)"
conditions: ["RAY_CI_PYTHON_AFFECTED"]
Expand Down Expand Up @@ -425,6 +420,9 @@
- cleanup() { if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then ./ci/build/upload_build_info.sh; fi }; trap cleanup EXIT
- DL=1 ./ci/env/install-dependencies.sh
- ./ci/env/env_info.sh
# This is needed or else the Ray Client tests run into a gRPC forking problem
# similar to https://github.com/grpc/grpc/issues/31885
- pip install pip install grpcio==1.50.0
- bazel test --config=ci $(./scripts/bazel_export_options)
--test_tag_filters=client_tests,small_size_python_tests
--test_env=TEST_EXTERNAL_REDIS=1
Expand Down Expand Up @@ -500,6 +498,8 @@
- cleanup() { if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then ./ci/build/upload_build_info.sh; fi }; trap cleanup EXIT
- pip uninstall -y ray
- RAY_DEBUG_BUILD=debug ./ci/ci.sh build
# Install latest pyspark. We cannot move this to the requirements file as subdependencies conflict
- pip install -U https://ml-team-public-read.s3.us-west-2.amazonaws.com/spark-pkgs/pyspark-3.4.0.dev0-0cb0fa313979e1b82ddd711a05d8c4e78cf6c9f5.tar.gz
- ./ci/env/env_info.sh
- bazel test --config=ci-debug $(./ci/run/bazel_export_options)
--test_env=RAY_ON_SPARK_BACKGROUND_JOB_STARTUP_WAIT=1
Expand Down Expand Up @@ -577,7 +577,8 @@
# Horovod needs to be installed separately (needed for API ref imports)
- ./ci/env/install-horovod.sh
# See https://stackoverflow.com/questions/63383400/error-cannot-uninstall-ruamel-yaml-while-creating-docker-image-for-azure-ml-a
- pip install mosaicml==0.12.1 --ignore-installed
# Pin urllib to avoid downstream ssl incompatibility issues. This matches requirements-doc.txt.
- pip install "mosaicml==0.12.1" "urllib3<1.27" --ignore-installed
- ./ci/ci.sh build

- label: ":octopus: Tune multinode tests"
Expand Down Expand Up @@ -626,24 +627,6 @@
- bazel test --config=ci $(./ci/run/bazel_export_options) --build_tests_only --test_tag_filters=minimal python/ray/train/...


- label: ":cold_face: :python: Ray Python 3.6 ML compatibility tests"
conditions:
["ALWAYS", "RAY_CI_TUNE_AFFECTED", "RAY_CI_TRAIN_AFFECTED", "RAY_CI_ML_AFFECTED", ]
instance_size: large
commands:
- cleanup() { if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then ./ci/build/upload_build_info.sh; fi }; trap cleanup EXIT
- ./ci/env/install-minimal.sh 3.6
- pip install -r python/requirements/compat/requirements_py36_compat.txt
- pip install -U typing-extensions
- HOROVOD_WITH_GLOO=1 HOROVOD_WITHOUT_MPI=1 HOROVOD_WITHOUT_MXNET=1 HOROVOD_WITH_TENSORFLOW=1 HOROVOD_WITH_PYTORCH=1 pip install horovod
- ./ci/env/env_info.sh
- bazel test --config=ci $(./scripts/bazel_export_options) --test_tag_filters=compat_py36
python/ray/tests/horovod/...
python/ray/tests/lightgbm/...
python/ray/tests/ml_py36_compat/...
python/ray/tests/xgboost/...


- label: ":cold_face: :python: Ray Python legacy dependency ML compatibility tests"
conditions:
["RAY_CI_PYTHON_DEPENDENCIES_AFFECTED", "RAY_CI_TUNE_AFFECTED", "RAY_CI_TRAIN_AFFECTED", "RAY_CI_ML_AFFECTED"]
Expand Down
13 changes: 13 additions & 0 deletions .buildkite/pipeline.gpu_large.yml
Original file line number Diff line number Diff line change
Expand Up @@ -49,3 +49,16 @@
- pip install -Ur ./python/requirements/ml/requirements_ml_docker.txt
- ./ci/env/env_info.sh
- bazel test --config=ci $(./ci/run/bazel_export_options) --build_tests_only --test_tag_filters=gpu,-timeseries_libs,-py37,-post_wheel_build doc/...

- label: ":zap: :python: Lightning 2.0 Train GPU tests"
conditions:
["NO_WHEELS_REQUIRED", "RAY_CI_TRAIN_AFFECTED"]
commands:
- cleanup() { if [ "${BUILDKITE_PULL_REQUEST}" = "false" ]; then ./ci/build/upload_build_info.sh; fi }; trap cleanup EXIT
- NO_DASHBOARD=1 ./ci/env/install-minimal.sh 3.8
- PYTHON=3.8 DOC_TESTING=1 TRAIN_TESTING=1 TUNE_TESTING=1 ./ci/env/install-dependencies.sh
- pip install -Ur ./python/requirements/ml/requirements_ml_docker.txt
- pip uninstall -y pytorch-lightning
- pip install lightning==2.0.0
- ./ci/env/env_info.sh
- bazel test --config=ci $(./scripts/bazel_export_options) --test_tag_filters=ptl_v2 python/ray/train/...
Loading