Skip to content
This repository has been archived by the owner on Nov 15, 2021. It is now read-only.

Merge with ray master #36

Merged
merged 131 commits into from
Jun 26, 2019
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
131 commits
Select commit Hold shift + click to select a range
351753a
[rllib] Remove dependency on TensorFlow (#4764)
ericl May 11, 2019
004440f
Dynamic Custom Resources - create and delete resources (#3742)
romilbhardwaj May 11, 2019
f3b8b90
Update tutorial link in doc (#4777)
May 12, 2019
69352e3
[rllib] Implement learn_on_batch() in torch policy graph
ericl May 13, 2019
62c949b
Fix `ray stop` by killing raylet before plasma (#4778)
jovany-wang May 13, 2019
1622fc2
Fatal check if object store dies (#4763)
stephanie-wang May 13, 2019
c5161a2
[rllib] fix clip by value issue as TF upgraded (#4697)
joneswong May 13, 2019
67af103
Merge with ray master
stefanpantic May 14, 2019
3bbafc7
[autoscaler] Fix submit (#4782)
richardliaw May 15, 2019
d6bf680
Merge with master
stefanpantic May 15, 2019
cb1a195
Queue tasks in the raylet in between async callbacks (#4766)
stephanie-wang May 15, 2019
643f62d
[Java][Bazel] Refine auto-generated pom files (#4780)
raulchen May 16, 2019
1490a98
Bump version to 0.7.0 (#4791)
devin-petersohn May 16, 2019
98dd033
[JAVA] setDefaultUncaughtExceptionHandler to log uncaught exception i…
May 16, 2019
9f2645d
[tune] Fix CLI test (#4801)
richardliaw May 16, 2019
ffd596d
Fix pom file generation (#4800)
raulchen May 17, 2019
7d5ef6d
[rllib] Support continuous action distributions in IMPALA/APPO (#4771)
ericl May 17, 2019
3807fb5
[rllib] TensorFlow 2 compatibility (#4802)
ericl May 17, 2019
84cf474
Change tagline in documentation and README. (#4807)
pcmoritz May 17, 2019
ffe61fc
[tune] Support non-arg submit (#4803)
richardliaw May 17, 2019
88b45a5
[autoscaler] rsync cluster (#4785)
richardliaw May 17, 2019
e20855c
[tune] Remove extra parsing functionality (#4804)
richardliaw May 17, 2019
dcd6d49
Fix Java worker log dir (#4781)
jovany-wang May 17, 2019
1ef9c07
[tune] Initial track integration (#4362)
noahgolmant May 17, 2019
6cb5b90
[rllib] [RFC] Dynamic definition of loss functions and modularization…
ericl May 18, 2019
04294d9
Merge remote-tracking branch 'remotes/main/master' into unstable
pimpke May 20, 2019
02583a8
[rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy…
ericl May 20, 2019
081708b
[Java] Dynamic resource API in Java (#4824)
jovany-wang May 21, 2019
ac47d03
Merge with master
stefanpantic May 21, 2019
5391b61
Add default values for Wgym flags
stefanpantic May 21, 2019
87bb2e5
Fix import
stefanpantic May 21, 2019
259cdfa
Fix issue when starting `raylet_monitor` (#4829)
jovany-wang May 22, 2019
1a39fee
Refactor ID Serial 1: Separate ObjectID and TaskID from UniqueID (#4776)
guoyuhong May 22, 2019
2015085
Fix bug in which actor classes are not exported multiple times. (#4838)
robertnishihara May 23, 2019
ba6c595
Bump Ray master version to 0.8.0.dev0 (#4845)
devin-petersohn May 24, 2019
4e281ba
Add section to bump version of master branch and cleanup release docs…
devin-petersohn May 24, 2019
71f95e1
Fix import
stefanpantic May 24, 2019
be1850f
Merge branch 'unstable' of github.com:wingman-ai/ray into unstable
stefanpantic May 24, 2019
49fe894
Export remote functions when first used and also fix bug in which rem…
robertnishihara May 24, 2019
a7d01ab
Update wheel versions in documentation to 0.8.0.dev0 and 0.7.0. (#4847)
devin-petersohn May 24, 2019
0ce0ecb
[tune] Later expansion of local_dir (#4806)
richardliaw May 25, 2019
7237ea7
[rllib] [RFC] Deprecate Python 2 / RLlib (#4832)
ericl May 25, 2019
ea8d7b4
Fix a typo in kubernetes yaml (#4872)
ikedaosushi May 26, 2019
6703519
Move global state API out of global_state object. (#4857)
robertnishihara May 26, 2019
7a78e1e
Install bazel in autoscaler development configs. (#4874)
robertnishihara May 26, 2019
574e1c7
[tune] Fix up Ax Search and Examples (#4851)
richardliaw May 27, 2019
a45c61e
[rllib] Update concepts docs and add "Building Policies in Torch/Tens…
ericl May 27, 2019
d7be5a5
[rllib] Fix error getting kl when simple_optimizer: True in multi-age…
ericl May 28, 2019
fa0892f
Replace ReturnIds with NumReturns in TaskInfo to reduce the size (#4854)
guoyuhong May 28, 2019
64a01b2
Update deps commits of opencensus to support building with bzl 0.25.x…
jovany-wang May 28, 2019
0bcc589
Merge with master
stefanpantic May 28, 2019
64eb7b3
Upgrade arrow to latest master (#4858)
pcmoritz May 28, 2019
acee89b
[tune] Auto-init Ray + default SearchAlg (#4815)
richardliaw May 29, 2019
a218a14
Bump version from 0.8.0.dev0 to 0.7.1. (#4890)
robertnishihara May 29, 2019
2dd0beb
[rllib] Allow access to batches prior to postprocessing (#4871)
ericl May 30, 2019
3f4d37c
[rllib] Fix Multidiscrete support (#4869)
ericl May 30, 2019
b7c284a
Refactor redis callback handling (#4841)
jovany-wang May 30, 2019
2912a7c
Initial high-level code structure of CoreWorker. (#4875)
raulchen May 30, 2019
4e0be8b
Drop duplicated string format (#4897)
suquark May 30, 2019
1f0809e
Refactor ID Serial 2: change all ID functions to `CamelCase` (#4896)
May 31, 2019
0066d7c
Hotfix for change of from_random to FromRandom (#4909)
May 31, 2019
1c073e9
[rllib] Fix documentation on custom policies (#4910)
ericl Jun 1, 2019
9aa1cd6
[rllib] Allow Torch policies access to full action input dict in extr…
ericl Jun 1, 2019
88bab5d
[tune] Pretty print params json in logger.py (#4903)
hartikainen Jun 1, 2019
c2ade07
[sgd] Distributed Training via PyTorch (#4797)
pschafhalter Jun 2, 2019
665d081
[rllib] Rough port of DQN to build_tf_policy() pattern (#4823)
ericl Jun 2, 2019
d86ee8c
fetching objects in parallel in _get_arguments_for_execution (#4775)
ajgokhale Jun 2, 2019
99eae05
[tune] Disallow setting resources_per_trial when it is already config…
ericl Jun 2, 2019
7501ee5
[rllib] Rename PolicyEvaluator => RolloutWorker (#4820)
ericl Jun 2, 2019
084b221
Fix local cluster yaml (#4918)
richardliaw Jun 3, 2019
89722ff
[tune] Directional metrics for components (#4120) (#4915)
hershg Jun 3, 2019
b674c4a
[Core Worker] implement ObjectInterface and add test framework (#4899)
zhijunfu Jun 3, 2019
c2253d2
[tune] Make PBT Quantile fraction configurable (#4912)
timonbimon Jun 4, 2019
d106283
Better organize ray_common module (#4898)
raulchen Jun 5, 2019
649af18
Merge branches 'master' and 'unstable' of github.com:wingman-ai/ray i…
stefanpantic Jun 5, 2019
d7680ab
Merge with ray master
stefanpantic Jun 5, 2019
ffaae1c
Fix error
stefanpantic Jun 5, 2019
b2581c4
Merge branch 'master' of github.com:wingman-ai/ray into unstable
stefanpantic Jun 5, 2019
2702b15
[tune] Add requirements-dev.txt and update docs for contributing (#4925)
timonbimon Jun 5, 2019
82b3972
Fix compute actions return value
stefanpantic Jun 6, 2019
a0f14e9
Bump version from 0.7.1 to 0.8.0.dev1. (#4937)
robertnishihara Jun 6, 2019
c3f8fc1
Update version number in documentation after release 0.7.0 -> 0.7.1 a…
robertnishihara Jun 7, 2019
cbc67fc
[doc] Update developer docs with bazel instructions (#4944)
stephanie-wang Jun 7, 2019
5eff47b
[C++] Add hash table to Redis-Module (#4911)
Jun 7, 2019
873d45b
Flush lineage cache on task submission instead of execution (#4942)
stephanie-wang Jun 7, 2019
9e328fb
[rllib] Add docs on how to use TF eager execution (#4927)
ericl Jun 7, 2019
77689d1
[rllib] Port remainder of algorithms to build_trainer() pattern (#4920)
ericl Jun 7, 2019
a82e811
Fix resource bookkeeping bug with acquiring unknown resource. (#4945)
robertnishihara Jun 8, 2019
85b82b2
Update aws keys for uploading wheels to s3. (#4948)
robertnishihara Jun 8, 2019
ec8aaf0
Upload wheels on Travis to branchname/commit_id. (#4949)
robertnishihara Jun 8, 2019
671c0f7
[Java] Fix serializing issues of `RaySerializer` (#4887)
jovany-wang Jun 8, 2019
4f8e100
fix (#4950)
ericl Jun 10, 2019
e6baffb
[Java] Add inner class `Builder` to build call options. (#4956)
jovany-wang Jun 10, 2019
6f48992
Make release stress tests work and improve them. (#4955)
robertnishihara Jun 11, 2019
1e2b649
Use proper session directory for debug_string.txt (#4960)
pcmoritz Jun 11, 2019
ebb3b3b
[core] Use int64_t instead of int to keep track of fractional resourc…
pcmoritz Jun 11, 2019
472c36e
[core worker] add task submission & execution interface (#4922)
zhijunfu Jun 12, 2019
e0e52f1
[sgd] Add non-distributed PyTorch runner (#4933)
pschafhalter Jun 12, 2019
89ca5ee
Flush all tasks from local lineage cache after a node failure (#4964)
stephanie-wang Jun 12, 2019
d2f5b71
Remove typing from setup.py install_requirements. (#4971)
robertnishihara Jun 12, 2019
ef1af49
[Java] Fix bug of `BaseID` in multi-threading case. (#4974)
jovany-wang Jun 13, 2019
fa1d4c9
[rllib] Fix DDPG example (#4973)
ericl Jun 13, 2019
3c92b2e
Upgrade CI clang-format to 6.0 (#4976)
raulchen Jun 14, 2019
37abdb2
[Core worker] add store & task provider (#4966)
zhijunfu Jun 14, 2019
1b86e55
Fix bugs in the a3c code template. (#4984)
TianhongDai Jun 15, 2019
05e2748
Inherit Function Docstrings and other metedata (#4985)
simon-mo Jun 15, 2019
b08765a
Fix a crash when unknown worker registering to raylet (#4992)
jovany-wang Jun 17, 2019
2bf92e0
[gRPC] Use gRPC for inter-node-manager communication (#4968)
raulchen Jun 17, 2019
d81c126
Merge branches 'master' and 'unstable' of github.com:wingman-ai/ray i…
stefanpantic Jun 18, 2019
2e0eec9
Merge with ray master
stefanpantic Jun 18, 2019
7bda5ed
Fix Java CI failure (#4995)
jovany-wang Jun 19, 2019
e59e807
fix handling of non-integral timeout values in signal.receive (#5002)
rueberger Jun 20, 2019
1d17125
temp fix for build (#5006)
ericl Jun 21, 2019
31b6da1
[tune] Tutorial UX Changes (#4990)
richardliaw Jun 21, 2019
3b23d94
Fix valgrind build by installing new version of valgrind (#5008)
pcmoritz Jun 21, 2019
a7f84b5
Fix no cpus test (#5009)
jiangzihao2009 Jun 21, 2019
2e342ef
Fix tensorflow-1.14 installation in jenkins (#5007)
pcmoritz Jun 21, 2019
e33d0ea
Add dynamic worker options for worker command. (#4970)
jovany-wang Jun 23, 2019
11ccf66
[docs] docs for running Tensorboard without sudo (#5015)
kiddyboots216 Jun 24, 2019
bd8aceb
[ci] Change Jenkins to py3 (#5022)
richardliaw Jun 25, 2019
0131353
[gRPC] Migrate gcs data structures to protobuf (#5024)
raulchen Jun 25, 2019
aa5fc52
[rllib] Add QMIX mixer parameters to optimizer param list (#5014)
ericl Jun 26, 2019
bb8e75b
[grpc] refactor rpc server to support multiple io services (#5023)
zhijunfu Jun 26, 2019
bbe3e5b
[rllib] Give error if sample_async is used with pytorch for A3C (#5000)
pcmoritz Jun 26, 2019
b1827d5
[tune] Update MNIST Example (#4991)
richardliaw Jun 26, 2019
d639737
Add entropy coeff schedule
stefanpantic Jun 26, 2019
108bfa2
Merge with ray master
stefanpantic Jun 26, 2019
92c0f88
Revert "Merge with ray master"
stefanpantic Jun 26, 2019
014cbb7
Revert "Revert "Merge with ray master""
stefanpantic Jun 26, 2019
342854b
Remove entropy decay stuff
stefanpantic Jun 26, 2019
17d1009
Merge branches 'master' and 'unstable' of github.com:wingman-ai/ray i…
stefanpantic Jun 26, 2019
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
96 changes: 47 additions & 49 deletions BUILD.bazel
Original file line number Diff line number Diff line change
@@ -1,22 +1,55 @@
# Bazel build
# C/C++ documentation: https://docs.bazel.build/versions/master/be/c-cpp.html

load("@com_github_grpc_grpc//bazel:grpc_build_system.bzl", "grpc_proto_library")
load("@com_github_grpc_grpc//bazel:cc_grpc_library.bzl", "cc_grpc_library")
load("@build_stack_rules_proto//python:python_proto_compile.bzl", "python_proto_compile")
load("@com_github_google_flatbuffers//:build_defs.bzl", "flatbuffer_cc_library")
load("@//bazel:ray.bzl", "flatbuffer_py_library")
load("@//bazel:cython_library.bzl", "pyx_library")

COPTS = ["-DRAY_USE_GLOG"]

# Node manager gRPC lib.
grpc_proto_library(
name = "node_manager_grpc_lib",
# === Begin of protobuf definitions ===

proto_library(
name = "gcs_proto",
srcs = ["src/ray/protobuf/gcs.proto"],
visibility = ["//java:__subpackages__"],
)

cc_proto_library(
name = "gcs_cc_proto",
deps = [":gcs_proto"],
)

python_proto_compile(
name = "gcs_py_proto",
deps = [":gcs_proto"],
)

proto_library(
name = "node_manager_proto",
srcs = ["src/ray/protobuf/node_manager.proto"],
)

cc_proto_library(
name = "node_manager_cc_proto",
deps = ["node_manager_proto"],
)

# === End of protobuf definitions ===

# Node manager gRPC lib.
cc_grpc_library(
name = "node_manager_cc_grpc",
srcs = [":node_manager_proto"],
grpc_only = True,
deps = [":node_manager_cc_proto"],
)

# Node manager server and client.
cc_library(
name = "node_manager_rpc_lib",
name = "node_manager_rpc",
srcs = glob([
"src/ray/rpc/*.cc",
]),
Expand All @@ -25,7 +58,7 @@ cc_library(
]),
copts = COPTS,
deps = [
":node_manager_grpc_lib",
":node_manager_cc_grpc",
":ray_common",
"@boost//:asio",
"@com_github_grpc_grpc//:grpc++",
Expand Down Expand Up @@ -114,7 +147,7 @@ cc_library(
":gcs",
":gcs_fbs",
":node_manager_fbs",
":node_manager_rpc_lib",
":node_manager_rpc",
":object_manager",
":ray_common",
":ray_util",
Expand Down Expand Up @@ -422,9 +455,11 @@ cc_library(
"src/ray/gcs/format",
],
deps = [
":gcs_cc_proto",
":gcs_fbs",
":hiredis",
":node_manager_fbs",
":node_manager_rpc",
":ray_common",
":ray_util",
":stats_lib",
Expand Down Expand Up @@ -555,46 +590,6 @@ filegroup(
visibility = ["//java:__subpackages__"],
)

flatbuffer_py_library(
name = "python_gcs_fbs",
srcs = [
":gcs_fbs_file",
],
outs = [
"ActorCheckpointIdData.py",
"ActorState.py",
"ActorTableData.py",
"Arg.py",
"ClassTableData.py",
"ClientTableData.py",
"ConfigTableData.py",
"CustomSerializerData.py",
"DriverTableData.py",
"EntryType.py",
"ErrorTableData.py",
"ErrorType.py",
"FunctionTableData.py",
"GcsEntry.py",
"HeartbeatBatchTableData.py",
"HeartbeatTableData.py",
"Language.py",
"ObjectTableData.py",
"ProfileEvent.py",
"ProfileTableData.py",
"RayResource.py",
"ResourcePair.py",
"SchedulingState.py",
"TablePrefix.py",
"TablePubsub.py",
"TaskInfo.py",
"TaskLeaseData.py",
"TaskReconstructionData.py",
"TaskTableData.py",
"TaskTableTestAndUpdate.py",
],
out_prefix = "python/ray/core/generated/",
)

flatbuffer_py_library(
name = "python_node_manager_fbs",
srcs = [
Expand Down Expand Up @@ -679,6 +674,7 @@ cc_binary(
linkstatic = 1,
visibility = ["//java:__subpackages__"],
deps = [
":gcs_cc_proto",
":ray_common",
],
)
Expand All @@ -688,7 +684,7 @@ genrule(
srcs = [
"python/ray/_raylet.so",
"//:python_sources",
"//:python_gcs_fbs",
"//:gcs_py_proto",
"//:python_node_manager_fbs",
"//:redis-server",
"//:redis-cli",
Expand All @@ -710,11 +706,13 @@ genrule(
cp -f $(location //:raylet_monitor) $$WORK_DIR/python/ray/core/src/ray/raylet/ &&
cp -f $(location @plasma//:plasma_store_server) $$WORK_DIR/python/ray/core/src/plasma/ &&
cp -f $(location //:raylet) $$WORK_DIR/python/ray/core/src/ray/raylet/ &&
for f in $(locations //:python_gcs_fbs); do cp -f $$f $$WORK_DIR/python/ray/core/generated/; done &&
mkdir -p $$WORK_DIR/python/ray/core/generated/ray/protocol/ &&
for f in $(locations //:python_node_manager_fbs); do
cp -f $$f $$WORK_DIR/python/ray/core/generated/ray/protocol/;
done &&
for f in $(locations //:gcs_py_proto); do
cp -f $$f $$WORK_DIR/python/ray/core/generated/;
done &&
echo $$WORK_DIR > $@
""",
local = 1,
Expand Down
4 changes: 4 additions & 0 deletions bazel/ray_deps_build_all.bzl
Original file line number Diff line number Diff line change
Expand Up @@ -4,6 +4,8 @@ load("@com_github_jupp0r_prometheus_cpp//:repositories.bzl", "prometheus_cpp_rep
load("@com_github_ray_project_ray//bazel:python_configure.bzl", "python_configure")
load("@com_github_checkstyle_java//:repo.bzl", "checkstyle_deps")
load("@com_github_grpc_grpc//bazel:grpc_deps.bzl", "grpc_deps")
load("@build_stack_rules_proto//java:deps.bzl", "java_proto_compile")
load("@build_stack_rules_proto//python:deps.bzl", "python_proto_compile")


def ray_deps_build_all():
Expand All @@ -13,4 +15,6 @@ def ray_deps_build_all():
prometheus_cpp_repositories()
python_configure(name = "local_config_python")
grpc_deps()
java_proto_compile()
python_proto_compile()

11 changes: 9 additions & 2 deletions bazel/ray_deps_setup.bzl
Original file line number Diff line number Diff line change
Expand Up @@ -105,7 +105,14 @@ def ray_deps_setup():
http_archive(
name = "com_github_grpc_grpc",
urls = [
"https://github.com/grpc/grpc/archive/7741e806a213cba63c96234f16d712a8aa101a49.tar.gz",
"https://github.com/grpc/grpc/archive/76a381869413834692b8ed305fbe923c0f9c4472.tar.gz",
],
strip_prefix = "grpc-7741e806a213cba63c96234f16d712a8aa101a49",
strip_prefix = "grpc-76a381869413834692b8ed305fbe923c0f9c4472",
)

http_archive(
name = "build_stack_rules_proto",
urls = ["https://github.com/stackb/rules_proto/archive/b93b544f851fdcd3fc5c3d47aee3b7ca158a8841.tar.gz"],
sha256 = "c62f0b442e82a6152fcd5b1c0b7c4028233a9e314078952b6b04253421d56d61",
strip_prefix = "rules_proto-b93b544f851fdcd3fc5c3d47aee3b7ca158a8841",
)
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@ pushd "$ROOT_DIR"

python -m pip install pytest-benchmark

pip install -U https://s3-us-west-2.amazonaws.com/ray-wheels/latest/ray-0.8.0.dev1-cp27-cp27mu-manylinux1_x86_64.whl
pip install -U https://ray-wheels.s3-us-west-2.amazonaws.com/latest/ray-0.8.0.dev1-cp36-cp36m-manylinux1_x86_64.whl
python -m pytest --benchmark-autosave --benchmark-min-rounds=10 --benchmark-columns="min, max, mean" $ROOT_DIR/../../../python/ray/tests/perf_integration_tests/test_perf_integration.py

pushd $ROOT_DIR/../../../python
Expand Down
8 changes: 4 additions & 4 deletions ci/jenkins_tests/run_tune_tests.sh
Original file line number Diff line number Diff line change
Expand Up @@ -78,16 +78,16 @@ $SUPPRESS_OUTPUT docker run --rm --shm-size=${SHM_SIZE} --memory=${MEMORY_SIZE}
--smoke-test

# Runs only on Python3
# docker run --rm --shm-size=${SHM_SIZE} --memory=${MEMORY_SIZE} $DOCKER_SHA \
# python3 /ray/python/ray/tune/examples/nevergrad_example.py \
# --smoke-test
$SUPPRESS_OUTPUT docker run --rm --shm-size=${SHM_SIZE} --memory=${MEMORY_SIZE} $DOCKER_SHA \
python /ray/python/ray/tune/examples/nevergrad_example.py \
--smoke-test

$SUPPRESS_OUTPUT docker run --rm --shm-size=${SHM_SIZE} --memory=${MEMORY_SIZE} $DOCKER_SHA \
python /ray/python/ray/tune/examples/tune_mnist_keras.py \
--smoke-test

$SUPPRESS_OUTPUT docker run --rm --shm-size=${SHM_SIZE} --memory=${MEMORY_SIZE} $DOCKER_SHA \
python /ray/python/ray/tune/examples/mnist_pytorch.py --smoke-test --no-cuda
python /ray/python/ray/tune/examples/mnist_pytorch.py --smoke-test

$SUPPRESS_OUTPUT docker run --rm --shm-size=${SHM_SIZE} --memory=${MEMORY_SIZE} $DOCKER_SHA \
python /ray/python/ray/tune/examples/mnist_pytorch_trainable.py \
Expand Down
15 changes: 1 addition & 14 deletions doc/source/conf.py
Original file line number Diff line number Diff line change
Expand Up @@ -23,20 +23,7 @@
"gym.spaces",
"ray._raylet",
"ray.core.generated",
"ray.core.generated.ActorCheckpointIdData",
"ray.core.generated.ClientTableData",
"ray.core.generated.DriverTableData",
"ray.core.generated.EntryType",
"ray.core.generated.ErrorTableData",
"ray.core.generated.ErrorType",
"ray.core.generated.GcsEntry",
"ray.core.generated.HeartbeatBatchTableData",
"ray.core.generated.HeartbeatTableData",
"ray.core.generated.Language",
"ray.core.generated.ObjectTableData",
"ray.core.generated.ProfileTableData",
"ray.core.generated.TablePrefix",
"ray.core.generated.TablePubsub",
"ray.core.generated.gcs_pb2",
"ray.core.generated.ray.protocol.Task",
"scipy",
"scipy.signal",
Expand Down
6 changes: 6 additions & 0 deletions doc/source/tune-usage.rst
Original file line number Diff line number Diff line change
Expand Up @@ -355,6 +355,12 @@ Then, after you run a experiment, you can visualize your experiment with TensorB

$ tensorboard --logdir=~/ray_results/my_experiment

If you are running Ray on a remote multi-user cluster where you do not have sudo access, you can run the following commands to make sure tensorboard is able to write to the tmp directory:

.. code-block:: bash

$ export TMPDIR=/tmp/$USER; mkdir -p $TMPDIR; tensorboard --logdir=~/ray_results

.. image:: ray-tune-tensorboard.png

To use rllab's VisKit (you may have to install some dependencies), run:
Expand Down
2 changes: 1 addition & 1 deletion docker/base-deps/Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ RUN apt-get update \
&& apt-get clean \
&& echo 'export PATH=/opt/conda/bin:$PATH' > /etc/profile.d/conda.sh \
&& wget \
--quiet 'https://repo.continuum.io/archive/Anaconda2-5.2.0-Linux-x86_64.sh' \
--quiet 'https://repo.continuum.io/archive/Anaconda3-5.2.0-Linux-x86_64.sh' \
-O /tmp/anaconda.sh \
&& /bin/bash /tmp/anaconda.sh -b -p /opt/conda \
&& rm /tmp/anaconda.sh \
Expand Down
5 changes: 4 additions & 1 deletion docker/examples/Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -5,11 +5,14 @@ FROM ray-project/deploy
# This updates numpy to 1.14 and mutes errors from other libraries
RUN conda install -y numpy
RUN apt-get install -y zlib1g-dev
# The following is needed to support TensorFlow 1.14
RUN conda remove -y --force wrapt
RUN pip install gym[atari] opencv-python-headless tensorflow lz4 keras pytest-timeout smart_open
RUN pip install -U h5py # Mutes FutureWarnings
RUN pip install --upgrade bayesian-optimization
RUN pip install --upgrade git+git://github.com/hyperopt/hyperopt.git
RUN pip install --upgrade sigopt
# RUN pip install --upgrade nevergrad
RUN pip install --upgrade nevergrad
RUN pip install --upgrade scikit-optimize
RUN pip install -U pytest-remotedata>=0.3.1
RUN conda install pytorch-cpu torchvision-cpu -c pytorch
2 changes: 1 addition & 1 deletion docker/stress_test/Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ FROM ray-project/base-deps

# We install ray and boto3 to enable the ray autoscaler as
# a test runner.
RUN pip install -U https://s3-us-west-2.amazonaws.com/ray-wheels/latest/ray-0.8.0.dev1-cp27-cp27mu-manylinux1_x86_64.whl boto3
RUN pip install -U https://ray-wheels.s3-us-west-2.amazonaws.com/latest/ray-0.8.0.dev1-cp36-cp36m-manylinux1_x86_64.whl boto3
RUN mkdir -p /root/.ssh/

# We port the source code in so that we run the most up-to-date stress tests.
Expand Down
11 changes: 8 additions & 3 deletions docker/tune_test/Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -4,22 +4,27 @@ FROM ray-project/base-deps

# We install ray and boto3 to enable the ray autoscaler as
# a test runner.
RUN pip install -U https://s3-us-west-2.amazonaws.com/ray-wheels/latest/ray-0.8.0.dev1-cp27-cp27mu-manylinux1_x86_64.whl boto3
RUN conda install -y -c anaconda wrapt=1.11.1
RUN conda install -y -c anaconda numpy=1.16.4
RUN pip install -U https://ray-wheels.s3-us-west-2.amazonaws.com/latest/ray-0.8.0.dev1-cp36-cp36m-manylinux1_x86_64.whl boto3
# We install this after the latest wheels -- this should not override the latest wheels.
RUN apt-get install -y zlib1g-dev
# The following is needed to support TensorFlow 1.14
RUN conda remove -y --force wrapt
RUN pip install gym[atari]==0.10.11 opencv-python-headless tensorflow lz4 keras pytest-timeout smart_open
RUN pip install --upgrade bayesian-optimization
RUN pip install --upgrade git+git://github.com/hyperopt/hyperopt.git
RUN pip install --upgrade sigopt
# RUN pip install --upgrade nevergrad
RUN pip install --upgrade nevergrad
RUN pip install --upgrade scikit-optimize
RUN pip install -U pytest-remotedata>=0.3.1
RUN conda install pytorch-cpu torchvision-cpu -c pytorch

# RUN mkdir -p /root/.ssh/

# We port the source code in so that we run the most up-to-date stress tests.
ADD ray.tar /ray
ADD git-rev /ray/git-rev
RUN python /ray/python/ray/rllib/setup-rllib-dev.py --yes
RUN python /ray/python/ray/setup-dev.py --yes

WORKDIR /ray
Loading