TimeoutError from run_cgs_models() in first 20 seconds #127
Replies: 3 comments 10 replies
-
Hi @galvaniquee Can you make sure that Best, Seppe |
Beta Was this translation helpful? Give feedback.
-
It's the databases in this folder: https://resources.aertslab.org/cistarget/databases/ e.g. for human use the ones here: https://resources.aertslab.org/cistarget/databases/homo_sapiens/hg38/screen/mc_v10_clust/region_based/ I hope this helps. Best, Seppe |
Beta Was this translation helpful? Give feedback.
-
import ray It will give me error like this: 2024-03-11 08:45:53,351 ERROR services.py:1329 -- Failed to start the dashboard , return code -11 2024-03-11 08:45:54,494 INFO worker.py:1724 -- Started a local Ray instance. And the kernel restarts, any thoughts on this? thanks |
Beta Was this translation helpful? Give feedback.
-
Hi guys,
I am trying to run the pbmc_multiome_tutorial and at this point -
And here is the error from it
TimeoutError Traceback (most recent call last)
File /opt/anaconda3/envs/scenicplus/lib/python3.8/site-packages/ray/_private/node.py:293, in Node.init(self, ray_params, head, shutdown_at_exit, spawn_reaper, connect_only, default_worker)
292 try:
--> 293 ray._private.services.wait_for_node(
294 self.gcs_address,
295 self._plasma_store_socket_name,
296 )
297 except TimeoutError:
File /opt/anaconda3/envs/scenicplus/lib/python3.8/site-packages/ray/_private/services.py:436, in wait_for_node(gcs_address, node_plasma_store_socket_name, timeout)
435 time.sleep(0.1)
--> 436 raise TimeoutError("Timed out while waiting for node to startup.")
TimeoutError: Timed out while waiting for node to startup.
During handling of the above exception, another exception occurred:
Exception Traceback (most recent call last)
Cell In[60], line 4
2 cistopic_obj = pickle.load(open(os.path.join(work_dir, 'scATAC/cistopic_obj.pkl'), 'rb'))
3 from pycisTopic.cistopic_class import *
----> 4 models=run_cgs_models(cistopic_obj,
5 n_topics=[2,4,10,16,32,48],
6 n_cpu=1,
7 n_iter=500,
8 random_state=555,
9 alpha=50,
10 alpha_by_topic=True,
11 eta=0.1,
12 eta_by_topic=False,
13 save_path=None,
14 _temp_dir = os.path.join(tmp_dir + 'ray_spill'))
File /opt/anaconda3/envs/scenicplus/lib/python3.8/site-packages/pycisTopic/lda_models.py:156, in run_cgs_models(cistopic_obj, n_topics, n_cpu, n_iter, random_state, alpha, alpha_by_topic, eta, eta_by_topic, top_topics_coh, save_path, **kwargs)
154 region_names = cistopic_obj.region_names
155 cell_names = cistopic_obj.cell_names
--> 156 ray.init(num_cpus=n_cpu, **kwargs)
157 model_list = ray.get(
158 [
159 run_cgs_model.remote(
(...)
174 ]
175 )
176 ray.shutdown()
File /opt/anaconda3/envs/scenicplus/lib/python3.8/site-packages/ray/_private/client_mode_hook.py:105, in client_mode_hook..wrapper(*args, **kwargs)
103 if func.name != "init" or is_client_mode_enabled_by_default:
104 return getattr(ray, func.name)(*args, **kwargs)
--> 105 return func(*args, **kwargs)
File /opt/anaconda3/envs/scenicplus/lib/python3.8/site-packages/ray/_private/worker.py:1451, in init(address, num_cpus, num_gpus, resources, object_store_memory, local_mode, ignore_reinit_error, include_dashboard, dashboard_host, dashboard_port, job_config, configure_logging, logging_level, logging_format, log_to_driver, namespace, runtime_env, storage, **kwargs)
1409 ray_params = ray._private.parameter.RayParams(
1410 node_ip_address=node_ip_address,
1411 raylet_ip_address=raylet_ip_address,
(...)
1445 node_name=_node_name,
1446 )
1447 # Start the Ray processes. We set shutdown_at_exit=False because we
1448 # shutdown the node in the ray.shutdown call that happens in the atexit
1449 # handler. We still spawn a reaper process in case the atexit handler
1450 # isn't called.
-> 1451 _global_node = ray._private.node.Node(
1452 head=True,
1453 shutdown_at_exit=False,
1454 spawn_reaper=True,
1455 ray_params=ray_params,
1456 )
1457 else:
1458 # In this case, we are connecting to an existing cluster.
1459 if num_cpus is not None or num_gpus is not None:
File /opt/anaconda3/envs/scenicplus/lib/python3.8/site-packages/ray/_private/node.py:298, in Node.init(self, ray_params, head, shutdown_at_exit, spawn_reaper, connect_only, default_worker)
293 ray._private.services.wait_for_node(
294 self.gcs_address,
295 self._plasma_store_socket_name,
296 )
297 except TimeoutError:
--> 298 raise Exception(
299 "The current node has not been updated within 30 "
300 "seconds, this could happen because of some of "
301 "the Ray processes failed to startup."
302 )
303 node_info = ray._private.services.get_node_to_connect_for_driver(
304 self.gcs_address,
305 self._raylet_ip_address,
306 )
307 if self._ray_params.node_manager_port == 0:
Exception: The current node has not been updated within 30 seconds, this could happen because of some of the Ray processes failed to startup.
Changing the n_cpu doesn't affect the results, may I know what I should do with it?
Beta Was this translation helpful? Give feedback.
All reactions