Make intra-process manager thread safe, rename IPMState to IPMImpl #165

jacquelinekay · 2015-12-01T19:59:53Z

To implement a lock-free IntraProcessManagerImpl in the future, I will extend from IntraProcessManagerImplBase and use lock-free structures instead of mutexes.

jacquelinekay · 2015-12-02T02:29:24Z

http://ci.ros2.org/job/ci_linux/647
http://ci.ros2.org/job/ci_osx/529
http://ci.ros2.org/job/ci_windows_connext_dynamic/18
http://ci.ros2.org/job/ci_windows_connext_static/18
http://ci.ros2.org/job/ci_windows_opensplice/717

jacquelinekay · 2015-12-02T22:50:01Z

Whoops, I needed to rename the class in a few other places.

http://ci.ros2.org/job/ci_linux/653/
http://ci.ros2.org/job/ci_osx/535/
http://ci.ros2.org/job/ci_windows_connext_dynamic/21
http://ci.ros2.org/job/ci_windows_connext_static/20
http://ci.ros2.org/job/ci_windows_opensplice/727

gerkey · 2015-12-03T00:16:27Z

rclcpp/include/rclcpp/intra_process_manager_impl.hpp

@@ -184,6 +186,7 @@ class IntraProcessManagerState : public IntraProcessManagerStateBase
    size_t & size
  )
  {
+    std::lock_guard<std::mutex> lock(runtime_mutex_);


Can this lock be moved into the next block, where we iterate over publishers_? Or do we need to hold it all the way through the other two blocks?

To implement this more optimally, I could have one mutex for publishers_ and then a mutex for each PublisherInfo entry in publishers_. That way one thread could look up an entry in publishers_ while the other is looking up something in the map owned by an entry in publishers (I believe that would be fine). And yes, that implementation would include moving the mutex for publishers_ into the block where find is invoked one the map.

A mutex per item sounds like overkill; I would leave it as is.

I hadn't dug into the data types being handled, and didn't realize that you still need exclusion after pulling the items out of publishers_.

gerkey · 2015-12-03T00:28:41Z

Without this change, I pretty reliably get a segfault after a dozen or so iterations of the test added in ros2/system_tests#72. With this change, I'm now at > 100 iterations with no problem.

FYI, the segfault (from a Debug build of ROS 2, but with a non-Debug build of opensplice) starts like this:

#0  0x00007f721d621b63 in std::_Rb_tree_increment(std::_Rb_tree_node_base const*) ()
   from /usr/lib/x86_64-linux-gnu/libstdc++.so.6
#1  0x00007f721dd4bb49 in std::_Rb_tree_const_iterator<unsigned long>::operator++ (this=0x7f721960b930)
    at /usr/include/c++/4.8/bits/stl_tree.h:270
#2  0x00007f721dd4a05e in std::__find<std::_Rb_tree_const_iterator<unsigned long>, unsigned long> (__first=..., 
    __last=..., __val=@0x7f721960b9b8: 161) at /usr/include/c++/4.8/bits/stl_algo.h:140
#3  0x00007f721dd49280 in std::find<std::_Rb_tree_const_iterator<unsigned long>, unsigned long> (__first=..., 
    __last=..., __val=@0x7f721960b9b8: 161) at /usr/include/c++/4.8/bits/stl_algo.h:4441
#4  0x00007f721dd48865 in rclcpp::intra_process_manager::IntraProcessManagerState<std::allocator<void> >::take_intra_process_message (this=0x973a78, intra_process_publisher_id=158, message_sequence_number=1, 
    requesting_subscriptions_intra_process_id=161, size=@0x7f721960ba68: 0)
    at /home/gerkey/ros2_ws/src/ros2/rclcpp/rclcpp/include/rclcpp/intra_process_manager_state.hpp:209
#5  0x00000000004ed496 in rclcpp::intra_process_manager::IntraProcessManager::take_intra_process_message<test_rclcpp::msg::UInt32_<std::allocator<void> >, std::allocator<void>, std::default_delete<test_rclcpp::msg::UInt32_<std::allocator<void> > > > (this=0x979380, intra_process_publisher_id=158, message_sequence_number=1, 
    requesting_subscriptions_intra_process_id=161, message=...)
    at /home/gerkey/ros2_ws/install/include/rclcpp/intra_process_manager.hpp:316

jacquelinekay · 2015-12-03T00:53:57Z

I can run up to iteration 650 of the multithreaded test and then I get an error from OpenSplice:

Description : The Handle Server ran out of handle space

I get a similar error when I repeat single-threaded tests indefinitely, however, so I don't think it's relevant.

gerkey · 2015-12-03T01:55:44Z

@jacquelinekay I get the same OpenSplice error, in my case after 116 iterations. Agreed that it's not related to this change (though it may indicate a resource management issue that we'll have to tackle at some point).

gerkey · 2015-12-03T01:57:32Z

+1

Make intra-process manager thread safe, rename IPMState to IPMImpl

dirk-thomas · 2015-12-03T17:47:24Z

Please create a ticket for the resource problem to keep track of it.

jacquelinekay · 2015-12-03T17:58:26Z

ros2/rmw_opensplice#99

* add timer test * more tests * another one just for fun * uncrustify

Generate rclcpp::Node before start_recording since rclcpp::Node will set parameters of use_sim_time and publish message to parameter_events. This will cause the wrong messages count in the test. Signed-off-by: evshary <evshary@gmail.com>

Rename intra_process_manager_state to intra_process_manager_impl

7fdbc4a

jacquelinekay added the in progress Actively being worked on (Kanban column) label Dec 1, 2015

jacquelinekay self-assigned this Dec 1, 2015

tfoote added in review Waiting for review (Kanban column) and removed in progress Actively being worked on (Kanban column) labels Dec 1, 2015

This was referenced Dec 2, 2015

Rename IntraProcessManagerImpl ros2/examples#77

Merged

Change State to Impl ros2/system_tests#79

Merged

gerkey mentioned this pull request Dec 3, 2015

Test for multithreaded execution ros2/system_tests#72

Merged

gerkey reviewed Dec 3, 2015
View reviewed changes

Finish renaming, add mutex to store and take

67151de

jacquelinekay force-pushed the intra_process_lock branch from a29cc96 to 67151de Compare December 3, 2015 17:33

jacquelinekay added a commit that referenced this pull request Dec 3, 2015

Merge pull request #165 from ros2/intra_process_lock

f73ebcb

Make intra-process manager thread safe, rename IPMState to IPMImpl

jacquelinekay merged commit f73ebcb into master Dec 3, 2015

jacquelinekay deleted the intra_process_lock branch December 3, 2015 17:34

jacquelinekay removed the in review Waiting for review (Kanban column) label Dec 3, 2015

jacquelinekay mentioned this pull request Dec 3, 2015

Handle Server runs out of handle space ros2/rmw_opensplice#99

Closed

nnmm pushed a commit to ApexAI/rclcpp that referenced this pull request Jul 9, 2022

Timer tests2 (ros2#165)

d811463

* add timer test * more tests * another one just for fun * uncrustify

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make intra-process manager thread safe, rename IPMState to IPMImpl #165

Make intra-process manager thread safe, rename IPMState to IPMImpl #165

jacquelinekay commented Dec 1, 2015

jacquelinekay commented Dec 2, 2015

jacquelinekay commented Dec 2, 2015

gerkey Dec 3, 2015

jacquelinekay Dec 3, 2015

gerkey Dec 3, 2015

gerkey commented Dec 3, 2015

jacquelinekay commented Dec 3, 2015

gerkey commented Dec 3, 2015

gerkey commented Dec 3, 2015

dirk-thomas commented Dec 3, 2015

jacquelinekay commented Dec 3, 2015

Make intra-process manager thread safe, rename IPMState to IPMImpl #165

Make intra-process manager thread safe, rename IPMState to IPMImpl #165

Conversation

jacquelinekay commented Dec 1, 2015

jacquelinekay commented Dec 2, 2015

jacquelinekay commented Dec 2, 2015

gerkey Dec 3, 2015

Choose a reason for hiding this comment

jacquelinekay Dec 3, 2015

Choose a reason for hiding this comment

gerkey Dec 3, 2015

Choose a reason for hiding this comment

gerkey commented Dec 3, 2015

jacquelinekay commented Dec 3, 2015

gerkey commented Dec 3, 2015

gerkey commented Dec 3, 2015

dirk-thomas commented Dec 3, 2015

jacquelinekay commented Dec 3, 2015