Normalize quaternions from msgs before Ogre use #1179

rhaschke · 2017-12-30T01:24:25Z

This is a follow-up to #1167, which added tests for invalid quaternions that caused crashes in Ogre.
However, instead of rejecting those invalid quaternions, I propose to normalize them (or use identity if a zero quaternion is passed). Actually this was already the default behaviour of FrameManager::transform() which is used by most routines anyway.
However, in rare cases ROS quaternions were used directly to set an Ogre::Quaternion, which then indeed requires normalization and special handling of the zero quaternion.

This PR reverts most changes introduced in #1167 and thus fixes the regression not handling non-normalized quaternions gracefully: #1178, moveit/moveit#732.

davetcoleman · 2018-01-02T00:32:21Z

I just ran into this issue today, spent ~4 hours trying to understand why some machines it was working until I upgraded them all. Thanks for this patch @rhaschke!

davetcoleman

I believe it would it be cleaner if you reverted #1167 then only had your proposed changes here

davetcoleman · 2018-01-02T00:32:50Z

src/rviz/default_plugin/covariance_visual.cpp

-  Ogre::Quaternion ori(pose.pose.orientation.w, pose.pose.orientation.x, pose.pose.orientation.y, pose.pose.orientation.z);
+  Ogre::Quaternion ori;
+  if (!normalizeQuaternion(pose.pose.orientation, ori))
+    ROS_WARN("invalid quaternion (zero length)");


nit: capitalize Invalid here and in rest of this PR

davetcoleman · 2018-01-02T00:35:55Z

src/rviz/validate_quaternions.h

 {
-  return validateQuaternions( msg.orientation );
+  float norm2 = quaternionNorm2(w, x, y, z);
+  if (norm2 < 10e-3f)


can you add a comment here why this threshold was chosen?

davetcoleman · 2018-01-02T00:37:17Z

src/rviz/validate_quaternions.h

+  return validateQuaternion(msg.pose.orientation);
+}
+
+


one line break

davetcoleman · 2018-01-02T02:29:27Z

I've tested this patch locally and it fixes the issue for me, thanks again

tfoote · 2018-01-03T00:19:11Z

I'm not sure that I agree with the logic to just "normalize" the quaternion on input. Sending an unnormalized quaternion is like sending a NaN for a float value. Just pushing it to the nearest valid float real value is not representing the same thing.

When doing quaternion math floating point arithmetic errors can induce quaternions that are non-normalized and they do periodically need to be renormalized. Renormalizing at every step is a very high overhead, normalizing never will end up with NaN values. There is a balance in between to two extremes. This is typically done in a specific time in processing. Often I would expect it to be done shortly before being output from an algorithm not on the input. As the input will not have semantic knowledge of the previous potential cause of non-normalization. And I think it's much more correct for input validation instead of input normalization.

Related to this arbitrarily normalizing all inputs can cause completely invalid quaternions (such as unset) to become valid and be rendered. Normalization can only be considered valid if the original is close to normalized, or in a known state of non-normalization. (for example unitialized with all zeros) I can certainly take 4 numbers and turn them into a normalized quaternion, but if they still represent the same underlying information is not clear. Thus it would be better to reject the invalid data rather than pushing it into the right form.

Another example of this might be if the throttle commands for a vehicle are on the range 0 to 1. If I receive a command of 1.1 what does that mean? It's undefined as it's not in the interval, it could be someone trying to dial it up to 11. Or it could be someone sending 1.1 out of a range on 100. The meaning of a non-normalized quaternions is similarly undefined. I would strongly advocate for resolving this issue by making sure that all published quaternions are valid rather than accepting invalid quaternions by auto-renormalizing them.

rhaschke · 2018-01-03T01:15:42Z

@tfoote To some extend I can agree to your argumentation. However, it's a plain fact, that rviz did (and still does if allowed to) normalize quaternions incoming via ROS msgs and now it simply rejects them. This dramatically changes behaviour of rviz and - besides others - breaks the plain interactive marker tutorial (which could be thought of a best-practice example).
So, I'm fine to change the semantics in some future. But, please give us some time for transition and don't break a released tool.
Also, we should provide some tools to normalize quaternions in a ROS pose msg. Then it becomes feasible to normalize at the output generator (instead of the input in rviz).

tfoote · 2018-01-03T01:28:06Z

We're looking at restoring the permissive behavior to restore existing behavior, but adding warnings. Anywhere that is known to be sending unnormalized quaternions should be ticketed for resolving at the source. Interactive markers tutorials or libraries should not be sending unnormalized quaternions. Please open tickets for anything that's having trouble right now while we're working to resolve this.

wjwwood · 2018-01-03T01:29:37Z

However, it's a plain fact, that rviz did (and still does if allowed to) normalize quaternions incoming via ROS msgs

Is that true? Looking at #1167, I don't see where code that did normalize incoming marker message quaternions was removed. Perhaps you mean in other places?

This dramatically changes behaviour of rviz and - besides others - breaks the plain interactive marker tutorial (which could be thought of a best-practice example).
So, I'm fine to change the semantics in some future. But, please give us some time for transition and don't break a released tool.

I can't speak for @tfoote, but we're on the same page here, the disruptive change in behavior was unexpected and we're going to revert it and do something about it. I think the idea of requiring valid quaternions would be a future change or at least would be opt-in if back ported.

I think it's worth pursuing to get valid quaternions in rviz (either fixed on the rviz side or on the publishing side), because there have been other issues which attribute crashes to invalid quaternions, but we haven't conclusively linked those two things yet, e.g.:

#1082 (comment)

dhood · 2018-01-03T01:40:20Z

I don't believe we ever normalised un-normalised quaternions before 1167, but @rhaschke pointed out that, previously, null quaternions would get set to identity in FrameManager::transform() while now they are rejected before they get to that point, which may be what they were referring to wrt "normalising" in the past

gavanderhoorn · 2018-01-03T08:31:25Z

The unexpected breakage of RViz is not nice, but patching up invalid data just so it can be rendered makes no sense to me, so my vote would be for the warn/error-and-discard option.

That would be similar to TF and other incoming msgs that can't be transformed properly: they're discarded and a warning/error with sufficient visibility is output (ie: "Dropped N% of msgs so far, ..").

Normalising unconditionally will probably just mean that other nodes will crash/error out on the same data, which could be equally confusing ("but RViz does accept my quaternion?").

rhaschke · 2018-01-03T09:44:40Z

To clarify my point: rviz currently adapts invalid quaternions in at least two central places (since ages):

null quaternions are replaced by identity quaternions in rviz::FrameManager::transform
quaternions are normalized in tf::Matrix3x3::setRotation (same in tf2)
called from rviz::FrameManager::transform

So, its not only rviz, which re-normalized quaternions, but that's already part of more basic tf library!

I fully understand the wish to handle normalization once at generation time, but current practice is fundamentally different. I guess, if you enable a warning in tf you will get hundreds of warnings. It's common practice to create quaternions like (1,0,0,1) to represent 90° rotations about some axes and then relying on normalization.

This reverts commit b329145.

Normalization of quaternions usually is done by rviz::FrameManager::transform() which transforms a ROS pose into an Ogre position + orientation. Only in some rare cases, a ROS quaternion was directly used as an Ogre::Quaternion, which then requires handling of null quaternions (as they arise from uninitialized ROS pose msgs).

rhaschke · 2018-01-03T10:18:59Z

I addressed @davetcoleman's comments.

... as suggested in ros-visualization#1182

davetcoleman · 2018-01-03T16:46:53Z

I agree we should require all input to be normalized in the future but for now it should just produce warnings instead of breaking Rviz worldwide. I have found regressions in several different uses of Rviz, including MoveIt!.

dhood · 2018-01-03T23:48:16Z

@rhaschke thank you for clarifying that tf will normalise the quaternions inside FrameManager::transform. In light of this it makes sense to continue normalising in the missing places as you have proposed.

Since we still want warnings for unnormalised quaternions I'll remove the reverting of 1167 from this PR in favour of #1182, and this PR will just focus on normalising the quaternions from ROS msgs that are not otherwise normalised.

Additionally I'll invert the logic to consider null quaternions valid (users being lazy) and unnormalised ones invalid (potentially sign of an error)

…ion#1167)"" This reverts commit 42a4416.

Don't mix logic for what is considered a valid quat into this function; matches Ogre

Logic of what makes a quaternion valid isn't in normalizeQuaternion; also gets the invalid quaternion value logged to console

dhood · 2018-01-04T04:16:09Z

@rhaschke thank you for kick starting this and for giving permission for this branch to be adapted by us.

As I mentioned, this PR now addresses the remaining places in the codebase @rhaschke identified where quaternions may be null/unnormalised. It addresses the issue that initially prompted rejecting invalid quaternions (#1137) and potentially others. It therefore removes the need for us to reject invalid quaternions (we now give warnings instead: #1182)

I've minimised the diff to simplify review, including removing printing of warnings for invalid quaternions that are covered by #1182. normalizeQuaternions still sets null quaternions to identity and normalises all others, but no longer returns a boolean; callers must use validateQuaternions to determine if a warning should be printed (and to get the invalid quaternion's magnitude logged to debug).

@rhaschke The previous state of this PR had map_display rejecting null quaternions which is no longer the case since I imagine it was unintentional, please correct me if that's not the case

dhood · 2018-01-04T04:16:32Z

@ros-pull-request-builder retest this please

dhood · 2018-01-04T04:27:34Z

@ros-pull-request-builder retest this please

(issues with the git refs)

wjwwood · 2018-01-04T08:30:46Z

Obviously need to get the CI passing or verify it locally, otherwise lgtm.

rhaschke

+1
I my proposal, I tried to avoid computing the quaternion norm twice (once for validation and once for normalization). Hence the boolean return value of normalizeQuaternion().
The current proposal explicitly separates validation and normalization which is fine if the additional overhead is accepted. As I understood @tfoote in #1179 (comment), he wanted to reduce the computational overhead of normalization to a minimum ;-)

rhaschke · 2018-01-04T11:19:23Z

src/rviz/validate_quaternions.h

+  if ( 0.0f == x && 0.0f == y && 0.0f == z && 0.0f == w )
+  {
+    w = 1.0f;
+    x = y = z = 0.0f;


This line is superfluous. x,y,z are zero already.

gah, thanks for pointing that out: df6dfd4

rhaschke · 2018-01-04T11:20:07Z

src/rviz/validate_quaternions.h

+  if ( 0.0 == x && 0.0 == y && 0.0 == z && 0.0 == w )
+  {
+    w = 1.0;
+    x = y = z = 0.0;


Superfluous again.

rhaschke · 2018-01-04T11:21:44Z

src/rviz/validate_quaternions.h

 template<typename T>
-inline bool validateQuaternions(const T &vec)
+inline bool validateQuaternions( const std::vector<T> &vec )


I more like the old version as it is more compact and easier to grasp ;-)

yeah, I know, but I'm trying to keep unnecessary changes out for this PR 😃

rhaschke · 2018-01-04T11:32:35Z

@rhaschke The previous state of this PR had map_display rejecting null quaternions which is no longer the case since I imagine it was unintentional, please correct me if that's not the case.

@dhood I think, converting null quaternions in a message to identity is what the user expects here too.

dhood · 2018-01-04T11:58:26Z

The current proposal explicitly separates validation and normalization which is fine if the additional overhead is accepted.

Yeah, for most of these msgs they're not being normalised elsewhere so it's roughly the same overhead as the validateQuaternions + tf normalisation in other places

The devel jobs are behaving now so we'll just wait for that to turn over. Appreciate your contributions/insight on this @rhaschke

VictorLamoine · 2018-01-04T14:36:56Z

I think we should reject quaternions that are not valid in the future.
I understand it will break a lot of code but this clearly needs to be fixed at the source; quaternions must be defined.

I'm in favour of:

Normalizing/initializing incoming quaternions AND printing warnings about invalid quaternions
Reporting and fixing of as much packages as possible during a defined time (6 months? 1 year?)
After this time, rejecting invalid quaternions (hoping that most packages are already fixed)

It will take some time to fix the code base but the effort is worth it.
What is the plan?

VictorLamoine · 2018-01-04T14:42:39Z

Did someone test the overhead of this patch? eg: when displaying a large number of markers?

dhood · 2018-01-05T03:16:29Z

OK, looks like we're all in agreement.

Summarising the thread: invalid quaternions will be permitted for now to not break displays; publishers of invalid quaternions should be reported/updated; invalid quaternions will be rejected in the future.

What is the plan?

Pretty much as you said, @VictorLamoine. This is the timeline to get to the end goal of not normalising inputs:

Now: normalizing/initializing incoming quaternions AND printing warnings about invalid quaternions, with the 'once' log filter to ease into it.
After 6 months: more + stronger warnings to encourage the remaining offenders.
After 1 year: reject invalid quaternions with a warning.

Regarding the overhead, we can consider adding an option to displays to assume valid quaternions if there are use cases for which the overhead needs to be avoided.

rhaschke · 2018-01-05T05:04:25Z

@dhood @VictorLamoine In order to enforce valid quaternions on the generating side, you should provide tools to:

correctly initialize an identity quaternion in corresponding ROS msgs by default. Currently, the default is (0,0,0,0). As long as this is not possible (as pointed out Normalize quaternions from msgs before Ogre use #1179 (comment)), null quaternions should be turned into identity quaternions on the receiving side.
centrally provide a tool function to normalize a quaternion msg. Optimally, this function comes with the geometry_msgs package.
Do you also want to remove the implicit normalization in tf?

wjwwood · 2018-01-05T05:07:34Z

correctly initialize an identity quaternion in corresponding ROS msgs by default. Currently, the default is (0,0,0,0).

That's not possible because there are no default values for generated messages in ROS atm. In ROS 2 we have this (specifically for this case with quaternions), but I don't think there are plans to back port it, but with some community interest and help it might happen.

rhaschke · 2018-01-05T05:09:59Z

correctly initialize an identity quaternion in corresponding ROS msgs by default. Currently, the default is (0,0,0,0).

That's not possible because there are no default values for generated messages in ROS atm.

Actually, I was afraid of this. For me, that's a clear reason that generation-side normalization is currently not feasible in a comfortable fashion.

wjwwood · 2018-01-05T05:18:30Z

That's always been the case though. I don't see why requiring people to initialize their quaternions is a problem. Also for the special case of all zeros, we can continue to change that to identity for people, since that's not as expensive as always normalizing. That case, in my mind, is separate from giving unnormalized quaternions.

rhaschke · 2018-01-05T08:41:46Z

src/rviz/validate_quaternions.h

-  x /= norm2;
-  y /= norm2;
-  z /= norm2;
+  float invnorm = 1.0f / norm2;


What's the benefit of this? Multiplication and division have identical costs on the CPU, haven't they?

rhaschke · 2018-01-05T08:46:18Z

Also for the special case of all zeros, we can continue to change that to identity for people.

Agreed.

davetcoleman mentioned this pull request Jan 2, 2018

Added check for invalid quaternions. #1167

Merged

davetcoleman reviewed Jan 2, 2018

View reviewed changes

wjwwood mentioned this pull request Jan 3, 2018

Arrow Marker broken when using a start and end point instead of pose. #1181

Closed

dhood mentioned this pull request Jan 3, 2018

Warn on unnormalised quaternions instead of rejecting them #1182

Merged

rhaschke mentioned this pull request Jan 3, 2018

Fix crash when setting covariance using null quaternion #1180

Closed

rhaschke added 3 commits January 3, 2018 11:05

Revert "Added checks for invalid quaternions. (ros-visualization#1167)"

42a4416

This reverts commit b329145.

addressed Dave's comments

76e3a52

rhaschke force-pushed the normalize-quaternions branch from 1d30875 to 76e3a52 Compare January 3, 2018 10:17

add more verbose warnings

99d4096

... as suggested in ros-visualization#1182

dhood added 6 commits January 3, 2018 16:06

Revert "Revert "Added checks for invalid quaternions. (ros-visualizat…

5ef2fa4

…ion#1167)"" This reverts commit 42a4416.

Merge branch 'kinetic-devel' into pr/1179

332d24d

Minimise diff

52ddd20

Warning will already be output for invalid quats when msg validated

f16fefb

Return pre-normalised length from normalizeQuat

4eb197e

Don't mix logic for what is considered a valid quat into this function; matches Ogre

Use validateQuaternions for map

f5c7bab

Logic of what makes a quaternion valid isn't in normalizeQuaternion; also gets the invalid quaternion value logged to console

dhood requested a review from wjwwood January 4, 2018 06:38

dhood mentioned this pull request Jan 4, 2018

Fresh git clone required for pr job ros-infrastructure/ros_buildfarm#494

Closed

wjwwood approved these changes Jan 4, 2018

View reviewed changes

rhaschke mentioned this pull request Jan 4, 2018

Interactive marker missing after upgrade moveit/moveit#736

Closed

rhaschke commented Jan 4, 2018

View reviewed changes

Remove unnecessary 0 setting

df6dfd4

VictorLamoine mentioned this pull request Jan 4, 2018

Reset marker should publish initialized quaternion PickNikRobotics/rviz_visual_tools#69

Merged

Reduce number of divisions

498c185

dhood changed the title ~~normalize invalid quaternions instead of rejecting them~~ Normalize quaternions from msgs before Ogre use Jan 5, 2018

dhood merged commit 866d376 into ros-visualization:kinetic-devel Jan 5, 2018

rhaschke commented Jan 5, 2018

View reviewed changes

rhaschke deleted the normalize-quaternions branch January 5, 2018 08:53

rhaschke mentioned this pull request Jan 5, 2018

rviz' stronger handling of marker quaternions breaks interactive markers in plugins moveit/moveit#732

Closed

cassinaj mentioned this pull request Jan 17, 2018

rviz crashes due to invalid quaternion after Interactive Marker Server applyChanges to existing marker #1185

Closed

j-petit mentioned this pull request Apr 2, 2019

Normalize quaternions when adding new or moving collision objects (#1119) moveit/moveit#1420

Merged

3 tasks

Normalize quaternions from msgs before Ogre use #1179

Normalize quaternions from msgs before Ogre use #1179

Conversation

rhaschke commented Dec 30, 2017

davetcoleman commented Jan 2, 2018 • edited Loading

davetcoleman left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

davetcoleman commented Jan 2, 2018

tfoote commented Jan 3, 2018

rhaschke commented Jan 3, 2018

tfoote commented Jan 3, 2018

wjwwood commented Jan 3, 2018

dhood commented Jan 3, 2018

gavanderhoorn commented Jan 3, 2018

rhaschke commented Jan 3, 2018

rhaschke commented Jan 3, 2018

davetcoleman commented Jan 3, 2018

dhood commented Jan 3, 2018

dhood commented Jan 4, 2018

dhood commented Jan 4, 2018

dhood commented Jan 4, 2018

wjwwood commented Jan 4, 2018

rhaschke left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rhaschke commented Jan 4, 2018 • edited Loading

dhood commented Jan 4, 2018

VictorLamoine commented Jan 4, 2018 • edited Loading

VictorLamoine commented Jan 4, 2018

dhood commented Jan 5, 2018

rhaschke commented Jan 5, 2018 • edited Loading

wjwwood commented Jan 5, 2018

rhaschke commented Jan 5, 2018

wjwwood commented Jan 5, 2018

Choose a reason for hiding this comment

rhaschke commented Jan 5, 2018

davetcoleman commented Jan 2, 2018 •

edited

Loading

rhaschke commented Jan 4, 2018 •

edited

Loading

VictorLamoine commented Jan 4, 2018 •

edited

Loading

rhaschke commented Jan 5, 2018 •

edited

Loading