You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

56 lines
2.9 KiB

  1. namespace StormEigen {
  2. /** \eigenManualPage TopicWrongStackAlignment Compiler making a wrong assumption on stack alignment
  3. <h4>It appears that this was a GCC bug that has been fixed in GCC 4.5.
  4. If you hit this issue, please upgrade to GCC 4.5 and report to us, so we can update this page.</h4>
  5. This is an issue that, so far, we met only with GCC on Windows: for instance, MinGW and TDM-GCC.
  6. By default, in a function like this,
  7. \code
  8. void foo()
  9. {
  10. StormEigen::Quaternionf q;
  11. //...
  12. }
  13. \endcode
  14. GCC assumes that the stack is already 16-byte-aligned so that the object \a q will be created at a 16-byte-aligned location. For this reason, it doesn't take any special care to explicitly align the object \a q, as Eigen requires.
  15. The problem is that, in some particular cases, this assumption can be wrong on Windows, where the stack is only guaranteed to have 4-byte alignment. Indeed, even though GCC takes care of aligning the stack in the main function and does its best to keep it aligned, when a function is called from another thread or from a binary compiled with another compiler, the stack alignment can be corrupted. This results in the object 'q' being created at an unaligned location, making your program crash with the \ref TopicUnalignedArrayAssert "assertion on unaligned arrays". So far we found the three following solutions.
  16. \section sec_sol1 Local solution
  17. A local solution is to mark such a function with this attribute:
  18. \code
  19. __attribute__((force_align_arg_pointer)) void foo()
  20. {
  21. StormEigen::Quaternionf q;
  22. //...
  23. }
  24. \endcode
  25. Read <a href="http://gcc.gnu.org/onlinedocs/gcc-4.4.0/gcc/Function-Attributes.html#Function-Attributes">this GCC documentation</a> to understand what this does. Of course this should only be done on GCC on Windows, so for portability you'll have to encapsulate this in a macro which you leave empty on other platforms. The advantage of this solution is that you can finely select which function might have a corrupted stack alignment. Of course on the downside this has to be done for every such function, so you may prefer one of the following two global solutions.
  26. \section sec_sol2 Global solutions
  27. A global solution is to edit your project so that when compiling with GCC on Windows, you pass this option to GCC:
  28. \code
  29. -mincoming-stack-boundary=2
  30. \endcode
  31. Explanation: this tells GCC that the stack is only required to be aligned to 2^2=4 bytes, so that GCC now knows that it really must take extra care to honor the 16 byte alignment of \ref TopicFixedSizeVectorizable "fixed-size vectorizable Eigen types" when needed.
  32. Another global solution is to pass this option to gcc:
  33. \code
  34. -mstackrealign
  35. \endcode
  36. which has the same effect than adding the \c force_align_arg_pointer attribute to all functions.
  37. These global solutions are easy to use, but note that they may slowdown your program because they lead to extra prologue/epilogue instructions for every function.
  38. */
  39. }