You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

50 lines
1.9 KiB

25 years ago
25 years ago
  1. Tips for performance tuning on a specific architecture:
  2. 1. Choose the optimal limb size (intDsize). This is fundamental. On 32-bit
  3. platforms intDsize=32 is best. On 64-bit platforms intDsize=64 may be
  4. better, especially if there is a 64x64-bit multiplication in hardware.
  5. 2. Tune GMP.
  6. 3. The break-even points between several algorithms for the same task
  7. have to be determined experimentally, in the order given below:
  8. multiplication:
  9. cl_DS_mul.cc karatsuba_threshold
  10. cl_DS_mul.cc function cl_fftm_suitable
  11. division:
  12. cl_DS_div.cc function cl_recip_suitable
  13. 2-adic reciprocal:
  14. cl_2DS_recip.cc recip2adic_threshold
  15. 2-adic division:
  16. cl_2DS_div.cc function cl_recip_suitable
  17. square root:
  18. cl_DS_sqrt.cc function cl_recipsqrt_suitable
  19. cl_LF_sqrt.cc "if (len > ...)"
  20. gcd:
  21. cl_I_gcd.cc cl_gcd_double_threshold
  22. binary->decimal conversion:
  23. cl_I_to_digits.cc cl_digits_div_threshold
  24. pi:
  25. cl_LF_pi.cc best of 4 algorithms
  26. exp, log:
  27. cl_F_expx.cc factor limit_slope of isqrt(d)
  28. cl_R_exp.cc inside function exp
  29. cl_R_ln.cc inside function ln
  30. eulerconst:
  31. cl_LF_eulerconst.cc function compute_eulerconst
  32. sin, cos, sinh, cosh:
  33. cl_F_sinx.cc factor limit_slope of isqrt(d)
  34. cl_R_sin.cc inside function sin
  35. cl_R_cos.cc inside function cos
  36. cl_R_cossin.cc inside function cl_cos_sin
  37. cl_F_sinhx.cc factor limit_slope of isqrt(d)
  38. cl_R_sinh.cc inside function sinh
  39. cl_R_cosh.cc inside function cosh
  40. cl_R_coshsinh.cc inside function cl_cosh_sinh
  41. cl_F_atanx.cc factor limit_slope of isqrt(d)
  42. cl_F_atanx.cc inside function atanx
  43. cl_F_atanhx.cc factor limit_slope of isqrt(d)
  44. cl_F_atanhx.cc inside function atanhx