Refactored the CUDA-SCC grouping algorithm as is took 80x longer to calculate the groups than it took to calculate the entire solution. Former-commit-id: 5a5ffabe38
5a5ffabe38