- Reducing the stack allocation for CUDA for the BIH unit test - Adding changes from Ken's review - Suppress ptxas stack size warning for BoundingIntervalHierarchy