Supersized Triangles
Currently triangles with an area greater than 200 pixels are dropped and not rendered. Instead, larger triangles of 32 pixels or more use an entire warp. One option is that shared memory list of large triangles is generated by a warp as it is processing. Once all small triangles are rendered by the warp, it checks for large triangles that it skipped and then uses the entire warp to render those triangles. It might be better to split this into two kernels, I'm not sure.
Edited by Nicolas Pope