I want to sort an array of ints with a length of 1.000.000 to 100.000.000 . I want to run this program on a core2duo computer with 2Mb cache using pthread library. I want the fastest algorithm!
I have written a semi-parallel sort code which uses mergesort algorithm. But it's not fast enough!
___ sort___
/ \
/____ sort ___\ __ merge __
___/ \___/ \___ merge
\ ____ sort ____/ \__ merge __/
\ /
\___ sort __/