Sample: matrixMulDrv
Minimum spec: SM 1.0

This sample implements matrix multiplication and uses the new CUDA 4.0 kernel l

Key concepts:
