Patch flat cs of 16-bit entry points if current %cs is different from compiled value, and retrieve flat ds from a global variable. This should avoid problems with win4lin kernels.