training mixture of B+C based on A checkpoint negate MCQA datasets using GPT4 find more negation datasets