#1 will never work well for the simple fact that when people know that they are being asked to pick something without actually going through the mental process of evaluating options and picking a goal, they will not pick in a way that correlates with a larger population that doesn't know it's being tested.
The best way to test something is to try it on people that actually want something and voluntarily are picking something and going towards a goal... and don't know they're being tested. The proof is in the pudding: if the copy works better than a control, it should produce a sale/signup/whatever.
Asking a bunch of people who are working for pennies if they want to buy some product that they have no clue about and are not in the market for, based on a copy they don't even understand well, is a waste of money and would even do you some damage since your customers might be more sophisticated.
Now, if you positioned it for simple, general websites, it might work better. But then again, those sites are not high paying customers.
Also, you have no way to check answers. MTurk workers might just pick random to increase their earnings. Random results will just create noise. It's even worse if you have just two options.
The best way to test something is to try it on people that actually want something and voluntarily are picking something and going towards a goal... and don't know they're being tested. The proof is in the pudding: if the copy works better than a control, it should produce a sale/signup/whatever.
Asking a bunch of people who are working for pennies if they want to buy some product that they have no clue about and are not in the market for, based on a copy they don't even understand well, is a waste of money and would even do you some damage since your customers might be more sophisticated.
Now, if you positioned it for simple, general websites, it might work better. But then again, those sites are not high paying customers.
PS: Look into Google Optimizer: http://www.google.com/websiteoptimizer