This seems similar to a project I've been working on: https://browserdaemon.com. In regards to your crowd sourced data collection, perhaps you should have some hidden percentage of prompts where you know the correct completion to them already, to catch bad actors.