This current article seems to cover the various choices for constructing 'contex...

Karlozkiller · on Oct 4, 2015

Ah, thank you for pointing that out. I guess I got confused in all the papers I've read on the topic recently. It's hard to get into.

However, I would still not agree that the comment-linked article explaining negative sampling really explains how word2vec works, well enough, or maybe I just didn't understand.

Either way I recommend looking at this article as well if anyone wants to understand word2vec. http://www-personal.umich.edu/~ronxin/pdf/w2vexp.pdf