Automated WordNet Construction Using Word Embeddings

Citation:

Khodak, M., et al. “Automated WordNet Construction Using Word Embeddings”. EACL Workshop on Sense, Concept and Entity Representations and their Applications (SENSE) 2017. Web.

Abstract:

We present a fully unsupervised method for automated construction of WordNets based upon recent advances in distributional representations of sentences and word-senses combined with readily available machine translation tools. The approach requires very few linguistic resources and is thus extensible to multiple target languages. To evaluate our method we construct two 600-word test sets for word-to-synset matching in French and Russian using native speakers and evaluate the performance of our method along with several other recent approaches. Our method exceeds the best language-specific and multi-lingual automated WordNets in F-score for both languages. The databases we construct for French and Russian, both languages without large publicly available manually constructed WordNets, will be publicly released along with the test sets.

Publisher's Version

Last updated on 12/04/2017