GitPedia
minimalparts

minimalparts/nonce2vec

Incremental learning of word embeddings with context informativeness.

5 Releases
Latest: 5y ago
v2.0.2Latest
akb89akb89·5y ago·November 2, 2020
GitHub
v2.0.1
akb89akb89·6y ago·January 10, 2020
GitHub

Fixed bug on empty probs in informativeness

v2.0.0
akb89akb89·6y ago·July 29, 2019
GitHub

This is the version accompanying the SRW 2019 paper *Towards Incremental Learning of Word Embeddings Using Context Informativeness* (Kabbach et al., 2019). **Abstract** In this paper, we investigate the task of learning word embeddings from very sparse data in an incremental, cognitively-plausible way. We focus on the notion of informativeness, that is, the idea that some content is more valuable to the learning process than other. We further highlight the challenges of online learning and argue that previous systems fall short of implementing incrementality. Concretely, we incorporate informativeness in a previously proposed model of nonce learning, using it for context selection and learning rate modulation. We test our system on the task of learning new words from definitions, as well as on the task of learning new words from potentially uninformative contexts. We demonstrate that informativeness is crucial to obtaining state-of-the-art performance in a truly incremental setup. **Citation** ```tex @inproceedings{kabbach-etal-2019-towards, title = "Towards Incremental Learning of Word Embeddings Using Context Informativeness", author = "Kabbach, Alexandre and Gulordava, Kristina and Herbelot, Aur{\'e}lie", booktitle = "Proceedings of the 57th Conference of the Association for Computational Linguistics: Student Research Workshop", month = jul, year = "2019", address = "Florence, Italy", publisher = "Association for Computational Linguistics", url = "https://www.aclweb.org/anthology/P19-2022", pages = "162--168" } ```

v1.2.1
akb89akb89·7y ago·September 21, 2018
GitHub

📋 Changes

  • added zen DOI
  • Added info about code location
  • Fixed chimera dataset
Initial release of nonce2vec.v1.0
minimalpartsminimalparts·8y ago·December 31, 2017
GitHub

📦 Citation

  • A. Herbelot and M. Baroni. 2017. High-risk learning: Acquiring new word vectors from tiny data. Proceedings of EMNLP 2017 (Conference on Empirical Methods in Natural Language Processing).