I’m using the Glove embeddings and before using AllenNLP, I have the following usage:
- If a word is not presented in Glove embedding, but its lowercased form is presented in the embedding. I will still use the embedding to initialize.
- During the evaluation (on the development set), if a word is not in the vocabulary in the training data, but it has a representation in the embedding file (Glove embeddings), I will still use this embedding as the word representation for this word.
Is it possible for me to do this in AllenNLP? I don’t mind modifying any part of the code.