CoNLL2020

Paper:

Code used in the paper Analysing Word Representation in the Input and Output Layers of Neural Language Models.

Progress:

This repository contains the majority of the code needed to run the experiments in the paper. A number of legacy models and packages are required to build and run the system, plus a number of benchmarks are required to replicate the results. Here, we use the lm_1b model which has moved to the tensorflow archive.

Model:

https://github.com/tensorflow/models/tree/archive/research/lm_1b

Follow instructions to download model, or run lm_1b.py --mode get_data

Prerequisites:

Install TensorFlow 1.x.
Install Keras
Install pytorch
GenSim
fastText
glove

Benchmarks:

These will be added to the pipeline, but include the following.

vecto -> https://pypi.org/project/vecto/
BrainBench -> http://www.langlearnlab.cs.uvic.ca/brainbench/
SentEval -> https://github.com/facebookresearch/SentEval

While vecto and Brainbench are included in the src, SentEval experiments will require you to clone the repository and run the experiments yourself using the numpy files. The code needed to run evaluate the models on SentEval is included in the src file. SentEval will require you to install pytorch.

Neural Language Model

Requires PennTreeBank dataset and preprocessing. Find the dataset at

Citation

@inproceedings{derby2020analysing,
  title={Analysing Word Representation from the Input and Output Embeddings in Neural Network Language Models},
  author={Derby, Steven and Miller, Paul and Devereux, Barry},
  booktitle={Proceedings of the 24th Conference on Computational Natural Language Learning},
  pages={442--454},
  year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
Paper		Paper
src		src
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CoNLL2020

Paper:

Progress:

Model:

Benchmarks:

Neural Language Model

Citation

About

Releases

Packages

Languages

License

stevend94/CoNLL2020

Folders and files

Latest commit

History

Repository files navigation

CoNLL2020

Paper:

Progress:

Model:

Benchmarks:

Neural Language Model

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages