GloVe is an unsupervised learning algorithm for obtaining vector representations for words. Training is performed on aggregated global word-word co-occurrence statistics from a corpus, and the resulting representations showcase interesting linear substructures of the word vector space. The links provided contain word vectors obtained from the respective corpora. If you want word vectors trained on massive web datasets, you need only download one of these text files! Pre-trained word vectors are made available under the Public Domain Dedication and License. If the web datasets above don't match the semantics of your end use case, you can train word vectors on your own corpus. The demo.sh script downloads a small corpus, consisting of the first 100M characters of Wikipedia. It collects unigram counts, constructs and shuffles cooccurrence data, and trains a simple version of the GloVe model. It also runs a word analogy evaluation script in python to verify word vector quality.

Features

  • Global vectors for word representation
  • Available pre-trained word vectors
  • The GloVe model is trained on the non-zero entries of a global word-word co-occurrence matrix
  • The tools provided in this package automate the collection and preparation of co-occurrence statistics for input into the model
  • GloVe is essentially a log-bilinear model with a weighted least-squares objective
  • The training objective of GloVe is to learn word vectors such that their dot product equals the logarithm of the words' probability of co-occurrence

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow GloVe

GloVe Web Site

Other Useful Business Software
Powerful App Monitoring Without Surprise Bills Icon
Powerful App Monitoring Without Surprise Bills

AppSignal starts at $23/month with all features included. No overages, no hidden fees. 30-day free trial.

Tired of monitoring tools that punish you for scaling? AppSignal offers transparent, predictable pricing with every feature unlocked on every plan. Track errors, monitor performance, detect anomalies, and manage logs across Ruby, Python, Node.js, and more. Trusted by developers since 2012 with free dev-to-dev support. No credit card required to start your 30-day trial.
Try AppSignal Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of GloVe!

Additional Project Details

Programming Language

C

Related Categories

C Word Processors, C Distributed Computing Software

Registered

2021-09-30