Skip to content

Version 1.2

Compare
Choose a tag to compare
@bmschmidt bmschmidt released this 24 Apr 04:42
· 75 commits to master since this release

This release implements a number of incremental improvements and clears up some errors.

  • The package is now able to read and write in the binary word2vec format; since this is faster and takes much less hard drive space (down by about 66%) than writing out floats as text, it does so internally.
  • Several improvements to the C codebase to avoid warnings by @mukul13, described here. (Does this address the long long issue?)
  • Subsetting with [[ now takes an argument average; if false, rather than collapse a matrix down to a single row, it just extracts the elements that correspond to the words.
  • Added sample data in the object demo_vectors: the 999 words from the most common vectors.
  • Began adding examples to the codebase.
  • Tracking build success using Travis.
  • Dodging most warnings from R CMD check.

Bug fixes

  • If the dir.exists function is undefined, it creates one for you. This should allow installation on R 3.1 and some lower versions.
  • reject and project are better about returning VSM objects, rather than dropping back into a matrix.