Version 1.2
This release implements a number of incremental improvements and clears up some errors.
- The package is now able to read and write in the binary word2vec format; since this is faster and takes much less hard drive space (down by about 66%) than writing out floats as text, it does so internally.
- Several improvements to the C codebase to avoid warnings by @mukul13, described here. (Does this address the
long long
issue?) - Subsetting with
[[
now takes an argumentaverage
; if false, rather than collapse a matrix down to a single row, it just extracts the elements that correspond to the words. - Added sample data in the object
demo_vectors
: the 999 words from the most common vectors. - Began adding examples to the codebase.
- Tracking build success using Travis.
- Dodging most warnings from R CMD check.
Bug fixes
- If the
dir.exists
function is undefined, it creates one for you. This should allow installation on R 3.1 and some lower versions. reject
andproject
are better about returning VSM objects, rather than dropping back into a matrix.