Skip to content

Option to only dump random walks to disk and skip training#5

Open
viveksck wants to merge 6 commits into
phanein:masterfrom
viveksck:master
Open

Option to only dump random walks to disk and skip training#5
viveksck wants to merge 6 commits into
phanein:masterfrom
viveksck:master

Conversation

@viveksck

Copy link
Copy Markdown
Collaborator

gensim version downgraded to 0.10.1 as 0.10.2 does not install via easy_install due to this bug: https://groups.google.com/forum/#!topic/gensim/NSOXuP4IE9Q

Vivek Kulkarni added 2 commits January 26, 2015 14:39
… required version of gensim 0.10.2 cannot be added because of a bug in gensim where easy_install gensim fails for 0.10.2. Refer https://groups.google.com/forum/#!topic/gensim/NSOXuP4IE9Q
@viveksck

Copy link
Copy Markdown
Collaborator Author

In [1]: import gensim

In [2]: gensim.version
Out[2]: '0.10.1'

vvkulkarni@descartes:~/deepwalk$ deepwalk --input ./example_graphs/karate.adjlist --output karate.embeddings
Number of nodes: 34
Number of walks: 340
Data size (walks*length): 13600
Walking...
Training...

vvkulkarni@descartes:~/deepwalk$ ls -ltr karate.embeddings
-rw-rw-r-- 1 vvkulkarni vvkulkarni 20847 Jan 26 15:02 karate.embeddings

@aboSamoor

Copy link
Copy Markdown
Collaborator

This is the solution I used in polyglot
https://github.com/aboSamoor/polyglot/blob/master/setup.py#L20-L22

@viveksck

viveksck commented May 5, 2016

Copy link
Copy Markdown
Collaborator Author

Pushing in changes to only dump walks if needed. Change needed for extended work.

vvkulkarni@curie:/toolkits/viveks_deepwalk/deepwalk$ deepwalk --input example_graphs/karate.adjlist --output karate.embeddings --max-memory-data-size 0
Number of nodes: 34
Number of walks: 340
Data size (walks_length): 13600
Data size 13600 is larger than limit (max-memory-data-size: 0). Dumping walks to disk.
Walking...
Counting vertex frequency...
Training...
vvkulkarni@curie:
/toolkits/viveks_deepwalk/deepwalk$ deepwalk --input example_graphs/karate.adjlist --output karate.embeddings --max-memory-data-size 0 --only-walk
Number of nodes: 34
Number of walks: 340
Data size (walks_length): 13600
Data size 13600 is larger than limit (max-memory-data-size: 0). Dumping walks to disk.
Walking...

@viveksck viveksck changed the title Adding dependencies to be installed in setup.py Option to only dump random walks to disk and skip training May 5, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants