I wrote this little tool to easily convert the GermaNet data set (xml-files) to a format that is readable by different network analysis software such as pajek, igraph, snap etc.
The tools takes the GermaNet XML-files and converts the content to either a node list („node node“), csv („node,edge,node“), a fanmod file (each node is represented by an integer: „int int“), or a pajek net-file.
./germanet.jar GermaNetWorkExtractor (Beta)Version 1.0 for GermaNet5.2 by email@example.com Usage: GermaNetWorkExtractor [options] [GermaNet-Files] [gn_relations.xml]* Options: --lexical: Extract lexical relations from GermaNet --conceptual Extract conceptual relations from GermaNet --all Extract lexical AND conceptual relations from GermaNet (default) --nodeList Output: list of nodes, directed (node node) --csv Output: comma separated values (node,edge,node) --fanmod Output: output for fanmod (int int) --net Output: pajek-net-file
If you choose to use only some, specific xml-files, this results in a process-rate <100%. Since some nodes are defined in different xml-files, i.e. xml-files not available to the tool, relations containing these are skipped!