Hi, Some questions:
- My input file is a text file with edges: node1 node2 edgeValue, I figured it out that I should use TextEdgeInputFormat and TextVertexValueInputFormat. But how do these two things fit together? Should I prepare another file that contains only the node informations for VertexValueInputFormat? - If the input file is a sequence file, how should I implement a SequenceEdgeInputFormat or SequenceVertexInputFormat? Or they exist already? - For output part, what I need to do is after the calculation terminates, every vertex need to output many lines. This could be big (for a dataset the output size is 400GB). I found only the TextVertexOuputFormat but it seems to output a single line per vertex. How should I achieve this? Thanks a lot! -- *JU Han* Software Engineer Intern @ KXEN Inc. UTC - Université de Technologie de Compiègne * **GI06 - Fouille de Données et Décisionnel* +33 0619608888