public interface VertexProgram<M> extends Cloneable
VertexProgram
represents one component of a distributed graph computation. Each vertex in the graph
(logically) executes the VertexProgram
instance in parallel. The collective behavior yields
the computational result. In practice, a "worker" (i.e. task, thread, etc.) is responsible for executing the
VertexProgram against each vertex that it has in its vertex set (a subset of the full graph vertex set).
At minimum there is one "worker" for each vertex, though this is impractical in practice and GraphComputer
implementations that leverage such a design are not expected to perform well due to the excess object creation.
Any local state/fields in a VertexProgram is static to the vertices within the same worker set.
It is not safe to assume that the VertexProgram's "worker" state will remain stable between iterations.
Hence, the existence of workerIterationStart(org.apache.tinkerpop.gremlin.process.computer.Memory)
and workerIterationEnd(org.apache.tinkerpop.gremlin.process.computer.Memory)
.Modifier and Type | Interface and Description |
---|---|
static interface |
VertexProgram.Builder |
static interface |
VertexProgram.Features |
Modifier and Type | Field and Description |
---|---|
static String |
VERTEX_PROGRAM |
Modifier and Type | Method and Description |
---|---|
VertexProgram<M> |
clone()
When multiple workers on a single machine need VertexProgram instances, it is possible to use clone.
|
static <V extends VertexProgram> |
createVertexProgram(Graph graph,
org.apache.commons.configuration.Configuration configuration)
A helper method to construct a
VertexProgram given the content of the supplied configuration. |
void |
execute(Vertex vertex,
Messenger<M> messenger,
Memory memory)
This method denotes the main body of the computation and is executed on each vertex in the graph.
|
default Set<String> |
getElementComputeKeys()
The
Element properties that will be mutated during the computation. |
default VertexProgram.Features |
getFeatures() |
default Set<MapReduce> |
getMapReducers()
The set of
MapReduce jobs that are associated with the VertexProgram . |
default Set<String> |
getMemoryComputeKeys()
The
Memory keys that will be used during the computation. |
default Optional<MessageCombiner<M>> |
getMessageCombiner()
Combine the messages in route to a particular vertex.
|
Set<MessageScope> |
getMessageScopes(Memory memory)
This method returns all the
MessageScope possibilities for a particular iteration of the vertex program. |
GraphComputer.Persist |
getPreferredPersist() |
GraphComputer.ResultGraph |
getPreferredResultGraph() |
default void |
loadState(Graph graph,
org.apache.commons.configuration.Configuration configuration)
When it is necessary to load the state of the VertexProgram, this method is called.
|
void |
setup(Memory memory)
The method is called at the beginning of the computation.
|
default void |
storeState(org.apache.commons.configuration.Configuration configuration)
When it is necessary to store the state of the VertexProgram, this method is called.
|
boolean |
terminate(Memory memory)
The method is called at the end of each iteration to determine if the computation is complete.
|
default void |
workerIterationEnd(Memory memory)
This method is called at the end of each iteration of each "computational chunk."
The set of vertices in the graph are typically not processed with full parallelism.
|
default void |
workerIterationStart(Memory memory)
This method is called at the start of each iteration of each "computational chunk."
The set of vertices in the graph are typically not processed with full parallelism.
|
static final String VERTEX_PROGRAM
default void storeState(org.apache.commons.configuration.Configuration configuration)
configuration
- the configuration to store the state of the VertexProgram in.default void loadState(Graph graph, org.apache.commons.configuration.Configuration configuration)
graph
- the graph that the VertexProgram will run againstconfiguration
- the configuration to load the state of the VertexProgram from.void setup(Memory memory)
GraphComputer
and as such, is not called for each vertex.
During this stage, the Memory
should be initialized to to its "start state."memory
- The global memory of the GraphComputervoid execute(Vertex vertex, Messenger<M> messenger, Memory memory)
Memory
is read, it is according to the aggregated state yielded in the previous iteration.
When the Memory
is written, the data will be aggregated at the end of the iteration for reading in the next iteration.vertex
- the Vertex
to execute the VertexProgram
onmessenger
- the messenger that moves data between verticesmemory
- the shared state between all vertices in the computationboolean terminate(Memory memory)
GraphComputer
and as such, is not called for each Vertex
.
The Memory
maintains the aggregated data from the last execute() iteration.memory
- The global memory of the GraphComputer
default void workerIterationStart(Memory memory)
execute(org.apache.tinkerpop.gremlin.structure.Vertex, org.apache.tinkerpop.gremlin.process.computer.Messenger<M>, org.apache.tinkerpop.gremlin.process.computer.Memory)
method.
The default implementation is a no-op.memory
- The memory at the start of the iteration.default void workerIterationEnd(Memory memory)
execute(org.apache.tinkerpop.gremlin.structure.Vertex, org.apache.tinkerpop.gremlin.process.computer.Messenger<M>, org.apache.tinkerpop.gremlin.process.computer.Memory)
method.
The default implementation is a no-op.memory
- The memory at the end of the iteration.default Set<String> getElementComputeKeys()
Element
properties that will be mutated during the computation.
All properties in the graph are readable, but only the keys specified here are writable.
The default is an empty set.default Set<String> getMemoryComputeKeys()
Memory
keys that will be used during the computation.
These are the only keys that can be read or written throughout the life of the GraphComputer
.
The default is an empty set.default Optional<MessageCombiner<M>> getMessageCombiner()
Set<MessageScope> getMessageScopes(Memory memory)
MessageScope
possibilities for a particular iteration of the vertex program.
The returned messages scopes are the scopes that will be used to send messages during the stated iteration.
It is not a requirement that all stated messages scopes be used, just that it is possible that they be used during the iteration.memory
- an immutable form of the Memory
default Set<MapReduce> getMapReducers()
MapReduce
jobs that are associated with the VertexProgram
.
This is not necessarily the exhaustive list over the life of the GraphComputer
.
If MapReduce jobs are declared by GraphComputer.mapReduce(), they are not contained in this set.
The default is an empty set.MapReduce
jobs associated with this VertexProgram
VertexProgram<M> clone()
storeState(org.apache.commons.configuration.Configuration)
and loadState(org.apache.tinkerpop.gremlin.structure.Graph, org.apache.commons.configuration.Configuration)
model.
The default implementation simply returns the object as it assumes that the VertexProgram instance is a stateless singleton.GraphComputer.ResultGraph getPreferredResultGraph()
GraphComputer.Persist getPreferredPersist()
static <V extends VertexProgram> V createVertexProgram(Graph graph, org.apache.commons.configuration.Configuration configuration)
VertexProgram
given the content of the supplied configuration.
The class of the VertexProgram is read from the VERTEX_PROGRAM
static configuration key.
Once the VertexProgram is constructed, loadState(org.apache.tinkerpop.gremlin.structure.Graph, org.apache.commons.configuration.Configuration)
method is called with the provided graph and configuration.V
- The vertex program typegraph
- The graph that the vertex program will execute againstconfiguration
- A configuration with requisite information to build a vertex programdefault VertexProgram.Features getFeatures()
Copyright © 2013–2015 Apache Software Foundation. All rights reserved.