public class CheckpointTupleForwarder extends BaseStatefulBoltExecutor
IRichBolt
and forwards checkpoint tuples in a stateful topology.
When a storm topology contains one or more IStatefulBolt
all non-stateful bolts are wrapped in CheckpointTupleForwarder
so that the checkpoint tuples can flow through the entire topology DAG.
BaseStatefulBoltExecutor.AnchoringOutputCollector
collector
Constructor and Description |
---|
CheckpointTupleForwarder(IRichBolt bolt) |
Modifier and Type | Method and Description |
---|---|
void |
cleanup()
Called when an IBolt is going to be shutdown.
|
void |
declareOutputFields(OutputFieldsDeclarer declarer)
Declare the output schema for all the streams of this topology.
|
Map<String,Object> |
getComponentConfiguration()
Declare configuration specific to this component.
|
protected void |
handleCheckpoint(Tuple checkpointTuple,
CheckPointState.Action action,
long txid)
Forwards the checkpoint tuple downstream.
|
protected void |
handleTuple(Tuple input)
Hands off tuple to the wrapped bolt to execute.
|
void |
prepare(Map<String,Object> topoConf,
TopologyContext context,
OutputCollector outputCollector)
Called when a task for this component is initialized within a worker on the cluster.
|
declareCheckpointStream, execute, init
public CheckpointTupleForwarder(IRichBolt bolt)
public void prepare(Map<String,Object> topoConf, TopologyContext context, OutputCollector outputCollector)
IBolt
This includes the:
topoConf
- The Storm configuration for this bolt. This is the configuration provided to the topology merged in with cluster
configuration on this machine.context
- This object can be used to get information about this task's place within the topology, including the task id and
component id of this task, input and output information, etc.outputCollector
- The collector is used to emit tuples from this bolt. Tuples can be emitted at any time, including the prepare and
cleanup methods. The collector is thread-safe and should be saved as an instance variable of this bolt object.public void cleanup()
IBolt
Config.SUPERVISOR_WORKER_SHUTDOWN_SLEEP_SECS
setting controls how long orderly shutdown is allowed to take.
There is no guarantee that cleanup will be called if shutdown is not orderly, or if the shutdown exceeds the time limit.
The one context where cleanup is guaranteed to be called is when a topology is killed when running Storm in local mode.
public void declareOutputFields(OutputFieldsDeclarer declarer)
IComponent
declarer
- this is used to declare output stream ids, output fields, and whether or not each output stream is a direct streampublic Map<String,Object> getComponentConfiguration()
IComponent
TopologyBuilder
protected void handleCheckpoint(Tuple checkpointTuple, CheckPointState.Action action, long txid)
handleCheckpoint
in class BaseStatefulBoltExecutor
checkpointTuple
- the checkpoint tupleaction
- the action (prepare, commit, rollback or initstate)txid
- the transaction id.protected void handleTuple(Tuple input)
Right now tuples continue to get forwarded while waiting for checkpoints to arrive on other streams after checkpoint arrives on one of the streams. This can cause duplicates but still at least once.
handleTuple
in class BaseStatefulBoltExecutor
input
- the input tupleCopyright © 2023 The Apache Software Foundation. All rights reserved.