Supported tool types
Data Management supports four tool types:
All tools extend net.redpoint.dataflow.transform.Tool
, and implement the interface appropriate to their tool type.
Input tool
Input tools implement the net.redpoint.dataflow.transform.InputTool
interface. Input tools are record producers. Typically, they read records from an external source (files, databases, queues, or APIs) and send them to a single output connector. Input tools can also generate records algorithmically and independent of any external source; for example, they can generate random numbers, primary keys, or mock data for testing.
The InputTool
interface is reactive, meaning that its nextRecord()
method will be called by RPDM whenever the next record is wanted.
Output tool
Output tools implement the net.redpoint.dataflow.transform.OutputTool
interface. Output tools are record consumers. They generally read records from a single connector and write records to a sink such as a file, database, queue, or web service. Output tools can also consume records without emitting them to a sink.
The OutputTool
interface is reactive, meaning that its nextRecord()
method will be called by RPDM whenever the next record is available for output.
Transform tool
Transform tools implement the net.redpoint.dataflow.transform.TransformTool
interface. They accept an input record, transform that record (or more accurately, a copy of that record) in some way, and emit an output record.
The TransformTool
interface is reactive, meaning that its nextRecord()
method will be called by RPDM with an input and output record, whenever the next input record is available.
General tool
General tools implement the net.redpoint.dataflow.transform.GeneralTool
interface. They follow a more procedural model, running in their own thread, reading records from any number of inputs, and writing records to any number of outputs.
Implementing a general tool tends to be more complicated; they should only be used when the desired functionality can't be easily achieved by an input, output, or transform tool.
The GeneralTool
interface is proactive or procedural, meaning that its run()
method will completely process all input and output records before returning. The run()
method is also "in charge" as opposed to responding to RPDM's callbacks.
Because of the GeneralTool
's procedural model, there are cases where it is easier to implement GeneralTool
than the other tool types. For example, if you are using a SAX-based XML parser or other library that expects your code to be reactive, you may find it difficult to marry the reactive requirements of RPDM and the library.