Category: Pentaho
-

Stream Data from Twitter API with OAuth using Kettle
Streaming data from Twitter Api is really important from the data analytic perspective. Getting the pulse of your user community on the web and across…
-

TRANSFORM CONSTANTS in Modified JavaScript | Pentaho Data Integration
In Modified JavaScript Step in Pentaho, you will notice that there are few pre-defined variable or constants defined. Check the image. What is interesting here…
-

Partitioning in Pentaho Data Integration
[Update 2023]: The Steps mentioned in this post is also applicable for the latest versions Pentaho Data Integration (version 8, 9 and above). Let us…
-

Hadoop Wordcount using Pentaho Data Integration/Kettle
This blog demonstrate the use of big data and Hadoop using Pentaho Data Integration. I will explain the basic hadoop-wordcount example using PDI. Prerequisite Steps…
-

Passing Parameters to Kettle Using Java | Pentaho Data Integration
The blog post provides instructions on how to pass parameters in Pentaho Data Integration/Kettle, both in transformations (.ktr) and jobs (.kjb) using a Java code.…
-

Inserting XML Node into a XML Source Data using Pentaho Data Integration | Handling Complex XML Structures
Inserting a new xml node into a complex XML data source will fail with the approach provided in my previous blog. This is because handling…
-

Inserting XML Node into a XML Source Data using Pentaho Data Integration
Let us suppose, we have a XML data source as below: Now if we want to insert a new XML Node in between the <Node></Node>…
-

Using External Jars import in Pentaho Data Integration
[Update 2023]: This blog is now applicable for the older versions of Pentaho Data Integration. Pentaho version 8, 9 and above are not applicable and…
-

Java and Pentaho Kettle | Executing a Kettle File using Java
The blog provides steps for executing transformation files in Java using Pentaho Data Integration. It guides through creating Maven Project, adding dependencies, creating a sample…
-

Special Character Remover | version 1.1.0 | Pentaho Kettle Step Plugin
This is a blog on an upgraded version of the Special Character Remover Pentaho Kettle Plugin. Please read the Version 1.0.0 of this plugin before…
-

Special Character Remover | Clean your data of special characters | Pentaho Kettle Step Plugin
Problem Statement When handling data especially in a data warehousing environment, developers tends to face serious issues with the data quality issue. Though there are…
-

“Copy Data” and “Distribute Data” in Pentaho Data Integration (Kettle)
In Pentaho DI, the data flow direction is denoted by “Hops.” Data movement can occur in either “Copy Data” or “Distribute Data” modes. Copy Data…
