Category: Pentaho+
-
Building Enterprise Data Lineage & Provenance
Data lineage is crucial for data management and governance, tracking data from its origin to destination. It ensures quality control and compliance, benefiting data engineers…
-
Building OpenAI GPT Assistant Framework with Pentaho
This blog post discusses about the building of OpenAI GPT Assistant Framework using Pentaho Data Integration. The blog post discusses the integration of OpenAI GPT…
-
Working with Pentaho Carte Server
This post dives into the use of Pentaho Carte, a lightweight web server for running and managing remote Pentaho data transformations and jobs. The post…
-
Connect to Mark Logic database using Pentaho DI
Mark Logic is a NoSQL database that allows third party tools to connect using REST Api. This blog aims at providing explanation on connecting to…
-
Load Balancing across Slaves in Pentaho Data Integration
Kettle Load Balancing Framework – A sample introduction
-
Loading Data from S3 to Redshift | Pentaho Data Integration
The blog details steps for loading data from Amazon S3 to Redshift using PDI. The process includes creating a table in the Redshift cluster, executing…
-
Loading Data to AWS S3 Bucket | Pentaho Data Integration
Loading large volumes of data into Amazon Redshift using Pentaho may initially present performance issues due to Redshift treating each data row as a separate…
-
Open Source BI Stack
The use of data by people and business around the world is on a rise. Almost everyone involved into work are now-a-days looking for a…
-
Setting up Amazon Redshift Cluster and accessing using Pentaho Kettle
Amazon Redshift is a fully managed and highly scalable data-warehouse service in the cloud. You can start from few hundred GB of data and scale…
-
Stream Data from Twitter API with OAuth using Kettle
Streaming data from Twitter Api is really important from the data analytic perspective. Getting the pulse of your user community on the web and across…