Skip to Content
Back to templates
// Template · Governance

Database Impact Report

Scans every pipeline and workflow in your project using Apache Hop's built-in RDBMS Impact transform, then writes a structured report to a PostgreSQL table — so you know exactly which pipelines read from or write to a given table before you drop it, rename it, or migrate it.

Uses RDBMS Impact TableOutput
Governance Data Quality DevOps 4 transforms Apache Hop Native only
Download .zip
Apache Hop pipeline
Created May 12, 2026
Hop 2.x+
Pipeline
Hop
RDBMS-Impact
RDBMS_IMPACT
filter-nulls
FilterRows
select-fields
SelectValues
write-report
TableOutput
How it works

This pipeline uses Apache Hop's native RDBMS Impact transform to introspect your entire project — no manual auditing, no grepping through XML files. It finds every database reference across all pipelines and workflows, filters out irrelevant rows, renames the fields for clarity, and writes a clean report to a PostgreSQL table you can query any time.

No Putki plugins required. This template runs entirely on Apache Hop's built-in transforms — RDBMS Impact, FilterRows, SelectValues, and TableOutput. A Putki subscription gives you the scheduler and monitoring, but the pipeline itself needs no extra plugins.
1
Scan every pipeline and workflow in the project
The RDBMS Impact transform reads all .hpl and .hwf files under ${PROJECT_HOME} and emits one row per database reference it finds — covering TableInput, TableOutput, OdooInput, and any other transform that touches a database connection.
2
Drop rows with no table reference
Many transforms reference a connection without targeting a specific table. FilterRows discards any row where table is null or empty, keeping the report focused on actual table-level dependencies.
3
Select and rename fields for the report
SelectValues picks the eight relevant fields and renames them for clarity: filename becomes pipeline, schema becomes schema_name, table becomes table_name. The raw column field is dropped — too granular for a table-level impact report.
4
Write the report to PostgreSQL
TableOutput truncates and rewrites ${OUTPUT_TABLE} on every run so the report always reflects the current state of the project. Fields are mapped by name automatically. Create the table first using the DDL in rdbms-impact-report.sql.
Output fields
FieldDescription
connectionName of the database connection defined in the Hop project metadata
schema_nameDatabase schema (e.g. public), if specified in the transform
table_nameThe target table or view being read from or written to
pipelineFull path of the .hpl or .hwf file that contains the reference
file_typeWhether the file is a Pipeline or a Workflow
item_nameName of the specific transform or action inside the pipeline
item_typeTransform type — e.g. TableInput, TableOutput, OdooInput
sql_queryThe SQL query configured in the transform, when available
What you need
·
Apache Hop 2.x+
The RDBMS Impact transform is included in Hop 2.x and above. No additional plugins required.
·
A PostgreSQL database connection
Configure a connection in your Hop project metadata, or update the write-report transform to point to your connection.
·
The report table created in advance
Run the DDL in rdbms-impact-report.sql (included in the download) to create the table before the first execution.
Parameters
OUTPUT_TABLE
Target table where the report rows are written
default: rdbms_impact_report
REPORT_DB
Database connection name where the report table lives
default: apachehop
Details
Type
Pipeline

Files
database-impact-report.hpl
rdbms-impact-report.sql

Transforms
4

Putki version
Any (native Hop only)

Created
May 12, 2026
Use case
Audit your entire Hop project for database dependencies. Run before dropping or renaming a table, migrating a connection, or changing a schema — and know exactly what breaks before it does.
When to run
On demand, before any schema change or migration. Optionally schedule weekly to keep an up-to-date snapshot in the report table.