Data Lineage

Automated SQL Lineage

Blindata SQL Lineage helps you effortlessly track and manage data movements within your database. The SQL Lineage module uses schema metadata and extracted SQL statements to infer data flows and transformations, including standard database objects such as views and routines, query logs, and scripts generated by ELT tools.


With Blindata SQL Lineage, you can easily manage and monitor your data movements, ensuring adherence to regulations and mitigating data security risks. The software module is user-friendly and highly configurable, making it easy to customize to your unique needs. Plus, being open API compliant allows you to easily integrate Blindata SQL Lineage into your existing tech stack.


The automated SQL parser generates a SQL syntax tree displaying data flows and transformations in statements. Then it further simplifies this representation by removing transformations, creating a lineage graph connecting only tables and columns.

With drill-down capabilities embedded in the lineage visualization, you can quickly find out the script or routine that generates a given dataflow or analyze transformation details with just a few clicks.

Blindata provides automated SQL parsing through its SQL Lineage module. The SQL Lineage module uses schema metadata and SQL statements to build the SQL syntax tree, which displays the exact data flows and transformations within the statements.

This allows users to easily understand the data flows and transformations within their SQL statements.

Blindata also offers automatic, manual, and markup comment-driven approaches for analyzing SQL scripts. With these strategies, Blindata can effectively identify SQL statements in scripts. Users can choose to either generate data flows at the catalog level instantly or review each routine or statement manually, depending on their unique needs. This flexibility ensures that users can select the most suitable approach for their requirements.

The complete lineage representation offers an exhaustive perspective of data lineage, encompassing all the alterations carried out in a SQL statement. Such a representation can enhance the comprehensibility of convoluted statements and expedite problem-solving. On the other hand, the compressed lineage representation prioritizes dependencies by eliminating transformations and constructing a lineage graph that links tables and columns. Such a representation offers a more lucid view of data lineage, streamlines the analysis process, and enables quicker identification of issues that require attention.

Thanks to its extensible preprocessing rules, Blindata’s parser can effortlessly handle uncommon keywords and vendor-specific syntax, delivering a personalized experience that perfectly aligns with your specific needs.

Blindata prioritizes transparency, providing users with access to all aspects of the analysis process, including configurable metadata crawling, syntax tree reconstruction, and data lineage generation. Our approach ensures that all results, successful or not, are provided to our users, giving them complete control over their data and the ability to make informed decisions. With Blindata, users can trust that they have access to advanced tools to optimize their analysis process and get the most value from their metadata.

The SQL Lineage module offers transparent analysis with inspectable outputs and seamless integration with external tools through REST API.

How to

Data Catalog Crawling

Extraction and loading of schema information: tables and columns definitions.

Analyze Views

Analysis of views definitions through SQL automatic parsing.

Analyze Procedures and Query Log

Analysis of routines and sql statements form executions logs, through automatic query parsing.

Augment data lineage

Fill the gap between different systems through the link between different data sources.