Azure Purview — Cataloging Delta Lake Assets using Apache Atlas API
Azure Purview, one of the latest tools delivered by Microsoft helps to properly govern customer Data Lake and have well-integration with various Azure services. Its support to Apache Atlas API can easily extend the data governance service to various non-Azure components as well. In my earlier blog, we have seen how we can leverage the API to catalog/lineage Apache Hive assets. In this blog, we’ll see how we can register Delta Lake assets into Purview.
Scanning Azure Data Lake identifies Delta Lake table schema. Find below few screenshots.
Though this should be fine for most of the cases however, there may be specific use case where, we need to take advantage of Delta Lake metadata to specifically catalog Delta assets along with storing the lineage information. To achieve this, we need to create a new type…