You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
For a PoC, we are testing the Lineage Connector and are aiming to create lineage between Databricks Notebooks and external tables stored in ADLS, basically similar to the lineage graph that's shown in the readme (See image below).
Our goal is to make this connection between ADLS -> Notebook -> ADLS, but we are not able to consistently get this. In most cases, the lineage graph will be as shown in the screenshot below, where the output is the dummy entity. In one attempt, we created the table, then ran the ADLS scan so the Resource Set can be found, and then we ran the script again, which transformed the dummy entity to the ADLS resource set. Unfortunately, after trying out several times after that, we have not been able to reproduce that result. Below is a simple example for what we're trying to achieve.
We do see that there's an open PR related to this #69 , but this has apparently not been updated for a couple of months, so we're wondering what's the progress on this.
Alternatively, if you know how to achieve our goal in a different way, we are happy to hear it.
To Reproduce
Steps to reproduce the behavior:
Create a .csv file with just some random integers as keys and store this in ADLS.
Run below code in Notebook %sql CREATE TABLE IF NOT EXISTS customer_keys (key bigint) LOCATION 'abfss://<...>@<..>.dfs.core.windows.net/customer_keys
%sql INSERT INTO customer_keys SELECT * FROM new_keys
3. Run ADLS scan in Purview so the customer_keys asset is found as ADLS asset
4. Run the code above again
Expected behavior
Expected is that the dummy entity is replaced by the ADLS asset, as we have achieved once, but unfortunately have not been able to reproduce.
Desktop (please complete the following information):
@svnnl thank you for your interest in the solution accelerator!
Just to be sure, customer_keys is a Delta table? That would be the default for DBR 11.3 but I wanted to confirm with you.
We are working on a release that improves the matching to existing assets and it should be available by Monday.
If you wanted to try it out, please see the PR #124 and you could deploy the azure function via VS code to your existing azure function application and see if the newest changes corrects your scenario.
Describe the bug
![image](https://user-images.githubusercontent.com/8983696/204802058-dedcda13-60fa-420f-967a-ec8bc285d08f.png)
For a PoC, we are testing the Lineage Connector and are aiming to create lineage between Databricks Notebooks and external tables stored in ADLS, basically similar to the lineage graph that's shown in the readme (See image below).
Our goal is to make this connection between ADLS -> Notebook -> ADLS, but we are not able to consistently get this. In most cases, the lineage graph will be as shown in the screenshot below, where the output is the dummy entity. In one attempt, we created the table, then ran the ADLS scan so the Resource Set can be found, and then we ran the script again, which transformed the dummy entity to the ADLS resource set. Unfortunately, after trying out several times after that, we have not been able to reproduce that result. Below is a simple example for what we're trying to achieve.
We do see that there's an open PR related to this #69 , but this has apparently not been updated for a couple of months, so we're wondering what's the progress on this.
Alternatively, if you know how to achieve our goal in a different way, we are happy to hear it.
To Reproduce
Steps to reproduce the behavior:
%sql CREATE TABLE IF NOT EXISTS customer_keys (key bigint) LOCATION 'abfss://<...>@<..>.dfs.core.windows.net/customer_keys
new_keys = spark.read.format('csv').option('header', 'true').load('abfss://<...>@<..>.dfs.core.windows.net/new_keys.csv') new_keys.createOrReplaceTempView('new_keys')
%sql INSERT INTO customer_keys SELECT * FROM new_keys
3. Run ADLS scan in Purview so the customer_keys asset is found as ADLS asset
4. Run the code above again
Expected behavior
Expected is that the dummy entity is replaced by the ADLS asset, as we have achieved once, but unfortunately have not been able to reproduce.
Desktop (please complete the following information):
The text was updated successfully, but these errors were encountered: