Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DataFactory: Problem using DatabricksSparkPython activity in pipeline #8596

Closed
maximauro opened this issue Nov 12, 2019 · 4 comments
Closed
Labels
customer-reported Issues that are reported by GitHub users external to the Azure organization. Data Factory Mgmt This issue is related to a management-plane library. Service Attention Workflow: This issue is responsible by Azure service team.

Comments

@maximauro
Copy link

Hello,

When I try to create a new pipeline in DataFactory with a DatabricksSparkPython activity, I get the following error:

Subtype value DatabricksSparkPython has no mapping, use base class Activity.
Traceback (most recent call last):
File "/usr/lib/python3.7/runpy.py", line 193, in _run_module_as_main
"main", mod_spec)
File "/usr/lib/python3.7/runpy.py", line 85, in _run_code
exec(code, run_globals)
File "/usr/local/deployer/main.py", line 5, in
DataDeployer().deploy_all()
File "/usr/local/deployer/data_deployer.py", line 37, in deploy_all
adf.deploy_all_to_adf()
File "/usr/local/deployer/services/adf_service.py", line 65, in deploy_all_to_adf
self.deploy_all_pipelines(adf_client)
File "/usr/local/deployer/services/adf_service.py", line 120, in deploy_all_pipelines
self.deploy_pipeline(adf_client, entry.path)
File "/usr/local/deployer/services/adf_service.py", line 134, in deploy_pipeline
p = adf_client.pipelines.create_or_update(rg_name, df_name, p_name, pipeline_definition)
File "/usr/lib/python3.7/site-packages/azure/mgmt/datafactory/operations/pipelines_operations.py", line 163, in create_or_update
body_content = self._serialize.body(pipeline, 'PipelineResource')
File "/usr/lib/python3.7/site-packages/msrest/serialization.py", line 578, in body
raise errors[0]
File "/usr/lib/python3.7/site-packages/msrest/serialization.py", line 220, in validate
Serializer.validate(value, debug_name, **self._validation.get(attr_name, {}))
File "/usr/lib/python3.7/site-packages/msrest/serialization.py", line 661, in validate
raise ValidationError("required", name, True)
msrest.exceptions.ValidationError: Parameter 'Activity.type' can not be None.

Here is my pipeline configuration:

{ "name": "Pipeline Test", "type": "Microsoft.DataFactory/factories/pipelines", "properties": { "activities": [ { "name": "Python Hello Test", "description": "test", "type": "DatabricksSparkPython", "typeProperties": { "pythonFile": "dbfs:/jobs/test/hello.py" }, "linkedServiceName": { "referenceName": "lsName", "type": "LinkedServiceReference" } } ] } }

Could someone help me with a solution?

Thank you!

@kaerm kaerm added customer-reported Issues that are reported by GitHub users external to the Azure organization. Data Factory Mgmt This issue is related to a management-plane library. Service Attention Workflow: This issue is responsible by Azure service team. labels Nov 12, 2019
@kaerm
Copy link
Contributor

kaerm commented Nov 12, 2019

@maximemauro thanks for reporting this, tagging the right team to have a look at this

@maximauro
Copy link
Author

Here is my Pipfile:

[[source]]
name = "pypi"
url = "https://pypi.org/simple"
verify_ssl = true

[dev-packages]
auto-changelog = "*"
coverage = "*"
flake8 = "*"
pylint = "*"
pytest = "*"
pytest-cov = "*"
python-githooks = "*"
python-semantic-release = "*"
flask = "*"

[packages]
azure-common = "*"
azure-mgmt = "*"
msrest = "*"
requests = "*"
cffi = "*"
envsubst = "*"

[requires]
python_version = "3.7"

Latest azure-mgmt version (v 4.0.0) includes azure-mgmt-datafactory v0.6.0, which is not the latest.
Latest version of azure-mgmt-datafactory is v0.8.0, but releases notes do not mention that DatabricksSparkPython activity has been implemented...

@maximauro
Copy link
Author

Upgrading the azure-mgmt-datafactory to v0.8.0 solved the problem.
But I had to get rid of the azure-mgmt package.
Closing the issue ;)

@ghost
Copy link

ghost commented Nov 13, 2019

Thanks for working with Microsoft on GitHub! Tell us how you feel about your experience using the reactions on this comment.

openapi-sdkautomation-test bot pushed a commit to openapi-env-test/azure-sdk-for-python that referenced this issue Mar 24, 2020
Merge branch 'master' of https://github.com/Azure/azure-rest-api-specs into keyvault_multiapi_readme

* 'master' of https://github.com/Azure/azure-rest-api-specs: (101 commits)
  add cli.md for automation (Azure#8411)
  adjust assignment (Azure#8782)
  Remove Microsoft.Backup.Admin 2016-05-01 API version (Azure#8588)
  Updating global setting in PostgreSQL/MySQL readme file (Azure#8777)
  update package name and output folder in readme.typescript.md (Azure#8764)
  add package-2019-12 python define (Azure#8769)
  Fix Parameter Description for validate resource move (Azure#8524)
  Edit pass for GA swagger (Azure#8759)
  Update proxy.json (Azure#8596)
  Model enums that may change in the future as strings (Azure#8760)
  Add api-version 2019-11-01 for resources/subscriptions (Azure#8728)
  regenerated all-api-versions
  PrivateLinkResources for Microsoft.Automation (Azure#8369)
  add cli.md for serialconsole (Azure#8401)
  add cli.md for mariadb (Azure#8466)
  [Computer Vision] Create CV API v3.0-preview (Azure#7402)
  Publish Microsoft.ContainerService api-version 2020-03-01 (Azure#8756)
  Update swagger based on auto-gen process change. (Azure#8766)
  add assignment-bot config (Azure#8716)
  add tag package-2019-12 to batch (Azure#8751)
  ...
@github-actions github-actions bot locked and limited conversation to collaborators Apr 12, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
customer-reported Issues that are reported by GitHub users external to the Azure organization. Data Factory Mgmt This issue is related to a management-plane library. Service Attention Workflow: This issue is responsible by Azure service team.
Projects
None yet
Development

No branches or pull requests

2 participants