-
-
Notifications
You must be signed in to change notification settings - Fork 63
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create subsets of pyarrow package with pyarrow-core < pyarrow < pyarrow-all and update to Arrow v16.0.0 #1255
Conversation
Hi! This is the friendly automated conda-forge-linting service. I just wanted to let you know that I linted all conda-recipes in your PR ( |
I had thought the idea was to just let |
From the discussions and the conversations I've had with @jorisvandenbossche I thought we wanted to provide both Currently the error on
I see your point and I am ok with both approaches, probably I would agree that 2 is a better future approach:
@jorisvandenbossche what are your thoughts? |
I haven't yet looked in detail, but some quick drive-by thoughts on the package naming:
A potential alternative is something in between:
|
Currently we can't import pyarrow without libparquet:
I've opened apache/arrow#39006 |
I think some degree of breakage is unavoidable if we want to introduce a minimal pyarrow and default to it at some point. The question is whether the error messages that people encounter would be self-explanatory enough to trivially fix things.
If we really want to have not just It wouldn't be the first package with such a setup though, for example ray has So I guess it'd be reasonable to do |
Hi guys! So how's this progressing? 👍 |
I am currently working on releasing Arrow 15.0.0 which includes a fix to be able to extract |
c5a9ac2
to
ecf8418
Compare
Hi! This is the friendly automated conda-forge-linting service. I wanted to let you know that I linted all conda-recipes in your PR ( Here's what I've got... For recipe:
|
Hi! This is the friendly automated conda-forge-linting service. I just wanted to let you know that I linted all conda-recipes in your PR ( |
@jorisvandenbossche @h-vetinari I have rebased this PR with 15.0.0 so
I think this is good for a review at the moment. |
7f62f77
to
5958885
Compare
@jorisvandenbossche @raulcd I think we need to have a discussion about this on the Arrow mailing lists before we push this change. The Arrow community currently has little awareness that this change is planned. I worry that there could be much surprise and frustration if we do not inform the community in advance. |
@ianmcook I think this is a valid concern but the only affected users will be those using
We could add a patch like: diff --git a/python/pyarrow/flight.py b/python/pyarrow/flight.py
index b183690..12eef59 100644
--- a/python/pyarrow/flight.py
+++ b/python/pyarrow/flight.py
@@ -65,5 +65,6 @@ try:
)
except ImportError as exc:
raise ImportError(
- f"The pyarrow installation is not built with support for 'flight' ({str(exc)})"
+ "The 'pyarrow' installation is not built with support for " +
+ f"'flight'. Please install the conda-forge 'pyarrow-all' package. ({str(exc)})"
) from None and users will get prompted something like:
to be even more explicit. We can send an email to the mailing list in order to give a heads up but in my opinion we could merge. We did the split of |
Thanks @raulcd — a message to the user@ and dev@ mailing lists would be great. The patch to give a more helpful error message would also be great. |
5958885
to
3c6cc03
Compare
@h-vetinari I was having a chat with @xhochy at PyCon DE and decided to add the Arrow update to v16.0.0 in the same PR so there is not a build without the split. I hope this is also good with you. |
That's exactly how I would have suggested doing it! :) |
2999bbe
to
6e21a4f
Compare
It seems we hit this issue: protocolbuffers/protobuf#14576 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's go!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is getting close but still needs a bit of work. I also want to do #1375 first to uncouple the aws-migrations from the arrow bump. I'm happy to rebase this PR on top afterwards (commit history needs some cleanup as well), and apply the necessary clean-ups.
Also, this hasn't been rerendered yet for the new outputs, which is going to further increase the rerender time and effectively break the bot from opening new migrations (see conda-forge/conda-forge-pinning-feedstock#5815). This is not an issue with this PR itself, but something we should fix beforehand, or at the very least ASAP afterwards.
b4195b8
to
031b47d
Compare
Hi! This is the friendly automated conda-forge-linting service. I was trying to look for recipes to lint for you, but it appears we have a merge conflict. Please ping the 'conda-forge/core' team (using the @ notation in a comment) if you believe this is a bug. |
031b47d
to
6166f4c
Compare
6166f4c
to
24fe27f
Compare
I've completely messed up with the rebase to update the commit history which has closed the PR. I've created a new PR here: #1376 |
@raulcd is |
The compute kernels itself ( |
(we have an issue about splitting the compute kernels into its own shared library (apache/arrow#25025), and that would allow it here to be installed separately as well, but until then I think the compute kernels are all included (or not, if one would disable building then) in the main libarrow.so) |
Checklist
0
(if the version changed)conda-smithy
(Use the phrase@conda-forge-admin, please rerender
in a comment in this PR for automated rerendering)As discussed in #1201 this PR tries to add a new
pyarrow-base
that only depends onlibarrow
andlibparquet
.