-
Notifications
You must be signed in to change notification settings - Fork 28.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-30627][SQL] Disable all the V2 file sources by default #27348
Conversation
@@ -1728,7 +1728,7 @@ object SQLConf { | |||
"implementation class names for which Data Source V2 code path is disabled. These data " + | |||
"sources will fallback to Data Source V1 code path.") | |||
.stringConf | |||
.createWithDefault("kafka") | |||
.createWithDefault("kafka,parquet,orc,json,csv,text,avro") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Shall we set this alphabetically at this time, @gengliangwang ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok, sure
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1, LGTM. (Pending Jenkins. Hopefully, we didn't miss anything in test suites)
Test build #117326 has finished for PR 27348 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM pending Jenkins
Test build #117327 has finished for PR 27348 at commit
|
@@ -86,6 +87,11 @@ class FileDataSourceV2FallBackSuite extends QueryTest with SharedSparkSession { | |||
private val dummyReadOnlyFileSourceV2 = classOf[DummyReadOnlyFileDataSourceV2].getName | |||
private val dummyWriteOnlyFileSourceV2 = classOf[DummyWriteOnlyFileDataSourceV2].getName | |||
|
|||
override protected def sparkConf: SparkConf = |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
indentation?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Updated, thanks!
Test build #117329 has finished for PR 27348 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1, LGTM again. Thank you, @gengliangwang and @gatorsmile .
All tests passed and the last commit is only indentation changes.
Merged to master.
Test build #117334 has finished for PR 27348 at commit
|
What changes were proposed in this pull request?
Disable all the V2 file sources in Spark 3.0 by default.
Why are the changes needed?
There are still some missing parts in the file source V2 framework:
FileSourceScanExec
. This requires another patch in the data source V2 framework. Tracked by SPARK-30362As we are going to code freeze on Jan 31st, this PR proposes to disable all the V2 file sources in Spark 3.0 by default.
Does this PR introduce any user-facing change?
No
How was this patch tested?
Existing tests.