We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
I'm leveraging spark to download overture data from S3. Sedona is incompatible with recent spark versions.
Example
Psuedocode
import pyspark.sql.functions as F from pyspark.sql import SparkSession spark = SparkSession.builder.getOrCreate() OVERTURE_RELEASE = "2025-01-22.0" COUNTRY_CODES_OF_INTEREST = ["US", "GH"] SOURCE_DATA_URL = f"s3a://overturemaps-us-west-2/release/{OVERTURE_RELEASE}/theme=places/type=place" TARGET_TABLE = "my_catalog.my_schema.my_table" country_overlap_condition = F.arrays_overlap( F.col("addresses.country"), F.array(*[F.lit(x.upper()) for x in COUNTRY_CODES_OF_INTEREST]), ) source_df = ( spark.read.format("parquet") .load(SOURCE_DATA_URL) .filter(country_overlap_condition) .withColumn("_overture_release_version", F.lit(OVERTURE_RELEASE)) .withColumn("_ingest_timestamp", F.current_timestamp()) ) source_df.write.mode("append").format("delta").saveAsTable(TARGET_TABLE)
The text was updated successfully, but these errors were encountered:
Yes! You are welcome to submit a Spark example. Thank you. You're right, we definitely need to update that Sedona page.
Sorry, something went wrong.
No branches or pull requests
I'm leveraging spark to download overture data from S3. Sedona is incompatible with recent spark versions.
Example
like the one below?Psuedocode
The text was updated successfully, but these errors were encountered: