Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add BigEarthNet Version2 #2531

Merged
merged 33 commits into from
Feb 19, 2025
Merged

Add BigEarthNet Version2 #2531

merged 33 commits into from
Feb 19, 2025

Conversation

nilsleh
Copy link
Collaborator

@nilsleh nilsleh commented Jan 24, 2025

Superseeds #2371 after talking to Ando.

After taking a look, the new version comes with a metadata.parquet file, which makes data handling quiet a bit more straightforward. With a version=2 argument, I felt like there would be many nested if statements and therefore, cleaner to do it this way. If there was a similar metadata.parquet file for V1, then this could be made more condensed.

example_benv2

@nilsleh nilsleh marked this pull request as draft January 24, 2025 19:00
@nilsleh nilsleh added this to the 0.7.0 milestone Jan 24, 2025
@github-actions github-actions bot added documentation Improvements or additions to documentation datasets Geospatial or benchmark datasets testing Continuous integration testing labels Jan 24, 2025
@github-actions github-actions bot added the dependencies Packaging and dependencies label Jan 27, 2025
@nilsleh nilsleh marked this pull request as ready for review January 27, 2025 18:49
@nilsleh nilsleh marked this pull request as draft January 27, 2025 19:11
@nilsleh nilsleh marked this pull request as ready for review January 28, 2025 16:16
@nilsleh
Copy link
Collaborator Author

nilsleh commented Jan 28, 2025

@ando-shah in case you wanna have a look, since you have experience with the dataset already and find anything.

Copy link
Collaborator

@adamjstewart adamjstewart left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Once this is done I would love to squash this on top of #2371 so that @ando-shah gets some credit.

@github-actions github-actions bot removed the dependencies Packaging and dependencies label Feb 19, 2025
adamjstewart
adamjstewart previously approved these changes Feb 19, 2025
Copy link
Collaborator

@adamjstewart adamjstewart left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor comments, can probably merge if you're in a rush.

self,
root: Path = 'data',
split: str = 'train',
bands: str = 'all',
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

People have complained that some datasets have (B1, B2, etc.) and others have s1, s2, etc. I know you're just matching the existing dataset, but should we change both to the former?

Copy link
Collaborator

@adamjstewart adamjstewart left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm happy if you're happy.

@adamjstewart adamjstewart merged commit 7071ecd into microsoft:main Feb 19, 2025
22 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
datasets Geospatial or benchmark datasets documentation Improvements or additions to documentation testing Continuous integration testing
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants