Skip to content
This repository has been archived by the owner on Jul 18, 2024. It is now read-only.

[DataCap Application] National Herbarium of NSW #2067

Closed
1 of 2 tasks
Hugh-Top opened this issue Jun 26, 2023 · 112 comments
Closed
1 of 2 tasks

[DataCap Application] National Herbarium of NSW #2067

Hugh-Top opened this issue Jun 26, 2023 · 112 comments

Comments

@Hugh-Top
Copy link

Hugh-Top commented Jun 26, 2023

Data Owner Name

Royal Botanic Gardens and Domain Trust

What is your role related to the dataset

Storage provider filling out application on behalf of the data owner

Data Owner Country/Region

United States

Data Owner Industry

Not-for-Profit

Website

https://www.rbgsyd.nsw.gov.au/science/national-herbarium-of-new-south-wales

Social Media

N/A

Total amount of DataCap being requested

1PiB

Expected size of single dataset (one copy)

160TiB

Number of replicas to store

10

Weekly allocation of DataCap requested

100TiB

On-chain address for first allocation

f1nrnyrttz53iau77m7sbk56pxij5g7mmi4afk6pq

Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig

  • Use Custom Multisig

Identifier

No response

Share a brief history of your project and organization

https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1974

We are small filecoin team. 
We have a node f01969779.
We will continue to participate in filecoin and seal more data.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

The National Herbarium of New South Wales is one of the most significant scientific, cultural and historical botanical resources in the Southern hemisphere. The 1.43 million preserved plant specimens have been captured as high-resolution images and the biodiversity metadata associated with each of the images captured in digital form. Botanical specimens date from year 1770 to today, and form voucher collections that document the distribution and diversity of the world's flora through time, particularly that of NSW, Austalia and the Pacific.The data is used in biodiversity assessment, systematic botanical research, ecosystem conservation and policy development. The data is used by scientists, students and the public.

aws s3 ls --no-sign-request --recursive --human-readable --summarize s3://herbariumnsw-pds/ | grep "Total Size:"
   Total Size: 103.9 TiB

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

singularity

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

aws s3 ls --no-sign-request s3://herbariumnsw-pds/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

  • I confirm

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Sporadic

For how long do you plan to keep this dataset stored on Filecoin

Less than 1 year

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, Europe

How will you be distributing your data to storage providers

HTTP or FTP server

How do you plan to choose storage providers

Slack, Partners

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

sp region
f02145020 CN
f02301 US
f03223 US
f01969779(our) US
f020522 DE
f02093396 Singapore

How do you plan to make deals to your storage providers

Singularity

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

@large-datacap-requests
Copy link

Thanks for your request!
Everything looks good. 👌

A Governance Team member will review the information provided and contact you back pretty soon.

@Sunnyiscoming
Copy link
Collaborator

How many percentage of datacap will you store in your nodes?

@Sunnyiscoming
Copy link
Collaborator

What is your role at the company that is behind this project?
How are you connected to the data set? The website isn't yours. Do you work for the listed organization?
How are you finding SPs. List a detailed plan.

@Hugh-Top
Copy link
Author

How many percentage of datacap will you store in your nodes?

@Sunnyiscoming I only have one node f01969779 that will store one copy dataset(less than 20%). I promise each sp will store not more than 30%. I am looking for other sps.

think you.

@Hugh-Top
Copy link
Author

What is your role at the company that is behind this project? How are you connected to the data set? The website isn't yours. Do you work for the listed organization? How are you finding SPs. List a detailed plan.

@Sunnyiscoming hello.
This dataset is sourced from https://registry.opendata.aws/nsw-herbarium/
I am not affiliated with this organization in anyway. However this dataset is CC licensed public open dataset, anyone can store it. I believe storing this dataset on filecoin network has its value.

We already have some cooperative sps, we will continue to look for new sps through slack. I promise each sp will store not more than 30%.

think you.

@Sunnyiscoming
Copy link
Collaborator

Is this storage node operated by your company? What's the name of your company? What is your role in the company?

@Hugh-Top
Copy link
Author

Is this storage node operated by your company? What's the name of your company? What is your role in the company?

@Sunnyiscoming We did not set up a company. We are a few friends who operate and participate in filecoin. I am mainly responsible for operation and maintenance.

think you

@herrehesse
Copy link

@Hugh-Top can you show me retrievability on your selected SP's?

sp region
f02145020 CN
f02301 US
f03223 US
f01969779(our) US
f020522 DE
f02093396 Singapore

@cryptowhizzard
Copy link

Hi @Hugh-Top

I see SP's from topblocks in your application. Although as applicant you are allowed to store one copy of your data, i wonder who the 3 other applicants are to store the other 3 replica's of this dataset?

@Hugh-Top
Copy link
Author

Hugh-Top commented Jul 3, 2023

Hi @Hugh-Top

I see SP's from topblocks in your application. Although as applicant you are allowed to store one copy of your data, i wonder who the 3 other applicants are to store the other 3 replica's of this dataset?

@cryptowhizzard topblocks is our partner. Is it necessary to provide the name of the sp?

sp region org
f02145020 CN harry
f02301 US topblocks
f03223 US topblocks
f01969779 US our
f020522 DE phantom
f02093396 Singapore STRAITDEER PTE. LTD.

@Sunnyiscoming
Copy link
Collaborator

Datacap Request Trigger

Total DataCap requested

1 PiB

Expected weekly DataCap usage rate

100 TiB

Client address

f1nrnyrttz53iau77m7sbk56pxij5g7mmi4afk6pq

@large-datacap-requests
Copy link

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1nrnyrttz53iau77m7sbk56pxij5g7mmi4afk6pq

DataCap allocation requested

50TiB

Id

f079365e-cf87-437e-af0c-c763204f1f66

@ipollo00
Copy link

ipollo00 commented Jul 5, 2023

@Hugh-Top Does your cooperate sps receive your data?

@Hugh-Top
Copy link
Author

Hugh-Top commented Jul 5, 2023

@Hugh-Top Does your cooperate sps receive your data?

@ipollo00 yes

Copy link
Contributor

Fatman13 commented Jul 5, 2023

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzaceckmghbj64w6by55o7owvoi4rvsso5z5hfpaii74w2lpqd6j6e3g6

Address

f1nrnyrttz53iau77m7sbk56pxij5g7mmi4afk6pq

Datacap Allocated

50.00TiB

Signer Address

f1j3u7crhjzwb2cj5mq7vodlt4o66yoyci7lhcauy

Id

f079365e-cf87-437e-af0c-c763204f1f66

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceckmghbj64w6by55o7owvoi4rvsso5z5hfpaii74w2lpqd6j6e3g6

Copy link

zcfil commented Sep 19, 2023

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceb6566fwhm33q4osu5hpphojagcmtgihrghzvuwkxost5phn2f2vu

Address

f1nrnyrttz53iau77m7sbk56pxij5g7mmi4afk6pq

Datacap Allocated

400.00TiB

Signer Address

f1cjzbiy5xd4ehera4wmbz63pd5ku4oo7g52cldga

Id

5b837390-6256-4efe-8ad9-e7f562e7dc34

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceb6566fwhm33q4osu5hpphojagcmtgihrghzvuwkxost5phn2f2vu

@Hugh-Top
Copy link
Author

checker:manualTrigger

@filplus-checker-app
Copy link

DataCap and CID Checker Report Summary1

Retrieval Statistics

  • Overall Graphsync retrieval success rate: 73.73%
  • Overall HTTP retrieval success rate: 0.00%
  • Overall Bitswap retrieval success rate: 0.00%

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 31.56% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.
Click here to view the Retrieval report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@github-actions
Copy link

github-actions bot commented Oct 7, 2023

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

--
Commented by Stale Bot.

@Hugh-Top
Copy link
Author

Hugh-Top commented Oct 7, 2023

checker:manualTrigger

@filplus-checker-app
Copy link

DataCap and CID Checker Report Summary1

Retrieval Statistics

  • Overall Graphsync retrieval success rate: 72.00%
  • Overall HTTP retrieval success rate: 0.00%
  • Overall Bitswap retrieval success rate: 0.00%

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 41.04% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.
Click here to view the Retrieval report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@Hugh-Top
Copy link
Author

checker:manualTrigger

@filplus-checker-app
Copy link

DataCap and CID Checker Report Summary1

Retrieval Statistics

  • Overall Graphsync retrieval success rate: 68.04%
  • Overall HTTP retrieval success rate: 0.00%
  • Overall Bitswap retrieval success rate: 0.00%

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 44.44% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.
Click here to view the Retrieval report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@Hugh-Top
Copy link
Author

checker:manualTrigger

@filplus-checker-app
Copy link

DataCap and CID Checker Report Summary1

Retrieval Statistics

  • Overall Graphsync retrieval success rate: 65.78%
  • Overall HTTP retrieval success rate: 0.00%
  • Overall Bitswap retrieval success rate: 0.00%

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.
Click here to view the Retrieval report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@Hugh-Top
Copy link
Author

checker:manualTrigger

@filplus-checker-app
Copy link

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@spaceT9
Copy link

spaceT9 commented Oct 31, 2023

checker:manualTrigger

@filplus-checker-app
Copy link

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@AlanGreaterheat
Copy link

checker:manualTrigger

Copy link

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Copy link

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecuuiinfxmxvfkb3ml2iuojimwbhd2ag5ng5nxhtdpzdcg5jwcjt6

Address

f1nrnyrttz53iau77m7sbk56pxij5g7mmi4afk6pq

Datacap Allocated

400.00TiB

Signer Address

f1pnmzlxj7cfeo2v6oj5nco46hkg2l46wj7o4xxui

Id

5b837390-6256-4efe-8ad9-e7f562e7dc34

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecuuiinfxmxvfkb3ml2iuojimwbhd2ag5ng5nxhtdpzdcg5jwcjt6

@DaYouGroup
Copy link

checker:manualTrigger

Copy link

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@DaYouGroup
Copy link

The project is very healthy and I am willing to support this round.

Copy link

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecd2fx5u23vdkyl3fzicqj3bp4d4btdh4mhhza6famqxosnumpapm

Address

f1nrnyrttz53iau77m7sbk56pxij5g7mmi4afk6pq

Datacap Allocated

400.00TiB

Signer Address

f1nwjsd2mc6hu4qrwnmd6ukrfkuu4h5fhs7u3exii

Id

5b837390-6256-4efe-8ad9-e7f562e7dc34

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecd2fx5u23vdkyl3fzicqj3bp4d4btdh4mhhza6famqxosnumpapm

@Hugh-Top
Copy link
Author

Hugh-Top commented Nov 3, 2023

checker:manualTrigger

Copy link

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Copy link

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

--
Commented by Stale Bot.

@Hugh-Top
Copy link
Author

checker:manualTrigger

Copy link

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@ghost
Copy link

ghost commented Nov 20, 2023

from CID report:
f01877184 | Singapore, Singapore, SGAlibaba (US) Technology Co., Ltd. | 13.97 TiB | 1.71% | 13.97 TiB | 0.00%
f02252173 | Tokyo, Tokyo, JPAwesomecloud Limited | 25.91 TiB | 3.18% | 25.91 TiB | 0.00%
f02230941 | Central, Central and Western, HKBIH-Global Internet Harbor | 27.55 TiB | 3.38% | 26.33 TiB | 4.42%
f02230375 | Central, Central and Western, HKBIH-Global Internet Harbor | 26.33 TiB | 3.23% | 26.33 TiB | 0.00%
f02230935 | Central, Central and Western, HKBIH-Global Internet Harbor | 26.23 TiB | 3.22% | 26.23 TiB | 0.00%
f02032191 | Hangzhou, Zhejiang, CNChina Mobile communications corporation | 75.72 TiB | 9.29% | 75.72 TiB | 0.00%
f02808800 | Hangzhou, Zhejiang, CNChina Mobile communications corporation | 51.50 TiB | 6.32% | 51.50 TiB | 0.00%
f02519323 | Yinchuan, Ningxia Hui Autonomous Region, CNCHINANET NINGXIA province ZHONGWEI IDC network | 32.38 TiB | 3.97% | 32.38 TiB | 0.00%
f02808888 | Shenzhen, Guangdong, CNCHINANET-BACKBONE | 26.13 TiB | 3.20% | 26.13 TiB | 0.00%
f01836766 | Shenzhen, Guangdong, CNCHINANET-BACKBONE | 21.59 TiB | 2.65% | 21.59 TiB | 0.00%
f0240185 | Clifton, New Jersey, USDigitalOcean, LLC | 46.22 TiB | 5.67% | 44.91 TiB | 2.84%
f0143858 | Clifton, New Jersey, USDigitalOcean, LLC | 41.09 TiB | 5.04% | 39.88 TiB | 2.97%
f03223 | Clifton, New Jersey, USDigitalOcean, LLC | 26.31 TiB | 3.23% | 26.31 TiB | 0.00%
f02301 | Clifton, New Jersey, USDigitalOcean, LLC | 25.25 TiB | 3.10% | 25.25 TiB | 0.00%
f01969779 | Clifton, New Jersey, USDigitalOcean, LLC | 23.81 TiB | 2.92% | 23.81 TiB | 0.00%
f02816666 | Hohhot, Inner Mongolia, CNHangzhou Alibaba Advertising Co.,Ltd. | 25.44 TiB | 3.12% | 25.44 TiB | 0.00%
f02062851 | Los Angeles, California, USNetLab Global | 19.94 TiB | 2.45% | 19.94 TiB | 0.00%
f02240216 | Tokyo, Tokyo, JPPCCW Global, Inc. | 81.28 TiB | 9.97% | 81.25 TiB | 0.04%
f02228866 | Tokyo, Tokyo, JPPCCW Global, Inc. | 67.56 TiB | 8.29% | 67.56 TiB | 0.00%
f02093396 | Singapore, Singapore, SGStarhub Ltd | 78.58 TiB | 9.64% | 78.58 TiB | 0.00%
f02259777 | Singapore, Singapore, SGStarhub Ltd | 52.66 TiB | 6.46% | 52.66 TiB | 0.00%

From registration form:

SP 1 | f02032191 | treal | NA | NA | zhejiang(CN) | No | NA
f02230375,f02230941,f02230939,f02230935 | chael | NA | NA | HongKong(CN) | No | NA
f01836766 | Huang | NA | NA | Guangzhou(CN) | No | NA
f02093396 | Lingchao Xu | | STRAITDEER PTE. LTD | Singapore | No | xlcbd
f02301,f03223,f0143858,f0240185 | HarryM | harry.ma@topblocks.io | topblocks | Santa Clara(US) | No | HarryM

SPs not matching from registation to actual deal storage. Closing for violation of 922

@ghost ghost closed this as completed Nov 20, 2023
@Hugh-Top
Copy link
Author

checker:manualTrigger

Copy link

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

This issue was closed.
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.