Skip to content
This repository has been archived by the owner on Jul 18, 2024. It is now read-only.

[DataCap Application]EMPIAR(1/3) #2152

Closed
1 of 2 tasks
TOPPOOL-LEE opened this issue Aug 15, 2023 · 104 comments
Closed
1 of 2 tasks

[DataCap Application]EMPIAR(1/3) #2152

TOPPOOL-LEE opened this issue Aug 15, 2023 · 104 comments

Comments

@TOPPOOL-LEE
Copy link

Data Owner Name

EMPIAR

What is your role related to the dataset

Data Preparer

Data Owner Country/Region

United Kingdom

Data Owner Industry

Life Science / Healthcare

Website

https://www.ebi.ac.uk/empiar/

Social Media

https://www.ebi.ac.uk/empiar/

Total amount of DataCap being requested

15PiB

Expected size of single dataset (one copy)

1P

Number of replicas to store

10

Weekly allocation of DataCap requested

1PiB

On-chain address for first allocation

f1dwes3ykaliqbvfgp6x4a22uhiepy24ruti3ohli

Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig

  • Use Custom Multisig

Identifier

No response

Share a brief history of your project and organization

EMPIAR, the Electron Microscopy Public Image Archive, is a public resource for raw images underpinning 3D cryo-EM maps and tomograms (themselves archived in EMDB). EMPIAR also accommodates 3D datasets obtained with volume EM techniques and soft and hard X-ray tomography. More ...
As of 2023-08-15, EMPIAR contains 1391 entries, taking up 3.15 PB of storage.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

Founded in 1956, NRAO provides the most advanced radio telescope facilities and information to the international scientific community.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

If you are a data preparer. What is your location (City and Country)

china

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

No response

If you are not preparing the data, who will prepare the data? (Provide name and business)

No response

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

No response

Please share a sample of the data

https://www.ebi.ac.uk/empiar/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

  • I confirm

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Monthly

For how long do you plan to keep this dataset stored on Filecoin

1.5 to 2 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, Europe

How will you be distributing your data to storage providers

Cloud storage (i.e. S3), HTTP or FTP server, IPFS, Shipping hard drives, Lotus built-in data transfer

How do you plan to choose storage providers

Slack, Filmine

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

No response

How do you plan to make deals to your storage providers

Boost client, Lotus client, Singularity

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

@large-datacap-requests
Copy link

Thanks for your request!
Everything looks good. 👌

A Governance Team member will review the information provided and contact you back pretty soon.

@Sunnyiscoming
Copy link
Collaborator

  • Can you introduce you or your organization?
  • Have you prepared enough token for sector pledge?
  • Are you a data preparer? What is your previous experience as a data-preparer? List previous applications and client IDs
  • How will the data be prepared? Please include tooling used and technical details
  • If you are not preparing the data, who will prepare the data? (Name and Business)
  • Has this dataset been stored on Filecoin before? If so, why are you choosing to store it again?
  • Best practice for storing large datasets includes ideally, storing it in 3 or more regions, with 4 or more storage provider operators or owners.You should list Miner ID, Business Entity, Location of sps you will cooperate with.
  • Per the Modification: Changes required to public open dataset application flow notary-governance#922 for Open, Public Dataset applicants, please complete the following Fil+ registration form to identify yourself as the applicant and also please add the contact information of the SP entities you are working with to store copies of the data.

This information will be reviewed by Fil+ Governance team to confirm validity and then the application will be triggered for notary review. Let us know if you have any questions.

@herrehesse
Copy link

herrehesse commented Aug 15, 2023

Hello there! I've noticed that you're a new client seeking an extreme volume of DC (15PiB). Could you provide us with some insight into your decision to not consider building your reputation with a smaller quantity initially? Appreciate it!

@Sunnyiscoming
Copy link
Collaborator

Any update here?

@TOPPOOL-LEE
Copy link
Author

We are still actively communicating with SP, it may take some time, when we are sure, we will fill out the form and reply to your questions.

@herrehesse
Copy link

@liou38469 I would appreciate if you could answer my question above. Thank you!

@zcfil
Copy link

zcfil commented Aug 24, 2023

In my opinion, there are many datacaps that apply for 15PB. People need to have a comprehensive understanding of Filecoin, data packaging, data distribution, and all the requirements that come with it. Are you a novice in the Filecoin field, or have you previously applied for datacap on different Github account names?

@TOPPOOL-LEE
Copy link
Author

Sorry for the late reply!
We are http://poolhub.top, the premier resource optimization and allocation platform connecting blockchain and Metaverse industry participants. We joined BTC mining in 2016 and Filecoin mining in 2020. As a technical service provider of the block chain, we have established a hubpool to serve SPs around the world.For 3 months, we have actively communicated with SP. We have contacted 15 SPs, including geographical location, pledge preparation, data transmission method (hard disk), etc. Now, we are ready and expect to get your help and support ,Thanks! @Sunnyiscoming @herrehesse @zcfil

@TOPPOOL-LEE
Copy link
Author

We filled out the form Fil+ registration form , thank you

@Sunnyiscoming
Copy link
Collaborator

Please list Miner ID, Business Entity, Location of sps you will cooperate with.

@TOPPOOL-LEE
Copy link
Author

TOPPOOL-LEE commented Aug 30, 2023

We have submitted it, thank you, we have contacted many, many sps, there may be some changes in the final cooperation, we will inform you in time, so, can you pass our application?
WX20230905-192215@2x

@Sunnyiscoming
Copy link
Collaborator

Datacap Request Trigger

Total DataCap requested

15PiB

Expected weekly DataCap usage rate

1PiB

Client address

f1dwes3ykaliqbvfgp6x4a22uhiepy24ruti3ohli

@large-datacap-requests
Copy link

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1dwes3ykaliqbvfgp6x4a22uhiepy24ruti3ohli

DataCap allocation requested

512TiB

Id

ee000d3c-6173-4603-b08b-d403a19233bf

@TOPPOOL-LEE
Copy link
Author

checker:manualTrigger

Copy link

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 49.66% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Copy link

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 49.66% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@sxxfuture-official
Copy link

Need to increase the number of data replicas, everything else seems to be fine

Copy link

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecboijnq3fz6ohspi5ffrl356fw55unrrw2aqlcirzw65hinhdr7k

Address

f1dwes3ykaliqbvfgp6x4a22uhiepy24ruti3ohli

Datacap Allocated

1.00PiB

Signer Address

f1foiomqlmoshpuxm6aie4xysffqezkjnokgwcecq

Id

a30ab29e-a496-4c88-9272-47b5c11e67a6

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecboijnq3fz6ohspi5ffrl356fw55unrrw2aqlcirzw65hinhdr7k

Copy link

mikezli commented Nov 28, 2023

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceba2newjsvzf26y543i5j22gvml2hzj5aafzbshlgklhgvqhumy4o

Address

f1dwes3ykaliqbvfgp6x4a22uhiepy24ruti3ohli

Datacap Allocated

1.00PiB

Signer Address

f1dnb3uz7sylxk6emti3ififcvu3nlufnnsjui6ea

Id

a30ab29e-a496-4c88-9272-47b5c11e67a6

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceba2newjsvzf26y543i5j22gvml2hzj5aafzbshlgklhgvqhumy4o

@TOPPOOL-LEE
Copy link
Author

checker:manualTrigger

Copy link

DataCap and CID Checker Report Summary1

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 47.59% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests