Skip to content
This repository has been archived by the owner on Jul 18, 2024. It is now read-only.

[DataCap Application] Sinso #961

Closed
Sinsoteam opened this issue Sep 13, 2022 · 104 comments
Closed

[DataCap Application] Sinso #961

Sinsoteam opened this issue Sep 13, 2022 · 104 comments

Comments

@Sinsoteam
Copy link

Sinsoteam commented Sep 13, 2022

Large Dataset Notary Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

  • Organization Name: Sinso
  • Website / Social Media: http://www.sinso.io/
  • Total amount of DataCap being requested (between 500 TiB and 5 PiB): 5PiB
  • Weekly allocation of DataCap requested (usually between 1-100TiB): 100TiB
  • On-chain address for first allocation:f1lozjhyay3heeav3wm4ttycoaumjgtgrp452woki

Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.

Project details

Share a brief history of your project and organization.

**Organization**
The Sinso team was established in October 2020. As the leading medical imaging SaaS cloud service provider, the core members have served more than 500 medical institutions and more than 80,000 medical imaging doctors.
Sinso builds the Sinso DAC ecology based on WEB3 technology, and jointly promotes the human society to enter the era of decentralized medical care.
**Our project**
Our program also participated in the Filecoin Frontier Accelerator.
Sinso, as a medical image data aggregator for telemedicine and AI diagnosis, provides a basis for remote diagnosis services to solve the problem of authenticity of patient data. Through a customized NFT release template for medical data, users actively participate in data confirmation and help users strengthen data collection And the conversion process of data assets to strengthen the flow of medical data. Therefore, Sinso also provides the issuance of NFT-like virtual assets to further promote the full range of doctors towards free practice.
Users collect data through Sinso Getway, cast health/medical-related data into NFTs in Sinso DAPP, and trade on Sinso Doctors Network to realize the value transfer of medical data. Sinso is based on WEB3 technology and Sinso DAC ecology to jointly create and promote human society into the era of decentralized medical care.
We also achieved a good ranking in the acceleration camp.
The following links are about our projects:
https://www.bilibili.com/video/BV1PV411J7ma/

What is the primary source of funding for this project?

A: At present, the project investment mainly comes from filecoin's ecological investment, for example: whylab investment fund.

What other projects/ecosystem stakeholders is this project associated with?

A: IPFS/filecoin、Polkadot 、Ethereum

Use-case details

Describe the data being stored onto Filecoin

A: Mainly medical record data, XML and medical image DICOM3.0 files

Where was the data in this dataset sourced from?

A: Uploaded from patients, doctors and hospitals

Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.

A: Please refer attachment for detail.
https://drive.google.com/file/d/1JTFp0BYAMygmRHHckv3cBFaS9UonRMIF/view?usp=sharing

Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

Yes, confirm. (Our data SDM-static data masking & DDM-dynamic data masking)

What is the expected retrieval frequency for this data?

A: At present, the retrieval frequency in one month is relatively high. Over three months, the retrieval probability will be reduced by 80%

For how long do you plan to keep this dataset stored on Filecoin?

A: Permanent storage as a patient's long-term personal health asset

DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

Mainly China or other countries in Asia.

How will you be distributing your data to storage providers? Is there an offline data transfer process?

I will transmit the data to the miners both online and offline.

How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

The main factors we considered are the following:
1.Location (Near China)
2.Possess of experience in dealing with verified data.
3.Possess of more than 10 PiB Total Raw Power

How will you be distributing deals across storage providers?

We will follow the rules for large-datasets, and we will ensure fair distribution through limiting the amount of deals send to

Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?

Yes
@large-datacap-requests
Copy link

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!

@large-datacap-requests
Copy link

Thanks for your request!
Everything looks good. 👌

A Governance Team member will review the information provided and contact you back pretty soon.

@large-datacap-requests
Copy link

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!

@large-datacap-requests
Copy link

Thanks for your request!
Everything looks good. 👌

A Governance Team member will review the information provided and contact you back pretty soon.

@large-datacap-requests
Copy link

Thanks for your request!
Everything looks good. 👌

A Governance Team member will review the information provided and contact you back pretty soon.

@UnionLabs2020
Copy link

Could you send an email to filplus-app-review@fil.org with your official domain in order to confirm your identity?

@Sinsoteam
Copy link
Author

mmexpor
Yes, here's email.

Could you send an email to filplus-app-review@fil.org with your official domain in order to confirm your identity?

@Sunnyiscoming
Copy link
Collaborator

What's the relationship between you and the organization?
Can you provide more detailed information about other storage providers participated in this program, such as you can list SPs you have contacted with at present?

@simonkim0515 simonkim0515 self-assigned this Nov 17, 2022
@Sinsoteam
Copy link
Author

@Sunnyiscoming I am the employee in charge of requesting storage capacity in our company, and we have finished KYC. #961 (comment)
We are ready for f0108979, f01210794, f0524489, then we'll contact more sp for the deal after we get first round datacap. Thank you for your questions.

@simonkim0515
Copy link
Collaborator

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

100TiB

Client address

f1lozjhyay3heeav3wm4ttycoaumjgtgrp452woki

@large-datacap-requests
Copy link

large-datacap-requests bot commented Nov 22, 2022

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1lozjhyay3heeav3wm4ttycoaumjgtgrp452woki

DataCap allocation requested

50TiB

Id

2a7f3bf1-8385-4b7e-912c-727feb7e0715

@ghost
Copy link

ghost commented Aug 16, 2023

Hello @Sinsoteam per the new guidelines filecoin-project/notary-governance#922 for Open Dataset applicants, please complete the following Fil+ registration form to identify yourself as the applicant and also please add the contact information of the SP entities you are working with to store copies of the data.

This information will be reviewed by Fil+ Governance team to confirm validity toward the Fil+ guideline of a distributed storage plan and SPs posted in the comments here. Let us know if you have any questions.

@github-actions github-actions bot removed the Stale label Aug 17, 2023
@Sinsoteam
Copy link
Author

This client is actively stalling http retrievals and blocked http ranged requests with a reverse proxy to prevent it's data being investigated.

It works as follows:

One set's a bandwidth limit with NGINX on the HTTP retrieval. After a random certain amount the limit is set to zero. This makes the transfer timeout. Because range retrieval is disabled in NGINX one cannot pick up where he left and needs to start all over again.

Log can be found at http://datasetcreators.com/downloadedcarfiles/logs/961.log

No. We have communicated with SPs and have made sure they are not using NGINX. I am not sure if some of your behavior triggered the network's security defenses.

@cryptowhizzard
Copy link

This client is actively stalling http retrievals and blocked http ranged requests with a reverse proxy to prevent it's data being investigated.
It works as follows:
One set's a bandwidth limit with NGINX on the HTTP retrieval. After a random certain amount the limit is set to zero. This makes the transfer timeout. Because range retrieval is disabled in NGINX one cannot pick up where he left and needs to start all over again.
Log can be found at http://datasetcreators.com/downloadedcarfiles/logs/961.log

No. We have communicated with SPs and have made sure they are not using NGINX. I am not sure if some of your behavior triggered the network's security defenses.

Again:

Boost supports range retrievals. Simple explanation , if a download breaks due timeout , it should pick up again where it left. This function is disabled on your side making retrieval impossible.

Fix it.

@TakiChain
Copy link

checker:manualTrigger

@filplus-checker-app
Copy link

DataCap and CID Checker Report Summary1

Retrieval Statistics

  • Overall Graphsync retrieval success rate: 0.00%
  • Overall HTTP retrieval success rate: 30.63%
  • Overall Bitswap retrieval success rate: 0.00%

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 61.76% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.
Click here to view the Retrieval report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@TakiChain
Copy link

Retrieval report seems normal. Please keep your request in accordance with the principles of the program and in line with their allocation strategy. @Sinsoteam

Copy link

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecclvi2iekxqbcnnhiy6utc4ulrjrvqjeva22c2ahrt2s4n6a6wsk

Address

f1lozjhyay3heeav3wm4ttycoaumjgtgrp452woki

Datacap Allocated

400.00TiB

Signer Address

f15impf3j2zcaex4lhyxndxswuuhv24vzstuqtxsi

Id

818faae9-8fc2-4c9f-a626-a7096df46c9f

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecclvi2iekxqbcnnhiy6utc4ulrjrvqjeva22c2ahrt2s4n6a6wsk

@raghavrmadya
Copy link
Collaborator

@TakiChain , I see that another notary is actively conducting due diligence on this application. Can you share evidence of what kind of due diligence you performed? Looking at the bot report is not enough. Did you attempt retrieval sampling?

This application has been under dispute - https://www.notion.so/filecoin/LDN-signed-without-retrieval-aebb6f0a736549ae85e03d7b2d411f0a?pvs=4

Client must show evidence of retrievability to continue

@TakiChain
Copy link

@raghavrmadya Isn't the retrieval report valid evidence? Why not consider upgrading your report?

@cryptowhizzard
Copy link

Dear Sinsoteam,

As notary I am doing due diligence on your LDN. I could not get retrieval to work. Can you please upload the car file of CID baga6ea4seaqjhbh6emggbx2zlxaaiqjcrci5ujmjnq6n6lh6kk3vwmn5de56gjq ?

You can use our upload system at http://send.datasetcreators.com. Please select 7 days for the system to keep the file and post the link you received here so I (and other notaries) can download your content.

@Sinsoteam
Copy link
Author

@raghavrmadya This can show that we support retrieval.
image

@raghavrmadya
Copy link
Collaborator

Thanks @Sinsoteam

@github-actions
Copy link

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

--
Commented by Stale Bot.

@Casey-PG
Copy link

checker:manualTrigger

@filplus-checker-app
Copy link

DataCap and CID Checker Report Summary1

Retrieval Statistics

  • Overall Graphsync retrieval success rate: 0.00%
  • Overall HTTP retrieval success rate: 35.38%
  • Overall Bitswap retrieval success rate: 0.00%

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 61.76% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients2

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.
Click here to view the Retrieval Dashboard.
Click here to view the Retrieval report.

Footnotes

  1. To manually trigger this report, add a comment with text checker:manualTrigger

  2. To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

@Casey-PG
Copy link

Report LGTM, willing to support.

Copy link

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacec6bz5mhmsdrc7dktos2k24wlacgfmq24swskhm33kjlh46gq6ksc

Address

f1lozjhyay3heeav3wm4ttycoaumjgtgrp452woki

Datacap Allocated

400.00TiB

Signer Address

f1d4yb3wags3mtddzesxoo63jv7dmlec3bq4yteni

Id

818faae9-8fc2-4c9f-a626-a7096df46c9f

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacec6bz5mhmsdrc7dktos2k24wlacgfmq24swskhm33kjlh46gq6ksc

@kevzak
Copy link
Collaborator

kevzak commented Sep 18, 2023

closing until @Sinsoteam provides a clear list of SP miner IDs, entities and locations storing replicas

@kevzak kevzak closed this as completed Sep 18, 2023
@herrehesse
Copy link

@Casey-PG @TakiChain can these notaries be removed immediately?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.