-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About CAMUS dataset #4
Comments
@dengxl0520 do you have any update on this? I was shocked by the amount of over-engineering of their projects. |
@dengxl0520 I have just read through the dataset_generator.py and it looks like the team could access to a CAMUS dataset with fully annotated sequence from ED to ES, it seems confuse to me because the public training set of CAMUS does not contain any of this information. |
@gungui98 About the dataset, you can see the paper https://arxiv.org/abs/2112.02102 |
@dengxl0520 Seem like they extend the original dataset to full cycle by manual annotation. I have email the author for the dataset but haven't receive the response yet. |
@gungui98 you can download it from here (https://humanheart-project.creatis.insa-lyon.fr/ted.html) |
@dengxl0520 I have successfully processed the dataset and got h5 file, first you have to run script with full cycle option, where the input dataset is from your provided link. I have also skip the k-fold part where I simply split the dataset into 80/10/10 for train test val for the function def get_fold_subset_from_file(
cls, data: Path, fold: int, subset: Literal["training", "validation", "testing"]
) -> List[str]:
"""Reads patient ids for a subset of a cross-validation configuration.
Args:
data: Path to the CAMUS root directory, under which the patient directories are stored.
fold: ID of the test set for the cross-validation configuration.
subset: Name of the subset for which to fetch patient IDs for the cross-validation configuration.
Returns:
IDs of the patients that are included in the subset of the fold.
"""
# list_fn = data / "listSubGroups" / f"subGroup{fold}_{subset}.txt"
# # Open text file containing patient ids (one patient id by row)
# with open(str(list_fn), "r") as f:
# patient_ids = [line for line in f.read().splitlines()]
import glob
patient_ids = glob.glob(str(data / "*"))
# patient_ids = sorted(patient_ids)
train_set = patient_ids[:int(len(patient_ids) * 0.8)]
test_set = patient_ids[int(len(patient_ids) * 0.8):int(len(patient_ids) * 0.9)]
val_set = patient_ids[int(len(patient_ids) * 0.9):]
if subset == "training":
return train_set
if subset == "testing":
return test_set
return val_set I will try to implement the correct and fixed k-fold part but this simply made thing run at first. |
@gungui98 I try to modify the python file
|
@dengxl0520 not really sure about your problem, but this is code that I have used, it could come from reading the image data: https://gist.github.com/gungui98/364e8f77930880132dee9704aca9a90d |
I want to use the CAMUS dataset on this project, but I have some problems:
I use the castor/vital/vital/data/camus/dataset_generator.py to generate the HDF5 file, but I got an error:
how can I get the 'subGroup1_training.txt'?
The text was updated successfully, but these errors were encountered: