-
Notifications
You must be signed in to change notification settings - Fork 237
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Integrate another bioMLmodel (biomd0000001076) #1371
Merged
bgruening
merged 18 commits into
bgruening:master
from
anuprulez:integrate_BIOMD0000001076
Jan 5, 2024
Merged
Changes from all commits
Commits
Show all changes
18 commits
Select commit
Hold shift + click to select a range
614f988
Add tool BIOMD0000001076
anuprulez c4306b7
update tool desc
anuprulez 47a8deb
update tool desc
anuprulez 345988c
add version
anuprulez 67346d8
update
anuprulez b50b15f
update test
anuprulez 4b6e947
remove print
anuprulez cabdbe7
add creator
bgruening 4c27f2d
update tool
anuprulez 1ed1b14
update doi
anuprulez 672437a
Update tools/biomodelsML/biomodels_BIOMD0000001076.xml
anuprulez ee9cc55
Merge branch 'bgruening:master' into integrate_BIOMD0000001076
anuprulez 48bf298
remove xlsx
anuprulez 27adac4
test bibtext to fix test
anuprulez 8df994c
revert to type=doi
anuprulez 9fa0e1c
remove creator tag
anuprulez 86695d5
update profile
anuprulez 80e6548
update embedded script name
anuprulez File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,93 @@ | ||
<tool id="biomodels_BIOMD0000001076" name="Prediction of Breast Cancer (BC)" version="@VERSION@" profile="23.05"> | ||
<description>proteins involved in cancer immunotherapy using molecular descriptors and Multi Layer Perceptron (MLP) neural network</description> | ||
<macros> | ||
<token name="@VERSION@">1</token> | ||
</macros> | ||
<creator> | ||
<organization name="EMBL's European Bioinformatics Institute (EMBL-EBI)" url="https://www.ebi.ac.uk/biomodels/" email="biomodels-net-support@lists.sf.net" /> | ||
<person name="Anup Kumar" email="kumara@informatik.uni-freiburg.de" /> | ||
</creator> | ||
<requirements> | ||
<container type="docker">docker.io/anupkumar/biomodels_biomd0000001076:@VERSION@</container> | ||
</requirements> | ||
<command><![CDATA[ | ||
cp /home/\$NB_USER/MLP_model3.onnx ./ && | ||
python '$biom_script_BIOMD0000001076' | ||
]]> | ||
</command> | ||
<configfiles> | ||
<configfile name="biom_script_BIOMD0000001076"> | ||
<![CDATA[ | ||
import os | ||
import numpy | ||
import onnxruntime as rt | ||
import pandas as pd | ||
|
||
#if $input_file.ext == "tabular": | ||
data_test = pd.read_csv('$input_file', sep="\t") | ||
#elif $input_file.ext == "csv": | ||
data_test = pd.read_csv('$input_file', sep=",") | ||
#end if | ||
|
||
Xdatap = data_test.iloc[:,:-1].values | ||
sess = rt.InferenceSession(os.getcwd() + "/MLP_model3.onnx") | ||
input_name = sess.get_inputs()[0].name | ||
label_name = sess.get_outputs()[0].name | ||
pred_onx = sess.run([label_name], {input_name: Xdatap.astype(numpy.float32)})[0] | ||
data_test["Predictions"] = pred_onx | ||
data_test.to_csv('$output_file', sep="\t", index=None) | ||
|
||
]]> | ||
</configfile> | ||
</configfiles> | ||
<inputs> | ||
<param name="input_file" type="data" label="Test data" format="tabular,csv" multiple="false" /> | ||
</inputs> | ||
<outputs> | ||
<data format="tabular" name="output_file" label="Predicted data"></data> | ||
</outputs> | ||
<tests> | ||
<test> | ||
<param name="input_file" value="test_data_BIOMD0000001076.tabular" ftype="tabular" /> | ||
<output name="output_file" file="pred_data_BIOMD0000001076.tabular"> | ||
<assert_contents> | ||
<has_n_columns n="302" /> | ||
<has_n_lines n="401" /> | ||
</assert_contents> | ||
</output> | ||
</test> | ||
<test> | ||
<param name="input_file" value="test_data_BIOMD0000001076.csv" ftype="csv" /> | ||
<output name="output_file" file="pred_data_BIOMD0000001076.tabular"> | ||
<assert_contents> | ||
<has_n_columns n="302" /> | ||
<has_n_lines n="401" /> | ||
</assert_contents> | ||
</output> | ||
</test> | ||
</tests> | ||
<help> | ||
<![CDATA[ | ||
**What it does** | ||
|
||
The tool makes prediction of breast cancer-related proteins: non-breast cancer related protein (0) or related to breast cancer protein (1). 300 features are used to train a Multi Layer Perceptron (artificial neural network) for such a prediction. The model is available via a Docker container. The original GitHub repository can be accessed at https://github.com/muntisa/neural-networks-for-breast-cancer-proteins. | ||
|
||
|
||
**Description** | ||
|
||
This study introduces a predictive classifier for breast cancer-related proteins, utilising a combination of protein sequence descriptors and machine learning techniques. The best-performing classifier is a Multi Layer Perceptron (artificial neural network) with 300 features, achieving an average Area Under the Receiver Operating Characteristics (AUROC) score of 0.984 through 3-fold cross-validation. Notably, the model identified top-ranked cancer immunotherapy proteins associated with breast cancer that should be studied for further biomarker discovery and therapeutic targeting. Please note that in this model, the output '0' means BC non-related protein and '1' means BC related protein. | ||
|
||
**Input file** | ||
|
||
Provide a tabular file (as tabular or csv) containing all the features as mentioned above. | ||
|
||
**Output file** | ||
|
||
Returns predicted data as a tabular file - the predictions are concatenated as a last column of the test data. | ||
|
||
]]> | ||
</help> | ||
<citations> | ||
<citation type="doi">10.1038/s41598-020-65584-y</citation> | ||
</citations> | ||
</tool> |
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is that ok for you?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes, it works. Thanks a lot :)