GitHub - belaboe97/AdaRank_LOINC: Proof of Concept for an Implementation of search engine for LOINC

##Readme LOINC Ranking Project

This is a fork of githubuser rueychang https://github.com/rueycheng/AdaRank

General Information:

The Loinc Folder consists of various steps taken in order to make the lsitwisem approach: -labelprep: Scripts to do an automated labeling and extension of given LOINC dataset - MQ2007: Benchmark Dataset for ranking with over 40 Parameters (works with implemented AdaRank Library @rueycheng)
For testing download the dataset @ https://www.microsoft.com/en-us/research/project/letor-learning-rank-information-retrieval/letor-4-0/
@ https://onedrive.live.com/?authkey=%21ACnoZZSZVfHPJd0&id=8FEADC23D838BDA8%21107&cid=8FEADC23D838BDA8
- resources: usefull papers about AdaRank and Letor 4 - results: results of the validation dataset after training AdaRank -Files: - test.py, utils.py ,metrics.py => part of reuycheng fork of AdaRank algorithm implementation - loinc_dataset_original, loinc_dataset_extended => Basic and modified version of loinc dataset - IPYNB File for preprocessing data

Installation: - pandas
- numpy
  - nltk
- os
- re
- math
- spacy
- sklearn
(files alread included)
- navigate to the folder and start jupyter notebook
- Run Preprocess Dataset in order to achive files for AdaRank Algorithm implemented by @rueycheng

These are the following batch commands for evaluating the results:

["glucose in blood","bilirubin in plasma","White blood cells count"] Query 1: Glucose in Blood

Org_DS -> python test.py data/original_ds/q1/train.txt data/original_ds/q1/test.txt data/original_ds/q1/vali.txt -o=results/original_ds/gib
.txt Ext_DS -> python test.py data/extend_ds/q1/train.txt data/extend_ds/q1/test.txt data/extend_ds/q1/vali.txt -o=results/extend_ds/gib.txt

Query 2: bilirubin in plasma

Org_DS -> python test.py data/original_ds/q2/train.txt data/original_ds/q2/test.txt data/original_ds/q2/vali.txt -o=results/original_ds/bip.txt
Ext_DS -> python test.py data/extend_ds/q2/train.txt data/extend_ds/q2/test.txt data/extend_ds/q2/vali.txt -o=results/extend_ds/bip.txt

Query 3: White blood cells count

Org_DS -> python test.py data/original_ds/q3/train.txt data/original_ds/q3/test.txt data/original_ds/q3/vali.txt -o=results/original_ds/
wbcc.txt Ext_DS -> python test.py data/extend_ds/q3/train.txt data/extend_ds/q3/test.txt data/extend_ds/q3/vali.txt -o=results/extend_ds/wbcc.txt

These commands are usefull to run the Letor Benchmark Ranking Dataset:

-python test.py MQ2007/Fold1/{train,vali,test}.txt
- More Information: https://github.com/rueycheng/AdaRank/issues/1

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.ipynb_checkpoints		.ipynb_checkpoints
MQ2007		MQ2007
__pycache__		__pycache__
data		data
labelprep		labelprep
resources		resources
results		results
Lab ML Ranking Assignment.pdf		Lab ML Ranking Assignment.pdf
PreprocessDataset.ipynb		PreprocessDataset.ipynb
adarank.py		adarank.py
authors.txt		authors.txt
loinc_dataset_extended.xlsx		loinc_dataset_extended.xlsx
loinc_dataset_original.xlsx		loinc_dataset_original.xlsx
metrics.py		metrics.py
readme.md		readme.md
test.py		test.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

General Information:

These are the following batch commands for evaluating the results:

These commands are usefull to run the Letor Benchmark Ranking Dataset:

About

Releases

Packages

Languages

belaboe97/AdaRank_LOINC

Folders and files

Latest commit

History

Repository files navigation

General Information:

These are the following batch commands for evaluating the results:

These commands are usefull to run the Letor Benchmark Ranking Dataset:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages