You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Graph construction was implemented using rustworkx in MRG: Add graph-based clustering #234, I want to mention that the rustworkx python interface is remarkably optimized. Using Python, we can build all graph-downstream analyses after the initial creation of the graph (I suppose an undirected graph).
BiPartite: As visualized in DBRetina bipartite, this can be useful in so many applications (maybe like):
metagenomes compositional analysis.
pangenome-like signatures relations with genomes.
Host-Pathogen Interactions.
Strain-level analysis.
Community Detection: The current clustering algorithm is weakly_connected_component,@bluegenes tried it before with kSpider, and -as far as I remember- it did a great job in the ANI-based clustering of the GTDB-207. Here, I propose adopting community detection methods, which have been proven very useful in DBRetina, but I haven't tried them on DNA data.
Note: RustworkX currently lacks variability in graph algorithms, unlike NetworkX.
Leiden: An extension to the popular Louvain algorithm, they provide an excellent python-c++ package.
k-mer graph 🌟: : Here the graph will consists of k-mer hashes as nodes, and genomes/metagenomes/etc.. as edges, with abundance as edge-weight. This also can be useful for God knows how many applications (maybe like):
Biomarkers detection
Evolutionary and taxonomy analysis
Low-complexity k-mers detection and removal
and more ...
Interactive Dashboard: In DBRetina, I implemented a JS-based dashboard that loads the graphs and allows interactive researching by filtering/querying the graph with many features/thresholds/etc.. it was super helpful. Previously, this was done by exporting the graph to a graph database like Neo4J or memgraph, but it will not help software users.
(maybe odds ratio & p-value): In the pairwise script, we can allow an optional calculation of the similarity significance by calculating the odds ratio and p-value. But I will need to think more about it in this context.
The text was updated successfully, but these errors were encountered:
rustworkx
in MRG: Add graph-based clustering #234, I want to mention that therustworkx
python interface is remarkably optimized. Using Python, we can build all graph-downstream analyses after the initial creation of the graph (I suppose an undirected graph).weakly_connected_component,
@bluegenes tried it before with kSpider, and -as far as I remember- it did a great job in the ANI-based clustering of the GTDB-207. Here, I propose adopting community detection methods, which have been proven very useful in DBRetina, but I haven't tried them on DNA data.The text was updated successfully, but these errors were encountered: