Skip to content

alienobserver/datengeist

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Datengeist

Application for easy understanding of unstructured data

Datengeist is a streamlit built application which is made to understand unstructured data through visualization of its components. Datengeist is working with .csv files. Datengeist has this key functionalities:

  1. Categorization of features
  2. Visualization of distributions
  3. Convenient handling of missing data
  4. Tools for feature comparison

To run datengeist you can install via pip

$ pip install datengeist
$ datengeist start

Or you can create a virtual environment and then run it (recommended)

$ python3 -m venv datengeist_env
$ source datengeist_env/bin/activate

$ pip install datengeist

1. Sample the Dataset

Sample the Dataset is where you can sample data, load it and have your first overview of the data screenshot

2. General Info

General Info is where you can divide your features into corresponding categories and view your missing values in each feature

screenshot

3. Feature Info

Feature Info is where you can view your features more closely, the distributions and missing value percentage screenshot screenshot

4. Relate Features

Relate Features is where you can view the correlation between your features and relate them via box plotting screenshot

License

Apache 2.0

About

Application for easy understanding of unstructured data

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages