A partitioned gpu-backed dataframe, using dask.
Setup from source repo:
- Follow instructions in https://github.com/gpuopenanalytics/pygdf#setup to setup a conda environment for pygdf.
- Activate pygdf environment:
source activate pygdf_dev
- Clone dask_gdf repo:
git clone https://github.com/gpuopenanalytics/dask_gdf path/to/dask_gdf
cd path/to/dask_gdf
- Install additional dependency:
conda install dask distributed
Installing (Optional):
- Run
cd path/to/dask_gdf pip install .
Testing:
- Ensure pytest is available:
conda install pytest
cd path/to/dask_gdf
- Run all tests with:
pytest dask_gdf
- Or, run individual tests with:
pytest dask_gdf/tests/test_file.py