Skip to content

Experimental layered distributed peer to peer overlay filesystem. Based on distributed hash tables (DHTs) and implemented as chord ring using C++20 and GRPC - work in progress.

License

Notifications You must be signed in to change notification settings

winternet/chord

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation


     _/_/_/  _/                                  _/
  _/        _/_/_/      _/_/    _/  _/_/    _/_/_/
 _/        _/    _/  _/    _/  _/_/      _/    _/
_/        _/    _/  _/    _/  _/        _/    _/
 _/_/_/  _/    _/    _/_/    _/          _/_/_/


Build Status codecov

Overview

This is an experimental layered distributed peer to peer overlay filesystem based on distributed hashtables (DHTs). The implementation is heavily based on the paper "Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications" by Ion Stoica et al. See Wikipedia for more information.

Architectural Overview

Fuse Adapter

The project comes with an optional, incomplete and even more experimental adapter for fuse (c.f. libfuse) to mount the distributed filesystem:

demo

Disclaimer

This software is highly experimental and data-loss is very likely - do NOT use this project except for experiments or educational purposes (see LICENSE). You are welcome to contribute making the project even more resilient and stable - help appreciated.

Installation

Pre-built packages

The easiest way to get started is to download the package from the Releases page. Download and install the package using your favorite package managers, e.g.:

$ # debian / ubuntu / ...
$ apt install ./chord*.deb libfuse3-3
$ apt-get install ./chord*.deb libfuse3-3
$ # fedora / centos / ...
$ dnf install chord*.rpm
$ rpm -i chord*.rpm
$ yum install chord*.rpm

Otherwise download and untar the tgz:

$ tar xvzf chord*.tgz

Docker

Another way to try out chord is using a small docker image (~20 MB) built from scratch provided by the repository winternet1337/chord. The images are automatically built and pushed to docker hub after each commit - provided all tests passed successfully. To pull the image issue:

$ docker pull winternet1337/chord:latest

See INSTALL.md for more detailed installation instructions, e.g. how to build the project from sources.

Build from Source

The build process requires conan. After installation of conan build chord using the following commands:

$ conan remote add bincrafters https://bincrafters.jfrog.io/artifactory/api/conan/public-conan
$ git clone https://github.com/winternet/chord.git
$ cd chord && mkdir build && cd build
$ conan install .. --build=missing && cmake .. && cmake --build . -j4

Usage

Mount the Filesystem (fuse)

Mounting the experimental overlay filesystem will require you to use a configuration file. The installer packages provide you with two example configurations under /etc/node.yml and /etc/fuse_node0.yml. Latter can be used in conjunction with fuse.

$ chord_fuse -s <absolute-path> -- --config /etc/fuse_node0.yml

Note that the configuration has to be adapted for further nodes (uuid,join-addr,data-directory,meta-directory).

After mounting the filesystem, optionally joining some more nodes to the ring, you should be able to issue basic file system operations in the mounted folder using bash or your favorite file explorer (e.g. nautilus).

Chord Client

To run a node in interactive mode and cleanup automatically afterwards issue $ docker run -ti --rm winternet1337/chord. This command will bootstrap the container with random node uuid on default port 50050. To stop the container issue Ctrl+C.

In order to print chord's help just append the --help argument (-h for short):

 $ docker run -ti --rm winternet1337/chord --help
 [program options]:
  -h [ --help ]           produce help message
  -c [ --config ] arg     path to the yaml configuration file.
  -j [ --join ] arg       join to an existing address.
  -b [ --bootstrap ]      bootstrap peer to create a new chord ring.
  -n [ --no-controller ]  do not start the controller.
  -u [ --uuid ] arg       client uuid.
  --bind arg              bind address that is promoted to clients.

To configure our node we could pass some of the arguments to the container, however, it is far more convenient and powerful to use a configuration file. For this to work, we create one on our docker host machine and mount the volume within the docker container.

The node configuration

Sample configurations can be found in the sourcecode repository under the config-folder. Either use this samples or copy & paste the one below to e.g. /tmp/chord/config/node0.yml:

version: 1

## data
data-directory: "/data"
meta-directory: "/meta"

## networking
bind-addr: "0.0.0.0:50050"
join-addr: "0.0.0.0:50051"
bootstrap: Yes
no-controller: No

## details
stabilize_ms: 10000
check_ms: 10000

## replication / striping
replication-count: 1

uuid: "0"

## logging
logging:
  level: trace
  sinks:
    CONSOLE_SINK:
      type: "console"
    FILE_SINK:
      type: "file-rotating"
      path: "/logs/chord0.log"
  loggers:
    CHORD_LOG:
      sinks: [FILE_SINK]
      filter: "^chord[.](?!fs)"
    CHORD_FS_LOG:
      sinks: [CONSOLE_SINK]
      filter: "^chord[.]fs"
      level: trace

The configuration should be quite self-explanatory. A more detailed description of the configuration will be provided in the wiki.

Starting configured node

To start the node with the yaml configuration file, we need to mount it to the container.

 $ docker run -ti --rm \
        -v /tmp/chord/config:/etc/chord \
        winternet1337/chord -c /etc/chord/node0.yml
[<timestamp>] [chord.fs.metadata.manager] [trace] [ADD] chord:///
[<timestamp>] [chord.fs.metadata.manager] [trace] [ADD] .

Since we restricted the console logging to the filesystem part, we are greeted by the metadata manager creating the root of our p2p filesystem - waiting for something to happen.

Setup chord cluster

The next section describes how to setup a small local cluster consisting of two nodes. The section closes with storing a folder within our cluster.

Forwarding ports and bind address

To wire our different nodes locally we will exploit docker's --net=host option. Note that this is not the preferred way but its ok for showing the basic concepts. We start by mounting more volumes so that all (meta-)data is written to the docker host filesystem.

 $ docker run -ti --rm --net=host \
        -v /tmp/chord/config:/etc/chord \
        -v /tmp/chord/data0:/data \
        -v /tmp/chord/meta0:/meta \
        -v /tmp/chord/logs0:/logs \
        winternet1337/chord -c /etc/chord/node0.yml

Copy the /tmp/chord/config/node0.yml to /tmp/chord/config/node1.yml and change the uuid to a value near (2^256)/2 so that all files are distributed equally across the cluster. Also change the bind address to bind-addr: "0.0.0.0:50051" and the join address to join-addr: "0.0.0.0:50050".

On a different shell start another docker instance with our second configuration and different (meta-)data directories.

 $ docker run -ti --rm --net=host \
        -v /tmp/chord/config:/etc/chord \
        -v /tmp/chord/data1:/data \
        -v /tmp/chord/meta1:/meta \
        -v /tmp/chord/logs1:/logs \
        winternet1337/chord -c /etc/chord/node1.yml

After a few seconds the nodes should synchronise and build a small cluster. To query the root of the distributed filesystem chord:///:

$ docker run -ti --rm --net=host winternet1337/chord dir chord:///
d---------   .

To upload new files to the distributed filesystem issue a put <node_local> chord:/// command. Examining the root afterwards reveals that a new directory /etc has been added.

$ docker run -ti --rm --net=host winternet1337/chord put /etc chord:///
$ docker run -ti --rm --net=host winternet1337/chord dir chord:///
d---------   .
d---------   etc
$ docker run -ti --rm --net=host winternet1337/chord dir chord:///etc
d---------   .
-rwxrwxrwx   hostname
-rwxrwxrwx   hosts

Note that the put commands are currently always sent to 127.0.0.1:50050 and issued locally on that node, i.e. the files on the node are uploaded since only nodes participating in the cluster can upload their directories and files.

Since the (meta-)data directories are mounted on the host filesystem the files hosted on the DHT are inside host's /tmp/chord/data? directories. The database storing the metadata is located under /tmp/chord/meta?.

Packaging

Build tgz package

$ cd build
$ cmake -G Ninja -DCMAKE_BUILD_TYPE=Release ..
$ cpack -G TGZ
$ tar -tf ../packages/chord_*.tgz

Build deb package

Given dpkg executable is installed (arch: community/dpkg)

$ cd build
$ cmake -DCPACK_PACKAGING_INSTALL_PREFIX=/ -G Ninja -DCMAKE_BUILD_TYPE=Release ..
$ cpack -G DEB
$ dpkg-deb -c ../packages/chord_*.deb

Build rpm package

Given rpmbuild executable is installed (arch: community/rpm-tools)

$ cd build
$ cmake -DCPACK_PACKAGING_INSTALL_PREFIX=/ -G Ninja -DCMAKE_BUILD_TYPE=Release ..
$ cpack -G RPM
$ rpm -qlp ../packages/chord_*.rpm

About

Experimental layered distributed peer to peer overlay filesystem. Based on distributed hash tables (DHTs) and implemented as chord ring using C++20 and GRPC - work in progress.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published