This repository contains code to generate a Filecoin Commp for a file as part of the Filecoin DumboDrop project. The DumboDrop project runs in AWS which utilizes Lambda serverless functions for processing logic. This leads to the following constraints that the design of this project take into account:
- Input files of maximum size 1GB
- AWS Lambda Limits (Maximum RAM of 3008MB and 512MB of Disk)
- Deployment size < 50 MB
- High performance (to keep costs low)
The design takes these requirements into account by utilizing the Rust implementation of Filecoin Commp generation configured for stream parsing of input data and an in memory backing store. The Filecoin Rust implementation is an ideal match given its high performance, stream parsing and in memory backing store. The use of an in memory backing store does limit the size of files that can be processed using this project to those that fit in available memory.
A Filecoin Commp (Piece Commitment) is a Merkle hash of a block of data that's (a) padded out to the next nearest base2 size and internally padded according to the storage proofs requirements (currently an extra 2 bits per 254 bits). Commp is a 32 byte value pacakged in a CID with a custom multi-format.
- Linux 64 bit (may work on other platforms but has not been tested)
- Rust 1.43.1 (and probably later...)
- OpenCL libraries (apt-get -y install ocl-icd-opencl-dev)
This project supports containerized development using Visual Studio Code Remote Containers extension. You can read more about this here: https://code.visualstudio.com/docs/remote/containers
make commp_local
Builds a binary in target/release/commp
that can be run against a file. There is a sample car file in tests/fixtures you can try it on:
target/release/commp tests/fixtures/bafyreidigczbx3d3fbpabihjh3lmeoppdlriaipuityslbl4kgaud6bkci.car
To run this in AWS Lambda, a custom runtime must be which requires building it to run inside the amazonlinux:1 docker image. This docker image is based on CentOS 6 which is quite old (released in 2011). This process is automated by building the binary inside a docker image based on amazonlinux. To build this build image:
make docker_image
Once this image is built, execute a build of commp inside it:
make commp_lambda
Which will result in the file commp_lambda.zip in the root directory.
- Log into AWS Console
- Navigate to Lambda Service -> Functions
- Click "Create Function" button
- Give the function a name "e.g. CommP"
- For "Runtime" choose "Custom Runtime/Provide your own bootstrap"
- Click "Create Function"
- Under "Basic settings" click "Edit"
- Set "Handler" to 'commp_lambda'
- Set memory to max (3008 MB)
- Set timeout to max (15 min)
- Choose an execution role (or create one) that grants access to the S3 bucket commp will read from
- ClicK "Save"
- In "Function code" click "Actions->Upload a .zip file". Navigate to commp_lambda.zip and click OK