Items to build, research, test or evaluate in creating the services outlined above:
-
metadata storage system - a resilent service which can serve and store data about each file uploaded and at which stage of processing the file has progressed.
- Determine where or how to store process metadata
- cloudsql - seems heavyweight?
- firebase real-time db - is this firestore?
- cloud bigtable
- Determine where or how to store process metadata
-
Cloud Storage bucket (https://storage.cloud.google.com/archive-routeviews/helowurld.txt) NOTE: the referenced path is not public, it should be.
-
BigQuery schema for the data to be loaded
- Decide what parts of the MRT content to store in bigquery
- This may take longer than just storing the data and making that available.
- Decide what parts of the MRT content to store in bigquery
-
CLI Client to read a file, package that file in a protobuf and send to a gRPC endpoint.
-
Server infrastructure to accept the gRPC request, process the included file and provide status.
-
Metrics to collect on the server portion of the processing (prometheus metrics)
-
Finding a golang MRT reader is also a required work item,
-
Integrate RouteViews data with MLab data in BigQuery?