Hello 👋
I'm a software engineer with reverse-engineering chops.
I specialise in hard-to-crack data acquisition problems at scale (i.e, reliably bypassing anti-scraping systems at scale)
I am available for consulting at the price of 150$/hour.
Project name | Description | Language | Notes |
---|---|---|---|
llmify | A simple tool to transform a subset of a git repository into documentation easily readable by a LLM (Large Language Model) | Rust | |
things3-server | A Things3 server with a focus on preserving your privacy. | TypeScript | |
postgresql-obsidian | An Obsidian plugin to upload your notes' metadata to your database. | TypeScript | |
3proxy-docker | A Dockerfile for building 3proxy | Dockerfile | |
soundcloud-scraper | A time-machine for SoundCloud | TypeScript | WIP |
AutoSQLi | An automatic SQL Injection tool | Python | Never quite worked |
typescript-boilerplate | TypeScript | My evergreen TypeScript scraping project template. I will update it regularly, as I implement new tools into my workflow. | |
typescript-boilerplate-old | TypeScript | An archived general-purpose TypeScript boilerplate. | |
puppeteer-boiler | TypeScript | A deprecated browser-automation oriented TypeScript boilerplate. |
Project name | Description | Language | First commit | Last commit |
---|---|---|---|---|
meowssl | A BoringSSL fork that allows more flexibility for TLS fingerprinting evasions | C++ | Sep 11 2024 | ongoing |
mitmproxy-meow | A Mitmproxy fork with TLS and h2 mimicking capabilities | Python | Sep 11 2024 | ongoing |
browserless-stealth | Browserless with evasions | TypeScript | Nov 10 2021 | ongoing |
requests-stealth | Go stealth HTTP request engine featuring an experimentation engine and TLS emulation | Go | February 14 2022 | ongoing |
simkhey | A SMS sending network that uses SIM800C USB modules and Raspberry Pis. Cheap alternative to Twilio | TypeScript | Jan 30 2022 | May 4 2022 |
OpinionShift | A Twitter bot | TypeScript | April 13 2021 | June 2 2022 |
Pumpbot | A cryptocurrency pump-and-dump exploitation tool | TypeScript | ||
QuizBot | Reverse engineered API client for a cash-winning quiz app | Go | ||
Instabot | Instagram auto-post and engagement bot | Go |
... and other projects that I forgot about
- You cannot improve what you cannot measure.
- Use the right tools for the job.
- Decouple flaky software
- Your database should mirror the reality
- Store everything; storage is cheap but data is not
- Evaluate each of your actions through the lens of your current goal
- Don't obsesss too much over optimization; CPU-time is cheap, brain-time is not.
- Reverse-engineer life
- Have boring code
- Pick boring technology
- Be humble; there is always something to learn by listening to others
- Take a walk when the next step is not clear.
- Great work is done in isolation.
- Pen&paper can do wonders
Of course, picking the right tools for the job is the most important part.
But here is a list of technologies that I use and love along with a small comment.
- Telemetry: Grafana, VictoriaMetrics, ClickHouse, (Loki — unfortunately)
- Orchestration:
- Kubernetes (on Hetzner to save $$$)
- Good old makefiles with
scp
- Programming languages
- TypeScript
- Python
- Go
- Kotlin
- Bash (... always forgetting the syntax, though...)
- SQL
- Rust (❤️)
- Databases:
- PostgreSQL
- ClickHouse
- Redis/Dragonfly
- Queues:
- AMQP (RabbitMQ)
- BullMQ
These are technologies I wish to adopt but couldn't get the chance to play around with them enough yet.
- Beeminder - to force me to do the boring tasks
- Todoist — to remember me to do the boring tasks
- A good old notebook! — to help me reason about the interesting tasks
- CommonCog
- LessWrong
- Beeminder's blog
- Sam Altman's blog
- Paul Graham's blog
- Antoine Vastel's blog
- TimescaleDB's blog
- puppeteer-extra's wiki
- the puppeteer-extra community
- Prescience's dark-knowledge (and all Prescience's work)
- niespod's browser-fingerprinting cheatsheet
Telegram: @clouedoc
Mail: clouedoc@tutanota.com
Mr. Hedghehog says: "thanks for the visit!"