All agentsOperator docs
CCBot
VerifiedCommon Crawl · Model training
Builds the open Common Crawl corpus — the dataset behind many LLM training runs. Blocking CCBot removes you from most open training data.
Visits · 30d
948
First seen
Feb 26
Last seen
9m ago
Respects robots.txt
Yes
Verification
UA pattern + reverse DNS (operator domain)
Daily visits
last 30 days| Most-read pages | Reads |
|---|---|
| /docs/quickstart | 178 |
| /docs/api/meters | 115 |
| /docs/api/tariffs | 108 |
| /docs/sdks/python | 95 |
| /docs/api/carbon | 90 |
| / | 74 |
| /pricing | 50 |
| /customers | 38 |
| /pricing-2024 | 36 |
| /blog/carbon-api-launch | 31 |