About Stats
We have so many plans for the Stats area of the site. For now, here are a few high-level data tables that contain the type of analyses we plan to release with our first full site version.
If you have any incredible ideas for some aggregated information you'd like to see here, please do get in touch.
Note. These tables are reports of what we see in robots.txt files. As these files are rarely validated or checked, there will be misspellings and incorrectly formatted crawler names. So, bear in mind, something that you see labelled as a "bot" may be a common typo, rather than a real crawler.
Browse Some Stats
User-Agents mentioned in robots.txt in all analysed sites
Other interesting stats
These files are an early-release sample of the data we will be providing. It is very likely that layout and column headers will change in the future.
Newest Bots (With >500 mentions)
User-Agent | First Seen | Mentions | |
1 | Neilpatelbot | 2025-05-08 | 574 |
2 | MistralAI-User | 2025-04-22 | 13762 |
3 | VolcEngineSpider | 2025-04-17 | 4045 |
4 | NovaAct | 2025-04-16 | 17273 |
5 | symbolicator | 2025-04-15 | 1210 |
6 | AITraining | 2025-03-20 | 741 |
7 | test123 | 2025-03-06 | 1375 |
8 | TikTokSpider | 2025-02-28 | 16620 |
9 | Perplexity AI Bot | 2025-02-19 | 1615 |
10 | Anthropic AI Bot | 2025-02-19 | 1619 |