CodeSkills
← 返回发现

Scrape

Legal web scraping with robots.txt compliance, rate limiting, and GDPR/CCPA-aware data handling.

来自 GitHub查看原文 →2026-03-24

## Pre-Scrape Compliance Checklist

Before writing any scraping code:

1. **robots.txt** — 获取 `{域名}/robots.txt`, check if target 路径 is disallowed. If yes, 停止. 2. **Terms of 服务** — Check `/terms`, `/tos`, `/legal`. Explicit scraping prohibition = need 权限. 3. **Data 类型** — Public factual data (prices

相关 Skills