Weather
一个面向 Data & APIs 场景的 Agent 技能。原始说明:Get current weather and forecasts (no API key required).
一个面向 Data & APIs 场景的 Agent 技能。原始说明:Multi-platform social media data collection and aggregation for content performance tracking. Use when: (1) collecting engagement metrics (views/likes/commen...
name: social-media-data-collector
description: |
Multi-platform social media data collection and aggregation for content performance tracking.
Use when: (1) collecting engagement metrics (views/likes/comments/shares) across multiple platforms,
(2) filling bitable/spreadsheet with social media performance data, (3) tracking content distribution
results across 10+ platforms, (4) need to scrape platforms without APIs.
Covers: Douyin, Weibo, Kuaishou, Bilibili, Toutiao, Xiaohongshu, WeChat Video (视频号),
Autohome (汽车之家), Yiche (易车), Baijiahao (百家号), Douyu (斗鱼), Pipixia (皮皮虾),
Dongchedi (懂车帝), TikTok, YouTube.
NOT for: posting content, account management, or social listening/monitoring.
Collect engagement metrics from 13+ platforms, aggregate into structured format (飞书多维表格/CSV).
Three-tier approach: API first → browser scrape fallback → manual flag.
| Tier | Platforms | Method |
|------|-----------|--------|
| API-first | 抖音, 微博, 快手, B站, 今日头条, 小红书 | TikHub API / BlueAI Crawler |
| Browser-scrape | 百家号, 汽车之家, 易车, 视频号, 斗鱼, 皮皮虾 | Playwright headless |
| API+scrape | 懂车帝 | TikHub (limited) + scrape |
Using opus/sonnet for the entire pipeline wastes tokens on mechanical tasks.
| Phase | Model | Why |
|-------|-------|-----|
| Planning & classification | opus/sonnet | Needs reasoning |
| API calls & JSON parsing | haiku/flash | Mechanical, no reasoning needed |
| Browser text extraction | Code (no LLM) | Pure Python, no model call |
| Data normalization | haiku/flash | Simple mapping |
| Report/summary | sonnet | Needs synthesis |
scripts/collect_api.py for API tier — zero LLM tokens (pure code)scripts/collect_browser.py for browser tier — zero LLM tokens (pure code)# Full collection run
python3 scripts/collect_api.py --config /tmp/sm-collect/config.json
# Browser scrape specific platforms
python3 scripts/collect_browser.py --platforms "百家号,汽车之家,视频号"
# Write to bitable
python3 scripts/write_bitable.py --app-token XXX --table-id YYY --data /tmp/sm-collect/results.json
# Cleanup
rm -rf /tmp/sm-collect/ /tmp/screenshots/
| 多维表格字段 | 类型 | 说明 |
|-------------|------|------|
| 播放量 | text | 带"万"后缀的文本 |
| 点赞 | number | 纯数字 |
| 评论 | number | 纯数字 |
| 分享 | number | 纯数字 |
| 收藏 | number | 纯数字 |
| 互动量合计 | text | 带"万"后缀的文本 |
| 数据统计日期 | text | 格式 "2026.5.15" |
⚠️ 注意 播放量 和 互动量合计 是 text 类型,不是 number!传数字会报 TextFieldConvFail。
After each collection run, delete:
/tmp/sm-collect/ (intermediate JSON)/tmp/screenshots/ (browser screenshots)/tmp/subagent-out/ (if spawned sub-agents).json temp files in workspacewait_until="domcontentloaded" See references/platform-guide.md for detailed per-platform experience including: