newsminimalist-rss/README.md

41 lines
1.0 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# NewsMinimalist RSS
从 [newsminimalist.com](https://www.newsminimalist.com) 抓取 Gemini AI 评分的新闻,生成 RSS Feed + HTML 页面。
## 架构
```
browserless (Chrome headless)
↓ /content API
scraper.py → 解析 <details> → 提取 [score] title (source) link
↓ JSON 缓存
server.py → RSS 2.0 + HTML
↓ NPM 反代
https://rsshub.arabiancloud.online/newsminimalist
```
## 文件
| 文件 | 说明 |
|:-----|:-----|
| `scraper.py` | 爬虫browserless 渲染 → BeautifulSoup 解析 → Google 翻译中文 → JSON 缓存 |
| `server.py` | 服务:读 JSON 缓存 → 输出 RSS 2.0 + Atom + HTML |
## 部署
```bash
# 1. 拉取浏览器镜像
docker pull browserless/chrome
# 2. 构建并运行
docker build -t newsminimalist-rss .
docker run -d --name newsminimalist-rss -p 1202:1202 \
--network rsshub_default \
-v /root/news_cache.json:/root/news_cache.json \
newsminimalist-rss
# 3. 定时抓取(建议 UTC 02:50, 14:50
crontab -e
50 2,14 * * * docker exec newsminimalist-rss python3 /app/scraper.py
```