Step 12: RSS integration + feedreader filtering

In the 3-phase model, RSS feeds are pre-filtered automatically before you see them — the feedreader (feedreader-score.py) is part of phase 1, not a separate phase. It scores feeds daily so that your feed reader only shows items that are likely relevant to your research. You then browse this curated selection and send interesting items to Zotero _inbox. Only in phase 2 do you decide what goes into the vault.

12a. Feedreader — Automatic relevance filtering

feedreader-score.py runs daily via launchd and produces a filtered, scored Atom feed and HTML reader from your RSS subscriptions. It uses the same ChromaDB preference profile as index-score.py — items are scored by semantic similarity to your existing library.

Install dependencies (if not already present from step 10):

~/.local/share/uv/tools/zotero-mcp-server/bin/pip install feedparser sentence-transformers youtube-transcript-api

youtube-transcript-api is used to fetch transcripts for YouTube items in your feeds. These transcripts enrich the relevance score (instead of scoring on the title alone) and are cached in .claude/transcript_cache/ so they are only fetched once per video.

Configure your feeds — add one URL per line to .claude/feedreader-list.txt:

https://arxiv.org/rss/econ.GN
https://www.skipr.nl/feed/
http://onlinelibrary.wiley.com/rss/journal/10.1002/(ISSN)1099-1050

Create and load the launchd agents (run once after installation):

Both the HTTP server and the nightly batch jobs run as LaunchDaemons in /Library/LaunchDaemons/ — they fire even without an active user session, which is required when the Mac wakes from a scheduled pmset power-on.

Create the HTTP server daemon in /Library/LaunchDaemons/:

<!-- /Library/LaunchDaemons/nl.researchvault.feedreader-server.plist -->
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN"
  "http://www.apple.com/DTDs/PropertyList-1.0.dtd">
<plist version="1.0">
<dict>
  <key>Label</key>
  <string>nl.researchvault.feedreader-server</string>
  <key>EnvironmentVariables</key>
  <dict>
    <key>HOME</key>
    <string>/Users/YOUR_USERNAME</string>
  </dict>
  <key>ProgramArguments</key>
  <array>
    <string>/Users/YOUR_USERNAME/.local/share/uv/tools/zotero-mcp-server/bin/python3</string>
    <string>/Users/YOUR_USERNAME/path/to/ResearchVault/.claude/feedreader-server.py</string>
  </array>
  <key>RunAtLoad</key>
  <true/>
  <key>KeepAlive</key>
  <true/>
  <key>StandardOutPath</key>
  <string>/Users/YOUR_USERNAME/Library/Logs/feedreader-server.log</string>
  <key>StandardErrorPath</key>
  <string>/Users/YOUR_USERNAME/Library/Logs/feedreader-server.log</string>
</dict>
</plist>

Note: the daemon runs as root (no UserName key). The HOME environment variable must point to your user's home directory so that Python tools can find their configuration files. Without HOME, tools like rclone and zotero-mcp will fail to locate their config paths during headless runs.

The nightly batch jobs run via ~/bin/nachtelijke-taken.sh, called from a LaunchDaemon in /Library/LaunchDaemons/. Because it is a system-level daemon running as root, it fires at 06:00 even without an active user session — which is required when the Mac wakes from a scheduled pmset power-on. The script runs sequentially in 6 steps: Zotero DB update → feedreader-score → FreshRSS actualize → feedreader-learn → proton-backup → proton-mirror → shutdown. The FreshRSS actualize step runs immediately after feedreader-score so that FreshRSS fetches the freshly generated feeds before the Mac shuts down. Without this step, FreshRSS would not update until the next time the Mac is awake and the FreshRSS actualize is triggered again. See Step 12c for how the actualize step differs between setup options.

A second LaunchDaemon (nl.pietstam.overdagtaken) runs the same steps 1–4 (Zotero → feedreader-score → FreshRSS actualize → feedreader-learn) at 09:00, 12:00, 15:00, 18:00, and 21:00, keeping feeds fresh throughout the day. After the 21:00 run the Mac also shuts down — but only if no user is logged in at the console. If you have manually turned on the Mac and are logged in (even with the screen locked), shutdown is skipped and a message is logged. The check uses stat -f%Su /dev/console: if it returns anything other than root, a user session is active. If the Mac was off when a scheduled time passed, launchd fires the missed job once immediately at next boot — not once per missed interval. The remaining scheduled times then run normally.

<!-- /Library/LaunchDaemons/nl.pietstam.nachtelijke-taken.plist -->
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN"
  "http://www.apple.com/DTDs/PropertyList-1.0.dtd">
<plist version="1.0">
<dict>
  <key>Label</key>
  <string>nl.pietstam.nachtelijke-taken</string>
  <key>EnvironmentVariables</key>
  <dict>
    <key>HOME</key>
    <string>/Users/YOUR_USERNAME</string>
    <key>LAUNCHD_RUN</key>
    <string>1</string>
  </dict>
  <key>ProgramArguments</key>
  <array>
    <string>/bin/bash</string>
    <string>/Users/YOUR_USERNAME/bin/nachtelijke-taken.sh</string>
  </array>
  <key>StartCalendarInterval</key>
  <dict>
    <key>Hour</key>
    <integer>6</integer>
    <key>Minute</key>
    <integer>0</integer>
  </dict>
  <key>WorkingDirectory</key>
  <string>/Users/YOUR_USERNAME</string>
  <key>RunAtLoad</key>
  <false/>
</dict>
</plist>

Note: the daemon runs as root (no UserName key) to avoid sudo/PAM issues when the script shuts down the Mac at the end of the run. LAUNCHD_RUN=1 signals the script that it was launched by launchd — the shutdown at the end of the script only fires when this variable is set, so manual runs are safe. The script logs to a file via an internal exec >> redirect, so no StandardOutPath is needed in the plist.

Install and load both daemons as root:

# feedreader-server
sudo cp /path/to/nl.researchvault.feedreader-server.plist /Library/LaunchDaemons/
sudo chown root:wheel /Library/LaunchDaemons/nl.researchvault.feedreader-server.plist
sudo chmod 644 /Library/LaunchDaemons/nl.researchvault.feedreader-server.plist
sudo launchctl load /Library/LaunchDaemons/nl.researchvault.feedreader-server.plist

# nachtelijke-taken
sudo cp /path/to/nl.pietstam.nachtelijke-taken.plist /Library/LaunchDaemons/
sudo chown root:wheel /Library/LaunchDaemons/nl.pietstam.nachtelijke-taken.plist
sudo chmod 644 /Library/LaunchDaemons/nl.pietstam.nachtelijke-taken.plist
sudo launchctl load /Library/LaunchDaemons/nl.pietstam.nachtelijke-taken.plist

macOS sleep/wake settings — configure a scheduled wake so the Mac powers on automatically before the 06:00 batch run:

Scheduled wake for the nightly daemon — set a recurring daily wake time at least 30 minutes before the batch job so that UserEventAgent-System has time to fully initialize before the 06:00 trigger fires:
```
sudo pmset repeat wakeorpoweron MTWRFSU 05:55:00
```
The Mac wakes at 05:55, the daemon fires at 06:00, and the Mac shuts down automatically at the end of the script. Check the schedule with pmset -g sched; cancel with sudo pmset repeat cancel. If the wake time is too close to 06:00 (< 30 min), UserEventAgent-System may not be ready in time and launchd silently skips the trigger — the job will then only run the next day.
Network wake for iPhone/iPad access — in System Settings → Energy (Dutch: Energie-instellingen), enable "Schakel sluimerstand uit voor netwerktoegang". Despite the wording ("disable sleep for network access"), this puts the Mac into a lighter sleep state rather than deep sleep, keeping the network interface active so the HTTP server on port 8765 remains reachable from iPhone or iPad. The Mac still saves significant power compared to staying fully awake.

With both settings active you can put the Mac mini to sleep at the end of the day instead of just locking the screen.

Run manually (first time, or on demand):

~/.local/share/uv/tools/zotero-mcp-server/bin/python3 .claude/feedreader-score.py

Access the filtered feeds:

HTML reader (Mac/iPhone/iPad): http://localhost:8765/filtered.html
Atom feed — web articles (NetNewsWire): http://[mac-ip]:8765/filtered-webpage.xml
Atom feed — YouTube (NetNewsWire): http://[mac-ip]:8765/filtered-youtube.xml
Atom feed — podcasts (NetNewsWire): http://[mac-ip]:8765/filtered-podcast.xml

Three separate Atom feeds are produced — one per source type. Each feed only contains items of that type, which lets you organise them into folders in NetNewsWire and apply different reading flows per type. Use the Mac's LAN IP instead of localhost so the feeds also work on iPad and iPhone.

The HTML reader includes an ⌨️ terminal button in the header. Clicking it opens an embedded terminal panel (powered by ttyd — see Step 17) alongside the article list, so you can run Phase 2 (Claude Code) without leaving the browser tab. The terminal works both on Mac (via localhost) and on iPad (via the Mac's local IP address).

YouTube articles: clicking a YouTube headline in the HTML reader opens a generated reading article at http://localhost:8765/article/{video_id} instead of going to YouTube. The article (Introduction + Key Points + Conclusion, written in the original video language) is generated locally by qwen2.5:7b via Ollama. The first visit takes 30–60 seconds; a loading page refreshes automatically every 5 seconds until it is ready. Subsequent visits are instant (cached in .claude/article_cache/).

Podcast articles: clicking a podcast headline opens a similar generated article at http://localhost:8765/article/podcast/{episode_id}, based on the episode's show notes rather than an audio transcript. Only episodes with show notes of at least 200 characters get an article page; episodes with thinner show notes link directly to the source. Generation follows the same async pattern as YouTube articles.

The article page (for both YouTube and podcast) injects the full article text as rft.description in a COinS span (<span class="Z3988">), so the text appears automatically in the Abstract field when you save the page via the Zotero Connector.

Serve directory: the HTTP server serves files from ~/.local/share/feedreader-serve/, not from ~/Documents/, because macOS TCC prevents system Python from accessing the Documents folder when launched via launchd.

Learning loop — feedreader-learn.py runs daily as part of the nightly batch job. It reads NNW stars and read items from FreshRSS, matches Zotero additions by URL and title, and processes the 👎 skip queue — labelling each item in score_log.jsonl as positive or negative. After ≥30 positives it prints a threshold recommendation. Once the threshold is stable, activate score filtering in feedreader-score.py by adjusting THRESHOLD_GREEN and THRESHOLD_YELLOW.

Privacy note: feedreader-score.py runs entirely locally. Feed URLs are fetched directly from the source; no feed content is sent to any cloud service.

12b. RSS feeds via NetNewsWire

NetNewsWire is a free, open-source RSS reader for macOS and iOS. Rather than subscribing to individual feeds, you subscribe to the three filtered feeds produced by the feedreader. This way your reading list only contains items that are likely relevant, sorted by relevance score.

Install:

brew install --cask netnewswire

Or download via netnewswire.com.

Connect NetNewsWire to FreshRSS (not directly to the feeds) — this is required for cross-device read/unread sync. See Step 12c for FreshRSS setup. Once FreshRSS is running, add a FreshRSS account in NetNewsWire on each device:

NetNewsWire → Settings → Accounts → + → FreshRSS

API URL: http://[mac-ip]:8080/api/greader.php (full path required — base URL alone does not work)
Username + API password

NetNewsWire will then show the three filtered feeds (webpage, YouTube, podcast) via FreshRSS, with read/unread status synced across all devices.

Titles are prefixed with score and label (🟢 54 | Title…). Items in each Atom feed are sorted by relevance score: the feedreader assigns a synthetic publication time within today's date (higher score = later time) so that higher-scoring items appear first in NetNewsWire's Newest First sort order. The server always responds with HTTP 200 for feed requests (never 304 Not Modified), ensuring NetNewsWire refreshes the feed contents on every poll.

Each Atom feed is limited to the top 300 items by score. Items older than 30 days (web articles, podcasts, YouTube) or 365 days (academic journal feeds) are automatically excluded.

Enable JavaScript in NetNewsWire — required for the 👎 action button to work:

NetNewsWire → Settings → Article Content
Check "Enable JavaScript"

Action buttons in the article view — each item in NetNewsWire has three buttons at the bottom of the article:

Button	Action
👎 Overslaan	Marks the item as skipped — sends a negative signal to the learning loop

The button fires a silent HTTP request to the local feedreader server via new Image().src — no browser tab opens. The item's URL and title are written to skip_queue.jsonl; feedreader-learn.py processes the queue the next morning.

Note: the ✅ and 📖 Zotero action buttons have been removed. NNW stars (via FreshRSS) now serve as the primary positive signal to the learning loop. Only the 👎 skip button remains active — it is fully local and does not require Zotero.

Auto-starring — items with a Bayesian score ≥70 (THRESHOLD_STAR) are automatically starred in FreshRSS/NNW. feedreader-score.py writes their URLs to /tmp/feedreader-star-queue.txt; feedreader-learn.py reads that queue each morning and stars the items via the FreshRSS GReader API before processing any other signals. This means high-confidence items appear starred in NetNewsWire without any manual action.

Add your source feeds to .claude/feedreader-list.txt instead of directly to NetNewsWire. Useful sources:

Journal RSS (e.g. BMJ, NEJM, Wiley Health Economics)
PubMed searches: https://pubmed.ncbi.nlm.nih.gov/rss/search/?term=[searchterm]&format=abstract
Policy sites and government newsletters
Trade blogs (e.g. Zorgvisie, Skipr)
Substack: [name].substack.com/feed

Phase 1: Items in the filtered feed have not yet been saved — they only exist in your feed reader. You browse through them and scan the scored headlines. Only what is truly relevant gets forwarded to Zotero. That is the phase 1 moment.

From NetNewsWire to the vault (phase 1 → phase 2 → phase 3):

Interesting articles are saved via the Zotero browser extension or iOS app: open the article in a browser, click the Zotero icon → item is saved with full metadata to Zotero _inbox. Use this route when you want annotation capabilities or need richer metadata.

Privacy note: NetNewsWire stores feed data locally. No reading habits are sent to external servers.

12c. FreshRSS — read/unread sync across devices

Without a sync backend, read/unread status in NetNewsWire is device-local: marking an item as read on your Mac is not visible on your iPad. FreshRSS is a self-hosted RSS sync server that solves this — it stores the feed items centrally so all devices stay in sync. There are three ways to set this up, each with different trade-offs.

Option A: external sync service (simplest)

Services such as Inoreader or Feedly support the Google Reader API and work with NetNewsWire out of the box. Add the three filtered Atom feeds produced by the feedreader as subscriptions in the service:

http://[mac-ip]:8765/filtered-webpage.xml
http://[mac-ip]:8765/filtered-youtube.xml
http://[mac-ip]:8765/filtered-podcast.xml

Then connect NetNewsWire to the service account (Settings → Accounts → select the service).

Trade-offs: zero infrastructure to manage, works immediately. The downside is that your filtered feed content — scored headlines and summaries — is sent to an external server. This conflicts with the privacy-first design of the rest of the workflow.

Option B: FreshRSS on Mac mini (Docker)

FreshRSS runs in a Docker container on the same Mac mini that generates the feeds. This keeps everything local.

Advantages over Option A: feed content never leaves your machine. FreshRSS is free and open-source. No account or subscription required.

Install Docker Desktop:

brew install --cask docker

Open Docker Desktop from Launchpad and wait until the whale icon in the menu bar stops animating.

Start FreshRSS:

docker run -d \
  --name freshrss \
  -p 8080:80 \
  --restart unless-stopped \
  freshrss/freshrss

Open http://localhost:8080 in your browser and complete the FreshRSS installation wizard. Create an admin account.

Set API password (required for NetNewsWire):

FreshRSS → click your username top-right → Profile → scroll to API Management → set an API password.

Add the three feeds in FreshRSS (use the Mac mini's LAN IP, not localhost, so other devices can reach the feedreader server):

http://[mac-ip]:8765/filtered-webpage.xml
http://[mac-ip]:8765/filtered-youtube.xml
http://[mac-ip]:8765/filtered-podcast.xml

FreshRSS actualize step in nachtelijke-taken.sh — the script triggers FreshRSS to fetch the freshly generated feeds immediately after feedreader-score.py finishes, by running actualize_script.php directly inside the Docker container on the Mac mini:

docker exec freshrss php /var/www/FreshRSS/app/actualize_script.php

Connect NetNewsWire on each device (Mac, iPad, iPhone):

NetNewsWire → Settings → Accounts → + → FreshRSS

API URL: http://[mac-ip]:8080/api/greader.php (full path required — base URL alone does not work)
Username + API password from above

Remote access outside your home network — use Tailscale Funnel to expose FreshRSS publicly over HTTPS without requiring Tailscale on the iPhone:

tailscale funnel --bg http://localhost:8080

This makes FreshRSS available at https://[machine-name].[tailnet].ts.net/. Tailscale Funnel is free and does not require port forwarding on the router. The Funnel URL is publicly accessible — protect FreshRSS with a strong password.

Trade-off: FreshRSS is only available when the Mac mini is awake. Since the Mac shuts down after the nightly run, NetNewsWire on iPhone or iPad cannot sync until the Mac is awake again (during one of the daytime runs, or when you wake it manually).

Note: FreshRSS may show a "Niet ingedeeld" (Uncategorised) folder in NetNewsWire alongside any custom categories. This is a known limitation of the Google Reader API integration — the folder is empty and does not affect functionality.

Option C: FreshRSS on Home Assistant Green (current setup)

FreshRSS runs in a Docker container on a Home Assistant Green device, which runs 24/7 independently of the Mac mini. The Mac mini generates the filtered feeds as before; after generating them it triggers FreshRSS on the HA Green via an HTTP actualize request (over Tailscale) to fetch them. From that point the Mac can shut down — FreshRSS keeps the feeds available all day from the HA Green.

Advantages over Option B: FreshRSS is available 24/7 even when the Mac mini is off. NetNewsWire can sync at any time, not only during Mac awake windows. Docker Desktop is not needed on the Mac mini. Remote access requires only Tailscale (no public Funnel URL).

Prerequisites:

Home Assistant Green (or any always-on Linux host with Docker)
SSH add-on installed on HA Green, with an SSH key from the Mac mini authorised
Tailscale running on both the Mac mini and the HA Green (so the Mac mini can reach HA Green by its Tailscale IP, and iPhone/iPad can reach it too)

Start FreshRSS on HA Green — SSH into HA Green and run:

docker run -d \
  --name freshrss \
  --network host \
  --restart unless-stopped \
  freshrss/freshrss

--network host maps port 80 directly to the host, making FreshRSS reachable at http://[ha-green-ip]:80. Open that address in your browser and complete the installation wizard.

Set API password — same as Option A: FreshRSS → Profile → API Management.

Add the three feeds in FreshRSS on HA Green — use the Mac mini's Tailscale IP so HA Green can reach the feedreader server even outside the home LAN:

http://[mac-mini-tailscale-ip]:8765/filtered-webpage.xml
http://[mac-mini-tailscale-ip]:8765/filtered-youtube.xml
http://[mac-mini-tailscale-ip]:8765/filtered-podcast.xml

FreshRSS actualize step in nachtelijke-taken.sh — the script triggers FreshRSS on HA Green via HTTP immediately after feedreader-score.py finishes, while the Mac mini is still awake and the feedreader server is still running:

# Store credentials in ~/bin/.freshrss-env (chmod 600):
# FRESHRSS_USER=your_username
# FRESHRSS_TOKEN=your_master_auth_token   # FreshRSS → Settings → Profile → Authentication token
source ~/bin/.freshrss-env
curl -s --max-time 60 \
  "http://[ha-green-tailscale-ip]:8080/i/?c=feed&a=actualize&ajax=1&maxFeeds=50&user=${FRESHRSS_USER}&token=${FRESHRSS_TOKEN}"

The flow has two directions. First, the Mac mini initiates: it sends a curl request to FreshRSS on HA Green ("please actualize"). Then the direction reverses: FreshRSS on HA Green pulls the XML feeds from the Mac mini's feedreader server (port 8765, reachable via Tailscale Funnel). The Mac mini must therefore still be running when the actualize step fires — and it is, because shutdown only happens after all pipeline steps complete.

The full sequence: feedreader-score generates the XML files → Mac mini's HTTP server serves them on port 8765 → curl tells FreshRSS on HA Green to actualize → FreshRSS pulls the feeds from the Mac mini → FreshRSS stores the items internally → Mac mini shuts down. After shutdown, FreshRSS on HA Green continues to serve the stored items to NetNewsWire.

Connect NetNewsWire on each device — use the HA Green's Tailscale IP:

NetNewsWire → Settings → Accounts → + → FreshRSS

API URL: http://[ha-green-tailscale-ip]:80/api/greader.php
Username + API password from above

Tailscale must be installed and active on iPhone and iPad for the Tailscale IP to be reachable outside the home network.

SSH add-on Protection Mode: Protection Mode on the HA SSH add-on is enabled. The actualize step uses a direct HTTP call to FreshRSS instead of sudo docker exec, so no Docker socket access is required. When SSH-ing into HA Green for manual Docker management, use docker without sudo — the hassio user has direct Docker group access.

12d. Feedback signals: training the scoring

feedreader-learn.py runs daily and labels each item in score_log.jsonl based on signals from FreshRSS, Zotero, and the skip queue. The labels are used to calibrate the scoring threshold.

Signal hierarchy — positive (highest priority first):

#	Signal	Source	Recorded as
1	⭐ Starred in NetNewsWire	FreshRSS GReader API	`added_to_zotero: true`, `starred_in_freshrss: true`
2	URL found in Zotero library	Zotero SQLite	`added_to_zotero: true`
3	Title match in Zotero library	Zotero SQLite	`added_to_zotero: true`

Signal hierarchy — negative:

#	Signal	Source	Recorded as
4	Read in NNW, not in Zotero after N days	FreshRSS GReader API	`added_to_zotero: false`, `read_in_nnw: true`
5	No action after 3 days (timeout)	score_log age	`added_to_zotero: false` — strongest negative

Explicit negative (tracked separately):

#	Signal	Source	Recorded as
6	👎 pressed in NetNewsWire	feedreader-server skip queue	`skipped: true` immediately

Ambiguity: items labelled by timeout (type 5) are indistinguishable from items you never saw. Only types 1–4 and 6 carry a clear intent signal. feedreader-learn.py reports all categories separately so you can track signal quality over time.

How to use the ⭐ star:

Star an item in NetNewsWire to send the strongest positive signal. feedreader-learn.py reads starred items from FreshRSS each morning.
You do not need to save the item to Zotero for the star to count — though saving it to Zotero (signal 2) also works.

How to use the 👎 button:

When a headline is clearly off-topic, press 👎 in NetNewsWire — no need to open the article.
The rejection is queued in skip_queue.jsonl; feedreader-learn.py processes it the next morning.

Future use of explicit negatives: once enough skipped: true items have accumulated, they can be used to build a negative profile that penalises similarity to rejected content: score = sim(item, positive_profile) − λ × sim(item, negative_profile). The learning loop will advise when enough data is available.

12e. Academic feeds: from NetNewsWire to Zotero

For academic articles from NetNewsWire, the recommended route is to always add them to Zotero first before having them processed. This way you have BibTeX metadata and annotation capabilities available:

Open the article in a browser → click the Zotero icon → item is saved to _inbox with richer metadata
In the next research session, run index-score.py and process items from _inbox via the skill

Local Research Workflow