Why can't ChatGPT browse or Claude WebFetch read my paid Substack subscriptions?

Both use server-side fetchers from datacenter IPs without your authentication. They see the paywall HTML, not the article. Even if you paste your Substack cookies into the prompt (don't do this), automation triggers anti-abuse detection that locks the account.

What does a clean Substack article Markdown look like?

Title and subtitle at the top, author and date, full article body as Markdown with images replaced by alt text, footnotes preserved as a numbered references section, embedded video/audio replaced by captioned placeholders. Typical 3,000-word Substack post comes out as ~8-10k tokens of clean Markdown.

Is converting paid Substack content for personal AI use allowed?

Reading content you legitimately subscribe to and converting it to your own preferred format for personal use falls under normal subscriber rights. The boundary is redistribution — don't share the Markdown with others, don't use it for commercial training, don't repost. Personal AI prompts and your own archive are fine.

Can I bulk-archive a year of Substack subscriptions for an end-of-year synthesis?

Yes. Web2MD's queue feature handles this well — open each post in a tab, queue, bulk-export as one Markdown file. 50 substantive Substack posts is ~150k tokens; Claude Opus 4.7's 1M context easily handles cross-newsletter synthesis questions like 'what were the 3 big themes in tech writing this quarter?'

What about Substack's official archive feature?

Substack lets you download your subscription archive as JSON via account settings. Useful for backup, but the JSON format is not directly Claude-readable and requires reformatting. Web2MD's per-article extraction produces immediately usable Markdown, and works for posts you've already read but didn't archive.

Does this work for Substack comments?

Web2MD's Substack extractor pulls the article and the top comments. Comments on substantive Substacks often contain corrections, counterarguments, and links to related work — useful research signal alongside the article. Author replies are included.

Substack Article to Markdown for AI: Reading Paid Newsletters with Claude (2026)

Substack is where the most considered writing on technology, economics, geopolitics, and culture happens in 2026. The best Substacks — Lenny's Newsletter, Stratechery (now on Substack), Doomberg, Slow Boring — have replaced what magazines used to do, for an order of magnitude lower subscription price.

Most of it is paid. The paywall is good — it filters who's there. But it makes feeding Substack content into Claude harder than it should be.

This post is the workflow that works.

Why your AI tools can't read paid Substacks

Standard server-side fetchers (ChatGPT browse, Claude WebFetch, Jina Reader, Firecrawl) all hit Substack URLs from their own servers without your authentication. Substack returns the paywall HTML:

<div class="paywall">
  <h2>This is a subscriber-only post.</h2>
  <p>Subscribe to read the rest...</p>
</div>

The AI then "summarizes" what's there — which is the paywall, not the article. Useless.

Even if you wanted to share your Substack auth cookies with these tools (you shouldn't), automation triggers anti-abuse detection. Substack's anti-bot system can lock the subscribing account.

The only category that works: read the article in your browser (where you're authenticated), extract the rendered content there.

What clean Substack Markdown looks like

After running through a Substack-aware extractor:

# What's actually changed in the tech labor market

**Author**: Lenny Rachitsky · **Published**: 2026-05-28
**Subscribers-only**: Yes (you have access)
**Source**: https://www.lennysnewsletter.com/p/tech-labor-market-2026

> Over the past 6 months, hiring patterns at tech companies have shifted in
> three ways that don't get talked about enough.

## The headline numbers everyone is missing

[Article body with proper paragraphs]

## What this means for individual contributors

[Continues...]

## What this means for managers

[Continues...]

## Footnotes

[1] Layoffs.fyi tracker — https://layoffs.fyi
[2] Pragmatic Engineer survey — https://newsletter.pragmaticengineer.com/

## Top comments

- **@reader1**: "Important caveat to point 2: the engineering manager numbers
  also include..."
- **@author_reply (Lenny)**: "Great point — I should have included that."
- **@reader2**: "The contrast with 2021 is striking..."

Token-efficient, properly structured, ready to paste into Claude as research input.

The workflow

Two paths:

Path 1: Web2MD Substack extractor (interactive)

Open the paid Substack article in Chrome (where you're already logged in as a subscriber). Click Web2MD. The Substack-specific extractor:

Reads the rendered article DOM in your authenticated session
Strips the Substack chrome (subscriber count badges, share buttons, related posts widget)
Captures title, subtitle, author, publish date
Preserves footnotes as a numbered references section
Pulls top comments and author replies
Outputs clean Markdown ready for Claude or your archive

End-to-end: ~5 seconds per article. The extension reads what's already on your screen — no cookie sharing, no third-party authentication.

Path 2: Browser-side bookmarklet for one-offs

If you don't want a full extension:

// Save as bookmarklet, click on a Substack article page
javascript:(function(){
  const title = document.querySelector('h1')?.innerText || '';
  const author = document.querySelector('.author-name')?.innerText || '';
  const body = document.querySelector('.post-body, .available-content')?.innerText || '';
  const md = `# ${title}\n\n**Author**: ${author}\n\n${body}`;
  navigator.clipboard.writeText(md);
  alert('Markdown copied to clipboard');
})();

Simpler than Web2MD but loses footnote handling, comment extraction, and doesn't generalize to other sites.

Each quarter I do a "what did the best tech newsletters say?" synthesis. Sources:

Lenny's Newsletter (PM/growth)
Stratechery (tech strategy)
Pragmatic Engineer (eng management)
Doomberg (commodities, but useful tech context)
Slow Boring (politics, edge cases for tech regulation)

5 newsletters × ~12 posts each over a quarter = ~60 posts. Web2MD queue + bulk export: ~40 minutes including skim-reading. Combined corpus: ~180k tokens.

Pasted into Claude Opus 4.7 with: "These are 60 newsletter posts from Q2 2026. Identify the 4 themes that multiple newsletters converge on, and the 2 where there's substantive disagreement. Quote specific posts and authors."

Output: a 6-page synthesis with quote-level citations. Total time: ~75 minutes for what's effectively a quarter-end industry recap.

What this is not

Honest about limits:

Not a substitute for reading. The synthesis is useful for high-level pattern recognition. For load-bearing decisions, read the actual posts.
Not for redistribution. Personal AI use of content you subscribe to is fine; sharing the Markdown with others violates your subscriber agreement.
Not for free-tier abuse. If you cancel a Substack and use Web2MD to keep accessing old posts you saved while subscribed, that's the kind of edge case authors will rightfully push back on. The intent is "feed your subscriptions into your AI workflow," not "save everything before unsubscribing."
Not for newsletter platforms other than Substack (without testing). Beehiiv, Ghost, Buttondown all have different structures. Web2MD handles each separately.

Pairing with other workflows

Substack content composes well with:

Reddit-to-Claude: Substack for considered analysis, Reddit for user reactions
Hacker News threads: HN for engineering perspectives on what Substacks discuss
Fill 1M context window: 60 Substack posts is comfortably within Claude's context budget
LinkedIn posts: LinkedIn for the same authors' shorter takes

The big picture

Paid Substacks are the modern equivalent of subscribing to The Economist or The Atlantic — a curated stream of considered analysis you pay for because the quality is worth it. The benefit of feeding them into Claude is the same benefit clipping articles into a notebook always was: you process more, you connect ideas across sources, you build a research base.

The mechanical step — extracting clean Markdown from a paywalled SPA — has been the missing piece. With a browser-side extractor, it's now a 5-second click.

Install

Web2MD on the Chrome Web Store →

Free tier: 3 conversions/day. Pro at $9/mo unlocks unlimited + queue + bulk export + dedicated Substack extractor that handles footnotes, author replies, and your subscription archive.

Substack Article to Markdown for AI: Reading Paid Newsletters with Claude (2026)

Substack Article to Markdown for AI: Reading Paid Newsletters with Claude (2026)

Why your AI tools can't read paid Substacks

What clean Substack Markdown looks like

The workflow

Path 1: Web2MD Substack extractor (interactive)

Path 2: Browser-side bookmarklet for one-offs

What this is not

Pairing with other workflows

The big picture

Install

Related Articles

Hacker News Thread to Markdown for Claude Research (2026)

Extend Perplexity Research With Your Sources

".md This Page": How to Turn the Page You're On Into Markdown Instantly

Most Read

Latest Articles

Substack Article to Markdown for AI: Reading Paid Newsletters with Claude (2026)

Why your AI tools can't read paid Substacks

What clean Substack Markdown looks like

The workflow

Path 1: Web2MD Substack extractor (interactive)

Path 2: Browser-side bookmarklet for one-offs

A real workflow: quarterly tech newsletter synthesis

What this is not

Pairing with other workflows

The big picture

Related

Install

Related Articles

Hacker News Thread to Markdown for Claude Research (2026)

Extend Perplexity Research With Your Sources

".md This Page": How to Turn the Page You're On Into Markdown Instantly

Most Read

Latest Articles