feed // cutthecrap

`j` / `↓`	next row
`k` / `↑`	prev row
`o` / `Enter`	open focused row
`1`..`5`	jump to category
`c`	toggle channels
`?`	toggle this overlay
`Esc`	close overlay

this week

Four-step audit for subtle AI hallucinations in high-stakes docs: 1) finish AI draft, 2) extract claims to table (w/ sources), 3) validate vs source via 4 labels (supported/conflicts/no-proof/needs-human), 4) rewrite—all in fresh chats. Prompts in presentation.
4-Step AI Audit Catches 'Almost Right' Errors
2dDylan DavisAI
Live demo of Archon using "harness engineering": YAML DAG workflows, git worktrees for isolated parallel agents, and auto-loading skills with Claude Code for more consistent PRs. Model quality still matters, and workflows need upfront design.
Archon Makes AI Coding Agents Deterministic via Harness Engineering
2dBetter StackAI
Breakdown of Codex agent scaffolding: prompts for one-offs, skills as markdown "house styles" for reuse, plugins packaging full workflows with integrations, plus MCPs/hooks/scripts—decision tree in the article.
AI Agents Need Scaffolding: Prompts to Plugins
2dNate B JonesAI
Walkthrough of installing vidIQ MCP connector in Claude Code to audit channel metrics (VPH trends, outliers), compare to competitors like Nate O'Brien, spot title/thumbnail gaps, and prompt a live analytics dashboard.
vidIQ MCP enables Claude to audit YouTube channels
2dDuncan Rogoff | AI AutomationAI
YC Founder Firesides chat with Trigger.dev co-founders on three product pivots: v1 "Zapier for devs," v2 embedded async tasks, v3 hosted SDK execution—hitting PMF as AI agents drove 90% usage.
Trigger.dev Pivots to AI Agents, Hits PMF with 90% Usage
2dY CombinatorDev Tooling
Walkthrough of Cursor's /goal command best practices: enable via /features enable goal, prompt with explicit verifiable "done" states (e.g., Playwright visual checks, quantified targets like "20 issues"), initial alignment chats, and scaffold via npx goalbody for goal.md + state.yaml files.
Codex /goal tips: Define verifiable 'done' states
2dAI JasonAI
Walkthrough and demo of Symphony, OpenAI's Elixir tool for spinning up Codex agents on Linear issues — highlights prompting an agent with a 2k-line spec to build a custom version (e.g., Python), plus hooks for repo cloning/PRs. Compares to MultiOn/Conductor but skips real-project readiness details.
OpenAI's Symphony: Autonomous Codex Agents for Linear Issues
2dBetter StackAI
Verdent Manager overview: coordinates AI agents by decomposing ideas into parallel tasks with long-term memory of your stack/preferences, specialist skills for testing/deployment, and Slack/Telegram messaging — invite-only, with Eco Mode/BYOK for costs.
Verdent Manager Coordinates Idea-to-Deployed App Builds
2dAICodeKingAI
Walkthrough of Printing Press, a Go-based CLI factory/library with 50+ pre-builts (ESPN, Craigslist, etc.) that agents like Claude Code invoke token-efficiently vs MCPs (35x fewer tokens, 100% vs 72% reliability) or raw APIs. Demos setup and custom CLI for Skool.
Printing Press: CLI Factory for AI Agents
2dNate Herk | AI AutomationAI
Weekly AI podcast recaps essays from Ezra Klein/Alex Ess/A16Z on relational job growth over apocalypse fears, plus Anthropic/OpenAI enterprise JVs and Wall Street data center bets as maturation signs.
AI Shifts: No Job Doom, Infra Boom Ahead
2dThe AI Daily BriefAI
Demo of oMLX using MLX's Two-Tier KV cache to page inactive context to SSD on M2 MacBook Pro: 47 t/s and 89% cache efficiency with Qwen 3.6 (vs LM Studio's 16 t/s), enables multitasking, but hits occasional 400 context errors.
oMLX SSD KV Cache Enables 3x Faster LLMs on M2 Macs
2dBetter StackAI
Interview with Claude platform leads Angela Jiang and Katelyn Lesse on Managed Agents primitives (messages API, code execution sandbox, file systems, skills), production scaling hurdles, and a future of outcome+budget goals with self-writing harnesses.
Claude Managed Agents: Production-Ready AI Infra from Anthropic
2dEveryAI

$ keymap

channels

today

Vori's AI OS Digitizes $1.5T US Grocery Retail

Agent Judge Layer Guards Production Actions

Gemini File Search Adds Multimodal RAG

5 Levels to Eliminate Bash Risk in AI Agents

AI Expands Economy's Demand Frontier, Creating Human-Premium Jobs

Okara AI CMO deploys site-analyzing marketing agents

90M Falcon Runs on 2014 Raspberry Pi

Agentic Coding Trap: Cognitive Debt Hits Hard

yesterday

Lily Hack: AI Procurement Ignores Agent Realities

Codex Chrome Extension Enables Signed-In Browser Tasks

Codex /goal: Simple Harness for Hour-Long AI Coding Agents

Build Self-Improving Hermes AI Agent on VPS

Pomelli Catalog Imports Products for Scaled Campaigns

this week

4-Step AI Audit Catches 'Almost Right' Errors

Archon Makes AI Coding Agents Deterministic via Harness Engineering

AI Agents Need Scaffolding: Prompts to Plugins

vidIQ MCP enables Claude to audit YouTube channels

Trigger.dev Pivots to AI Agents, Hits PMF with 90% Usage

Codex /goal tips: Define verifiable 'done' states

OpenAI's Symphony: Autonomous Codex Agents for Linear Issues

Verdent Manager Coordinates Idea-to-Deployed App Builds

Printing Press: CLI Factory for AI Agents

AI Shifts: No Job Doom, Infra Boom Ahead

oMLX SSD KV Cache Enables 3x Faster LLMs on M2 Macs

Claude Managed Agents: Production-Ready AI Infra from Anthropic