Distillery

Created By
norrietaylor2 months ago
Persistent shared context for AI coding sessions — capture, search, synthesize with 18 MCP tools and ambient intelligence
Overview

Distillery

Distillery

Team Knowledge, Distilled
Capture, classify, connect, and surface team knowledge through conversational commands.

Documentation · Skills · Quick Start · Roadmap · Slides

PyPI version PyPI downloads License Python version


What is Distillery?

Distillery is a team knowledge base accessed through Claude Code skills. It refines raw information from working sessions, meetings, bookmarks, and conversations into concentrated, searchable knowledge — stored as vector embeddings in DuckDB and retrieved through natural language. Runs locally over stdio or as a hosted HTTP service with GitHub OAuth for team access.

Distillery captures the highest-value transformation — from noise to signal — and makes it a tool the whole team can use.

Full documentation: norrietaylor.github.io/distillery

Distillery demo — /distill captures a decision, /pour synthesizes it

Skills

Distillery provides 14 Claude Code slash commands:

SkillPurposeExample
/distillCapture session knowledge with dedup detection/distill "We decided to use DuckDB for local storage"
/recallSemantic search with provenance/recall distributed caching strategies
/pourMulti-entry synthesis with citations/pour how does our auth system work?
/bookmarkStore URLs with auto-generated summaries/bookmark https://example.com/article #caching
/minutesMeeting notes with append updates/minutes --update standup-2026-03-22
/classifyClassify entries and triage review queue/classify --inbox
/watchManage monitored feed sources/watch add github:duckdb/duckdb
/radarAmbient feed digest with source suggestions/radar --days 7
/tuneAdjust feed relevance thresholds/tune relevance 0.4
/digestTeam activity summary from internal entries/digest --days 7 --project myapp
/gh-syncSync GitHub issues/PRs into the knowledge base/gh-sync owner/repo --issues
/investigateDeep context builder with relationship traversal/investigate distributed caching
/briefingTeam knowledge dashboard with metrics/briefing --days 7
/setupOnboarding wizard for MCP connectivity and config/setup

Quick Start

Step 1: Install the Plugin

claude plugin marketplace add norrietaylor/distillery
claude plugin install distillery

This installs all 14 skills. The plugin defaults to a hosted demo server — you can start using Distillery immediately.

Demo Server: distillery-mcp.fly.dev is for evaluation only. Do not store sensitive or confidential data.

For a private knowledge base, run the MCP server locally with uvx — no persistent install needed:

# Get a free API key from jina.ai, then:
export JINA_API_KEY=jina_...

Add to ~/.claude/settings.json (overrides the plugin's demo server):

{
  "mcpServers": {
    "distillery": {
      "command": "uvx",
      "args": ["distillery-mcp"],
      "env": {
        "JINA_API_KEY": "${JINA_API_KEY}"
      }
    }
  }
}

Restart Claude Code and run the onboarding wizard:

/setup

See the Local Setup Guide for full configuration options, or deploy your own instance for team use.

Development

uv pip install -e ".[dev]"
# or
pip install -e ".[dev]"
pytest                              # run tests
mypy --strict src/distillery/       # type check
ruff check src/ tests/              # lint

See Contributing for the full guide.

License

Apache 2.0 — see LICENSE for details.

Server Config

{
  "mcpServers": {
    "distillery": {
      "url": "https://distillery-mcp.fly.dev/mcp",
      "transport": "http"
    }
  }
}
Project Info
Created At
2 months ago
Updated At
2 months ago
Author Name
norrietaylor
Star
-
Language
-
License
-
Category

Recommend Servers

View All
//beforeyouship — LLM Cost Modeling From Your Editor
@Indiegoing

Query realistic LLM cost models without leaving your editor. beforeyouship models the **true monthly cost** of an LLM app architecture — retries, prompt caching, batch discounts, infra overhead, and 3×/10× growth — across GPT-5.x, Claude, Gemini, DeepSeek, and more. Not a token calculator: a planning tool for the design phase, before you commit to a stack. **No API key needed to try it** — demo mode covers the six free-tier models. A Pro key from [beforeyouship.dev](https://beforeyouship.dev) unlocks the full 18-model catalog. ## What you can ask - "How much will a RAG chatbot cost at 10,000 requests/day?" - "Compare Claude Haiku vs Gemini Flash pricing for my workload" - "What's the cheapest model for a multi-step agent at scale?" - "Show me current per-token prices for Anthropic models" ## Tools ### `estimate_cost` Full cost model for an architecture at a given usage level. Returns Naive / Realistic / Worst Case monthly cost per model, 3×/10× growth scenarios, and an opinionated recommendation with reasoning. ### `get_model_prices` Current per-1M-token pricing — input, output, cached input, batch — with context windows and staleness metadata. ### `list_archetypes` Seven preset architecture patterns (simple chatbot, chatbot with history, RAG pipeline, multi-model router, coding assistant, document processor, multi-step agent) used as starting points for estimates. ## Setup **Claude Code:** ​```bash claude mcp add --transport http beforeyouship https://beforeyouship.dev/api/mcp ​``` **Cursor / other clients** — add a remote server: ​```json { "mcpServers": { "beforeyouship": { "type": "streamable-http", "url": "https://beforeyouship.dev/api/mcp" } } } ​``` Add an `Authorization: Bearer bys_...` header with a Pro key for the full catalog. ## Try it > Estimate the monthly cost of a RAG pipeline at 10,000 requests/day

13 hours ago
Shippo
@Shippo

21 hours ago