Hellogrowth Crm Mcp

Created By

MeruLocal9 days ago

HelloGrowth CRM MCP is an AI-powered CRM integration platform that enables secure AI access to CRM data, lead and customer management, deal tracking, task automation, and business workflow orchestration through a standardized MCP interface.

# crm

# ai

Overview Content Tools Comments

Overview

mcp-bot-crawler

An MCP (Model Context Protocol) server that helps you discover, identify, and govern every bot interacting with your website — search engines, AI crawlers, SEO tools, social-preview fetchers, security scanners, and the long tail of suspicious scripts. Plug it into any MCP-capable client (Claude Desktop, Cursor, Claude Code, custom Agent SDK app, etc.) and ask natural-language questions about your traffic.

It is polite by design: it respects robots.txt, rate-limits its own fetches, advertises an honest User-Agent, and never tries to bypass any control.

Features

Eight MCP tools covering the full bot-governance lifecycle (scan, analyze, verify, list, generate, suggest, export).
Curated database of 55+ well-known bots — Googlebot, Bingbot, GPTBot, ChatGPT-User, ClaudeBot, PerplexityBot, Amazonbot, Google-Extended, Applebot-Extended, FacebookBot, LinkedInBot, AhrefsBot, SemrushBot, Bytespider, and more — each tagged with category, operator, baseline risk, and reverse-DNS verification suffixes.
Behavioural risk scoring: combines UA matching, robots.txt compliance, error rate, request rate, and unique-path fan-out into a 0–100 score and a recommended action (allow / monitor / rate-limit / block / verify-identity).
Cryptographic-grade identity verification via PTR + forward DNS (same method documented by Google, Microsoft, OpenAI).
robots.txt + sitemap.xml parser with proper longest-match Allow/Disallow semantics.
Reports in Markdown, JSON, and CSV.
TypeScript-first, modular file layout, zero unsafe parsing.

Repository layout

mcp-bot-crawler/
├─ src/
│  ├─ index.ts                # entrypoint: loads .env, starts the MCP server
│  ├─ server.ts               # wires tools into ListTools / CallTool
│  ├─ tools/                  # one file per MCP tool
│  │  ├─ scan-website-bots.ts
│  │  ├─ analyze-access-logs.ts
│  │  ├─ verify-bot-identity.ts
│  │  ├─ list-allowed-bots.ts
│  │  ├─ list-blocked-bots.ts
│  │  ├─ generate-robots-txt.ts
│  │  ├─ suggest-bot-policy.ts
│  │  ├─ export-bot-report.ts
│  │  ├─ tool-types.ts
│  │  └─ index.ts
│  ├─ core/                   # detection engine
│  │  ├─ bot-detector.ts
│  │  ├─ aggregator.ts
│  │  ├─ log-parser.ts
│  │  ├─ robots-parser.ts
│  │  ├─ reverse-dns.ts
│  │  └─ crawler.ts           # polite HTTP client
│  ├─ data/known-bots.ts      # signature database
│  ├─ reports/report-generator.ts
│  └─ utils/                  # types, logger, rate limiter
├─ samples/
│  ├─ access.log              # realistic mixed-bot traffic
│  ├─ robots.txt
│  └─ sitemap.xml
├─ examples/usage.md
├─ reports/                   # generated reports land here
├─ .env.example
├─ package.json
├─ tsconfig.json
└─ README.md

Quick start

# From the hellocrmwebsite repo root:
cd mcp-bot-crawler

cp .env.example .env          # already pre-configured for hellogrowthcrm.com

npm install
npm run build

Run it on stdio:

node dist/index.js

Or dev-mode (no build step, uses tsx):

npm run dev

The server speaks MCP over stdio. Any MCP-capable client can launch it.

Claude Desktop / Claude Code

Add the following to your claude_desktop_config.json (or the equivalent mcpServers block in your client):

{
  "mcpServers": {
    "bot-crawler": {
      "command": "node",
      "args": ["/absolute/path/to/hellocrmwebsite/mcp-bot-crawler/dist/index.js"],
      "env": {
        "DEFAULT_TARGET_URL": "https://hellogrowthcrm.com",
        "DEFAULT_ACCESS_LOG": "/var/log/nginx/access.log",
        "CRAWLER_USER_AGENT": "mcp-bot-crawler/1.0 (+https://hellogrowthcrm.com/bot-info)"
      }
    }
  }
}

The eight MCP tools

Tool	What it does
`scan_website_bots`	Polite live scan: robots.txt + sitemap + sample pages, correlated with your access log.
`analyze_access_logs`	Parses Apache/Nginx Combined-format logs and returns per-bot summaries with risk scores.
`verify_bot_identity`	PTR + forward DNS verification of a specific `(ip, userAgent)` pair.
`list_allowed_bots`	Bots permitted under the current policy (default curated, or live robots.txt).
`list_blocked_bots`	Bots blocked under the current policy (default high-risk, or live robots.txt).
`generate_robots_txt`	Policy-driven robots.txt generator (block AI / SEO / scrapers / security scanners, declare sitemaps, set Crawl-delay).
`suggest_bot_policy`	For each bot observed in a log, recommends allow / monitor / rate-limit / block with rationale and ready-to-paste nginx snippet.
`export_bot_report`	Writes a Markdown / JSON / CSV report under `REPORT_OUTPUT_DIR`.

Full payload examples live in examples/usage.md.

Full MCP tool catalog (81 tools)

Beyond the original eight bot-governance tools, this server exposes the entire hellogrowthcrm.com website — every module, feature, product, pricing table, AI agent, and integration — as MCP tools. Live content (blog, help, newsletter, forms, social proof) is served from Supabase; everything else is a read-mirror of the website source files (see WEBSITE_DATA_TOOLS.md).

Category	Tools
Bot governance (8)	`scan_website_bots`, `analyze_access_logs`, `verify_bot_identity`, `list_allowed_bots`, `list_blocked_bots`, `generate_robots_txt`, `suggest_bot_policy`, `export_bot_report`
Blog (7)	`blog_list`, `blog_get`, `blog_search`, `blog_create`, `blog_update`, `blog_revalidate`, `blog_get_categories`
Help center (6)	`help_list_categories`, `help_list_articles`, `help_get_article`, `help_search`, `help_create_article`, `help_update_article`
Newsletter (4)	`newsletter_subscribe`, `newsletter_unsubscribe`, `newsletter_get_subscribers`, `newsletter_get_stats`
Contact forms (4)	`forms_submit`, `forms_list_submissions`, `forms_get_submission`, `forms_export_csv`
Static content (6)	`content_list_case_studies`, `content_list_comparisons`, `content_get_comparison`, `content_list_industries`, `content_list_tools`, `content_get_seo_rules`
Pricing (5)	`pricing_get_plans`, `pricing_get_addons`, `pricing_get_faq`, `pricing_compare_plans`, `pricing_get_country_plans`
Features (3)	`features_list`, `features_get`, `features_list_products`
Analytics (1)	`analytics_social_proof`
Countries (2)	`countries_list`, `country_get`
Company (2)	`company_get_profile`, `company_get_contacts`
SEO (5)	`seo_get_site_config`, `seo_get_hreflang`, `seo_get_canonical`, `seo_get_sitemaps`, `seo_get_schema`
Products (2)	`products_list`, `product_get`
Integrations (3)	`integrations_list`, `integrations_get`, `integrations_list_categories` — 397-entry catalog, 55 categories
AI Agents / Agentic AI (4)	`agents_list`, `agents_get`, `agents_get_autonomy_levels`, `agents_list_comparisons` — 12 agents, autonomy matrix, 4 vs-competitor pages
Glossary (2)	`glossary_list_terms`, `glossary_get_term` — 44 terms
Templates (2)	`templates_list`, `templates_get` — 42 templates in 7 categories
Feature guides (2)	`guides_list`, `guides_get` — 32 guides
Alternatives & migration (4)	`alternatives_list`, `alternatives_get`, `switch_list_competitors`, `switch_get_guide` — 42 alternatives pages, 26 switch-from guides
Changelog (2)	`changelog_list_releases`, `changelog_get_release` — 6 releases
Site FAQs (1)	`faqs_get_site`
Media (2)	`media_list_videos`, `media_list_testimonials`
Partner program (2)	`partners_get_program`, `partners_get_application_schema`
Solutions (2)	`solutions_list_whatsapp_use_cases`, `solutions_get_managed_revops` — incl. 9 market variants + 25 US city pages

All mirror tools carry synced_at provenance (last sync: 2026-06-11) and validate inputs with zod; unknown slugs return a clear error listing valid values. Run npm run build && node test-tools.mjs for 62 smoke assertions across the catalog.

How detection works

User-Agent matching. The signature database in src/data/known-bots.ts defines each known bot with one or more case-insensitive UA patterns. The first match wins, so more specific signatures come first (e.g. Googlebot-Image before generic Googlebot).
Generic heuristics. If no signature hits, we look for automation hints (bot, crawler, spider, python-requests, headless, …) and classify the source as unknown — flagged for verification.
Behavioural enrichment. When access logs are available, the aggregator (src/core/aggregator.ts) computes hit count, unique IPs, error rate, request rate, unique paths, and how many requests hit paths Disallowed in robots.txt for that UA. These signals nudge the risk score and emit human-readable notes.
Identity verification. For high-trust signatures we keep documented PTR suffixes (.googlebot.com, .search.msn.com, etc.). verify_bot_identity runs reverse DNS, checks the suffix, then forward-resolves to ensure the IP matches. Spoofed Googlebots show up as spoofed.

Risk scoring

Baseline risk per bot lives in the signature DB (0 = trusted search engine, 100 = hostile scraper). The aggregator adds bonuses for:

Bot ignoring robots.txt (+20)
Very high request rate (>1000 req/hr, +25; >300 req/hr, +10)
Error rate >50% — probing behaviour (+15)
Touching >5000 unique paths (+10)

The recommended action is derived from the final score plus the category:

search/social ≤ 25 → allow
ai ≤ 40 → monitor
Score ≥ 70 → block
Score ≥ 45 → rate-limit
unknown → verify-identity

Tune these thresholds in src/core/bot-detector.ts if your environment is more or less permissive.

Security & politeness

Respects robots.txt for outbound fetches. scan_website_bots will not retrieve paths Disallowed for its own UA.
Per-host rate limiter (CRAWL_DELAY_MS, default 1 s).
Hard cap on sitemap pages (MAX_SITEMAP_PAGES, default 25).
HTTP timeout (HTTP_TIMEOUT_MS, default 10 s).
No content storage: only URL + HTTP status is recorded from sampled fetches.
Honest User-Agent with a contact URL — change it via CRAWLER_USER_AGENT.
stdout reserved for MCP: all logs go to stderr.

The tools never attempt to bypass authentication, CAPTCHAs, paywalls, WAFs, or any other access control. They also never accept arbitrary code from inputs.

Configuration

All knobs live in .env (see .env.example):

Variable	Default	Purpose
`DEFAULT_ACCESS_LOG`	`./samples/access.log`	Fallback log path.
`DEFAULT_TARGET_URL`	`https://example.com`	Fallback site for scans.
`MAX_SITEMAP_PAGES`	`25`	Hard cap per scan.
`CRAWL_DELAY_MS`	`1000`	Per-host delay.
`HTTP_TIMEOUT_MS`	`10000`	Per-request timeout.
`CRAWLER_USER_AGENT`	`mcp-bot-crawler/1.0 (+...)`	Outbound UA.
`REPORT_OUTPUT_DIR`	`./reports`	Where exports land.
`LOG_LEVEL`	`info`	`error` / `warn` / `info` / `debug`.
`ENABLE_MCP_ANALYTICS`	`false`	Master switch for MCP/SSE analytics — must be `true` to send.
`GA4_MEASUREMENT_ID`	—	GA4 stream id for MCP/SSE analytics (optional).
`GA4_API_SECRET`	—	GA4 Measurement Protocol API secret (optional).

Privacy-first MCP/SSE usage analytics (connections, requests, tools, bots) are emitted to GA4 only when ENABLE_MCP_ANALYTICS=true and the GA4_* vars are set — and silently no-op otherwise. No raw IP, User-Agent, request body, or tool arguments are ever tracked. See docs/MCP_ANALYTICS.md.

Extending

Add a new bot:

// src/data/known-bots.ts
{
  name: "MyCorpBot",
  category: "search",
  operator: "MyCorp",
  userAgentPatterns: [/MyCorpBot/i],
  verifiedHostnameSuffixes: [".mycorp.com"],
  respectsRobotsTxt: true,
  baselineRisk: 10,
  description: "MyCorp search index crawler.",
}

Add a new tool:

Create src/tools/<name>.ts exporting { definition, schema, handle }.
Drop it into the tools array in src/tools/index.ts.

Everything else (registration, schema validation, error handling) is automatic.

Development

npm run dev          # run with tsx, no build needed
npm run typecheck    # strict TS check
npm run build        # compile to dist/
npm test             # (add your own tests under src/__tests__/)

License

MIT — see LICENSE.

Disclaimer

This project helps you observe and govern bots interacting with your own website. Do not use it to crawl, scrape, or analyze third-party sites without permission. Always respect robots.txt, terms of service, and applicable law.

Try in Playground

Server Config

{
  "mcpServers": {
    "hellogrowth-crm": {
      "url": "https://mcp.hellogrowthcrm.com/sse"
    }
  }
}

Project Info

Created At

9 days ago

Updated At

10 hours ago

Author Name

MeruLocal

Star

Language

License

Recommend Servers

View All

Name Brewery Domain Checker

@Name Brewery

Bulk domain checking for AI chats: availability, aftermarket prices, archive.org history, social handle links, and buy links. Your AI brainstorms names; check_domains reports what's real for up to 50 names across 6 TLDs per call. Free to start — 20 credits. Setup: https://namebrewery.com/mcp

5 hours ago

Swipr

@nochinxx

Swipe-to-review GitHub PRs with AI context. Paste a repo URL, get open PRs as a card stack with risk scores, AI summaries, similar past changes, and contributor history. Works as a Claude MCP plugin — review PRs directly from Claude Desktop or Cursor without a browser. 12 tools including risk scoring, semantic similarity search, caller lookup, and test coverage detection.

2 days ago

Figma Mcp Express

@sunhome243

figma-mcp-express connects AI agents directly to Figma via a local plugin bridge. No Figma token required. No quota. No per-seat billing. Ships 70 discrete tools for reading design context, creating and mutating nodes, importing library components and variables, managing styles and tokens, running batch operations in a single round-trip, and exporting frames. Designed for agent-driven design automation workflows in Claude, Cursor, Codex, and other MCP-compatible clients.

a day ago

//beforeyouship — LLM Cost Modeling From Your Editor

@Indiegoing

Query realistic LLM cost models without leaving your editor. beforeyouship models the **true monthly cost** of an LLM app architecture — retries, prompt caching, batch discounts, infra overhead, and 3×/10× growth — across GPT-5.x, Claude, Gemini, DeepSeek, and more. Not a token calculator: a planning tool for the design phase, before you commit to a stack. **No API key needed to try it** — demo mode covers the six free-tier models. A Pro key from [beforeyouship.dev](https://beforeyouship.dev) unlocks the full 18-model catalog. ## What you can ask - "How much will a RAG chatbot cost at 10,000 requests/day?" - "Compare Claude Haiku vs Gemini Flash pricing for my workload" - "What's the cheapest model for a multi-step agent at scale?" - "Show me current per-token prices for Anthropic models" ## Tools ### `estimate_cost` Full cost model for an architecture at a given usage level. Returns Naive / Realistic / Worst Case monthly cost per model, 3×/10× growth scenarios, and an opinionated recommendation with reasoning. ### `get_model_prices` Current per-1M-token pricing — input, output, cached input, batch — with context windows and staleness metadata. ### `list_archetypes` Seven preset architecture patterns (simple chatbot, chatbot with history, RAG pipeline, multi-model router, coding assistant, document processor, multi-step agent) used as starting points for estimates. ## Setup **Claude Code:** ```bash claude mcp add --transport http beforeyouship https://beforeyouship.dev/api/mcp ``` **Cursor / other clients** — add a remote server: ```json { "mcpServers": { "beforeyouship": { "type": "streamable-http", "url": "https://beforeyouship.dev/api/mcp" } } } ``` Add an `Authorization: Bearer bys_...` header with a Pro key for the full catalog. ## Try it > Estimate the monthly cost of a RAG pipeline at 10,000 requests/day

16 hours ago

mcp-server-flomo MCP Server

@chatmcp

Write notes to Flomo

JavaScript

a year ago

Framelink Figma MCP Server

@GLips

MCP server to provide Figma layout information to AI coding agents like Cursor

TypeScript

a year ago

Indian Food Nutrition Mcp - Log Indian meals with your AI using accurate data. India's official IFCT 2017 nutrition tables + USDA (8,335 foods), by text or photo. Local-first, open source.

@krishnabhat

One-line description: Log Indian meals with your AI using accurate data. India's official IFCT 2017 nutrition tables + USDA (8,335 foods), by text or photo. Local-first, open source. Long description: An MCP server that gives Claude (and soon ChatGPT) accurate Indian food data. Most calorie databases are US-centric and wrong for home-cooked Indian food. This wraps India's official Food Composition Tables (IFCT 2017, National Institute of Nutrition) plus USDA. Log by talking ("2 rotis and a katori of dal") or by photo; the model identifies the food, the database supplies the numbers (no LLM guessing), and your history feeds back so the AI can coach you against what you actually ate. Local SQLite, no account, no telemetry. AGPL-3.0. Tools: search_food, log_meal, get_day, get_history, edit_entry, delete_entry, fetch_image

11 hours ago

Neon MCP Server

@neondatabase-labs

MCP server for interacting with Neon Management API and databases

TypeScript

a year ago

Puppeteer

@modelcontextprotocol

Browser automation and web scraping

a year ago

Waymark — shared procedural-knowledge network for AI agents

@waymark-network

Collective procedural-knowledge network for AI agents — the shared route map of the agent economy. Before attempting a non-trivial task (API integration, multi-step procedure), agents query Waymark for verified routes: step sequences and known gotchas other agents documented. After finishing, they attest the outcome, building trust scores by consensus. 100+ expert routes live across Stripe, GitHub, Slack, Salesforce, Kafka, Postgres and ~50 more domains. Blind-graded benchmark: 3.6× more known failure modes avoided, +45% first-try success (waymark.network/benchmark). Remote streamable-HTTP server — no install: https://mcp.waymark.network/mcp · Live dashboard: mcp.waymark.network/dashboard

12 hours ago

Versium Reach

@Versium

Find leads, enrich your contacts, and verify emails just by describing what you need. Versium REACH builds and sizes B2B and B2C audiences and fills in the contact and company data you're missing, all in plain language with no manual exports or API code. US data only. Estimates are free; building a list draws on your Versium account credits and always confirms with you first. Requires an active Versium REACH subscription with API access.

2 days ago

Callingly - Let your AI call, text and get leads on the phone

Callingly is an MCP server that lets AI agents actually get leads on the phone. Connect Claude, ChatGPT, or n8n and it can place a call, send an SMS, or schedule a callback — Callingly dials your sales team, plays the lead's details, and warm-transfers the conversation. 8 tools: dispatch + AI calls, SMS, scheduling, sequences, and call reporting. Remote server (streamable HTTP) with OAuth 2.0.

2 days ago

Serper MCP Server

@garymengcom

A Serper MCP Server

Python

a year ago

Mailtrap Email Sending MCP

@Mailtrap

An MCP server that provides a tool for sending transactional emails via Mailtrap

a year ago

EverArt

@modelcontextprotocol

AI image generation using various models

a year ago

Mnemom

18 hours ago

EdgeOne Pages MCP

@TencentEdgeOne

An MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.

TypeScript

a year ago

Linkpulse

19 hours ago

Stackql

@stackql

Open source SQL-native query and provisioning engine for cloud and SaaS infrastructure. Ships as a single signed binary with MCP tools for provider discovery, schema exploration, queries, and lifecycle operations.

15 hours ago

Memory

@modelcontextprotocol

a year ago

Riley Craig X402 Agent Store

@rccola990-cloud

Remote MCP server with 10 tools: live brand AI-visibility audits (score 0-100, who AI recommends instead), AI Visibility Index dataset (25 brands, 5 industries), trucking profit & cost-per-mile calculators, DeFi yields, crypto prices, US Treasury rates, and macro risk-spread. Free catalog tool; paid tools are machine-payable per call via x402 (USDC on Base) — no API keys or subscriptions. Endpoint: https://x402-agent-store.rileycraig14.workers.dev/mcp

a day ago

AGHIST

@Archiv-fur-Agrargeschichte

AGHIST is the search portal for agricultural, food and environmental history. It is run by the Archive of Rural History (ARH) in Bern (CH) and provides direct access to online resources (films, photographs and written sources, as well as catalogue data of archive collections, academic texts, video essays, etc.) made publicly available by the ARH and its partner institutions in Switzerland and abroad.

a day ago

Gas Fee Predictor

@higher-being

Live Ethereum + Layer-2 gas-fee data for AI agents — current gas, cheapest L2, ETH price, best time to transact, and per-action cost estimates. Wraps the free gasfeepredictor.com API. No key required.

18 hours ago

Acopio

@Daniel Valcarce

Save developer tools once — repos, CLIs, API docs — then let Claude, Cursor, and any MCP client search and recommend from your own curated catalog instead of generic model knowledge. Remote MCP over Streamable HTTP (OAuth 2.0 + DCR).

2 hours ago

Amap Maps

@amap

高德地图官方 MCP Server

a year ago

Perplexity Ask MCP Server

@ppl-ai

A Model Context Protocol Server connector for Perplexity API, to enable web search without leaving the MCP ecosystem.

JavaScript

a year ago

Fonteum Mcp Server

@Fonteum

Hosted MCP server for source-provenanced US federal healthcare provider data — NPPES, CMS PECOS, Care Compare, OIG LEIE, Open Payments. Every field returns with its exact federal source, snapshot date, and SHA-256 attestation. Public data only; no PHI. Install: npx -y @fonteum/mcp

5 hours ago

Erabi

@HMAKT99

ERABI is the open, cryptographically auditable intent exchange for AI agents: register an identity in one command, discover providers ranked by reputation (never by payment), fire intents, and build verifiable reputation and earnings from dual-signed outcomes on a public hash-chained ledger. Zero-config — `npx -y erabi-mcp` joins the live public network with no accounts, no API keys. Six tools: register, discover, intent, report_outcome, my_reputation, my_earnings. Live explorer: https://erabi-explorer.vercel.app

a day ago

Bucket Feature Flags MCP Server

@bucketco

Flag features directly from chat in your code editor, including VS Code, Cursor, Windsurf, Claude Code—any IDE with MCP support.

a year ago

Howtocook Mcp

@worryzyy

基于Anduin2017 / HowToCook （程序员在家做饭指南）的mcp server，帮你推荐菜谱、规划膳食，解决“今天吃什么“的世纪难题； Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"

a year ago