File Search Assistant with LLM Integration

Created By
Code-Treesa year ago
Learn how to: ✅ Build a file-search AI using natural language queries ✅ Create embeddings from local Linux files using Hugging Face models ✅ Integrate Gemini API (Google AI Studio) into your local apps ✅ Use MCP to control multiple agents with server-client architecture ✅ Apply cosine similarity, asyncio Python, and more!
Overview

Agentic_search is a file search assistant that integrates Large Language Models (LLM) to enable intelligent file searching using natural language queries in Linux systems.

To use Agentic_search, clone the repository, install the dependencies, configure your environment variables with the Gemini API key, and run the main application to start searching for files using natural language queries.

  • Natural language file search queries
  • Semantic search using BERT embeddings
  • Integration with Gemini LLM for enhanced query understanding
  • MCP server for efficient file system operations
  • Automatic inference of file extensions
  1. Finding specific files based on natural language descriptions.
  2. Searching for documents or scripts in a Linux environment.
  3. Enhancing productivity by quickly locating files without remembering exact names.
  • What programming language is used for Agentic_search?

The project is developed in Python.

  • Do I need a GPU to run Agentic_search?

A CUDA-compatible GPU is optional but recommended for faster processing.

  • How does the semantic search work?

It uses BERT embeddings to understand the context of the search queries.

Project Info
Created At
a year ago
Updated At
a year ago
Author Name
Code-Trees
Star
0
Language
Python
License
-

Recommend Servers

View All
Tatsu55

4 hours ago
Docwand

2 days ago
Wpnews

a day ago
Orkestr

a day ago
Numox

4 hours ago
Numox

5 hours ago
//beforeyouship — LLM Cost Modeling From Your Editor
@Indiegoing

Query realistic LLM cost models without leaving your editor. beforeyouship models the **true monthly cost** of an LLM app architecture — retries, prompt caching, batch discounts, infra overhead, and 3×/10× growth — across GPT-5.x, Claude, Gemini, DeepSeek, and more. Not a token calculator: a planning tool for the design phase, before you commit to a stack. **No API key needed to try it** — demo mode covers the six free-tier models. A Pro key from [beforeyouship.dev](https://beforeyouship.dev) unlocks the full 18-model catalog. ## What you can ask - "How much will a RAG chatbot cost at 10,000 requests/day?" - "Compare Claude Haiku vs Gemini Flash pricing for my workload" - "What's the cheapest model for a multi-step agent at scale?" - "Show me current per-token prices for Anthropic models" ## Tools ### `estimate_cost` Full cost model for an architecture at a given usage level. Returns Naive / Realistic / Worst Case monthly cost per model, 3×/10× growth scenarios, and an opinionated recommendation with reasoning. ### `get_model_prices` Current per-1M-token pricing — input, output, cached input, batch — with context windows and staleness metadata. ### `list_archetypes` Seven preset architecture patterns (simple chatbot, chatbot with history, RAG pipeline, multi-model router, coding assistant, document processor, multi-step agent) used as starting points for estimates. ## Setup **Claude Code:** ​```bash claude mcp add --transport http beforeyouship https://beforeyouship.dev/api/mcp ​``` **Cursor / other clients** — add a remote server: ​```json { "mcpServers": { "beforeyouship": { "type": "streamable-http", "url": "https://beforeyouship.dev/api/mcp" } } } ​``` Add an `Authorization: Bearer bys_...` header with a Pro key for the full catalog. ## Try it > Estimate the monthly cost of a RAG pipeline at 10,000 requests/day

2 days ago