Ollama Client - Chat with Local LLM Models

ollamaclient.in

4.6(

15 ratings

)

ExtensionTools2,000 users

Item media 5 (screenshot) for Ollama Client - Chat with Local LLM Models

Item media 6 (screenshot) for Ollama Client - Chat with Local LLM Models

Item media 2 (screenshot) for Ollama Client - Chat with Local LLM Models

Item media 3 (screenshot) for Ollama Client - Chat with Local LLM Models

Item media 4 (screenshot) for Ollama Client - Chat with Local LLM Models

Overview

Local-first Chrome extension for private LLM chat with Ollama, LM Studio, and llama.cpp, including local RAG workflows.

Ollama Client – Local-First AI Chat in Your Browser Ollama Client is a privacy-focused browser extension for chatting with local and user-configured AI providers. Connect directly to your preferred model server, manage conversations, use local knowledge, and access AI tools from the browser side panel. Verified Built-In Providers • Ollama • LM Studio • llama.cpp Beta Custom Providers • OpenAI-compatible servers and services, including vLLM, LocalAI, KoboldCPP, and OpenRouter • Anthropic through the native Claude Messages API • Other compatible local, LAN, or remote endpoints Features • Connect and manage multiple AI providers • Discover available models and configure custom model IDs • Switch models and monitor provider connection status • Stream chat responses with stop, retry, edit, regenerate, and fork controls • View reasoning and thinking traces from supported models • Use native and prompt-based tool calling • Search the web through your configured SearXNG, Brave, or Tavily provider • Attach local files and images to conversations • Build reusable local knowledge from uploaded documents • Add optional webpage and browser-tab context • Use selected-text actions from the right-click context menu • Save memories and reusable context • Organise conversations with search, tags, pinning, and session management • Create custom prompt templates and system prompts • Adjust model parameters and capability overrides • Export, import, and reset locally stored data • Access privacy, permission, storage, and diagnostic controls • Use the browser side panel for a focused desktop workflow • Multi-language interface: English, German, Spanish, French, Hindi, Italian, Japanese, Russian, and Simplified Chinese Privacy • Chats, session history, settings, knowledge, and vector embeddings are stored locally in your browser • Provider API keys are stored only on your device and excluded from backups • Requests are sent directly to the provider you configure; Ollama Client does not operate an intermediary inference server • Local providers can keep inference entirely on your device or local network • Remote providers receive content only when you choose to use them • Web search is off by default and connects only to the search provider you configure • Page, tab, file, and browser context is shared with the selected model only when you include or request that context • Optional browser permissions can be reviewed and revoked from the extension settings • Local data can be exported, deleted, or fully reset at any time Who It’s For • Developers building with local and self-hosted AI models • Researchers testing different LLM providers and model capabilities • Students learning local and offline AI workflows • Teams using private LAN-hosted model servers • Privacy-conscious users who want control over providers and stored data Setup 1. Install the extension 2. Start Ollama, LM Studio, llama.cpp, or another supported provider 3. Open the extension settings and connect using a localhost, LAN, or remote endpoint 4. Select a model and start chatting Important Notes • Ollama Client is a frontend client and does not include AI models • Local inference performance depends on your hardware, selected model, and backend configuration • Custom and remote provider support is marked Beta because compatibility can vary between services • Remote providers may have their own privacy policies, pricing, authentication, and data-retention rules Page Context and Script Injection When you ask the model to use the current page, Ollama Client first contacts its content script on that tab. If the content script is unavailable—for example, because the tab was opened before the extension was installed or updated—the extension uses the scripting permission to inject its content script into that specific tab. This injection is: • Triggered only by a user-requested page-context action • Scoped to the active tab involved in the request • Limited to extracting readable page content needed for that request • Never performed continuously, on a schedule, or across every open tab • Not used on protected browser pages where extensions cannot run Useful Links Chrome Web Store: https://chromewebstore.google.com/detail/ollama-client/bfaoaaogfcgomkjfbmfepbiijmciinjl Provider Setup Guide: https://www.ollamaclient.in/guides/provider-setup Website: https://www.ollamaclient.in GitHub: https://github.com/Shishir435/ollama-client

4.6 out of 5
15 ratings
Learn more about results and reviews.

Details

Version
0.12.4
Updated
July 26, 2026
Flag concern
Size
4.18MiB
Languages
9 languages
Developer
Shishir
Taramani Chennai, Tamil Nadu 600036 IN
Website
Email
shishirchaurasiya435@gmail.com
Non-trader
This developer has not identified itself as a trader. For consumers in the European Union, please note that consumer rights do not apply to contracts between you and this developer.

Privacy

Manage extensions and learn how they're being used in your organization

The developer has disclosed that it will not collect or use your data. To learn more, see the developer’s privacy policy.

This developer declares that your data is

Not being sold to third parties, outside of the approved use cases
Not being used or transferred for purposes that are unrelated to the item's core functionality
Not being used or transferred to determine creditworthiness or for lending purposes

Support

For help with questions, suggestions, or problems, visit the developer's support site