Item logo image for Local Nano

Local Nano

ExtensionTools
Item media 3 (screenshot) for Local Nano
Item media 1 (screenshot) for Local Nano
Item media 2 (screenshot) for Local Nano
Item media 3 (screenshot) for Local Nano
Item media 1 (screenshot) for Local Nano
Item media 1 (screenshot) for Local Nano
Item media 2 (screenshot) for Local Nano
Item media 3 (screenshot) for Local Nano

Overview

Local Nano is a private AI assistant that lives one click away on every page. The language model runs entirely inside your browser.…

Local Nano is a private AI assistant that lives one click away on every page. The language model runs entirely inside your browser. No account, no API key, no servers, no data collection, and no remote code. Everything that runs ships inside the package. How to open it Click the Local Nano icon next to the address bar. Or press Ctrl+Shift+K (Cmd+Shift+K on Mac). On a fresh install the shortcut may need to be bound once at chrome://extensions/shortcuts. The icon's tooltip shows the current binding, and the icon works without one. What you can do - Ask about the page you're on. Your question is answered using the page's visible text as context. The chat is DOM aware. - Rewrite selected text in place. Highlight a sentence, tell Local Nano how to change it, and the rewrite streams directly back into the page. One click Undo restores the original. - Choose your model. A gear popover offers a curated catalog of on-device models. Pick one, click Load to switch. Your choice persists across sessions. - Reclaim memory when you walk away. After a configurable idle period (5, 15, 60 minutes, or Never) the model is released from VRAM. The next use re-warms automatically. How it stays private All inference runs locally via WebGPU, with an automatic CPU/WASM fallback. The runtime is Transformers.js and ONNX Runtime Web, both bundled inside the extension. The only network request Local Nano ever makes is the one-time download of the open model weights from Hugging Face, which are cached on your device afterward. Your page content, prompts, and chat history never leave your machine. No telemetry. No analytics. No remote scripts. What to expect The first run downloads the model (a few GB) with a real-time percentage indicator and takes roughly 30 to 90 seconds. After that it loads from cache. A modern GPU with a few GB of memory is recommended. If the model can't load at full quality, Local Nano automatically tries lighter precision and CPU modes. If your device can't run it at all, it tells you clearly and gives you a copyable diagnostic. Nothing in the diagnostic leaves your device automatically. Open and on-device. For anyone who wants a quick AI helper on the pages they're reading without handing their browsing or their text to a cloud service.

Details

  • Version
    0.4.3
  • Updated
    June 4, 2026
  • Offered by
    Gemenie Labs
  • Size
    18.24MiB
  • Languages
    English
  • Developer
    Email
    gemenielabs@gmail.com
  • Non-trader
    This developer has not identified itself as a trader. For consumers in the European Union, please note that consumer rights do not apply to contracts between you and this developer.

Privacy

Manage extensions and learn how they're being used in your organization
The developer has disclosed that it will not collect or use your data. To learn more, see the developer’s privacy policy.

This developer declares that your data is

  • Not being sold to third parties, outside of the approved use cases
  • Not being used or transferred for purposes that are unrelated to the item's core functionality
  • Not being used or transferred to determine creditworthiness or for lending purposes

Support

For help with questions, suggestions, or problems, visit the developer's support site

Google apps