Ultimate AI Connector for WebLLM
The Ultimate AI Connector for WebLLM brings browser-native AI inference to your WordPress multisite network. It runs large language models entirely in the browser using WebLLM and the MLC engine — no API keys, no external calls, no data leaving the user's device.
Key Features
- Browser-side inference: LLM runs locally in the visitor's browser via WebLLM/MLC — no server GPU required
- Floating chat widget: Logged-in users can prompt the browser-side LLM directly from the front end
- Admin-bar status indicator: Real-time status of the WebLLM engine visible in the WordPress admin bar
- SharedWorker runtime: Multiple browser tabs share one GPU session instead of fighting over GPU resources
- apiFetch middleware: WordPress REST requests matching the AI Client SDK pattern are transparently routed to the local WebLLM broker — no loopback HTTP round-trip
- Widget settings UI: Connector panel settings to toggle the chat widget and configure auto-prompt behaviour
- IndexedDB cache: Model weight downloads survive CDN redirects that break the default Cache API path
- wpai filter integration: Hooks into the
wpai_preferred_text_modelsfilter so the AI Experiments feature routes to the browser engine when configured