Ollama v0.30.8 fixes launch provider selection, improves prompt caching by decoupling it from context shifts for better KV cache reuse, and enhances…
OriginalAI Intel
Last 90 days · 2733 total
llama.cpp's b9669 release introduces backend sampling support for eagle3 and specifies various build configurations for macOS (Apple Silicon and Inte…
OriginalvLLM has released version 0.23.0, featuring 408 commits from 200 contributors, though Minimax M3 support is not yet included and requires following s…
OriginalA Reddit user, previously skeptical due to poor Openrouter performance reports, now asserts that Nex-N2 Pro is a strong performer, intrigued by compa…
OriginalMakersClaw is introduced as a service allowing users to "hire AI employees" that integrate directly into Slack, Teams, and Telegram.
OriginalGoogleAI has announced that Nano Banana 2 and Nano Banana Pro are now Generally Available (GA) through the Gemini Enterprise Agent Platform, Gemini A…
OriginalThe Grok Build Plugin Marketplace is now in beta, enabling developers to build with plugins for MongoDB, Vercel, Sentry, Cloudflare, and Chrome DevTo…
OriginalGrok Voice is highlighted for its state-of-the-art performance, featuring human-like timing, tone, and warmth, while being offered at a fraction of c…
OriginalMistral Vibe has shipped, positioning itself as an AI agent for long-horizon productivity and coding, complete with Work mode, Code mode, a CLI, and…
OriginalMistral AI has launched its Connectors API into Public Preview, allowing developers to register connectors once and reuse them across Le Chat, AI Stu…
Original