Ollama v0.30.9 has been released, adding support for the Cohere2Moe architecture and fixing issues with the LFM2 parser and incorrect output in codin…
OriginalAI Intel
Last 90 days · 2691 total
llama.cpp b9733 has been released, adding adapter toggles for F16 on Vulkan + NVIDIA for ggml-webgpu and improving support for macOS Apple Silicon an…
OriginalvLLM has officially released version v0.23.0, incorporating 408 commits from 200 contributors, though Minimax M3 is not yet supported in this version.
OriginalA Reddit discussion highlights that the economics of AI are increasingly favoring open models, shifting away from the previous reliance on closed API…
OriginalOhio State University's NLP team released QUEST-35B, an open-source Deep Research agent trained on approximately 32 H100 GPUs, along with its trainin…
OriginalAPI to MCP is a tool designed to convert any existing API into an MCP (Multi-Agent Communication Protocol) server for AI agents.
OriginalThe Zernio WhatsApp API offers a single interface for WhatsApp messaging, calling, and integration with AI agents.
OriginalNew research introduces SPSD (Social-Semantic Prompt Disentanglement) for edge-based prompt compression to reduce energy consumption during the prefi…
OriginalAccess to Google Project Genie is expanding, with all global Google AI Ultra 5X subscribers now able to access the project.
OriginalClaude Code Artifacts allows users to preview and share their coding work live as it happens.
Original