Gemma 4 E2B has demonstrated in-browser inference at 255 tokens/second on an M4 Max, utilizing WebGPU kernels optimized with assistance from Fable 5.
OriginalAI Intel
Last 90 days · 2711 total
New research on arXiv presents "When Rules Learn," a self-evolving agent designed to improve legal case retrieval by addressing the complexities of l…
OriginalPhoneHarness is a new framework that enables phone-use agents to complete mobile workflows through a mix of GUI, CLI, and tool actions, addressing li…
OriginalGLM-5.2 has achieved a significant milestone by becoming the first open-weights model to score over 80% on Terminal-Bench, demonstrating frontier-lev…
OriginalOllama's v0.30.9 update introduces support for the Cohere2Moe architecture and resolves issues with LFM2 parser/render and coding agent outputs.
OriginalClaude Code's v2.1.178 update introduces advanced Tool(param:value) syntax for permission rules, allowing more granular control over tool input param…
OriginalLocus Founder is a new AI agent product that allows users to build and operate their businesses by simply texting the AI.
OriginalDeep Work Plan highlights the critical importance of providing AI agents with context and structured plans to enhance their performance, beyond just…
OriginalTyto, developed by ai-coustics, offers audio insights capable of predicting the performance of voice AI systems.
OriginalvLLM has released version v0.23.0, incorporating 408 commits from 200 contributors, though Minimax M3 is not yet supported.
Original