AI Intel

Last 90 days · 2691 total

Jun 19, 2026www.greybeam.ai

趋势

搞钱

DuckDB Internals: Why Is DuckDB Fast? (Part 1)

This article delves into the internal mechanisms and optimization techniques of the DuckDB database, explaining the reasons behind its high performan…

Original

Jun 19, 2026developers.google.com

重点

搞钱

Gemini Code Assist Updates: Gemini 2.5 Pro/Flash GA, Agent Mode, AI Exclusion Config

Gemini Code Assist rolled out several updates, including the General Availability of Gemini 2.5 Pro and Flash models, an agent mode for individuals i…

Original

Jun 19, 2026arxiv.org

趋势

Bridging Social-Semantic Gap with SPSD for Edge-Based Prompt Compression in Cloud LLM Inference

A new paper introduces SPSD, an edge-based prompt compression method for cloud LLM inference, aiming to reduce the significant energy costs incurred…

Original

Jun 19, 2026arxiv.org

趋势

How LLMs Fail and Generalize in RTL Coding for Hardware Design?

This research investigates the challenges and generalization capabilities of large language models when translating sequential programming logic into…

Original

Jun 19, 2026github.com

重点

搞钱

Ollama v0.30.9 Release: Cohere2Moe Support & Bug Fixes

Ollama released version 0.30.9, introducing support for the Cohere2Moe architecture and fixing issues with LFM2 parser/renderer and coding agent outp…

Original

Jun 19, 2026www.reddit.com

趋势

搞钱

Fearless Concurrency on GPU: Safe Rust Inference Rivaling vLLM/SGLang

A new paper introduces "Fearless Concurrency on the GPU," demonstrating safe GPU inference in Rust with cuTile Rust, positioning it as a competitor t…

Original

Jun 19, 2026github.com

重点

Claude Code v2.1.178 Update: Enhanced Tool Permission Rules & Skill Loading

Claude Code released version 2.1.178, adding new Tool(param:value) syntax for permission rules with wildcard support and enabling skills to load from…

Original

Jun 19, 2026www.reddit.com

趋势

搞钱

LQ50/LQ50-24GB GPU Price Alert: Approximately $1200 on Taobao

A Reddit user reported finding LQ50/LQ50-24GB GPUs for approximately $1200 on Taobao, noting the high price.

Original

Jun 19, 2026github.com

重点

llama.cpp Release b9707: Server Schema/Validation & macOS Improvements

llama.cpp released version b9707, which adds server schema and validation, improves error messages, and notes various macOS/iOS build statuses.

Original

Jun 19, 2026www.reddit.com

趋势

GLM-5.2 Free Inference on Hugging Face for 6 Hours

GLM-5.2 model inference was offered for free on Hugging Face for a limited six-hour period, providing an opportunity for users to try it out.

Original

Previous68 / 270 · 2691 totalNext