AI Model Updates

GitHub: OpenAI Codex: rust-v0.119.0-alpha.5
Release v5.5.0
Gemma 4: Byte for byte, the most capable open models
Gemma 4: Our most intelligent open models to date, purpose-built for advanced reasoning and agentic workflows.
0.119.0-alpha.4
GitHub: Kimi CLI: 1.30.0
GitHub: Kimi CLI: pykaos-0.9.0
GitHub: Kimi CLI: kosong-0.48.0
0.119.0-alpha.3
GitHub: Kimi CLI: 1.29.0
GitHub: Kimi CLI: pykaos-0.8.0
0.119.0-alpha.2
GitHub: Qwen: Show HN: SwiftLM – Qwen Chat on iPhone, 100B+ Moe on M5 Pro 64GB (Native Swift)
<p>Article URL: <a href="https://github.com/SharpAI/SwiftLM">https://github.com/SharpAI/SwiftLM</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47597181">https://news.ycombinator.com/item?id=47597181</a></p> <p>Points: 1</p> <p># Comments: 2</p>
0.119.0-alpha.1
0.118.0
0.118.0-alpha.5
0.118.0-alpha.4
GitHub: Qwen: Qwen 3.6 Plus Preview Available for Free for a Limited Time
<p>Article URL: <a href="https://twitter.com/OpenRouter/status/2038701599175196715">https://twitter.com/OpenRouter/status/2038701599175196715</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47578983">https://news.ycombinator.com/item?id=47578983</a></p> <p>Points: 5</p> <p># Comments: 0</p>
GitHub: Kimi CLI: 1.28.0
GitHub: Kimi CLI: kosong-0.47.0
Google AI Blog: Google Released Lyria 3 Pro to Google AI Studio
<p>Article URL: <a href="https://aistudio.google.com/app/new_music?model=lyria-3-pro-preview">https://aistudio.google.com/app/new_music?model=lyria-3-pro-preview</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47566347">https://news.ycombinator.com/item?id=47566347</a></p> <p>Points: 2</p> <p># Comments: 0</p>
GitHub: Qwen: Show HN: Qwen Meetup Presentation, Function Calling Harness, 6.75% to 100%
<p>I was personally invited by the Qwen team to speak at Qwen Meetup Korea, and got to present locally here in Korea yesterday β€” pretty honored to have been reached out to directly.<p>The talk was about how I got function calling to work reliably on deeply recursive union types β€” the stuff the industry generally says doesn't work. With qwen3-coder-next, first-try success rate was 6.75%. And the entire Qwen 3.5 model family was hitting 0% on union types due to a consistent double-stringify b...
0.118.0-alpha.3
0.118.0-alpha.2
GitHub: Kimi CLI: 1.27.0
GitHub: Kimi CLI: 1.27
rust-v0.118.0-alpha.1
GitHub: Qwen: 1M Tokens/s: Scaling Qwen 3.5 27B on 96 B200 GPUs with vLLM
<p>Article URL: <a href="https://medium.com/google-cloud/1-million-tokens-per-second-qwen-3-5-27b-on-gke-with-b200-gpus-161da5c1b592">https://medium.com/google-cloud/1-million-tokens-per-second-qwen-3-5-27b-on-gke-with-b200-gpus-161da5c1b592</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47542691">https://news.ycombinator.com/item?id=47542691</a></p> <p>Points: 3</p> <p># Comments: 0</p>
Release v5.4.0: PaddlePaddle models πŸ™Œ, Mistral 4, PI0, VidEoMT, UVDoc, SLANeXt, Jina Embeddings v3
Gemini 3.1 Flash Live: Making audio AI more natural and reliable
Our latest voice model has improved precision and lower latency to make voice interactions more fluid, natural and precise.
Protecting people from harmful manipulation
Google DeepMind researches AI's harmful manipulation risks across areas like finance and health, leading to new safety measures.
Lyria 3 Pro: Create longer tracks in more
Introducing Lyria 3 Pro, which unlocks longer tracks with structural awareness. We’re also bringing Lyria to more Google products and surfaces.
GitHub: Kimi CLI: 1.26.0
Google AI Blog: Ask HN: Is Google AI overview good enough now?
<p>I think it's much better than before, and might even replace perplexity.</p> <hr> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47513343">https://news.ycombinator.com/item?id=47513343</a></p> <p>Points: 7</p> <p># Comments: 3</p>
GitHub: Qwen: Run Qwen 3.5 Locally with Claude Code
<p>Article URL: <a href="https://gist.github.com/kibotu/a009f00414b7c10fb1c74e603d7838c0">https://gist.github.com/kibotu/a009f00414b7c10fb1c74e603d7838c0</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47499324">https://news.ycombinator.com/item?id=47499324</a></p> <p>Points: 2</p> <p># Comments: 0</p>
GitHub: Qwen: Autoresearching Apple's "LLM in a Flash" to run Qwen 397B locally
<p>Article URL: <a href="https://twitter.com/danveloper/status/2034353876753592372">https://twitter.com/danveloper/status/2034353876753592372</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47498414">https://news.ycombinator.com/item?id=47498414</a></p> <p>Points: 2</p> <p># Comments: 1</p>
Google AI Blog: Google's AI Studio now integrates with Firebase for vibe coding production apps
<p>Article URL: <a href="https://blog.google/innovation-and-ai/technology/developers-tools/full-stack-vibe-coding-google-ai-studio/">https://blog.google/innovation-and-ai/technology/developers-tools/full-stack-vibe-coding-google-ai-studio/</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47450049">https://news.ycombinator.com/item?id=47450049</a></p> <p>Points: 2</p> <p># Comments: 3</p>
GitHub: MiniMax: MiniMax Launches M2.7 Model on MiniMax Agent and APIs
<p>Article URL: <a href="https://www.testingcatalog.com/minimax-launches-m2-7-model-on-minimax-agent-and-apis/">https://www.testingcatalog.com/minimax-launches-m2-7-model-on-minimax-agent-and-apis/</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47446452">https://news.ycombinator.com/item?id=47446452</a></p> <p>Points: 2</p> <p># Comments: 0</p>
Google AI Blog: New vibe coding experience – Google AI Studio
<p>Article URL: <a href="https://twitter.com/GoogleAIStudio/status/2034654985850659149">https://twitter.com/GoogleAIStudio/status/2034654985850659149</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47442421">https://news.ycombinator.com/item?id=47442421</a></p> <p>Points: 6</p> <p># Comments: 0</p>
GitHub: Qwen: Qwen-ASR-CLI – local Qwen ASR CLI written in pure Rust
<p>Article URL: <a href="https://github.com/huanglizhuo/QwenASR">https://github.com/huanglizhuo/QwenASR</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47438212">https://news.ycombinator.com/item?id=47438212</a></p> <p>Points: 1</p> <p># Comments: 0</p>
GitHub: Qwen: Autoresearching Apple's "LLM in a Flash" to run Qwen 397B locally with low RAM
<p>Article URL: <a href="https://simonwillison.net/2026/Mar/18/llm-in-a-flash/">https://simonwillison.net/2026/Mar/18/llm-in-a-flash/</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47438166">https://news.ycombinator.com/item?id=47438166</a></p> <p>Points: 3</p> <p># Comments: 0</p>
GitHub: MiniMax: MiniMax-M2.7 Announced
<p>Article URL: <a href="https://old.reddit.com/r/LocalLLaMA/comments/1rwvn6h/minimaxm27_announced/">https://old.reddit.com/r/LocalLLaMA/comments/1rwvn6h/minimaxm27_announced/</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47434629">https://news.ycombinator.com/item?id=47434629</a></p> <p>Points: 1</p> <p># Comments: 0</p>
GitHub: MiniMax: Ask HN: Are MiniMax Models Scams?
<p>I kept trying to use their M2.5 model and now they released M2.7, but they are TERRIBLE.<p>See this comparison I made:<p>https://aibenchy.com/compare/minimax-minimax-m2-7-medium/minimax-minimax-m2-5-medium/z-ai-glm-5-medium/google-gemini-3-1-flash-lite-preview-medium/<p>Not only that, but M2.5 is #1 on OpenRouter, which is crazy: https://openrouter.ai/rankings<p>I think the only reason why it is #1 is because it is a scam. In the comparison you can see it had over 200k reasoning tokens, wher...
GitHub: MiniMax: MiniMax M2.7 (200K context, $0.30/1.20) released
<p>Article URL: <a href="https://openrouter.ai/minimax/minimax-m2.7">https://openrouter.ai/minimax/minimax-m2.7</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47425518">https://news.ycombinator.com/item?id=47425518</a></p> <p>Points: 7</p> <p># Comments: 1</p>
Measuring progress toward AGI: A cognitive framework
We’re introducing a framework to measure progress toward AGI, and launching a Kaggle hackathon to build the relevant evaluations.
GitHub: Qwen: Beating Aider's 20% pass rate on local Qwen 32B using deterministic RAG
<p>Article URL: <a href="https://fararoni.dev/publicacion/caso-estudio-qwen">https://fararoni.dev/publicacion/caso-estudio-qwen</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47401192">https://news.ycombinator.com/item?id=47401192</a></p> <p>Points: 1</p> <p># Comments: 1</p>
Google AI Blog: Google AI Pro users getting locked out of Antigravity
<p>Article URL: <a href="https://discuss.ai.google.dev/t/google-ai-pro-subscription-antigravity-quota-not-working-as-advertised-10-day-lockout-instead-of-5-hour-reset/118505">https://discuss.ai.google.dev/t/google-ai-pro-subscription-antigravity-quota-not-working-as-advertised-10-day-lockout-instead-of-5-hour-reset/118505</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47374098">https://news.ycombinator.com/item?id=47374098</a></p> <p>Points: 1</p> <p># Comments: 0</p>
GitHub: MiniMax: MiniMax M2.5 is trained by Claude Opus 4.6?
<p>I was chatting with MiniMax M2.5 in OpenRouter and suddenly he mysteriously repeated on "I'm Claude, an AI assistant created by Anthropic - not a "language" ", heh wut?</p> <hr> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47372273">https://news.ycombinator.com/item?id=47372273</a></p> <p>Points: 12</p> <p># Comments: 11</p>
Google AI Blog: Users report drastic quota reductions on Google AI Pro
<p>Article URL: <a href="https://twitter.com/antigravity/status/2031835833716625883">https://twitter.com/antigravity/status/2031835833716625883</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47348613">https://news.ycombinator.com/item?id=47348613</a></p> <p>Points: 3</p> <p># Comments: 0</p>
GitHub: MiniMax: 15 Cloud/local LLMs benchmarked on 38 real tasks. MiniMax and Kimi tied for 2nd
<p>Article URL: <a href="https://ianlpaterson.com/blog/llm-benchmark-2026-38-actual-tasks-15-models-for-2-29/">https://ianlpaterson.com/blog/llm-benchmark-2026-38-actual-tasks-15-models-for-2-29/</a></p> <p>Comments URL: <a href="https://news.ycombinator.com/item?id=47324730">https://news.ycombinator.com/item?id=47324730</a></p> <p>Points: 3</p> <p># Comments: 1</p>
GitHub: Qwen: Show HN: TubeTrim – A local YouTube summarizer using Qwen in pure Python
<p>I wanted a way to summarize YouTube videos without paying for a SaaS or leaking my viewing history to someone. TubeTrim is a Python-based tool that runs LLMs locally to process transcripts. No API keys, no subscriptions, no tracking.<p>It uses the transformers library with a device-aware backend: it will prioritize CUDA, then MPS (for Mac users), and finally fallback to CPU. I've found that Qwen 2.5-1.5B provides a good balance between speed and summary quality for this specific task.<p>How ...