AI Model Updates

Article URL: <a href="https://github.com/SharpAI/SwiftLM">https://github.com/SharpAI/SwiftLM</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47597181">https://news.ycombinator.com/item?id=47597181</a> Points: 1 # Comments: 2

0.119.0-alpha.1

GitHub: OpenAI Codex | Mar 31, 2026

0.118.0

GitHub: OpenAI Codex | Mar 31, 2026

0.118.0-alpha.5

GitHub: OpenAI Codex | Mar 31, 2026

0.118.0-alpha.4

GitHub: OpenAI Codex | Mar 31, 2026

GitHub: Qwen: Qwen 3.6 Plus Preview Available for Free for a Limited Time

GitHub: Qwen | Mar 30, 2026

Article URL: <a href="https://twitter.com/OpenRouter/status/2038701599175196715">https://twitter.com/OpenRouter/status/2038701599175196715</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47578983">https://news.ycombinator.com/item?id=47578983</a> Points: 5 # Comments: 0

GitHub: Kimi CLI: 1.28.0

GitHub: Kimi CLI | Mar 30, 2026

GitHub: Kimi CLI: kosong-0.47.0

GitHub: Kimi CLI | Mar 30, 2026

Google AI Blog: Google Released Lyria 3 Pro to Google AI Studio

Google AI Blog | Mar 29, 2026

Article URL: <a href="https://aistudio.google.com/app/new_music?model=lyria-3-pro-preview">https://aistudio.google.com/app/new_music?model=lyria-3-pro-preview</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47566347">https://news.ycombinator.com/item?id=47566347</a> Points: 2 # Comments: 0

GitHub: Qwen: Show HN: Qwen Meetup Presentation, Function Calling Harness, 6.75% to 100%

GitHub: Qwen | Mar 28, 2026

I was personally invited by the Qwen team to speak at Qwen Meetup Korea, and got to present locally here in Korea yesterday — pretty honored to have been reached out to directly.The talk was about how I got function calling to work reliably on deeply recursive union types — the stuff the industry generally says doesn't work. With qwen3-coder-next, first-try success rate was 6.75%. And the entire Qwen 3.5 model family was hitting 0% on union types due to a consistent double-stringify b...

0.118.0-alpha.3

GitHub: OpenAI Codex | Mar 27, 2026

0.118.0-alpha.2

GitHub: OpenAI Codex | Mar 27, 2026

GitHub: Kimi CLI: 1.27.0

GitHub: Kimi CLI | Mar 27, 2026

GitHub: Kimi CLI: 1.27

GitHub: Kimi CLI | Mar 27, 2026

rust-v0.118.0-alpha.1

GitHub: OpenAI Codex | Mar 27, 2026

GitHub: Qwen: 1M Tokens/s: Scaling Qwen 3.5 27B on 96 B200 GPUs with vLLM

GitHub: Qwen | Mar 27, 2026

Article URL: <a href="https://medium.com/google-cloud/1-million-tokens-per-second-qwen-3-5-27b-on-gke-with-b200-gpus-161da5c1b592">https://medium.com/google-cloud/1-million-tokens-per-second-qwen-3-5-27b-on-gke-with-b200-gpus-161da5c1b592</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47542691">https://news.ycombinator.com/item?id=47542691</a> Points: 3 # Comments: 0

Release v5.4.0: PaddlePaddle models 🙌, Mistral 4, PI0, VidEoMT, UVDoc, SLANeXt, Jina Embeddings v3

GitHub: Hugging Face Transformers | Mar 27, 2026

Gemini 3.1 Flash Live: Making audio AI more natural and reliable

DeepMind Blog | Mar 26, 2026

Our latest voice model has improved precision and lower latency to make voice interactions more fluid, natural and precise.

Protecting people from harmful manipulation

DeepMind Blog | Mar 25, 2026

Google DeepMind researches AI's harmful manipulation risks across areas like finance and health, leading to new safety measures.

Lyria 3 Pro: Create longer tracks in more

DeepMind Blog | Mar 25, 2026

Introducing Lyria 3 Pro, which unlocks longer tracks with structural awareness. We’re also bringing Lyria to more Google products and surfaces.

GitHub: Kimi CLI: 1.26.0

GitHub: Kimi CLI | Mar 25, 2026

Google AI Blog: Ask HN: Is Google AI overview good enough now?

Google AI Blog | Mar 25, 2026

I think it's much better than before, and might even replace perplexity. <hr> Comments URL: <a href="https://news.ycombinator.com/item?id=47513343">https://news.ycombinator.com/item?id=47513343</a> Points: 7 # Comments: 3

GitHub: Qwen: Run Qwen 3.5 Locally with Claude Code

GitHub: Qwen | Mar 24, 2026

Article URL: <a href="https://gist.github.com/kibotu/a009f00414b7c10fb1c74e603d7838c0">https://gist.github.com/kibotu/a009f00414b7c10fb1c74e603d7838c0</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47499324">https://news.ycombinator.com/item?id=47499324</a> Points: 2 # Comments: 0

GitHub: Qwen: Autoresearching Apple's "LLM in a Flash" to run Qwen 397B locally

GitHub: Qwen | Mar 24, 2026

Article URL: <a href="https://twitter.com/danveloper/status/2034353876753592372">https://twitter.com/danveloper/status/2034353876753592372</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47498414">https://news.ycombinator.com/item?id=47498414</a> Points: 2 # Comments: 1

Google AI Blog: Google's AI Studio now integrates with Firebase for vibe coding production apps

Google AI Blog | Mar 20, 2026

Article URL: <a href="https://blog.google/innovation-and-ai/technology/developers-tools/full-stack-vibe-coding-google-ai-studio/">https://blog.google/innovation-and-ai/technology/developers-tools/full-stack-vibe-coding-google-ai-studio/</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47450049">https://news.ycombinator.com/item?id=47450049</a> Points: 2 # Comments: 3

GitHub: MiniMax: MiniMax Launches M2.7 Model on MiniMax Agent and APIs

GitHub: MiniMax | Mar 19, 2026

Article URL: <a href="https://www.testingcatalog.com/minimax-launches-m2-7-model-on-minimax-agent-and-apis/">https://www.testingcatalog.com/minimax-launches-m2-7-model-on-minimax-agent-and-apis/</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47446452">https://news.ycombinator.com/item?id=47446452</a> Points: 2 # Comments: 0

Google AI Blog: New vibe coding experience – Google AI Studio

Google AI Blog | Mar 19, 2026

Article URL: <a href="https://twitter.com/GoogleAIStudio/status/2034654985850659149">https://twitter.com/GoogleAIStudio/status/2034654985850659149</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47442421">https://news.ycombinator.com/item?id=47442421</a> Points: 6 # Comments: 0

GitHub: Qwen: Qwen-ASR-CLI – local Qwen ASR CLI written in pure Rust

GitHub: Qwen | Mar 19, 2026

Article URL: <a href="https://github.com/huanglizhuo/QwenASR">https://github.com/huanglizhuo/QwenASR</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47438212">https://news.ycombinator.com/item?id=47438212</a> Points: 1 # Comments: 0

GitHub: Qwen: Autoresearching Apple's "LLM in a Flash" to run Qwen 397B locally with low RAM

GitHub: Qwen | Mar 19, 2026

Article URL: <a href="https://simonwillison.net/2026/Mar/18/llm-in-a-flash/">https://simonwillison.net/2026/Mar/18/llm-in-a-flash/</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47438166">https://news.ycombinator.com/item?id=47438166</a> Points: 3 # Comments: 0

GitHub: MiniMax: MiniMax-M2.7 Announced

GitHub: MiniMax | Mar 19, 2026

Article URL: <a href="https://old.reddit.com/r/LocalLLaMA/comments/1rwvn6h/minimaxm27_announced/">https://old.reddit.com/r/LocalLLaMA/comments/1rwvn6h/minimaxm27_announced/</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47434629">https://news.ycombinator.com/item?id=47434629</a> Points: 1 # Comments: 0

GitHub: MiniMax: Ask HN: Are MiniMax Models Scams?

GitHub: MiniMax | Mar 18, 2026

I kept trying to use their M2.5 model and now they released M2.7, but they are TERRIBLE.See this comparison I made:https://aibenchy.com/compare/minimax-minimax-m2-7-medium/minimax-minimax-m2-5-medium/z-ai-glm-5-medium/google-gemini-3-1-flash-lite-preview-medium/Not only that, but M2.5 is #1 on OpenRouter, which is crazy: https://openrouter.ai/rankingsI think the only reason why it is #1 is because it is a scam. In the comparison you can see it had over 200k reasoning tokens, wher...

GitHub: MiniMax: MiniMax M2.7 (200K context, $0.30/1.20) released

GitHub: MiniMax | Mar 18, 2026

Article URL: <a href="https://openrouter.ai/minimax/minimax-m2.7">https://openrouter.ai/minimax/minimax-m2.7</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47425518">https://news.ycombinator.com/item?id=47425518</a> Points: 7 # Comments: 1

Measuring progress toward AGI: A cognitive framework

DeepMind Blog | Mar 17, 2026

We’re introducing a framework to measure progress toward AGI, and launching a Kaggle hackathon to build the relevant evaluations.

GitHub: Qwen: Beating Aider's 20% pass rate on local Qwen 32B using deterministic RAG

GitHub: Qwen | Mar 16, 2026

Article URL: <a href="https://fararoni.dev/publicacion/caso-estudio-qwen">https://fararoni.dev/publicacion/caso-estudio-qwen</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47401192">https://news.ycombinator.com/item?id=47401192</a> Points: 1 # Comments: 1

Google AI Blog: Google AI Pro users getting locked out of Antigravity

Google AI Blog | Mar 14, 2026

Article URL: <a href="https://discuss.ai.google.dev/t/google-ai-pro-subscription-antigravity-quota-not-working-as-advertised-10-day-lockout-instead-of-5-hour-reset/118505">https://discuss.ai.google.dev/t/google-ai-pro-subscription-antigravity-quota-not-working-as-advertised-10-day-lockout-instead-of-5-hour-reset/118505</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47374098">https://news.ycombinator.com/item?id=47374098</a> Points: 1 # Comments: 0

GitHub: MiniMax: MiniMax M2.5 is trained by Claude Opus 4.6?

GitHub: MiniMax | Mar 14, 2026

I was chatting with MiniMax M2.5 in OpenRouter and suddenly he mysteriously repeated on "I'm Claude, an AI assistant created by Anthropic - not a "language" ", heh wut? <hr> Comments URL: <a href="https://news.ycombinator.com/item?id=47372273">https://news.ycombinator.com/item?id=47372273</a> Points: 12 # Comments: 11

Google AI Blog: Users report drastic quota reductions on Google AI Pro

Google AI Blog | Mar 12, 2026

Article URL: <a href="https://twitter.com/antigravity/status/2031835833716625883">https://twitter.com/antigravity/status/2031835833716625883</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47348613">https://news.ycombinator.com/item?id=47348613</a> Points: 3 # Comments: 0

GitHub: MiniMax: 15 Cloud/local LLMs benchmarked on 38 real tasks. MiniMax and Kimi tied for 2nd

GitHub: MiniMax | Mar 10, 2026

Article URL: <a href="https://ianlpaterson.com/blog/llm-benchmark-2026-38-actual-tasks-15-models-for-2-29/">https://ianlpaterson.com/blog/llm-benchmark-2026-38-actual-tasks-15-models-for-2-29/</a> Comments URL: <a href="https://news.ycombinator.com/item?id=47324730">https://news.ycombinator.com/item?id=47324730</a> Points: 3 # Comments: 1

GitHub: Qwen: Show HN: TubeTrim – A local YouTube summarizer using Qwen in pure Python

GitHub: Qwen | Mar 9, 2026

I wanted a way to summarize YouTube videos without paying for a SaaS or leaking my viewing history to someone. TubeTrim is a Python-based tool that runs LLMs locally to process transcripts. No API keys, no subscriptions, no tracking.It uses the transformers library with a device-aware backend: it will prioritize CUDA, then MPS (for Mac users), and finally fallback to CPU. I've found that Qwen 2.5-1.5B provides a good balance between speed and summary quality for this specific task.How ...

AI Model Updates

Latest Releases