每日 AI 简报

2026-08-03（内容获取于 08/03 04:49）

OpenAI 推出 Presence，旨在为企业落地 AI 智能体

The Decoder · 08/02 21:10

OpenAI 推出新的企业级产品 Presence，旨在帮助企业将 AI 智能体投入实际生产环境，用于客户服务和内部工作流程。与现有的 Workspace Agents 不同，Presence 主要面向外部部署，且 OpenAI 工程师团队将为复杂案例提供支持，以确保 AI 智能体的商业化应用。

OpenAIAI Agent企业服务

推荐理由：OpenAI 致力于解决 AI Agent 在企业环境中的实际部署难题，Presence 的推出有望加速 AI 智能体在商业领域的落地。

欧盟 AI 法案正式生效，对 AI 模型产生何种影响？

Hacker News · 08/03 03:40

欧盟的 AI 法案已正式生效，该法案旨在规范人工智能的开发和部署，对高风险 AI 系统提出了严格要求，并明确了 AI 透明度义务。这标志着全球首个全面 AI 监管框架进入实施阶段，将对开发商和用户产生深远影响，尤其是在数据治理和算法责任方面。

AI监管政策法规欧盟

推荐理由：欧盟AI法案作为全球首个综合性AI监管框架，其生效标志着AI治理新时代的开启，对所有AI参与者都至关重要。

MoonPay 发布 PayBox：首个专为 AI 智能体设计的支付保管库

X 推文 (AttentionVC), X 创作者 (AttentionVC) · 08/02 21:18

金融科技公司 MoonPay 推出 PayBox，这是首个专为 AI 设计的支付保管库。它旨在让用户的 AI 代理管理和执行支付，实现法币与数字资产的无缝价值转移，进一步推动 AI 在金融领域的应用和自动化，为 AI 智能体赋予更强的商业操作能力。（多家报道）

AI Agent金融科技支付

推荐理由：PayBox 为 AI 智能体提供了金融操作能力，预示着 AI 在金融领域将扮演更核心的角色，具有前瞻性。

AirLLM：单张 4GB 显存 GPU 运行 70B 大模型推理

GitHub Trending

AirLLM 项目通过优化模型加载、量化和计算方法，显著降低了大型语言模型（LLM）推理的硬件门槛，使得 700 亿参数级别的模型能够在仅有 4GB 显存的单张 GPU 上高效运行。这对于在本地设备、低成本硬件或边缘设备部署复杂 AI 应用的开发者和研究人员极为有用。

大模型开源推理优化

推荐理由：提供了在资源受限环境下部署大型语言模型的创新解决方案，对边缘AI和个人开发者具有极高实用价值。

Lumichats：免费离线代码助手，Claude Code 终端替代品

Product Hunt · 08/03 04:39

Lumichats 是一款免费的离线代码助手，定位为 Claude Code 的替代品。它专为不习惯使用终端的用户设计，提供了一个无需命令行即可进行代码交互和辅助的环境，旨在提升开发便利性和效率。支持离线使用，增强了数据隐私性。

开发工具代码助手离线AI

推荐理由：对于不熟悉终端的开发者，Lumichats 提供了一个友好且免费的离线代码辅助方案，值得尝试。

英伟达研究：AI 仅模仿人类不足以实现高级目标

Two Minute Papers · 08/02 23:01

英伟达（NVIDIA）发布一项 AI 研究指出，人工智能在某些复杂任务中，仅通过模仿人类行为难以达到最佳效果。研究可能深入探讨了 AI 需要发展出超越简单模仿的深层次学习、决策或创造性机制，以实现更高级别的智能和自主性。

AI研究NVIDIA深度学习

推荐理由：揭示了 AI 发展中一个关键的挑战，即如何从模仿走向真正的智能和自主，对AI研究具有指导意义。

AI 生成艺术：补偿艺术家能否化解伦理争议？

The Verge · 08/02 21:00

插画师们对生成式 AI 未经许可使用其作品训练模型长期表示担忧，认为此举涉嫌盗窃。当前讨论聚焦于，如果 AI 初创公司向艺术家支付报酬，是否足以说服他们接受 AI 技术，以及如何平衡版权保护、伦理考量与技术创新之间的关系，为 AI 艺术发展探索可持续路径。

AI伦理版权生成艺术

推荐理由：探讨了 AI 艺术发展中的核心伦理和版权问题，对行业健康发展至关重要，是值得持续关注的议题。

开源项目为何不利用 AI 加速开发？社区热议

V2EX · 08/02 13:43

V2EX 社区有用户发帖讨论，为何许多开源项目作者似乎没有或不愿利用 AI 工具来加速自身开发流程。讨论围绕 AI 在代码生成、测试、文档编写等方面的应用潜力和局限性展开，以及开源社区对 AI 辅助开发的接受度与伦理考量。

开发效率AI应用社区讨论

推荐理由：探讨了 AI 在软件开发领域的应用现状与未来潜力，对开发者和开源项目贡献者具有参考价值。

Fender’s CEO seems to think your bandmates are just analog AI

The Verge · 08/03 03:36

Edward “Bud” Cole speaks in Japan in 2023. | Image: Jun Sato/WireImage Fender CEO Edward "Bud" Cole gave an interview to T3 in May celebrating the 75th anniversary of the Telecaster with comments on AI and music that initially flew under the radar. But it has started making the rounds recently, pour

音乐AI观点

中文介绍 Fender 首席执行官 Edward "Bud" Cole 在 5 月份庆祝 Telecaster 吉他问世 75 周年的 T3 采访中，就人工智能与音乐发表了看法。Cole 将乐队成员比作「模拟 AI」，此番言论最初未引起广泛关注，但现在开始引发讨论。

Malaysia is reportedly shutting down Balaji Srinivasan’s Network School

TechCrunch · 08/03 01:05

Let's see how this "frontier community for techno-optimists" is doing ...

教育政策区块链

中文介绍消息称，马来西亚正在关闭由 Balaji Srinivasan 创办的 Network School。这所学校被称为「技术乐观主义者的前沿社区」，其具体关闭原因和后续影响尚待进一步报道。此举可能对数字教育和去中心化学习模式产生影响。

Xbox prices are increasing by up to €200 or £170

The Verge · 08/03 00:14

When Microsoft announced its latest round of Xbox price bumps in June, it only gave US pricing. Now we know the pricing increases for the EU and UK, and they're dramatic. Depending on the model, Xbox prices are increasing by up to €200 or £170. The 1TB Xbox Series X with a disc drive is going up by

游戏主机涨价微软

中文介绍微软在 6 月份公布 Xbox 在美国市场的涨价信息后，现已确认欧盟和英国地区的 Xbox 价格也将大幅上涨。根据不同型号，涨幅最高可达 200 欧元或 170 英镑。其中，配备光驱的 1TB Xbox Series X 涨价尤为显著。

TechCrunch Mobility: Two roads diverged — for robotaxis

TechCrunch · 08/03 00:05

Welcome back to TechCrunch Mobility, your hub for the future of transportation and now, more than ever, the role AI is playing in it.

自动驾驶AI交通

中文介绍 TechCrunch Mobility 栏目指出，在未来交通领域，尤其是机器人出租车的发展上，目前出现了两条不同的道路。文章将探讨人工智能在塑造这一新兴产业中的关键作用，并分析自动驾驶技术面临的挑战与机遇。

These App Store hidden gems prove there’s still room for great software in the AI era

TechCrunch · 08/02 23:23

Despite predictions that AI agents could make traditional apps obsolete, developers are shipping new software faster than ever. From smarter bookmarking tools and neighborhood marketplaces to digital pen pals and nature journals, here are the latest App Store finds worth adding to your Home Screen.

App StoreAI软件开发

中文介绍尽管有预测认为 AI 智能体可能取代传统应用程序，但开发者仍在以前所未有的速度推出新软件。App Store 上涌现出许多“隐藏佳作”，包括更智能的书签工具、邻里市场、数字笔友和自然日记等，证明在 AI 时代，优秀软件仍有广阔的发展空间。

Skylight’s smart calendars are up to $90 off during its back-to-school sale

The Verge · 08/02 23:00

The 15-inch Skylight Calendar 2 can stand on a table or mounted to the wall. | Image: Skylight The start of a new school year can feel hectic, as parents juggle their kids’ classes, extracurriculars and sports on top of work, appointments, and other responsibilities. It’s easy for things to slip thr

智能家居促销日历

中文介绍在返校季促销活动期间，Skylight 的智能日历产品提供高达 90 美元的折扣优惠。例如，15 英寸的 Skylight Calendar 2 兼具桌面放置和壁挂功能，旨在帮助家庭更好地管理开学季繁忙的课程、课外活动和日常安排。

HP’s HyperX Omen 15 isn’t quite the budget-friendly gaming laptop its predecessor was

The Verge · 08/02 22:00

Tough sell. | Photo: Antonio G. Di Benedetto / The Verge The HP HyperX Omen 15, which I first saw at CES, replaces the HP Victus 15, a longtime bestselling budget gaming laptop. The Victus cost just $800, or less when on sale, and was a good entry point to decent laptop gaming. The Omen 15 has nice

笔记本电脑游戏硬件惠普

中文介绍惠普的 HyperX Omen 15 游戏笔记本电脑在 CES 上首次亮相，旨在取代畅销的惠普 Victus 15。与前代 Victus 15 曾以 800 美元左右的低价成为入门级游戏本首选不同，新款 HyperX Omen 15 似乎不再主打高性价比，其定价策略有所调整。

OpenAI Presence wants to make AI agents production-ready for businesses

The Decoder · 08/02 21:10

OpenAI's new enterprise offering, Presence, is designed to get AI agents into production for customer service and internal workflows. Unlike the existing Workspace Agents, Presence targets external deployments. For complex cases, OpenAI's own engineers step in. The article OpenAI Presence wants to m

OpenAIAI智能体企业服务

中文介绍 OpenAI 推出新的企业级产品 Presence，旨在帮助企业将 AI 智能体投入实际生产环境，用于客户服务和内部工作流程。与现有的 Workspace Agents 不同，Presence 主要面向外部部署。对于复杂案例，OpenAI 的工程师团队也将提供介入支持。

Is paying artists enough to convince them to embrace AI?

The Verge · 08/02 21:00

Illustrators have spent years sounding the alarm about generative artificial intelligence startups training their models on artists' work without permission. They've pointed out how the practice is tantamount to theft, and in response, many gen AI boosters have argued that it's necessary for the tec

AI艺术版权伦理

中文介绍插画师们多年来一直对生成式人工智能初创公司未经许可使用艺术家作品训练模型表示担忧，认为这种行为等同于盗窃。围绕艺术家是否会因获得报酬而接受 AI 技术展开讨论，探讨如何平衡版权保护与技术发展之间的关系，以及报酬能否解决艺术家们的伦理和经济顾虑。

Meta AI uses a second AI agent as a memory coach to keep long tasks on track

The Decoder · 08/02 20:57

Meta AI wants to stop AI agents from forgetting errors they've already diagnosed and repeating failed steps during complex tasks. A separate memory agent maintains a structured memory bank and decides when to remind the main agent and when to stay silent. The system improved scores by up to 8.3 perc

MetaAI模型智能体

中文介绍 Meta AI 正在探索利用第二个 AI 智能体作为“记忆教练”，以防止主智能体在处理复杂任务时忘记已诊断的错误并重复失败步骤。这个独立的记忆智能体负责维护结构化的记忆库，并根据需要提醒主智能体，显著提升了任务完成表现。

A real macOS flaw worth $200K went unreported because Apple's bug bounty inbox was full of AI slop

The Decoder · 08/02 20:42

Apple's bug bounty program is drowning in AI-generated bug reports. The company has capped submissions per researcher because fabricated reports are clogging the review pipeline. As a result, Italian startup Bynario was initially unable to report a serious macOS vulnerability worth up to $200,000 on

网络安全苹果AI滥用

中文介绍苹果公司的漏洞赏金计划因充斥大量 AI 生成的虚假报告而陷入困境，导致该公司对每位研究人员的提交数量进行了限制。结果，意大利初创公司 Bynario 最初未能提交一个价值 20 万美元的严重 macOS 漏洞，凸显了 AI 滥用对网络安全报告流程的负面影响。

Foldables are sort of boring now — and that’s great news for Apple

The Verge · 08/02 20:00

This is The Stepback, a weekly newsletter breaking down one essential story from the tech world. For more on smartphones and Android, follow Dominic Preston. The Stepback arrives in our subscribers' inboxes on Sunday at 8AM ET. Opt in for The Stepback here. How it started Believe it or not, it didn'

折叠屏手机苹果市场分析

中文介绍一篇评论文章指出，可折叠手机如今的市场表现趋于平淡，缺乏革命性创新，这对于尚未推出可折叠设备的苹果公司来说，反而是一个利好消息。文章认为，当前折叠屏市场缺乏新意，为苹果提供了充足的时间来完善自身的技术并观察市场发展。

AI finds plenty of security flaws, but almost none of them get exploited

The Decoder · 08/02 18:09

VulnCheck counted how often security flaws found by AI actually get exploited. Out of 1,061 AI-discovered vulnerabilities in the first half of 2026, just 14 saw confirmed attacks. That's 1.3 percent, the same rate as vulnerabilities overall. But exploits are landing faster, with the median dropping

AI安全漏洞网络安全

中文介绍据 VulnCheck 统计，在 2026 年上半年由 AI 发现的 1,061 个安全漏洞中，仅有 14 个遭到实际利用，利用率约为 1.3%，与总体漏洞利用率持平。这表明 AI 尽管能识别大量安全缺陷，但其中极少一部分会成为现实攻击的目标。然而，漏洞被利用的速度正在加快。

Europeans Are About to Find Out How Entrenched AI Is in Their Daily Lives

Wired AI · 08/02 18:00

New EU rules stipulate that people must be told when they’re interacting with AI or looking at AI-generated or -edited content, leading to fear of “disclosure fatigue.”

欧盟AI监管政策

中文介绍欧盟出台新规，要求民众在与 AI 互动或接触由 AI 生成/编辑的内容时必须被告知，这可能让欧洲人意识到 AI 在其日常生活中根深蒂固的程度。然而，该规定也引发了对可能出现「披露疲劳」的担忧，即用户可能因频繁收到提示而感到厌烦。

Claude Opus 5 pushes prompt-to-game AI from rough color blocks to full 3D prototypes with physics and music

The Decoder · 08/02 16:51

Anthropic's Claude Opus 5 generates complete 3D games from single prompts, including a first-person shooter, a kart racer, and a Minecraft clone, all without a single external asset. Geometry, textures, physics, and in some cases music are produced as code and run directly in the browser. In side-by

AI游戏Anthropic大模型

中文介绍 Anthropic 发布的 Claude Opus 5 能够仅凭单个提示词生成完整的 3D 游戏原型，包括第一人称射击、卡丁车竞速和《我的世界》克隆版等。该 AI 模型无需任何外部素材，直接生成游戏的几何结构、纹理、物理效果乃至部分音乐代码，并在浏览器中运行，实现了从概念到可玩原型的飞跃。

microsoft/AI-For-Beginners

Jupyter Notebook · ★ 58,873 · 🍴 11,590 · 📈 2,617 stars today

12 Weeks, 24 Lessons, AI for All!

AI教程学习资源机器学习

中文介绍 Microsoft 的 AI-For-Beginners 是一个为期12周、包含24节课程的全面人工智能学习路径，旨在普及 AI 知识，面向所有希望入门 AI 的学习者。该课程涵盖人工智能的核心概念、机器学习基础、深度学习、自然语言处理和计算机视觉等关键领域。它通过实践项目和清晰的讲解，帮助初学者逐步建立 AI 知识体系和动手能力，解决了传统 AI 学习门槛高的问题，非常适合学生、软件开发者以及希望转行进入 AI 领域的专业人士。

usekaneo/kaneo

TypeScript · ★ 6,071 · 🍴 513 · 📈 491 stars today

🎯 All you need. Nothing you don't. Open source project management that works for you, not against you.

项目管理开源团队协作

中文介绍 Kaneo 是一个开源的项目管理工具，其核心理念是提供必要功能，去除冗余复杂性，确保工具能够真正服务于用户，而非增加负担。它旨在优化团队协作效率，简化任务分配、进度跟踪和项目规划等管理环节。Kaneo 特别适合那些寻求轻量级、直观且高度可定制的项目管理解决方案的团队和个人，帮助他们在不被工具束缚的情况下高效推进项目。

lyogavin/airllm

Jupyter Notebook · ★ 25,551 · 🍴 2,875 · 📈 963 stars today

AirLLM 70B inference with single 4GB GPU

LLM推理显存优化边缘计算

中文介绍 AirLLM 致力于解决大型语言模型 (LLM) 推理的硬件限制问题，使得 700 亿参数级别的模型能够在仅有 4GB 显存的单张 GPU 上高效运行。它通过优化模型加载、量化和计算方法，显著降低了 LLM 推理的资源门槛。这对于在本地设备、低成本硬件或边缘设备上部署复杂 AI 应用的开发者和研究人员极为有用。

iv-org/invidious

Crystal · ★ 21,928 · 🍴 2,452 · 📈 307 stars today

Invidious is an alternative front-end to YouTube

YouTube隐私保护前端

中文介绍 Invidious 是一个开源的 YouTube 替代前端，旨在提供更注重隐私、无广告且可定制的用户体验。它允许用户在不登录 Google 账号的情况下观看 YouTube 视频，并提供订阅、播放列表等功能，同时避免被追踪。适合追求隐私保护和更自由观看体验的用户。

codecrafters-io/build-your-own-x

Markdown · ★ 534,758 · 🍴 50,545 · 📈 710 stars today

Master programming by recreating your favorite technologies from scratch.

编程学习项目实战计算机基础

中文介绍 build-your-own-x 提供了通过从零开始重新实现流行技术来掌握编程的实践路径。它鼓励学习者通过亲手构建诸如 Git、Docker、Redis 等系统，深入理解其底层原理和工作机制。该项目是为那些寻求通过项目实战巩固计算机科学基础、提升编程技能，并获得更深层次技术洞察力的开发者和自学者设计的。

zhaoxuya520/reverse-skill

PowerShell · ★ 13,135 · 🍴 1,969 · 📈 1,145 stars today

Reverse Engineering / Authorized Penetration Testing / Security Research Skill Router Pack AI-powered routing + On-demand toolchain bootstrapping + Self-evolving knowledge base Supports Claude Code, Kiro, Cursor, Cline, and other AI coding clients 逆向/渗透/安全技能路由包 - AI 自动路由 + 按需自举工具链 + 自动进化经验库 | 支持 Cla

逆向工程安全研究AI工具

中文介绍 reverse-skill 是一个面向逆向工程、授权渗透测试和安全研究的智能技能路由包。它利用 AI 实现工具和知识的智能路由，并支持按需启动工具链，结合自进化的知识库，大大简化了安全分析流程。该项目旨在提升安全专家在漏洞挖掘、恶意软件分析等场景下的效率，并支持集成 Claude Code 等高级 AI 辅助能力。

different-ai/openwork

TypeScript · ★ 20,256 · 🍴 2,084 · 📈 319 stars today

The open-source alternative to Claude Cowork (powered by opencode)

AI助手开源企业协作

中文介绍 openwork 是一个开源项目，旨在作为类似 Claude Cowork 的 AI 协作助手的替代方案。它提供了一个可定制和私有化部署的平台，帮助团队进行内容创作、信息总结、任务管理等，解决了对专有 AI 工具数据隐私、厂商锁定及定制化不足的担忧。主要面向希望构建自主可控 AI 协作环境的企业、开发者或寻求增强数据安全性的团队，支持集成各类开源大语言模型。

microsoft/generative-ai-for-beginners

Jupyter Notebook · ★ 114,686 · 🍴 61,279 · 📈 588 stars today

21 Lessons, Get Started Building with Generative AI

生成式AI学习教程入门

中文介绍该项目是微软推出的一套生成式 AI 入门课程，包含 21 节课。它旨在帮助初学者快速掌握生成式 AI 的核心概念与开发实践，通过实际构建应用来学习相关技术。适合对生成式 AI 感兴趣的开发者、学生或研究人员，是系统学习 AI 应用开发的优秀起点。

Panniantong/Agent-Reach

Python · ★ 64,594 · 🍴 5,341 · 📈 645 stars today

Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.

AI代理数据获取网络爬虫

中文介绍 Agent-Reach 赋予 AI 代理“观察”整个互联网的能力，使其能够无缝阅读和搜索来自 Twitter、Reddit、YouTube、GitHub、Bilibili、小红书等主流平台的公开信息。该项目通过一个简洁的命令行界面 (CLI)，帮助 AI 代理克服传统 API 限制和高昂费用，以低成本方式获取实时网络数据。它解决了 AI 代理信息获取的瓶颈问题，极大地扩展了代理的应用场景，适用于研究、内容分析、趋势监测等需要广泛网络数据支持的 AI 应用开发者。

TencentCloud/TencentDB-Agent-Memory

TypeScript · ★ 10,838 · 🍴 1,025 · 📈 604 stars today

TencentDB Agent Memory is a team-level memory hub for AI Agents — turning conversations, docs, and code into four reusable memory assets (Chat Memory, Skill, LLM-Wiki, Code-Graph) that are governed, shared, and equipped across agents and frameworks.

AI Agent长期记忆本地化

中文介绍 TencentDB Agent Memory 为 AI Agents 提供本地化的长效记忆解决方案。它采用四层渐进式管道设计，旨在实现无需外部 API 依赖的完全本地化记忆存储与检索，解决了 AI Agent 在需要长期记忆和上下文感知时对外部服务的依赖问题。该项目通过内建机制管理记忆，增强了数据隐私性和系统性能。适用于需要为 AI Agent 构建稳定、独立且具备丰富记忆能力的开发者，例如开发智能客服、个人助理或具备学习能力的自动化系统。

mvanhorn/last30days-skill

Python · ★ 56,827 · 🍴 4,968 · 📈 217 stars today

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

AI 代理信息检索内容摘要

中文介绍这是一个 AI 代理技能，能迅速在 Reddit、X、YouTube、Hacker News、Polymarket 及整个互联网上研究任意话题，并综合生成一份基于事实的摘要。主要用于获取过去 30 天的最新信息，解决信息过载问题，帮助用户快速了解热点动态和事件。适合研究人员和内容创作者。

NomaDamas/k-skill

JavaScript · ★ 6,866 · 🍴 803 · 📈 179 stars today

한국인을 위한 스킬 모음집 - 에이전트를 한국인으로

AI Agent本地化韩国语

中文介绍 `k-skill` 是一个专为韩国用户和开发者设计的 AI Agent 技能集合。其目标是增强 AI Agent 的本地化能力，使其更好地理解、处理韩语文本，并适应韩国文化及特定任务需求。通过集成这些“技能”，可以构建出更符合韩国用户习惯、具备更高本土化智能水平的 AI 应用。

HarbourMasters/Lighthouse

C · ★ 210 · 🍴 15 · 📈 62 stars today

通用工具系统管理开发辅助

中文介绍 HarbourMasters 的 Lighthouse 项目，虽然缺乏具体描述，但其命名暗示它可能是一个用于监测、指引或管理特定系统或资源的工具。如同灯塔为航海提供方向，该项目或旨在提供洞察力、优化性能或确保某种流程的顺畅运行。开发者或系统管理员可能会关注此类项目，以寻求改善其基础设施或应用的可见性和控制能力。

antirez/ds4

C · ★ 19,951 · 🍴 1,772 · 📈 187 stars today

DeepSeek 4 Flash and PRO local inference engine for Metal, CUDA and ROCm

LLM本地推理AI

中文介绍 antirez 推出的 `ds4` 项目是一个高性能的本地推理引擎，专为 DeepSeek 4 Flash 和 PRO 系列大型语言模型设计。它利用先进的硬件加速技术，支持 Apple 的 Metal、NVIDIA 的 CUDA 以及 AMD 的 ROCm 平台，使用户能够在个人设备上高效运行 DeepSeek 4 模型。这为开发者和研究人员提供了在本地进行 LLM 推理的能力，适用于对数据隐私、运行成本或离线可用性有严格要求的场景。

esengine/DeepSeek-Reasonix

Go · ★ 28,987 · 🍴 1,868 · 📈 389 stars today

DeepSeek-native AI coding agent for your terminal. Engineered around prefix-cache stability — leave it running.

AI编程LLM终端工具

中文介绍 `esengine` 推出的 `DeepSeek-Reasonix` 是一个专为终端用户设计的 DeepSeek 原生 AI 编码代理。它旨在为开发者在命令行界面提供智能编程辅助，例如代码生成、补全或问题解答。该项目特别强调了其“前缀缓存稳定性”工程优化，确保代理可以长时间运行并保持高效，减少重复计算。这使得开发者能够无缝地在终端内获取 AI 协助，从而提升编码效率和开发体验。

You already own the most valuable thing in shopping. You just can't reach it.

@fridayresearch_ · 1.9K 粉丝 · 1.6M 阅 · 597 赞 · 451 转

07/31 01:30

Your taste is scattered across forty companies that each own a piece and answer to advertisers. FRIDAY is building the version you own — a personal shopping intelligence that lives on your device and

产品发布购物智能数据隐私

中文介绍 FRIDAY 正构建一款设备端个人购物智能工具，旨在整合用户分散在四十家公司中的购物偏好数据，将其归还给用户。这款工具将帮助用户管理自有品味数据，摆脱广告商影响，实现更自主的购物体验。

How to build an AI video studio in Claude Code:

@EXM7777 · 129.8K 粉丝 · 221.4K 阅 · 504 赞 · 36 转

07/28 22:13

I turned Claude Code into a working film studio and i'm going to hand you the complete system: the skills, the prompts, the loops, and the six-stage pipeline that ties them together each stage leans

教程Claude CodeAI视频

中文介绍博主分享如何在 Claude Code 中构建一个 AI 视频工作室的方法。

22580: From GPT2 to Kimi3, Explained

@waterloo_intern · 10.4K 粉丝 · 215.4K 阅 · 660 赞 · 86 转

07/27 23:22

Twenty-two thousand five hundred and eighty. That’s how many GPT-2 (2019) models fit inside KimiK3 (2026). We scaled up by a factor of 22,580 in seven years. But is it just... scale? In this worklog,

AI发展模型规模行业分析

中文介绍博主分析了从2019年的GPT-2到2026年预测的KimiK3模型，计算出模型规模在7年内增长了22,580倍。推文质疑这种「规模化」是否是唯一的进步指标，并表示会在工作日志中深入探讨AI发展中的其他关键因素。这篇分析旨在超越单纯的参数数量，审视AI技术演进的深层驱动力及潜在影响，鼓励对行业发展进行更全面的思考。

What's gone wrong with AI & labor — a thought experiment

@random_walker · 128.4K 粉丝 · 134.0K 阅 · 552 赞 · 54 转

07/29 01:56

A thought experiment that I think helps explain much of what’s gone wrong with AI and labor: Imagine an alternate universe in which — for whatever reason — no one ever published source code online.

观点AI与劳工社会影响

中文介绍博主通过一个思想实验，探讨 AI 与劳动力之间出现问题的原因。实验设想在一个「没有人在线发布源代码」的平行宇宙中，以此分析当前 AI 发展对劳动市场的影响及潜在困境。

Opus 5 is a really bad model

@HarukaKunori · 241 粉丝 · 129.0K 阅 · 632 赞 · 44 转

07/27 19:07

After trying out Opus 5 for a few hours today, I honestly think Anthropic's benchmark scores are a complete fraud. Sure, the model might have improved in a few areas, but it has regressed unbelievably

模型评测Opus 5用户反馈

中文介绍博主试用Anthropic的Opus 5数小时后，强烈质疑其基准测试分数存在「欺诈」。他认为尽管模型在某些方面有所改进，但在多数方面却出现了令人难以置信的退步，与官方宣传大相径庭。

The harness is all you need (mostly)

@github · 2.7M 粉丝 · 123.1K 阅 · 583 赞 · 69 转

07/29 04:28

A practical GitHub Copilot workflow for prototyping, planning, implementing, and reviewing software - without chasing every new AI tool. By @burkeholland If you’re feeling overwhelmed by AI right now,

工作流GitHub Copilot软件开发

中文介绍 GitHub 分享一个实用的 GitHub Copilot 工作流，涵盖软件原型设计、规划、实现与审查全过程。该方法强调高效利用 Copilot，避免盲目追逐各种新 AI 工具，减轻开发者对 AI 工具的焦虑。

We rewrote our agent to run entirely in a Durable Object with Pi, Agents SDK and Code Mode

@Vercantez · 1.9K 粉丝 · 122.3K 阅 · 567 赞 · 44 转

07/29 00:19

We recently finished moving the camelAI agent off of virtual machines. The agent now runs inside a Cloudflare Durable Object, its filesystem lives in SQLite and R2, and it writes JavaScript instead of

技术实现架构迁移AI代理

中文介绍 camelAI 团队宣布将其 AI 代理从虚拟机迁移至 Cloudflare Durable Object。新架构中，代理的文件系统由 SQLite 和 R2 承载，并使用 JavaScript 进行编写，显著优化了代理的运行效率和部署方式。

Why Software Factories Fail: Benchmarking the new frontier

@dexhorthy · 27.3K 粉丝 · 115.4K 阅 · 500 赞 · 39 转

07/28 01:43

This is a continuation of Parts 1 and 2 of "Why Software Factories Fail" Part 1: the harness is not enough Part 2: turning the lights back on we got better benchmarks Remember when I said this in Part

软件工程基准测试系统设计

中文介绍该帖子是「软件工厂为何失败」系列文章的续篇，深入探讨了构建和评估软件工厂的挑战。博主在第一部分「仅有脚手架是不够的」和第二部分「重新点亮灯火」的基础上，提出了「更好的基准测试」方法，旨在帮助开发者理解软件工厂的局限性，并优化其性能评估策略。内容聚焦如何避免常见陷阱，提升自动化软件开发的成功率。

Here's exactly how to build your company brain (in 5 mins)

@DhravyaShah · 61.2K 粉丝 · 56.0K 阅 · 548 赞 · 36 转

07/28 14:16

Every company will have a company brain, whether they believe it or not. This is the bet that I'm making, and I'm constantly seeing very future forward startups come to us for their own setup. What is

企业AI知识库教程

中文介绍博主提出「公司大脑」是企业发展的必然趋势，并分享了「如何准确地在5分钟内构建公司大脑」的实操指南。他观察到许多前瞻性初创公司正积极寻求建立自己的公司大脑系统。该帖子旨在指导企业快速搭建一个整合知识、驱动智能决策的内部AI平台，强调其对未来公司运营的重要性与即时性，提供实用操作建议。

Getting the most out of GPT-5.6: Sol, Terra, and Luna

@cerebras · 64.6K 粉丝 · 48.8K 阅 · 550 赞 · 40 转

07/28 03:44

Authors: @0xSero & Zhenwei Gao (@zhennydez) Your Codex subscription now comes with 3 main models, Sol, Terra, and Luna, each independently trained and served, with reasoning dials. Together, they

产品发布模型特性AI工具

中文介绍该帖子宣布，Codex订阅服务已新增三款主要模型：Sol、Terra和Luna。这三款模型均独立训练、部署，并配备了独特的「推理调节器」（reasoning dials）。推文旨在指导用户如何充分利用这些新模型，以优化其在GPT-5.6（或高级AI任务）中的表现，帮助用户根据不同任务需求微调模型行为，从而获得更精准和高效的AI输出。

MiniMax H3: An Open Model Breaking the Boundaries Between Tasks and Modalities

@MiniMax_AI · 107.4K 粉丝 · 42.3K 阅 · 696 赞 · 103 转

07/31 09:53

Today, we're launching MiniMax H3, a general-purpose multimodal generation model. H3 understands unified context across text, images, video, and audio, generating video with native stereo sound, up to

模型发布多模态AI视频

中文介绍 MiniMax 推出 H3 通用多模态生成模型，该模型能理解文本、图像、视频、音频的统一上下文，并能生成带有原生立体声音频的视频内容，旨在打破任务与模态之间的界限。

PagedAttention & RadixAttention

@jaga_prasanna · 799 粉丝 · 37.9K 阅 · 502 赞 · 55 转

07/27 17:49

today we look at how two techniques solve memory and compute bottlenecks from two complementary angles virtual memory paging for intra-request memory allocation and prefix tree caching for

技术解析性能优化内存管理

中文介绍帖子深入探讨了两种互补技术 PagedAttention 和 RadixAttention，如何从内存分配和计算瓶颈两方面进行优化。前者利用虚拟内存分页解决请求内的内存分配问题，后者通过前缀树缓存提高效率，旨在提升AI模型运行的性能。

AI News Roundup!

@AITECHio · 455.4K 粉丝 · 32.6K 阅 · 512 赞 · 110 转

07/31 19:58

ChatGPT Can Now Talk While Running Your Computer OpenAI has taken AI agents another step forward. You can now have a natural voice conversation with ChatGPT while it operates your computer in the

AI 新闻ChatGPT智能体

中文介绍 OpenAI 推出 ChatGPT 新功能，用户可与模型进行自然语音对话，同时让它操作电脑。这标志着 AI 智能体技术又向前迈进了一步，增强了人机交互的深度和广度，预示了更高级别自动化应用的可能。

PayBox 101

@moonpay · 421.1K 粉丝 · 32.0K 阅 · 547 赞 · 62 转

08/02 21:18

MoonPay, the global financial technology company powering the movement of value across fiat and digital assets, has launched PayBox, the first payment vault built for AI that lets a person's AI agent

产品发布AI支付金融科技

中文介绍金融科技公司MoonPay推出PayBox，这是首个专为AI设计的支付保管库，旨在让用户的AI代理管理和执行支付，实现法币与数字资产的无缝价值转移，进一步推动AI在金融领域的应用。

Anthropic Are Buying Rare Books, Feeding Them Into AI, Then Destroying Them.

@ActionModelAI · 57.6K 粉丝 · 6.8K 阅 · 521 赞 · 390 转

07/30 21:12

It sounds like the plot of a dystopian film. But according to recently released court documents, internal company communications, and a reported $1.5 billion settlement, it's something that has

AI伦理版权争议数据来源

中文介绍报告指出，Anthropic 被曝购买稀有书籍用于AI训练后销毁，这涉及内部文件和15亿美元的和解金。帖子揭示了AI公司数据获取方式引发的争议，探讨了版权、伦理及数据所有权问题。

PayBox 101

@moonpay · 421.1K 粉丝 · 32.0K 阅 · 7d 曝光 79.8K

08/02 21:18

PayBox 101

FSD v14.3.7: My review

@BLKMDL3 · 93.4K 粉丝 · 108.0K 阅 · 7d 曝光 108.0K

08/02 13:25

FSD v14.3.7: My review

产品评测自动驾驶用户体验

中文介绍博主对特斯拉FSD (Full Self-Driving) 自动驾驶系统的最新版本v14.3.7进行了深度评测，分享了该版本在实际驾驶场景中的表现、功能改进及个人使用体验。

8G显卡本地未审查生图，42个风格我都替你测好了

@davinci_seven · 21.2K 粉丝 · 283.5K 阅 · 7d 曝光 283.5K

08/01 13:56

8G显卡本地未审查生图，42个风格我都替你测好了

AI绘画本地部署风格测试

中文介绍博主实测使用8G显卡在本地生成未审查AI图片的可行性，并详细分享了42种不同风格的生图效果与性能表现，为低配置硬件用户提供本地AI绘画的实用参考。

Deep Dive: What Happens When Cars Drive Themselves

@chamath · 2.3M 粉丝 · 90.8K 阅 · 7d 曝光 90.8K

08/01 00:25

Deep Dive: What Happens When Cars Drive Themselves

自动驾驶深度分析未来趋势

中文介绍博主深入探讨了自动驾驶汽车普及后的潜在影响和未来情景。内容可能涵盖技术挑战、社会伦理、交通模式改变以及法律法规等多个维度，对自动驾驶的深远影响进行了全面分析。

Windows quality: an update on the commitment we made in March

@pavandavuluri · 6.8K 粉丝 · 197.1K 阅 · 7d 曝光 197.1K

08/01 00:00

Windows quality: an update on the commitment we made in March

产品更新Windows微软

中文介绍该推文提供了关于 Windows 系统质量改进的最新进展，回顾了三月份做出的承诺。内容可能涉及微软在提升用户体验和系统稳定性方面的具体措施及成效，旨在回应用户对产品质量的关切。

Clash Verge从0~1零基础教程（界面认识）

@gengdaJ · 47.0K 粉丝 · 86.1K 阅 · 7d 曝光 86.1K

07/31 22:18

Clash Verge从0~1零基础教程（界面认识）

AI News Roundup!

@AITECHio · 455.4K 粉丝 · 32.6K 阅 · 7d 曝光 32.6K

07/31 19:58

AI News Roundup!

最近大火的 AI 岗位，FDE 到底是干嘛的？普通人怎么上车？

@AdrianPunk115 · 23.0K 粉丝 · 406.0K 阅 · 7d 曝光 406.0K

07/31 15:19

最近大火的 AI 岗位，FDE 到底是干嘛的？普通人怎么上车？

职业发展AI 岗位FDE

中文介绍博主探讨近期大火的 AI 岗位「FDE」（AI 功能开发工程师）的具体职责与职业前景，并为普通人提供了进入该领域的上车路径和实用建议，帮助读者了解如何转型或学习相关技能。

MiniMax H3: An Open Model Breaking the Boundaries Between Tasks and Modalities

@MiniMax_AI · 107.4K 粉丝 · 42.3K 阅 · 7d 曝光 42.3K

07/31 09:53

MiniMax H3: An Open Model Breaking the Boundaries Between Tasks and Modalities

ReToken: One Token to Improve Vision-Language Models for Visual Retrieval

👍 5

07/30 08:00

Long visual context poses a challenge for vision-language models: performance degrades as the number of distractors grows, and processing all tokens at once is computationally infeasible under GPU memory constraints. We present ReToken, a single learnable embedding trained as an explicit retrieval t

视觉语言模型视觉检索AI模型

中文介绍视觉语言模型在长视觉上下文和干扰项增多时性能下降，且受限于GPU内存。研究提出ReToken，一个可学习的单一嵌入，旨在改善视觉语言模型在视觉检索任务中的表现，有效解决计算资源瓶颈。

ACE-Data-0: Human-Centric Ambient Capture as Embodied Data Engine

👍 36

07/30 08:00

Embodied intelligence faces a fundamental data bottleneck. Models must capture how first-person perception, whole-body motion, dexterous manipulation, object state, sound, and touch evolve together as humans pursue goals over time. Existing datasets fragment this experience across viewpoints, modali

具身智能数据集AI数据

中文介绍具身智能面临数据瓶颈，现有数据集无法全面捕捉人类第一人称感知、全身运动、灵巧操作、物体状态、声音和触觉等体验。本研究推出ACE-Data-0，一个以人为中心的环境捕捉具身数据引擎，旨在弥合这一数据鸿沟。

PhiZero: A World Model Built Around Physical Language

👍 158

07/30 08:00

We introduce PhiZero, a physical world model built around physical language, a compact discrete representation of world-state transitions. Existing physical world models typically predict future videos directly in pixel space, leaving the underlying world dynamics implicit within high-dimensional vi

世界模型物理语言AI模型

中文介绍现有物理世界模型通常直接在像素空间预测未来视频，隐含了世界动态。研究引入PhiZero，一个基于「物理语言」构建的物理世界模型，该物理语言是一种紧凑的离散表示，能够明确地捕捉世界状态的转换，提升了对世界动态的理解。

AskChem: Claim-Centered Infrastructure for Chemistry Literature Synthesis

👍 292

07/30 08:00

Chemistry literature synthesis often requires assembling specific findings scattered across many publications, yet existing literature-search systems primarily return ranked document lists. As a result, scientists and AI agents need to locate relevant information, verify their provenance, and assemb

化学文献检索AI应用

中文介绍化学文献综合需从多篇论文中汇集特定发现，但现有文献搜索系统仅返回文档列表，效率低下。本研究提出AskChem，一个「以声明为中心」的基础设施，旨在帮助科学家和AI智能体更高效地定位和综合化学文献中的相关信息。

AISPA: User-Centric System Prompt Auditing for Large Language Model Applications

👍 0

07/30 08:00

System prompts are instructions configured by developers to govern the behaviors of foundation models in AI applications. They are used throughout commercial AI products, but are rarely disclosed to the public or regulators, creating a serious trust and accountability gap in the wide deployment of A

大模型系统提示AI审计

中文介绍「AISPA」是一个针对大型语言模型（LLM）应用的「用户中心系统提示审计」系统。系统提示是开发者为基础模型设定的指令，以管理其在AI应用中的行为。这些提示在商业AI产品中普遍使用，但很少向公众或监管机构披露，这导致了严重的信任和问责缺口。AISPA的提出旨在解决这一问题，通过用户中心审计来增强透明度和可信度。

Extraction of σ_{TT} for Proton, Neutron, Deuteron and ^3He from Quasi-real Photon Scattering

👍 0

07/30 08:00

We report on an extraction of the polarized photoproduction cross-section for the proton, deuteron, neutron and ^3He, obtained by extrapolating electron scattering data to the real photon point. The data are from the Jefferson Lab E97-110 (^3He) and CLAS EG4 (proton and deuteron) experiments. Inform

核物理粒子物理实验数据

中文介绍一项研究报告了从准实光子散射中提取质子、中子、氘和氦-3的极化光产生截面σ_{TT}。该结果通过将电子散射数据外推到实光子点获得，数据来自杰斐逊实验室E97-110（用于氦-3）和CLAS EG4（用于质子和氘）实验。

Chimera: Designing and Chinchilla-Scaling Hybrid Visual Diffusion Transformers

👍 17

07/30 08:00

Visual generation increasingly requires high-resolution images, long videos, and multimodal context, making the quadratic cost of full attention prohibitive. We introduce Chimera, a hybrid visual diffusion backbone with a principled scaling recipe. Chimera processes text, image, and video tokens in

视觉生成扩散模型Transformer

中文介绍视觉生成任务日益需要高分辨率图像、长视频及多模态上下文，导致全注意力机制的二次计算成本过高。研究引入Chimera，一种混合视觉扩散骨干网络，具备系统性的扩展方案，能够高效处理文本和图像，克服了计算限制。

Beacon: Knowing When and How to Perform Agentic Visual Reasoning

👍 47

07/30 08:00

The fundamental goal of agentic visual reasoning is to improve the success rate of multimodal large language models (MLLMs) on complex tasks, rather than merely equipping them with a sophisticated yet inefficient reasoning paradigm. In this work, we rethink agentic visual reasoning through two key d

具身智能视觉推理多模态大模型

中文介绍具身视觉推理旨在提升多模态大模型在复杂任务上的成功率。本研究重新审视了具身视觉推理的「何时」以及「如何」执行，并提出了Beacon框架，以期更有效率地实现这一目标，避免仅仅提供复杂但低效的推理范式。

β-OPSD: Deriving with Policy Optimization, Training with Self-Distillation

👍 19

07/30 08:00

On-policy self-distillation (OPSD) is a promising approach to improve reasoning language models, but it remains brittle in practice: making it work reliably often requires substantial engineering effort. We identify a structural source of this difficulty: vanilla OPSD is precisely the β=1 member of

语言模型自蒸馏强化学习

中文介绍策略内自蒸馏（OPSD）是改进推理语言模型的一种有前景方法，但实践中其稳定性不足，需要大量工程投入。本研究识别了其结构性难题，并提出β-OPSD，通过策略优化进行推导并结合自蒸馏训练，以提高方法的可靠性。

ROAD: Reciprocal-Objective Alignment of Discriminative Semantics for 3D Shape Generation

👍 0

07/30 08:00

High-fidelity 3D generation predominantly relies on scaling model capacity and data, which incurs prohibitive computational costs. This paradigm typically requires learning geometry from scratch and overlooks the rich semantic and structural priors already encapsulated in discriminative 3D foundatio

3D生成大模型人工智能

中文介绍高保真3D生成模型过度依赖扩展模型容量和数据，导致计算成本过高。一篇论文提出了ROAD方法，通过辨别性语义的互惠目标对齐，利用现有语义和结构先验来优化3D形状生成，以降低计算成本。

Frontis-MA1: Training an AI4AI Model towards Recursive Self-Improvement in Machine Learning Engineering

👍 168

07/30 08:00

Recursive self-improvement (RSI) requires AI systems that improve the process of building AI (i.e., AI4AI); machine learning engineering (MLE) offers a concrete, executable testbed for studying this capability. We introduce OpenMLE, an open full-stack system for RSI research in MLE, spanning verifia

AI4AI机器学习工程自我改进

中文介绍递归自我改进（RSI）要求AI系统能改进构建AI（即AI4AI）的过程，机器学习工程（MLE）为此提供了具体测试平台。研究引入Frontis-MA1，一个AI4AI模型，并推出OpenMLE，一个用于RSI研究的开放全栈系统，以实现机器学习工程的自我提升。

X-NavDP: Generalizing Navigation Diffusion Policy to Novel Behavior and Embodiments with Group Q-score Reweighted Matching

👍 1

07/30 08:00

Pretraining navigation diffusion policies rely on large-scale expert demonstrations. These data are typically generated by a fully-informed oracle planner suited to a single nominal robot. This limits the policy's generalization to diverse embodiments and challenging scenarios (e.g., escaping dead e

机器人导航扩散策略

中文介绍现有的导航扩散策略预训练依赖大规模专家演示数据，但这些数据通常由针对单一机器人的规划器生成，限制了策略向多样化实体和复杂场景的泛化能力。本研究提出X-NavDP，利用群组Q值重加权匹配来泛化导航策略至新行为和新实体。

RefCaptioner: Multi-Reference Image-Grounded Video Captioning

👍 25

07/30 08:00

Existing video captioning models generate natural descriptions of video content but cannot explicitly ground local visual elements to multiple reference images. We introduce multi-reference image-grounded video captioning, a new task requiring factual video descriptions with phrase-level reference g

One Future, Every Robot: Label-Efficient Collective-State Prediction with Decentralized JEPA

👍 1

07/30 08:00

Can every robot in a swarm predict the same future collective state from only local observations and bandwidth-limited messages? We formulate this as decentralized shared-state prediction and introduce Collective-State JEPA (CS-JEPA), a recurrent joint-embedding predictive architecture whose output

QAdapt: A Noise-Adaptive Neural Pre-Decoding Framework for Quantum Error Correction

👍 0

07/30 08:00

Fault-tolerant quantum computing (FTQC) relies on quantum error correction to suppress physical errors and preserve logical information at scale. In practice, however, performance is constrained not only by physical noise but also by the latency of classical decoders processing rapidly generated syn

Can Large Language Models Execute Parent Orders?

👍 15

07/30 08:00

Parent-order execution is a core problem in algorithmic trading, where the goal is to split a large order into smaller orders while reducing execution costs. Existing approaches either rely on pre-specified market assumptions that may not hold in practice, or require task-specific training that limi

LEDGERMIND: Provenance-Constrained Multimodal Agentic Reasoning with a Structured Evidence Ledger

👍 10

07/30 08:00

Multimodal agents for visual question answering increasingly operate as multi-step trajectories that interleave perception, retrieval, and reasoning, yet evaluation still largely reduces to final-answer accuracy. This aggregate signal cannot tell whether a correct answer was reached through grounded

ShadowDancer: Teaching Video World Models Any Action by Learning Unified Dynamics Representations from a Video and Its Shadow

👍 16

07/30 08:00

We present ShadowDancer, a novel approach to any-action, frame-level control of interactive video world models. The obstacle is representational: existing interfaces either encode an action loosely, leaving how it unfolds for the model to improvise, or encode it exactly through structured signals th

Fairness Pruning: Locating Demographic Bias in GLU-MLP Layers via Differential Activations

👍 2

07/30 08:00

This work presents Fairness Pruning, a lightweight structural intervention method designed for the management and future mitigation of demographic bias in large language models (LLMs). As a foundational empirical validation of this method, this work focuses on causal bias localization. Using minimal

Beyond Geometric Complementarity: Coherent Overlap in Sparse Mixture-of-Experts Routing

👍 2

07/30 08:00

Sparse mixture-of-experts (MoE) language models route each token to multiple experts, suggesting a geometric account of their benefit: co-selected experts should contribute distinct representation directions. Existing evidence often conflates route coherence, candidate quality, and candidate-by-cont

MORFES: A Benchmark for Productive Inflectional Competence in Modern Greek

👍 0

07/30 08:00

Modern Greek is a richly inflected language, yet the language models built for it are evaluated mainly on factual knowledge, and no benchmark is dedicated to their inflectional competence. We introduce MORFES (Morphological Open-class Recognition-and-Formation Evaluation Suite), a benchmark of 500 e

MemHarness: Memory Is Reconstructed, Not Replayed

👍 14

07/30 08:00

Retrieving past experiences has become a common strategy to enhance large language model agents. However, most existing memory-augmented agents treat retrieved experiences as static records to be replayed verbatim, injecting them into the context regardless of whether they align with the agent's cur

TARS: Timestep-Aware Data Scaling for 3D-Free Video Re-Shooting

👍 7

07/30 08:00

Video re-shooting aims to regenerate videos with controllable camera motion and viewpoint. Existing methods rely on explicit 3D priors, which are limited by reconstruction quality and often perform poorly when synthesizing previously unseen regions, or on paired videos with different camera trajecto

Qwen-UI-Agent Technical Report: Toward Next-Generation Real-World Centric Foundation GUI Agents

👍 282

07/30 08:00

GUI agents have the potential to become a general purpose executor over existing digital devices. To advance them toward real-world use, we envision agents that operate reliably on real devices, execute workflows across platforms, combine GUI interaction with CLI execution, complete long-horizon tas

Diversifying Personalized Research Ideation against AI-Induced Homogenization

👍 0

07/30 08:00

AI-assisted research ideation has emerged as a promising paradigm for accelerating scientific discovery, with systems now capable of generating research directions conditioned on papers, topics, or lightweight researcher contexts. Yet current systems largely optimize individual suggestions in isolat

Distilling Answer Set Programming Theories from Large Language Models

👍 0

07/30 08:00

Writing Answer Set Programming (ASP) theories from scratch is a difficult and time-consuming task. We take a neurosymbolic approach to study whether a model can distill complete and correct theories, given a fixed agent harness with the solver in the loop. The protocol is dataset-agnostic: with a si

Echoverse: Deep, Evolving Environments for Training Computer-Use Agents at Scale

👍 9

07/30 08:00

Computer-use agents learn from what their actions change, so training one needs applications it can act on, break and reset. The applications that matter most are login-gated and stateful, so synthetic environments stand in for them. Recent pipelines generate such environments in bulk, which moves t

Flux-OPD: On-Policy Distillation with Evolving Contexts

👍 40

07/30 08:00

Large language model training in open-ended domains lacks verifiable rewards, making task preferences difficult to formalize as effective supervision. Contexts can convey such preferences, yet provide little additional supervision once distilled into the student, motivating contexts that evolve with

Σ-Mem: An Online Reliability Memory for LLM-based Multi-Agent Systems

👍 12

07/30 08:00

Memory is central to long-horizon LLM agents, yet existing memory systems primarily preserve interaction content rather than modeling which agents can be trusted and under what conditions. This limitation is particularly important in multi-agent systems, where a central model may be unable to direct

The Geometric Nature and a Free Proxy for Flow-Matching Uncertainty

👍 0

07/30 08:00

Flow matching (FM) has become a popular action head paradigm for modern embodied models. However, as a conditional generative model, it does not explicitly expose its inherent uncertainty, producing faulty action chunks even when it misinterprets the scene or encounters out-of-distribution (OOD) inp

TimeOS 2.0

08/03 04:40

Work your tasks. Bill your clients with confidence.

任务管理效率工具时间追踪

中文介绍 TimeOS 2.0 是一款终极任务管理器，专为专业人士设计。它能帮助用户高效管理日常任务，并自信地向客户开具账单，简化项目追踪和时间记录流程，从而全面提升用户的工作效率和财务管理能力。

Finamie

08/03 04:40

Speak your expenses and get instant spending insights

财务管理语音输入消费分析

中文介绍 Finamie 是一款个人财务管理工具，允许用户通过语音记录日常开销。它能即时提供消费洞察，帮助用户更好地了解和管理自己的资金流向，实现智能财务追踪，从而提升个人理财效率。

YourSitee

08/03 04:39

Make your bio link worth clicking

个人主页链接聚合营销工具

中文介绍 YourSitee 是一款旨在优化个人简介链接的工具。它帮助用户创建一个更具吸引力且「值得点击」的聚合链接页面，从而提升在社交媒体等平台上的个人或品牌展示效果和用户互动，有效引导流量。

Lumichats(Free)

08/03 04:39

A Claude Code alternative for people who avoid the terminal

代码工具AI助手离线应用

中文介绍 Lumichats 是一款免费的离线代码助手，定位为 Claude Code 的替代品。它专为不习惯使用终端的用户设计，提供了一个无需命令行即可进行代码交互和辅助的环境，旨在提升开发便利性和效率。

Zinley

08/03 04:39

Your Personal AI Representative for calls, email, and tasks

AI助手效率工具自动化

中文介绍 Zinley 是一款个人AI代表，能够协助用户处理电话、电子邮件和日常任务。它作为智能助理，旨在自动化和优化个人沟通及工作流程，提升效率和管理能力，提供全面的AI支持，简化日常运营。

Capptivo

08/03 04:36

Free open-source screen recorder & demo editor

屏幕录制视频编辑开源

中文介绍 Capptivo 是一款免费的开源屏幕录制和演示编辑工具。它允许用户捕捉屏幕活动并对录制的视频进行编辑，非常适合创建教程、产品演示或任何需要屏幕录制的场景，且具备开源特性，灵活易用。

Zen Whisper

08/03 04:30

On-device Mac dictation that types into any app

语音输入Mac应用效率工具

中文介绍 Zen Whisper 是一款专为 Mac 设计的设备端听写工具。它支持在任何应用程序中进行语音输入，将用户的口述内容直接转换为文本，无需依赖云服务，确保了隐私性和快速响应，提升了Mac设备上的文本输入效率。

UniwebPay Skill

08/03 04:29

Financial Infra for the AI era

金融科技AI支付

中文介绍 UniwebPay Skill 旨在为AI时代提供金融基础设施服务。它构建了支持人工智能应用和业务的支付与交易解决方案，致力于提供适应新兴AI技术生态的金融底层支持，以满足未来AI驱动的经济需求。

NudgeForMe

08/03 04:21

AI follow-up agent for missed email opportunities

AI代理邮件工具效率工具

中文介绍 NudgeForMe 是一款 AI 邮件跟进代理，旨在帮助用户处理错过的邮件机会，确保重要信息得到及时关注和回复，从而提升沟通效率。

FreqWave EQ

08/03 04:18

Customize your web audio with a real-time EQ

音频工具浏览器插件音效

中文介绍 FreqWave EQ 是一款浏览器音频均衡器，允许用户实时自定义和调整网页上的音频输出。它提供了强大的均衡功能，使用户能够根据个人喜好优化各种在线内容的音质，提升网页浏览时的听觉体验。

Cursor 3.0 Tutorial For Beginners (Full Course)

07/31 03:41

AI编程工具Cursor教程

中文介绍 YouTube博主Riley Brown发布了一期60分钟的视频教程，旨在帮助用户高效学习AI编程工具Cursor。该教程声称能将超过1000小时的学习内容浓缩呈现，覆盖从初学者到专业水平，使观看者能在短时间内全面掌握Cursor工具的使用技巧。

Claude Code + Codex Can FINALLY Work Together (Buzz AI)

07/30 04:54

AI模型编程AI工具集成

中文介绍 YouTube博主Riley Brown（通过Buzz AI）发布视频，探讨了人工智能模型Claude Code与Codex现已能够协同工作。该视频内容可能关注如何整合并利用这两种AI模型，以实现更强大的编程辅助功能或解决复杂的编码问题，标志着AI工具协作能力的新进展。

Master Claude for Excel in 20 Minutes (Full Guide)

07/28 07:23

ClaudeExcelAI应用

中文介绍由Riley Brown制作发布的这份YouTube视频教程，旨在为初学者提供一份完整指南，详细讲解如何有效利用人工智能模型Claude来辅助和优化Excel电子表格的操作。该视频内容将聚焦于演示Claude在数据处理、分析等方面的实际应用技巧，帮助用户掌握将AI工具与Excel结合使用的基本方法。

Opus 5 Is Here… But NEW Claude Voice Is Even Bigger

07/26 04:28

大模型语音技术产品更新

中文介绍 YouTube 视频指出，「Opus 5」已推出，但新的 Claude 语音功能「Claude Voice」被认为是更重要的进展。视频强调，虽然「Opus 5」已发布，但这一全新的 Claude 语音技术被认为具有更大的影响力。

OpenAI just released Codex Voice (It's basically Jarvis)

07/25 00:09

大模型语音AI产品发布

中文介绍 OpenAI 发布了其名为 Codex Voice 的新产品。该产品被描述为类似电影中「贾维斯」（Jarvis）的人工智能系统，暗示它可能具备先进的语音交互或智能助手功能，代表了AI在自然语言处理和人机互动方面的新进展。

Codex Basically Runs My Company Now. Here’s How.

07/22 06:27

Devin AI: The Full Beginner’s Guide (Better Than Claude Code?)

07/17 02:25

Director by OpenArt

07/17 02:12

Codex New Browser: The Secret Weapon Everyone Should be Using

07/15 21:00

OpenAI Just Merged ChatGPT and Codex. This Changes Everything.

07/13 01:24

What do AI models actually know?

07/24 23:38

AI原理认知科学

中文介绍该视频探讨了人工智能模型实际「知道」什么的核心问题。内容可能涉及AI模型的知识边界、其学习和理解机制与人类认知的异同，以及AI系统如何获取、处理和表达信息。

Why does AI hallucinate?

07/23 23:42

大模型人工智能幻觉

中文介绍这则由Claude发布的YouTube短视频，探讨了人工智能（AI）出现「幻觉」现象的原因。AI幻觉是指大型语言模型在生成文本时，提供看似合理但实际不准确或虚假信息的问题。该视频旨在解释为何AI会生成不符合事实的内容。

How does AI get its character?

07/23 01:05

人工智能AI原理YouTube

中文介绍由Claude在YouTube发布的一则短视频，探讨了人工智能（AI）如何形成其“性格”。视频以此为主题，讨论AI系统在学习和训练过程中发展出独特行为模式的现象。

What is sycophancy?

07/22 02:19

Making New York City miniature with Claude

07/17 04:56

Build data-driven lesson plans with Claude for Teachers

07/15 23:08

Regenerative beekeeping with Claude

07/15 00:00

Plan smarter with Claude for Teachers

07/14 22:55

The Briefing: AI for Science

07/14 00:02

Building the future of agentic infrastructure

07/11 01:41

What do AI models actually know?

07/24 23:38

AI原理认知科学

Why does AI hallucinate?

07/23 23:42

大模型人工智能幻觉

How does AI get its character?

07/23 01:05

What is sycophancy?

07/22 02:19

Making New York City miniature with Claude

07/17 04:56

Build data-driven lesson plans with Claude for Teachers

07/15 23:08

Regenerative beekeeping with Claude

07/15 00:00

Plan smarter with Claude for Teachers

07/14 22:55

The Briefing: AI for Science

07/14 00:02

Building the future of agentic infrastructure

07/11 01:41

NVIDIA's AI Learns Why Copying Humans Isn't Enough

08/02 23:01

人工智能NVIDIA机器学习

中文介绍英伟达（NVIDIA）的人工智能研究指出，仅模仿人类行为不足以实现某些高级AI目标。该研究可能探讨了AI在复杂任务中，需要超越简单模仿，发展更深层次的学习或决策机制。这揭示了AI发展中，对智能自主性的更高要求。

Kimi K3 Just Broke The Economics Of AI

07/29 20:37

大模型AI经济

中文介绍中文AI模型Kimi的K3版本据称「颠覆了AI经济学」。标题表明Kimi K3在人工智能领域可能带来了成本效益或性能上的重大突破，对AI行业的经济模式产生了显著影响。

Claude Just Revealed AI's Biggest Problem

07/16 23:39

Anthropic Found Something That Shouldn't Exist

07/15 21:58

Minecraft Was Missing One Brilliant Idea

07/12 23:48

DeepSeek's Absolutely Insane AI Speed Hack

07/08 00:33

They Said This Will Never Run In Real Time

07/04 01:19

AI Just Entered A New Era

07/01 13:23

DeepSeek Just Solved AI's Billion Dollar Problem

06/22 23:53

Scientists Found A Better Language For AI Agents

06/19 22:06

冒险岛 083 网页版

08/02 19:40

14 回复 · 程序员节点

Clash 的规则集现在都怎么选？ rule-provider 感觉不够用了？

08/02 16:33

18 回复 · 程序员节点

除了刷 leetcode, 平时大家还会古法编程吗？

08/02 15:29

12 回复 · 程序员节点

似乎开源项目作者不知道/不想利用 AI 来加速自己的开发？

08/02 13:43

48 回复 · 程序员节点

Google Play 订阅的 GPT20 X 封号后退款成功，不过客服说这次是破例

08/02 12:01

10 回复 · 程序员节点

请教：除了 api 直连，有没有更便宜也保质量的办法可以用 deepseek-v4-flash？

08/02 11:09

63 回复 · 程序员节点

codex 现在怎么这么慢了，用的 sol 中、高都很慢

08/02 09:09

23 回复 · 程序员节点

App Store 中国区充值返 10%各位充了多少？

07/29 23:53

47 回复 · Apple 节点

M1 Max 能装 macOS 27 Public Beta 么？

07/29 18:57

16 回复 · Apple 节点

微信在 Mac 上频繁闪退（等）

07/29 18:46

8 回复 · Apple 节点

该源今日无内容。

My personal AI benchmark: "Generate an SVG of a frog with a Habsburg jaw."

08/03 03:42

18 points · 4 comments

EU rules on AI models become enforceable. What's going to change?

08/03 03:40

23 points · 13 comments

German carmakers flood jobs market with managers after wielding axe

08/03 03:24

24 points · 9 comments

Europe EV Sales BEVs Jump 50% & Reach 26% Market Share

08/03 03:20

27 points · 5 comments

'Crush this lady': how eBay harassment campaign led to $56M payout

08/03 03:19

42 points · 7 comments

Harvesting SSH Credentials: Insights from My Honeypot Network

08/03 01:45

23 points · 15 comments

Show HN: NixOS-DGX-Spark – Nix and NixOS on the DGX Spark

08/03 01:05

Try DGX Spark playbooks using Nix on DGX OS, or install NixOS on your DGX Spark for the full Nix experience. The repository provides USB images and a NixOS module with settings for DGX Spark systems.This works on the NVIDIA DGX Spark itself and also on the Asus Ascent GX10.See my 5 minute lightning

Show HN: Kakehashi – Experimental userspace to run macOS binaries on Linux ARM

08/03 00:26

108 points · 27 comments

Rooting, firmware analysis and persistent credentials of TP-Link TL-841N

08/03 00:19

60 points · 7 comments

How the words we teach English language learners changed

08/02 23:41

163 points · 90 comments

Twenty Years of RISC OS Open

08/02 20:36

128 points · 20 comments

F*: A general-purpose proof-oriented programming language

08/02 20:31

119 points · 47 comments

Great Question (YC W21) Is Hiring Senior Demand Gen Manager

08/02 20:01

1 points · 0 comments

Meshdiff – visually compare two STL versions in the browser, client-side

08/02 19:34

154 points · 15 comments

Show HN: Fuse – statically typed functional programming language

08/02 19:23

Hi HN! I've been working on the fuse programming language, it's a statically typed purely functional language with higher-kinder types and ad-hoc polymorphism. It compiles to the GRIN whole-program optimizer, producing LLVM-generated native code.Fuse supports ADTs, Generics, Type Methods,

Artificial Intelligence: Ars Notoria and the Promise of Instant Knowledge

08/02 18:18

114 points · 27 comments

Show HN: Bor – Open-source policy management for Linux desktops

08/02 17:06

Hi HN! I've been working on Bor, an open-source system for centralized Linux desktop management.Bor consists of a lightweight Go agent and a central server. Policies are streamed to clients over mTLS/gRPC in real time—no polling—and currently support Firefox, Chrome, KDE, dconf, polkit and

Karpathy’s Pelican

08/02 12:05

https://xcancel.com/karpathy/status/2083749667410727319

Show HN: I'm a 15 Year Old Wannabe Engineer, This Is a Cycloidal Gearbox I Built

08/02 10:07

304 points · 98 comments

Go 1.27 Interactive Tour

08/02 09:35

335 points · 171 comments

Diátaxis

08/02 04:33

509 points · 57 comments

When transit passes were designed by hand (2022)

07/31 21:38

65 points · 18 comments

Holocloth

07/31 06:59

152 points · 29 comments

Schmitt Trigger: Robust Comparator Design with Hysteresis

07/30 23:12

3 points · 0 comments

Developers are attached to tools because tools encode trust

07/29 22:25

76 points · 33 comments

Fasttracker II clone in C using SDL 2

07/29 14:47

96 points · 31 comments

Folding Paper Globes

07/29 13:53

119 points · 27 comments

Note-Taking and Personal Knowledge Management

07/28 22:21

31 points · 7 comments

Turtle-inspired interactive Python project

07/28 21:00

13 points · 3 comments

Norway Salmon

07/28 09:54

116 points · 73 comments

v2.1.220

07/25 09:35

What's changed Bug fixes and reliability improvements

版本更新bug修复Claude

中文介绍 Anthropic 旗下的 Claude Code 项目发布了 v2.1.220 版本。此次更新主要内容为错误修复和可靠性改进，旨在提升软件的稳定性和用户体验。

v2.1.219

07/25 01:14

What's changed Added Claude Opus 5 (claude-opus-5), now the default Opus model — 1M context, fast mode at $10/$50 per Mtok Added sandbox.network.strictAllowlist setting to deny non-allowlisted hosts for sandboxed commands without prompting Added DirectoryAdded hook that fires after /add-dir or the S

大模型软件更新安全

中文介绍 Anthropic的Claude-code项目发布v2.1.219更新。此版本引入了Claude Opus 5（claude-opus-5）作为默认Opus模型，具备1M上下文。其快速模式定价为每百万token $10/$50。同时，更新新增“sandbox.network.strictAllowlist”设置，增强沙盒命令的网络安全。

v2.1.218

07/23 05:24

What's changed Changed /code-review to run as a background subagent, so review work no longer fills your conversation and keeps stacked slash commands as its review target Added screen-reader announcements of deleted text for word and line deletions (Option+Delete, Ctrl+W, Cmd+Backspace, Ctrl+U, Ctr

v2.1.217

07/22 05:35

What's changed Added emoji shortcode autocomplete in the prompt input: type :heart: to insert ❤️, or :hea for suggestions — disable with the emojiCompletionEnabled setting Added warnings when transcript writes are failing (e.g. disk full) or when session saving is off due to an inherited environment

v2.1.216

07/21 06:14

What's changed Added sandbox.filesystem.disabled setting to skip filesystem isolation while keeping network egress control Fixed a slowdown in long sessions where message normalization cost grew quadratically with the number of turns, causing multi-second stalls and slow resumes Fixed auto mode deny

v2.1.215

07/19 10:56

What's changed Claude no longer runs the /verify and /code-review skills on its own; invoke them with /verify or /code-review when you want them

v2.1.214

07/18 09:20

What's changed Fixed single-segment dir/** allow rules like Edit(src/**) auto-approving writes to nested dir/ directories anywhere in the tree instead of only /dir Fixed a permission-check bypass affecting commands run in Windows PowerShell 5.1 sessions Fixed Bash permission checks to fail closed on

v2.1.212

07/17 08:26

What's changed /fork now copies your conversation into a new background session (its own row in claude agents) while you keep working; the in-session subagent it used to launch is now /subtask Added claude auto-mode reset to restore the default auto-mode configuration, with a confirmation prompt (pa

v2.1.211

07/16 07:02

What's changed Added --forward-subagent-text flag and CLAUDE_CODE_FORWARD_SUBAGENT_TEXT environment variable to include subagent text and thinking in stream-json output Fixed permission previews relayed to chat channels not neutralizing bidirectional-override, zero-width, and look-alike quote charac

v2.1.210

07/15 07:45

What's changed Added a live elapsed-time counter to the collapsed tool summary line so long-running tool calls visibly tick instead of looking stuck Added a startup warning for Write(path), NotebookEdit(path), and Glob(path) permission rules — use Edit(path) or Read(path) instead Fixed isolation: 'w

0.147.0-alpha.4

08/01 01:58

Release 0.147.0-alpha.4

版本发布OpenAICodex

中文介绍 OpenAI Codex发布了其Rust组件的0.147.0-alpha.4预览版本更新。这是该项目在GitHub上的一个新发行版，属于alpha测试阶段。

0.147.0-alpha.3

07/31 23:39

Release 0.147.0-alpha.3

版本发布OpenAICodex

中文介绍 OpenAI Codex发布了其Rust组件的0.147.0-alpha.3预览版本更新。此发行版在GitHub上公布，标志着该组件的alpha测试进展。

0.147.0-alpha.1.1

07/31 17:51

Release 0.147.0-alpha.1.1

版本发布OpenAICodex

中文介绍 OpenAI Codex发布了其Rust组件的0.147.0-alpha.1.1预览版本更新。此版本已在GitHub上发布，是Codex Rust组件的alpha测试版本。

0.147.0-alpha.2

07/30 09:07

Release 0.147.0-alpha.2

软件发布OpenAICodex

中文介绍 OpenAI旗下的Codex项目近日发布了其Rust语言版本的最新更新，版本号为v0.147.0-alpha.2。此次发布代表了该项目在Rust生态系统中的持续迭代和发展。

0.146.0-alpha.9.2

07/30 07:57

Release 0.146.0-alpha.9.2

软件发布OpenAICodex

中文介绍 OpenAI旗下的Codex项目发布了其Rust语言版本的更新，版本号为v0.146.0-alpha.9.2。此次发布是该项目在Rust生态系统中的又一次迭代。

0.146.0-alpha.9.1

07/30 07:00

Release 0.146.0-alpha.9.1

软件发布OpenAICodex

中文介绍 OpenAI旗下的Codex项目发布了其Rust语言版本的更新，版本号为v0.146.0-alpha.9.1。这标志着Codex项目在Rust生态系统中的持续发展。

0.147.0-alpha.1

07/29 17:16

Release 0.147.0-alpha.1

软件发布测试版OpenAI

中文介绍 OpenAI Codex近日发布了其`rust-v0.147.0-alpha.1`版本。此为该项目的一个早期测试阶段性发布，通常用于内部测试或有限用户试用，以收集反馈并进一步完善功能。

0.146.0

07/29 09:44

New Features Name new sessions with /new or /clear, pin important threads, and switch between side conversations without closing them. (#34605, #34840, #35011) Support Agent Plugins manifests, workspace plugin publishing, and additional plugin marketplaces for Amazon Bedrock and Claude Code. (#35105

软件发布新功能插件

中文介绍 OpenAI Codex正式发布`rust-v0.146.0`版本，带来多项新功能。用户现在可通过`/new`或`/clear`命令命名新会话、置顶重要线程，并在不关闭侧边对话的情况下进行切换。新版本还支持Agent插件清单、工作区插件发布，并增加了对Amazon Bedrock等额外插件市场的支持。

rusty-v8-v150.4.0

07/29 08:28

Update rusty_v8 to 150.4.0 (#35831) ## What changed - Upgrade the Rust `v8` crate to `150.4.0` and the Bazel V8 source to `15.0.245.2`. - Refresh the prebuilt archives, checksums, LLVM source revisions, Bazel targets, and downstream V8 patches for the new release. - Expose the pinned llvm-libc heade

软件更新依赖升级V8引擎

中文介绍 OpenAI Codex发布`rusty-v8-v150.4.0`版本，主要更新了其核心依赖。该版本将Rust `v8` crate升级至`150.4.0`，并将Bazel V8源更新至`15.0.245.2`。同时，为配合新版本，还刷新了预构建归档、校验和、LLVM源修订等相关组件。

rust-v0.146.0-alpha.16

07/29 07:32

Release 0.146.0-alpha.16

软件发布测试版OpenAI

中文介绍 OpenAI Codex发布了`rust-v0.146.0-alpha.16`版本。这是该项目在`0.146.0`主版本之前的一个第16个alpha测试版本，旨在逐步引入新功能并进行内部测试和验证。

今日主题

今日AI圈焦点汇聚于模型能力突破、智能体应用拓展及行业挑战。Claude Opus 5实现游戏原型一键生成，AI代理通过“记忆教练”提升长任务稳定性，同时OpenAI和MoonPay加速企业级和金融AI服务落地。然而，AI滥用对网络安全和艺术版权的冲击，以及欧盟对AI互动的披露新规，提醒行业在技术普惠与伦理监管之间寻求平衡。

模型发布/更新

Model Releases 44 篇

AirLLM 优化大模型推理，4GB显存可运行700亿参数模型

开源项目GitHub Trending

AirLLM 项目致力于解决大型语言模型（LLM）推理的硬件瓶颈，使其能够在仅有4GB显存的单张GPU上高效运行700亿参数级别的模型。该项目通过优化模型加载、量化和计算方法，显著降低了LLM推理所需的硬件资源，为在本地设备、低成本硬件或边缘设备上部署复杂AI应用提供了切实可行的解决方案，极大地拓展了LLM的部署场景和可及性。

LLM推理显存优化开源

antirez 发布 DeepSeek 4 本地高性能推理引擎 ds4

开源项目GitHub Trending

知名开发者 antirez 推出的 `ds4` 项目是一款高性能的本地推理引擎，专为 DeepSeek 4 Flash 和 PRO 系列大型语言模型而设计。该引擎利用先进的硬件加速技术，全面支持 Apple 的 Metal、NVIDIA 的 CUDA 以及 AMD 的 ROCm 平台。它使用户能够在个人设备上高效运行 DeepSeek 4 模型，为对数据隐私、运行成本或离线可用性有严格要求的开发者和研究人员提供了本地化LLM推理的强大能力。

LLM本地推理开源

Meta AI 引入“记忆教练”代理，提升长任务处理稳定性

研究聚合The Decoder

Meta AI 正在探索一项创新机制，即利用第二个AI智能体充当“记忆教练”，以防止主智能体在处理复杂、长时间任务时遗忘已诊断的错误并重复失败步骤。这个独立的记忆智能体负责维护一个结构化的记忆库，并能在必要时提醒主智能体，显著提升了其任务完成的表现和稳定性。这项研究成果为开发更鲁棒、更具上下文感知能力的AI代理提供了新的思路。

AI智能体AI模型长期记忆

Claude Opus 5 实现单提示词生成完整3D游戏原型

研究聚合The Decoder

Anthropic 发布的 Claude Opus 5 展示了惊人的能力，能够仅凭单个提示词就生成完整的 3D 游戏原型，涵盖了第一人称射击、卡丁车竞速和《我的世界》克隆版等多种类型。这款AI模型无需任何外部素材，能直接生成游戏的几何结构、纹理、物理效果乃至部分音乐代码，并可在浏览器中运行。这标志着AI在游戏开发领域取得了从概念到可玩原型的飞跃，极大地降低了游戏创作的门槛。

AI游戏生成式AI大模型

产品发布/更新

Product 44 篇

MoonPay 推出专为AI设计的支付保管库 PayBox

X·KOLX 推文 (AttentionVC)

金融科技公司 MoonPay 正式推出 PayBox，这是首个专为AI设计的支付保管库。该产品旨在赋予用户的AI代理管理和执行支付的能力，实现法币与数字资产间的无缝价值转移。PayBox 的推出进一步推动了AI在金融领域的应用深度与广度，解决了AI代理进行金融操作的信任和安全问题，为未来AI驱动的经济模式奠定了基础设施。

AI支付金融科技产品发布

Zinley：您的个人AI代表，助力日常事务管理

产品榜单Product Hunt

Zinley 是一款创新的个人AI代表工具，旨在全方位协助用户处理电话、电子邮件和日常任务。它作为智能助理，能够自动化和优化用户的个人沟通与工作流程，从而显著提升效率和管理能力。Zinley 为用户提供全面的AI支持，简化了繁琐的日常运营，让用户能够更专注于核心工作，是寻求提升个人生产力的理想选择。

AI助手效率工具自动化

Lumichats：免费离线代码助手，替代 Claude Code

产品榜单Product Hunt

Lumichats 是一款免费的离线代码助手，定位为 Claude Code 的替代品，专为不习惯使用终端的开发者设计。它提供了一个无需命令行即可进行代码交互和辅助的图形化环境，旨在极大提升开发的便利性和效率。作为一款离线工具，Lumichats 还能确保用户代码和数据的隐私安全，是寻求便捷、私密编码辅助的开发者的理想选择。

代码工具AI助手离线应用

OpenAI Presence 助力企业级AI智能体落地生产

研究聚合The Decoder

OpenAI 推出新的企业级产品 Presence，旨在帮助企业将AI智能体顺利投入实际生产环境，特别适用于客户服务和内部工作流程的自动化。与现有的 Workspace Agents 不同，Presence 主要面向外部部署场景。对于复杂的实施案例，OpenAI 的工程师团队还将提供专业的介入支持，确保企业能够高效、稳定地利用AI智能体提升运营效率和用户体验。

OpenAIAI智能体企业服务

行业动态

Industry 44 篇

AI时代优秀软件仍有空间，App Store涌现“隐藏佳作”

综合资讯TechCrunch

尽管有预测认为AI智能体可能取代传统应用程序，但App Store上仍以空前速度涌现出大量创新软件。文章指出，这些“隐藏佳作”包括更智能的书签工具、邻里市场、数字笔友和自然日记等，有力证明了在AI技术蓬勃发展的时代，优秀软件依然拥有广阔的发展空间。这表明，AI并非终结传统软件，而是催生了更多创新和优化用户体验的可能性。

App StoreAI影响软件开发

艺术家报酬能否平息对生成式AI版权争议？

综合资讯The Verge

多年来，插画师们对生成式人工智能初创公司未经许可使用艺术家作品训练模型的行为表达强烈担忧，认为此举无异于盗窃。文章探讨了向艺术家支付报酬是否足以让他们接受AI技术。这一讨论的核心在于如何平衡版权保护与技术创新之间的关系，以及经济补偿能否真正解决艺术家们在伦理和经济层面的顾虑，推动AI艺术创作的健康发展。

AI艺术版权伦理

欧盟新规促使欧洲民众深切感知AI融入日常生活

综合资讯Wired AI

欧盟出台的新规要求民众在与AI互动或接触由AI生成/编辑的内容时必须被明确告知，此举可能让欧洲人前所未有地意识到AI在其日常生活中根深蒂固的程度。然而，该规定也引发了“披露疲劳”的担忧，即用户可能因频繁收到提示而感到厌烦。这一政策旨在提升透明度，但也带来了如何有效传达信息而不造成用户体验负担的新挑战。

欧盟AI监管政策

AI滥用致苹果漏洞赏金计划受阻，真实漏洞报告被淹没

研究聚合The Decoder

苹果公司的漏洞赏金计划因充斥大量由AI生成的虚假报告而陷入困境，导致该公司不得不限制每位研究人员的提交数量。结果，意大利初创公司 Bynario 最初未能成功提交一个价值20万美元的严重 macOS 漏洞。这一事件凸显了AI技术滥用对网络安全报告流程造成的负面影响，提醒行业需要有效策略来过滤AI生成内容，确保真实且重要的安全威胁能够得到及时处理。

网络安全AI滥用苹果

技巧与观点

Tips & Takes 44 篇

微软发布“AI for Beginners”：全面AI入门课程

开源项目GitHub Trending

Microsoft 推出的“AI-For-Beginners”是一个为期12周、包含24节课程的全面人工智能学习路径。该课程旨在普及AI知识，面向所有希望入门AI的学习者，涵盖了人工智能的核心概念、机器学习基础、深度学习、自然语言处理和计算机视觉等关键领域。它通过实践项目和清晰的讲解，帮助初学者逐步建立AI知识体系和动手能力，有效降低了传统AI学习门槛，是理想的系统学习资源。

AI教程学习资源机器学习

Agent-Reach 赋予AI代理全网“观察”能力

开源项目GitHub Trending

Agent-Reach 项目赋予AI代理“观察”整个互联网的能力，使其能够无缝阅读和搜索来自 Twitter、Reddit、YouTube、GitHub、Bilibili、小红书等主流平台的公开信息。该项目通过简洁的命令行界面（CLI），帮助AI代理以低成本方式获取实时网络数据，克服了传统API限制和高昂费用。它解决了AI代理信息获取的瓶颈问题，极大地扩展了代理的应用场景，适用于需要广泛网络数据支持的研究和内容分析。

AI代理数据获取开源

8G显卡本地AI生图实测：42种风格效果详尽分析

X·KOLX 创作者 (AttentionVC)

一位博主深入实测了使用8G显卡在本地生成未审查AI图片的可行性，并详细分享了42种不同风格的生图效果与性能表现。这项测试为低配置硬件用户提供了本地AI绘画的实用参考，证明了即使显存有限，用户仍能通过优化配置和选择合适的模型，实现高质量的AI图像生成。该实测分享了宝贵的经验和数据，帮助用户更好地利用现有硬件进行AI创作。

AI绘画本地部署显卡优化

英伟达研究：AI超越模仿，寻求深层学习机制

大咖博客Two Minute Papers

英伟达（NVIDIA）的人工智能研究指出，仅模仿人类行为不足以实现某些高级AI目标。该研究可能深入探讨了AI在处理复杂任务时，需要超越简单的行为复制，发展更深层次的学习或决策机制，以应对现实世界的多变性和不确定性。这揭示了AI发展中，对智能自主性和创造性解决问题能力的更高要求，预示着未来AI模型将更加注重内在逻辑和推理能力而非表面模仿。

人工智能机器学习AI研究

今日产品趋势

今天产品发布的核心脉络是 AI Agent 生态系统的深化与工具链的完善。新产品涵盖了赋予 Agent 更强感知与执行能力的底层技术，以及面向特定领域和本土化需求的创新应用，同时 Agent 市场化趋势也初现端倪，预示着 AI 助手将走向更细分、更智能的服务阶段。

今日必看

Must See 22 款

Agent-Reach — 赋予 AI 代理浏览互联网的能力

开源项目GitHub Trending

Agent-Reach 为 AI 代理提供了「观察」整个互联网的能力，使其能无缝访问 Twitter、Reddit、YouTube、GitHub 等主流平台的公开信息。该项目通过一个简洁的命令行界面 (CLI)，帮助 AI 代理以低成本方式获取实时网络数据，克服了传统 API 限制和高昂费用。它解决了 AI 代理信息获取的瓶颈问题，极大地扩展了代理的应用场景，对于需要广泛网络数据支持的研究、内容分析及趋势监测类 AI 应用开发者尤其有用。

AI代理数据获取开源

DeepSeek-Reasonix — 终端原生 AI 编程助手

开源项目GitHub Trending

DeepSeek-Reasonix 是一个专为终端用户设计的 DeepSeek 原生 AI 编码代理。它旨在为开发者在命令行界面提供智能编程辅助，例如代码生成、补全或问题解答。该项目特别强调了其「前缀缓存稳定性」工程优化，确保代理可以长时间运行并保持高效，减少重复计算。这使得开发者能够无缝地在终端内获取 AI 协助，从而提升编码效率和开发体验，尤其适合习惯命令行工作流的专业人士。

AI编程LLM终端工具

开发者工具

Dev Tools 22 款

Openwork — 开源自部署 AI 协作平台

开源项目GitHub Trending

Openwork 是一个开源项目，旨在作为类似 Claude Cowork 的 AI 协作助手的替代方案。它提供了一个可定制和私有化部署的平台，帮助团队进行内容创作、信息总结、任务管理等，解决了对专有 AI 工具数据隐私、厂商锁定及定制化不足的担忧。主要面向希望构建自主可控 AI 协作环境的企业、开发者或寻求增强数据安全性的团队，支持集成各类开源大语言模型，提供灵活的AI协作解决方案。

AI协作开源企业工具

Lumichats — 免费离线代码辅助工具

产品榜单Product Hunt

Lumichats 是一款免费的离线代码助手，定位为 Claude Code 的替代品。它专为不习惯使用终端的用户设计，提供了一个无需命令行即可进行代码交互和辅助的环境。该工具支持离线操作，确保数据隐私，并通过直观的用户界面降低了 AI 辅助编程的门槛。它旨在提升非命令行用户的开发便利性和效率，使更多人能够轻松利用 AI 进行代码开发和调试。

代码工具AI助手离线应用

创作与效率

Creative & Productivity 44 款

Zinley — 您的个人 AI 代表

产品榜单Product Hunt

Zinley 是一款个人 AI 代表，能够协助用户处理电话、电子邮件和日常任务。它作为智能助理，旨在自动化和优化个人沟通及工作流程，提升效率和管理能力，提供全面的 AI 支持，简化日常运营。Zinley 通过理解用户意图并执行相应操作，将人们从重复性工作中解放出来，从而让他们专注于更重要的事务，是个人生产力提升的强大工具。

AI助手效率工具自动化

Zen Whisper — Mac 设备端语音输入

产品榜单Product Hunt

Zen Whisper 是一款专为 Mac 设计的设备端听写工具。它支持在任何应用程序中进行语音输入，将用户的口述内容直接转换为文本，无需依赖云服务，确保了隐私性和快速响应。这款工具利用本地 AI 模型，提供了卓越的识别准确性和低延迟，极大地提升了 Mac 设备上的文本输入效率，特别适合需要频繁进行文本输入同时关注数据安全和速度的用户。

语音输入Mac应用效率工具

Finamie — 语音驱动的消费洞察助手

产品榜单Product Hunt

Finamie 是一款个人财务管理工具，允许用户通过语音记录日常开销。它能即时提供消费洞察，帮助用户更好地了解和管理自己的资金流向，实现智能财务追踪，从而提升个人理财效率。该产品通过结合语音识别和 AI 分析，简化了记账流程，让用户能更直观、便捷地掌控财务状况，是追求智能便捷理财体验的理想选择。

财务管理语音输入消费分析

NudgeForMe — AI 邮件跟进代理

产品榜单Product Hunt

NudgeForMe 是一款 AI 邮件跟进代理，旨在帮助用户处理错过的邮件机会，确保重要信息得到及时关注和回复，从而提升沟通效率。它能智能识别需要跟进的邮件，并自动发送提醒或起草回复，大大减轻了用户手动处理邮件的负担。这款 AI 代理特别适合那些邮件量大、容易错过重要信息，希望提升邮件管理和响应速度的专业人士。

AI代理邮件工具效率工具

新鲜实验

Emerging 22 款

Kopai — AI 代理服务市场

产品榜单Product Hunt

Kopai 是一个专注于 AI 代理的市场平台。用户可以在此分享他们的专业知识，并由平台的智能代理代为创造收益。该平台旨在连接专业人士与 AI 技术，实现知识变现，通过提供一个发现、购买和部署 AI 代理的中心化市场，预示着 AI 代理经济的兴起。Kopai 为那些希望通过 AI 技术拓展业务或寻求自动化解决方案的企业和个人提供了新的商业模式和机会。

AI代理AI市场商业模式

Bolcho AI — 印度本土化语音 AI 代理构建平台

产品榜单Product Hunt

Bolcho AI 平台致力于帮助用户构建能够真实「讲印度语」的语音 AI 代理。它专注于提供适应印度多种语言、口音及文化背景的 AI 语音解决方案，旨在增强 AI 代理在印度市场的本土化交流能力和用户体验。该平台通过定制化的语言模型和语音合成技术，解决了通用 AI 语音产品在特定文化语境下的不足，对于希望在印度市场部署智能语音应用的开发者和企业具有重要意义。

AI语音本地化印度市场

→ 查看产品库