第37期 | Show HN: I built a Cargo-like build tool for C/C++
今日摘要
OpenAI Blog:A pilot program to support independent safety and alignment research and develop the next generation of talent
OpenAI Blog:Explore our ambitious, people-first industrial policy ideas for the AI era—focused on expanding opportunity, sharing prosperity, a…
OpenAI Engineering:OpenAI engineering 列表显示,Responses API agent computer environment,这意味着模型调用正在往更完整的 agent runtime
OpenAI Engineering:OpenAI RSS Model Spec agent
Anthropic Engineering:Anthropic agentic coding benchmark,波动甚至可能超过榜单模型之间的差距。这对 agent eval
总结 + 观点:Anthropic harness agent runtime、上下文和安全边界设计问题。|中文观点:从 Harness design for long-running application…
总结 + 观点:Karpathy 2025 LLM RLVR reasoning test-time compu…|中文观点:比起表面参数,2025 LLM Year in Review 更需要观察它是否在推理质量、…
总结 + 观点:OpenAI outlines the next phase of enterprise AI,…|中文观点:围绕 The next phase of enterprise AI,真正重要的是它会不会…
总结 + 观点:AI agents running research on single-GPU nanocha…|中文观点:围绕 karpathy/autoresearch,真正重要的是它会不会影响团队的模型选型、…
总结 + 观点:Public repository for Agent Skills|中文观点:anthropics/skills 的核心不在新鲜感,而在它是否能提升工程效率、部署稳定性…
Announcing the OpenAI Safety Fellowship
标签:#ai_engineering_blogs #core
作者:
原文:A pilot program to support independent safety and alignment research and develop the next generation of talent
链接:https://openai.com/index/introducing-openai-safety-fellowship
Industrial policy for the Intelligence Age
标签:#ai_engineering_blogs #core
作者:
原文:Explore our ambitious, people-first industrial policy ideas for the AI era—focused on expanding opportunity, sharing prosperity, and building resilient institutions as advanced intelligence evolves.
链接:https://openai.com/index/industrial-policy-for-the-intelligence-age
From model to agent: Equipping the Responses API with a computer environment
标签:#uncategorized #core
作者:
原文:OpenAI engineering 列表显示,Responses API agent computer environment,这意味着模型调用正在往更完整的 agent runtime
链接:https://openai.com/index/equip-responses-api-computer-environment/
Inside our approach to the Model Spec
标签:#uncategorized #core
作者:
原文:OpenAI RSS Model Spec agent
Quantifying infrastructure noise in agentic coding evals
标签:#uncategorized #core
作者:
原文:Anthropic agentic coding benchmark,波动甚至可能超过榜单模型之间的差距。这对 agent eval
链接:https://www.anthropic.com/engineering/infrastructure-noise
Harness design for long-running application development
标签:#uncategorized #core
作者:
原文:Anthropic harness agent runtime、上下文和安全边界设计问题。
链接:https://www.anthropic.com/engineering/harness-design-long-running-apps
2025 LLM Year in Review
标签:#uncategorized #core
作者:
原文:Karpathy 2025 LLM RLVR reasoning test-time compute
The next phase of enterprise AI
标签:#ai_engineering_blogs #core
作者:
原文:OpenAI outlines the next phase of enterprise AI, as adoption accelerates across industries with Frontier, ChatGPT Enterprise, Codex, and company-wide AI agents.
karpathy/autoresearch
标签:#github_orgs #extended
作者:
原文:AI agents running research on single-GPU nanochat training automatically
anthropics/skills
标签:#github_orgs #extended
作者:
原文:Public repository for Agent Skills
openai/skills
karpathy/KarpathyTalk
标签:#github_orgs #extended
作者:
原文:A positive developer community for builders and agents.
anthropics/claude-cookbooks
标签:#github_orgs #extended
作者:
原文:A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
openai/evals
标签:#github_orgs #extended
作者:
原文:Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
anthropics/claude-plugins-official
标签:#github_orgs #extended
作者:
原文:Official, Anthropic-managed directory of high quality Claude Code Plugins.
Research-Driven Agents: What Happens When Your Agent Reads Before It Codes
标签:#research_community #core
作者:
原文:SkyPilot 团队讲“研究驱动 agent”:agent 在写代码前先做 literature review 与背景调查。
Show HN: I built a Cargo-like build tool for C/C++
标签:#research_community #core
作者:
原文:一个类 Cargo 的 C/C++ 构建工具,目标是把 Rust 生态里简洁的依赖管理体验搬到 C/C++。
Escaping the Fork: How Meta Modernized WebRTC Across 50+ Use Cases
标签:#engineering_ai_infra_blogs #extended
作者:
原文:Meta 讲他们如何把 50+ 产品里各自分叉的 WebRTC 合并回主线,减少长期维护负担。
Emperor penguin and Antarctic fur seal now endangered
标签:#research_community #core
作者:
原文:IUCN 把帝企鹅和南极毛皮海豹列为濒危,主因是气候变化导致栖息地退化。
Deep Agents Deploy: an open alternative to Claude Managed Agents
标签:#ai_engineering_blogs #core
作者:
原文:LangChain 的 Deep Agents Deploy 进入 beta,定位是模型无关的开源 agent harness,对标 Claude Managed Agents。
链接:https://blog.langchain.com/deep-agents-deploy-an-open-alternative-to-claude-managed-agents/