第4期 | Liberate your OpenClaw
今日摘要
GitHub karpathy:Karpathy 新发布的最小 ChatGPT 复现项目,训练到推理的完整栈只有几千行可读代码,目标是把“百美元跑一个 ChatGPT”压到个人可动手的范围。
GitHub karpathy:Karpathy 早期的教学级 GPT 实现,代码短到可以一口气读完,长期用作理解 Transformer 训练与推理最短路径的入口。
GitHub anthropics:Anthropic 公开其内部工程师 take-home 面试题,可作为理解他们工程品味和评估标准的一手材料。
GitHub openai:OpenAI 新开源的多 agent 编排框架,重点不是写代码的 coding agent,而是任务隔离、委派与团队级协作。
GitHub openai:OpenAI 官方示例库更新,通常折射出他们希望开发者优先采用的新模式(tool use、structured output、responses API 等)。
总结 + 观点:Karpathy 用 Rust 重写的 BPE tokenizer 训练器,把 tiktoken…|中文观点:rustbpe 补上了 tokenizer 训练这块的“黑盒”:它让 tokenizer…
总结 + 观点:OpenAI 官方 Python SDK 更新,通常先于公告暴露出新接口细节、参数变化或默认路径…|中文观点:官方 SDK 的 commit 经常是 API 方向的早期指示灯,对做集成和多模型平台的团…
总结 + 观点:Anthropic 官方的 Claude Agent SDK 示例仓库,覆盖代码 agent、文…|中文观点:demos 仓库往往比文档更早暴露 SDK 的边界和推荐模式,对正在选型 agent 栈的…
总结 + 观点:Anthropic 的官方交互式 prompt 工程教程,沿用他们内部训练素材的结构,适合团队系…|中文观点:它价值不在炫技,而在把 prompt 工程从“艺术”收敛成“可教可测”。对刚上手 Clau…
总结 + 观点:OpenAI Full Fan Mode 比赛规则页面,覆盖参赛条件、评判、奖项等。|中文观点:这类 marketing 页值得收录的理由只有一个:它暴露 OpenAI 把产品往哪种用户…
karpathy/nanochat
标签:#github_orgs #extended
作者:
原文:Karpathy 新发布的最小 ChatGPT 复现项目,训练到推理的完整栈只有几千行可读代码,目标是把“百美元跑一个 ChatGPT”压到个人可动手的范围。
karpathy/minGPT
标签:#github_orgs #extended
作者:
原文:Karpathy 早期的教学级 GPT 实现,代码短到可以一口气读完,长期用作理解 Transformer 训练与推理最短路径的入口。
anthropics/original_performance_takehome
标签:#github_orgs #extended
作者:
原文:Anthropic 公开其内部工程师 take-home 面试题,可作为理解他们工程品味和评估标准的一手材料。
链接:https://github.com/anthropics/original_performance_takehome
openai/symphony
标签:#github_orgs #extended
作者:
原文:OpenAI 新开源的多 agent 编排框架,重点不是写代码的 coding agent,而是任务隔离、委派与团队级协作。
openai/openai-cookbook
标签:#github_orgs #extended
作者:
原文:OpenAI 官方示例库更新,通常折射出他们希望开发者优先采用的新模式(tool use、structured output、responses API 等)。
karpathy/rustbpe
标签:#github_orgs #extended
作者:
原文:Karpathy 用 Rust 重写的 BPE tokenizer 训练器,把 tiktoken 里不透明的训练流程变成可学习、可实验的代码。
openai/openai-python
标签:#github_orgs #extended
作者:
原文:OpenAI 官方 Python SDK 更新,通常先于公告暴露出新接口细节、参数变化或默认路径调整。
anthropics/claude-agent-sdk-demos
标签:#github_orgs #extended
作者:
原文:Anthropic 官方的 Claude Agent SDK 示例仓库,覆盖代码 agent、文件编辑、工具链编排等典型用法。
anthropics/prompt-eng-interactive-tutorial
标签:#github_orgs #extended
作者:
原文:Anthropic 的官方交互式 prompt 工程教程,沿用他们内部训练素材的结构,适合团队系统补齐 prompt 基础功。
链接:https://github.com/anthropics/prompt-eng-interactive-tutorial
OpenAI Full Fan Mode Contest: Terms Conditions
标签:#ai_engineering_blogs #core
作者:
原文:OpenAI Full Fan Mode 比赛规则页面,覆盖参赛条件、评判、奖项等。
链接:https://openai.com/index/full-fan-mode-contest-terms-conditions
A New Framework for Evaluating Voice Agents (EVA)
标签:#ai_engineering_blogs #core
作者:
原文:A New Framework for Evaluating Voice Agents (EVA)
How Kensho built a multi-agent framework with LangGraph to solve trusted financial data retrieval
标签:#ai_engineering_blogs #core
作者:
原文:Discover how Kensho, S&P Global’s AI innovation engine, leveraged LangGraph to create its Grounding framework–a unified agentic access layer solving fragmented financial data retrieval at enterprise scale.
How we build evals for Deep Agents
标签:#ai_engineering_blogs #core
作者:
原文:TLDR: The best agent evals directly measure an agent behavior we care about. Here's how we source data, create metrics, and run well-scoped, targeted experiments over time to make agents more accurate and reliable. Evals shape agent behavior We've been curating evaluations to measure and
链接:https://blog.langchain.com/how-we-build-evals-for-deep-agents/
Agent Evaluation Readiness Checklist
标签:#ai_engineering_blogs #core
作者:
原文:A practical checklist for agent evaluation: error analysis, dataset construction, grader design, offline online evals, and production readiness.
链接:https://blog.langchain.com/agent-evaluation-readiness-checklist/
Liberate your OpenClaw
标签:#ai_engineering_blogs #core
作者:
原文:Liberate your OpenClaw
AI for American-Produced Cement and Concrete
标签:#engineering_ai_infra_blogs #extended
作者:
原文:Meta is continuing its long-term roadmap to help the construction industry leverage AI to produce high-quality and more sustainable concrete mixes, as well as those exclusively produced in the United States. Concurrent with the 2026 American Concrete Institute (ACI) Spring Convention, Meta is releasing a new AI model for designing concrete mixes Bayesian Optimization Read More... The post AI for American-Produced Cement and Concrete appeared first on Engineering at Meta
Latest open artifacts (#20): New orgs! New types of models!
标签:#hidden_high_value #hidden_high_value
作者:
原文:New orgs! New types of models! With Nemotron Super, Sarvam, Cohere Transcribe, others
链接:https://www.interconnects.ai/p/latest-open-artifacts-20-new-orgs
Announcing the LangChain MongoDB Partnership: The AI Agent Stack That Runs On The Database You Already Trust
标签:#ai_engineering_blogs #core
作者:
原文:Build production AI agents on MongoDB Atlas with vector search, persistent memory, natural-language querying, and end-to-end observability built in.
Meta Adaptive Ranking Model: Bending the Inference Scaling Curve to Serve LLM-Scale Models for Ads
标签:#engineering_ai_infra_blogs #extended
作者:
原文:Meta continues to lead the industry in utilizing groundbreaking AI Recommendation Systems (RecSys) to deliver better experiences for people, and better results for advertisers. To reach the next frontier of performance, we are scaling Meta’s Ads Recommender runtime models to LLM-scale & complexity to further a deeper understanding of people’s interests and intent. This increase Read More... The post Meta Adaptive Ranking Model: Bending the Inference Scaling Curve to Serve LLM-Scale Models for Ads appeared first on Engineering at Meta
Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents
标签:#ai_engineering_blogs #core
作者:
原文:Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents