Short-term Memory(短期记忆)
返回概述
Memory(记忆)是一个记住之前交互信息的系统。对于 AI Agent 来说,记忆至关重要,因为它让它们能够记住之前的交互、从反馈中学习并适应用户偏好。随着 Agent 处理更复杂的任务和大量用户交互,这种能力对于效率和用户满意度都变得必不可少。
Short-term Memory(短期记忆)让你的应用程序能够在单个 thread 或对话中记住之前的交互。
Conversation History(对话历史)是最常见的短期记忆形式。长对话对当今的 LLM 构成挑战;完整的历史可能不适合 LLM 的 context window,导致 context loss 或错误。
即使你的模型支持完整的 context length,大多数 LLM 在长上下文中仍然表现不佳。它们会被过时的或离题的内容”分散注意力”,同时还会遭受更慢的响应时间和更高的成本。
Chat Models 使用 messages 接受 context,其中包括指令(system message)和输入(human messages)。在 chat 应用中,消息在 human inputs 和 model responses 之间交替,导致消息列表随时间增长。由于 context windows 有限,许多应用可以从使用技术来移除或”忘记”过时信息中受益。
使用
要为 Agent 添加短期记忆(thread-level persistence),你需要在创建 Agent 时指定 checkpointer。
通过将这些存储在 graph 的 state 中,Agent 可以访问给定对话的完整 context,同时保持不同 threads 之间的分离。
State 使用 checkpointer 持久化到数据库(或内存),因此 thread 可以随时恢复。
当调用 Agent 或完成步骤(如 tool call)时,短期记忆会更新,并且在每个步骤开始时读取 state。
:::python
from langchain.agents import create_agent
from langgraph.checkpoint.memory import InMemorySaver # [!code highlight]
agent = create_agent(
"gpt-5",
tools=[get_user_info],
checkpointer=InMemorySaver(), # [!code highlight]
)
agent.invoke(
{"messages": [{"role": "user", "content": "Hi! My name is Bob."}]},
{"configurable": {"thread_id": "1"}}, # [!code highlight]
)
::: :::js
import { createAgent } from "langchain";
import { MemorySaver } from "@langchain/langgraph";
const checkpointer = new MemorySaver();
const agent = createAgent({
model: "claude-sonnet-4-6",
tools: [],
checkpointer,
});
await agent.invoke(
{ messages: [{ role: "user", content: "hi! i am Bob" }] },
{ configurable: { thread_id: "1" } }
);
:::
在生产环境中
在生产环境中,使用由数据库支持的 checkpointer:
:::python
pip install langgraph-checkpoint-postgres
from langchain.agents import create_agent
from langgraph.checkpoint.postgres import PostgresSaver # [!code highlight]
DB_URI = "postgresql://postgres:postgres@localhost:5442/postgres?sslmode=disable"
with PostgresSaver.from_conn_string(DB_URI) as checkpointer:
checkpointer.setup() # 自动在 PostgresSql 中创建表
agent = create_agent(
"gpt-5",
tools=[get_user_info],
checkpointer=checkpointer, # [!code highlight]
)
::: :::js
import { PostgresSaver } from "@langchain/langgraph-checkpoint-postgres";
const DB_URI = "postgresql://postgres:postgres@localhost:5442/postgres?sslmode=disable";
const checkpointer = PostgresSaver.fromConnString(DB_URI);
:::
自定义 Agent Memory
:::python
默认情况下,Agent 使用 @[AgentState] 来管理短期记忆,特别是通过 messages 键管理对话历史。
你可以扩展 @[AgentState] 来添加额外字段。自定义 state schemas 使用 @[state_schema] 参数传递给 @[create_agent]。
from langchain.agents import create_agent, AgentState
from langgraph.checkpoint.memory import InMemorySaver
class CustomAgentState(AgentState): # [!code highlight]
user_id: str # [!code highlight]
preferences: dict # [!code highlight]
agent = create_agent(
"gpt-5",
tools=[get_user_info],
state_schema=CustomAgentState, # [!code highlight]
checkpointer=InMemorySaver(),
)
# 自定义 state 可以在 invoke 中传递
result = agent.invoke(
{
"messages": [{"role": "user", "content": "Hello"}],
"user_id": "user_123", # [!code highlight]
"preferences": {"theme": "dark"} # [!code highlight]
},
{"configurable": {"thread_id": "1"}})
:::
:::js
你可以通过创建带有 state schema 的自定义 middleware 来扩展 Agent state。自定义 state schemas 可以使用 middleware 中的 stateSchema 参数传递。最好使用 StateSchema 类进行 state 定义(也支持普通 Zod 对象)。
import { createAgent, createMiddleware } from "langchain";
import { StateSchema, MemorySaver } from "@langchain/langgraph";
import * as z from "zod";
const CustomState = new StateSchema({ # [!code highlight]
userId: z.string(), # [!code highlight]
preferences: z.record(z.string(), z.any()), # [!code highlight]
}); # [!code highlight]
const stateExtensionMiddleware = createMiddleware({
name: "StateExtension",
stateSchema: CustomState, # [!code highlight]
});
const checkpointer = new MemorySaver();
const agent = createAgent({
model: "gpt-5",
tools: [],
middleware: [stateExtensionMiddleware], # [!code highlight]
checkpointer,
});
# 自定义 state 可以在 invoke 中传递
const result = await agent.invoke({
messages: [{ role: "user", content: "Hello" }],
userId: "user_123", # [!code highlight]
preferences: { theme: "dark" }, # [!code highlight]
});
:::
常见模式
启用 short-term memory 后,长对话可能超出 LLM 的 context window。常见的解决方案是:
这允许 Agent 跟踪对话而不超出 LLM 的 context window。
Trim Messages(修剪消息)
大多数 LLM 都有最大支持的 context window(以 tokens 计)。
:::python
决定何时截断消息的一种方法是计算消息历史中的 tokens,并在接近该限制时截断。如果你使用 LangChain,你可以使用 trim messages utility 并指定要从列表中保留的 tokens 数量,以及用于处理边界的 strategy(例如保留最后 max_tokens)。
:::
:::js
决定何时截断消息的一种方法是计算消息历史中的 tokens,并在接近该限制时截断。如果你使用 LangChain,你可以使用 trim messages utility 并指定要从列表中保留的 tokens 数量,以及用于处理边界的 strategy(例如保留最后 maxTokens)。
:::
:::python
要在 Agent 中修剪消息历史,使用 @[@before_model] middleware decorator:
from langchain.messages import RemoveMessage
from langgraph.graph.message import REMOVE_ALL_MESSAGES
from langgraph.checkpoint.memory import InMemorySaver
from langchain.agents import create_agent, AgentState
from langchain.agents.middleware import before_model
from langgraph.runtime import Runtime
from langchain_core.runnables import RunnableConfig
from typing import Any
@before_model
def trim_messages(state: AgentState, runtime: Runtime) -> dict[str, Any] | None:
"""Keep only the last few messages to fit context window."""
messages = state["messages"]
if len(messages) <= 3:
return None # No changes needed
first_msg = messages[0]
recent_messages = messages[-3:] if len(messages) % 2 == 0 else messages[-4:]
new_messages = [first_msg] + recent_messages
return {
"messages": [
RemoveMessage(id=REMOVE_ALL_MESSAGES),
*new_messages
]
}
agent = create_agent(
your_model_here,
tools=your_tools_here,
middleware=[trim_messages],
checkpointer=InMemorySaver(),
)
config: RunnableConfig = {"configurable": {"thread_id": "1"}}
agent.invoke({"messages": "hi, my name is bob"}, config)
agent.invoke({"messages": "write a short poem about cats"}, config)
agent.invoke({"messages": "now do the same but for dogs"}, config)
final_response = agent.invoke({"messages": "what's my name?"}, config)
final_response["messages"][-1].pretty_print()
"""
================================== Ai Message ==================================
Your name is Bob. You told me that earlier.
If you'd like me to call you a nickname or use a different name, just say the word.
"""
:::
:::js
要在 Agent 中修剪消息历史,使用带有 beforeModel hook 的 @[createMiddleware]:
import { RemoveMessage } from "@langchain/core/messages";
import { createAgent, createMiddleware } from "langchain";
import { MemorySaver, REMOVE_ALL_MESSAGES } from "@langchain/langgraph";
const trimMessages = createMiddleware({
name: "TrimMessages",
beforeModel: (state) => {
const messages = state.messages;
if (messages.length <= 3) {
return; // No changes needed
}
const firstMsg = messages[0];
const recentMessages =
messages.length % 2 === 0 ? messages.slice(-3) : messages.slice(-4);
const newMessages = [firstMsg, ...recentMessages];
return {
messages: [
new RemoveMessage({ id: REMOVE_ALL_MESSAGES }),
...newMessages,
],
};
},
});
const checkpointer = new MemorySaver();
const agent = createAgent({
model: "gpt-4.1",
tools: [],
middleware: [trimMessages],
checkpointer,
});
:::
Delete Messages(删除消息)
你可以从 graph state 中删除消息来管理消息历史。
当你想移除特定消息或清除整个消息历史时,这很有用。
:::python
要从 graph state 中删除消息,你可以使用 RemoveMessage。
要使 RemoveMessage 工作,你需要使用带有 @[add_messages] reducer 的 state key。
默认的 @[AgentState] 提供此功能。
要移除特定消息:
from langchain.messages import RemoveMessage # [!code highlight]
def delete_messages(state):
messages = state["messages"]
if len(messages) > 2:
# 移除最早的两条消息
return {"messages": [RemoveMessage(id=m.id) for m in messages[:2]]} # [!code highlight]
要移除所有消息:
from langgraph.graph.message import REMOVE_ALL_MESSAGES # [!code highlight]
def delete_messages(state):
return {"messages": [RemoveMessage(id=REMOVE_ALL_MESSAGES)]} # [!code highlight]
:::
:::js
要从 graph state 中删除消息,你可以使用 RemoveMessage。要使 RemoveMessage 工作,你需要使用带有 @[messagesStateReducer][messagesStateReducer] reducer 的 state key,如 MessagesValue。
要移除特定消息:
import { RemoveMessage } from "@langchain/core/messages";
const deleteMessages = (state) => {
const messages = state.messages;
if (messages.length > 2) {
# 移除最早的两条消息
return {
messages: messages
.slice(0, 2)
.map((m) => new RemoveMessage({ id: m.id })),
};
}
};
:::
* 某些 provider 期望消息历史以 `user` 消息开始
* 大多数 provider 要求带有 tool calls 的 `assistant` 消息后跟相应的 `tool` 结果消息。
:::python
from langchain.messages import RemoveMessage
from langchain.agents import create_agent, AgentState
from langchain.agents.middleware import after_model
from langgraph.checkpoint.memory import InMemorySaver
from langgraph.runtime import Runtime
from langchain_core.runnables import RunnableConfig
@after_model
def delete_old_messages(state: AgentState, runtime: Runtime) -> dict | None:
"""Remove old messages to keep conversation manageable."""
messages = state["messages"]
if len(messages) > 2:
# 移除最早的两条消息
return {"messages": [RemoveMessage(id=m.id) for m in messages[:2]]}
return None
agent = create_agent(
"gpt-5-nano",
tools=[],
system_prompt="Please be concise and to the point.",
middleware=[delete_old_messages],
checkpointer=InMemorySaver(),
)
config: RunnableConfig = {"configurable": {"thread_id": "1"}}
for event in agent.stream(
{"messages": [{"role": "user", "content": "hi! I'm bob"}]},
config,
stream_mode="values",
):
print([(message.type, message.content) for message in event["messages"]])
for event in agent.stream(
{"messages": [{"role": "user", "content": "what's my name?"}]},
config,
stream_mode="values",
):
print([(message.type, message.content) for message in event["messages"]])
[('human', "hi! I'm bob")]
[('human', "hi! I'm bob"), ('ai', 'Hi Bob! Nice to meet you. How can I help you today? I can answer questions, brainstorm ideas, draft text, explain things, or help with code.')]
[('human', "hi! I'm bob"), ('ai', 'Hi Bob! Nice to meet you. How can I help you today? I can answer questions, brainstorm ideas, draft text, explain things, or help with code.'), ('human', "what's my name?")]
[('human', "hi! I'm bob"), ('ai', 'Hi Bob! Nice to meet you. How can I help you today? I can answer questions, brainstorm ideas, draft text, explain things, or help with code.'), ('human', "what's my name?"), ('ai', 'Your name is Bob. How can I help you today, Bob?')]
[('human', "what's my name?"), ('ai', 'Your name is Bob. How can I help you today, Bob?')]
:::
:::js
import { RemoveMessage } from "@langchain/core/messages";
import { createAgent, createMiddleware } from "langchain";
import { MemorySaver } from "@langchain/langgraph";
const deleteOldMessages = createMiddleware({
name: "DeleteOldMessages",
afterModel: (state) => {
const messages = state.messages;
if (messages.length > 2) {
# 移除最早的两条消息
return {
messages: messages
.slice(0, 2)
.map((m) => new RemoveMessage({ id: m.id! })),
};
}
return;
},
});
const agent = createAgent({
model: "gpt-4.1",
tools: [],
systemPrompt: "Please be concise and to the point.",
middleware: [deleteOldMessages],
checkpointer: new MemorySaver(),
});
const config = { configurable: { thread_id: "1" } };
const streamA = await agent.stream(
{ messages: [{ role: "user", content: "hi! I'm bob" }] },
{ ...config, streamMode: "values" }
);
for await (const event of streamA) {
const messageDetails = event.messages.map((message) => [
message.getType(),
message.content,
]);
console.log(messageDetails);
}
const streamB = await agent.stream(
{
messages: [{ role: "user", content: "what's my name?" }],
},
{ ...config, streamMode: "values" }
);
for await (const event of streamB) {
const messageDetails = event.messages.map((message) => [
message.getType(),
message.content,
]);
console.log(messageDetails);
}
[[ "human", "hi! I'm bob" ]]
[[ "human", "hi! I'm bob" ], [ "ai", "Hello, Bob! How can I assist you today?" ]]
[[ "human", "hi! I'm bob" ], [ "ai", "Hello, Bob! How can I assist you today?" ]]
[[ "human", "hi! I'm bob" ], [ "ai", "Hello, Bob! How can I assist you today?" ], ["human", "what's my name?" ]]
[[ "human", "hi! I'm bob" ], [ "ai", "Hello, Bob! How can I assist you today?" ], ["human", "what's my name?"], [ "ai", "Your name is Bob, as you mentioned. How can I help you further?" ]]
[[ "human", "what's my name?" ], [ "ai", "Your name is Bob, as you mentioned. How can I help you further?" ]]
:::
Summarize Messages(总结消息)
如上所示,修剪或移除消息的问题是你可能会因剔除消息队列而丢失信息。 因此,一些应用受益于使用 chat model 总结消息历史的更复杂方法。

:::python
要在 Agent 中总结消息历史,使用内置的 SummarizationMiddleware:
from langchain.agents import create_agent
from langchain.agents.middleware import SummarizationMiddleware
from langgraph.checkpoint.memory import InMemorySaver
from langchain_core.runnables import RunnableConfig
checkpointer = InMemorySaver()
agent = create_agent(
model="gpt-4.1",
tools=[],
middleware=[
SummarizationMiddleware(
model="gpt-4.1-mini",
trigger=("tokens", 4000),
keep=("messages", 20)
)
],
checkpointer=checkpointer,
)
config: RunnableConfig = {"configurable": {"thread_id": "1"}}
agent.invoke({"messages": "hi, my name is bob"}, config)
agent.invoke({"messages": "write a short poem about cats"}, config)
agent.invoke({"messages": "now do the same but for dogs"}, config)
final_response = agent.invoke({"messages": "what's my name?"}, config)
final_response["messages"][-1].pretty_print()
"""
================================== Ai Message ==================================
Your name is Bob!
"""
有关更多配置选项,请参阅 SummarizationMiddleware。
:::
:::js
要在 Agent 中总结消息历史,使用内置的 summarizationMiddleware:
import { createAgent, summarizationMiddleware } from "langchain";
import { MemorySaver } from "@langchain/langgraph";
const checkpointer = new MemorySaver();
const agent = createAgent({
model: "gpt-4.1",
tools: [],
middleware: [
summarizationMiddleware({
model: "gpt-4.1-mini",
trigger: { tokens: 4000 },
keep: { messages: 20 },
}),
],
checkpointer,
});
const config = { configurable: { thread_id: "1" } };
await agent.invoke({ messages: "hi, my name is bob" }, config);
await agent.invoke({ messages: "write a short poem about cats" }, config);
await agent.invoke({ messages: "now do the same but for dogs" }, config);
const finalResponse = await agent.invoke({ messages: "what's my name?" }, config);
console.log(finalResponse.messages.at(-1)?.content);
// Your name is Bob!
有关更多配置选项,请参阅 summarizationMiddleware。
:::
访问 Memory
你可以通过多种方式访问和修改 Agent 的短期记忆(state):
Tools
在 Tool 中读取短期记忆
使用 runtime 参数(类型为 ToolRuntime)在 tool 中访问短期记忆(state)。
runtime 参数对 tool signature 隐藏(因此模型看不到它),但 tool 可以通过它访问 state。
:::python
from langchain.agents import create_agent, AgentState
from langchain.tools import tool, ToolRuntime
class CustomState(AgentState):
user_id: str
@tool
def get_user_info(
runtime: ToolRuntime
) -> str:
"""Look up user info."""
user_id = runtime.state["user_id"]
return "User is John Smith" if user_id == "user_123" else "Unknown user"
agent = create_agent(
model="gpt-5-nano",
tools=[get_user_info],
state_schema=CustomState,
)
result = agent.invoke({
"messages": "look up user information",
"user_id": "user_123"
})
print(result["messages"][-1].content)
# > User is John Smith.
::: :::js
import { createAgent, tool, type ToolRuntime } from "langchain";
import { StateSchema } from "@langchain/langgraph";
import * as z from "zod";
const CustomState = new StateSchema({
userId: z.string(),
});
const getUserInfo = tool(
async (_, config: ToolRuntime<typeof CustomState.State>) => {
const userId = config.state.userId;
return userId === "user_123" ? "John Doe" : "Unknown User";
},
{
name: "get_user_info",
description: "Get user info",
schema: z.object({}),
}
);
const agent = createAgent({
model: "gpt-5-nano",
tools: [getUserInfo],
stateSchema: CustomState,
});
const result = await agent.invoke(
{
messages: [{ role: "user", content: "what's my name?" }],
userId: "user_123",
},
{
context: {},
}
);
console.log(result.messages.at(-1)?.content);
// Outputs: "Your name is John Doe."
:::
从 Tools 写入短期记忆
要在执行期间修改 Agent 的短期记忆(state),你可以直接从 tools 返回 state updates。
这对于持久化中间结果或使信息对后续 tools 或 prompts 可访问很有用。
:::python
from langchain.tools import tool, ToolRuntime
from langchain_core.runnables import RunnableConfig
from langchain.messages import ToolMessage
from langchain.agents import create_agent, AgentState
from langgraph.types import Command
from pydantic import BaseModel
class CustomState(AgentState): # [!code highlight]
user_name: str
class CustomContext(BaseModel):
user_id: str
@tool
def update_user_info(
runtime: ToolRuntime[CustomContext, CustomState],
) -> Command:
"""Look up and update user info."""
user_id = runtime.context.user_id
name = "John Smith" if user_id == "user_123" else "Unknown user"
return Command(update={ # [!code highlight]
"user_name": name,
# 更新消息历史
"messages": [
ToolMessage(
"Successfully looked up user information",
tool_call_id=runtime.tool_call_id
)
]
})
@tool
def greet(
runtime: ToolRuntime[CustomContext, CustomState]
) -> str | Command:
"""Use this to greet the user once you found their info."""
user_name = runtime.state.get("user_name", None)
if user_name is None:
return Command(update={
"messages": [
ToolMessage(
"Please call the 'update_user_info' tool it will get and update the user's name.",
tool_call_id=runtime.tool_call_id
)
]
})
return f"Hello {user_name}!"
agent = create_agent(
model="gpt-5-nano",
tools=[update_user_info, greet],
state_schema=CustomState, # [!code highlight]
context_schema=CustomContext,
)
agent.invoke(
{"messages": [{"role": "user", "content": "greet the user"}]},
context=CustomContext(user_id="user_123"),
)
:::
:::js
import { tool, createAgent, ToolMessage, type ToolRuntime } from "langchain";
import { Command, StateSchema } from "@langchain/langgraph";
import * as z from "zod";
const CustomState = new StateSchema({
userId: z.string().optional(),
});
const updateUserInfo = tool(
async (_, config: ToolRuntime<typeof CustomState.State>) => {
const userId = config.state.userId;
const name = userId === "user_123" ? "John Smith" : "Unknown user";
return new Command({
update: {
userName: name,
# 更新消息历史
messages: [
new ToolMessage({
content: "Successfully looked up user information",
tool_call_id: config.toolCall?.id ?? "",
}),
],
},
});
},
{
name: "update_user_info",
description: "Look up and update user info.",
schema: z.object({}),
}
);
const greet = tool(
async (_, config) => {
const userName = config.context?.userName;
return `Hello ${userName}!`;
},
{
name: "greet",
description: "Use this to greet the user once you found their info.",
schema: z.object({}),
}
);
const agent = createAgent({
model: "openai:gpt-5-mini",
tools: [updateUserInfo, greet],
stateSchema: CustomState,
});
const result = await agent.invoke({
messages: [{ role: "user", content: "greet the user" }],
userId: "user_123",
});
console.log(result.messages.at(-1)?.content);
// Output: "Hello! I'm here to help — what would you like to do today?"
:::
Prompt
在 middleware 中访问短期记忆(state)以基于对话历史或自定义 state 字段创建动态 prompts。
:::python
from langchain.agents import create_agent
from typing import TypedDict
from langchain.agents.middleware import dynamic_prompt, ModelRequest
class CustomContext(TypedDict):
user_name: str
def get_weather(city: str) -> str:
"""Get the weather in a city."""
return f"The weather in {city} is always sunny!"
@dynamic_prompt
def dynamic_system_prompt(request: ModelRequest) -> str:
user_name = request.runtime.context["user_name"]
system_prompt = f"You are a helpful assistant. Address the user as {user_name}."
return system_prompt
agent = create_agent(
model="gpt-5-nano",
tools=[get_weather],
middleware=[dynamic_system_prompt],
context_schema=CustomContext,
)
result = agent.invoke(
{"messages": [{"role": "user", "content": "What is the weather in SF?"}]},
context=CustomContext(user_name="John Smith"),
)
for msg in result["messages"]:
msg.pretty_print()
================================ Human Message =================================
What is the weather in SF?
================================== Ai Message ==================================
Tool Calls:
get_weather (call_WFQlOGn4b2yoJrv7cih342FG)
Call ID: call_WFQlOGn4b2yoJrv7cih342FG
Args:
city: San Francisco
================================= Tool Message =================================
Name: get_weather
The weather in San Francisco is always sunny!
================================== Ai Message ==================================
Hi John Smith, the weather in San Francisco is always sunny!
::: :::js
import * as z from "zod";
import { createAgent, tool, dynamicSystemPromptMiddleware } from "langchain";
const contextSchema = z.object({
userName: z.string(),
});
type ContextSchema = z.infer<typeof contextSchema>;
const getWeather = tool(
async ({ city }) => {
return `The weather in ${city} is always sunny!`;
},
{
name: "get_weather",
description: "Get user info",
schema: z.object({
city: z.string(),
}),
}
);
const agent = createAgent({
model: "gpt-5-nano",
tools: [getWeather],
contextSchema,
middleware: [
dynamicSystemPromptMiddleware<ContextSchema>((_, config) => {
return `You are a helpful assistant. Address the user as ${config.context?.userName}.`;
}),
],
});
const result = await agent.invoke(
{
messages: [{ role: "user", content: "What is the weather in SF?" }],
},
{
context: {
userName: "John Smith",
},
}
);
for (const message of result.messages) {
console.log(message);
}
/**
* HumanMessage {
* "content": "What is the weather in SF?",
* // ...
* }
* AIMessage {
* // ...
* "tool_calls": [
* {
* "name": "get_weather",
* "args": {
* "city": "San Francisco"
* },
* "type": "tool_call",
* "id": "call_tCidbv0apTpQpEWb3O2zQ4Yx"
* }
* ],
* // ...
* }
* ToolMessage {
* "content": "The weather in San Francisco is always sunny!",
* "tool_call_id": "call_tCidbv0apTpQpEWb3O2zQ4Yx"
* // ...
* }
* AIMessage {
* "content": "John Smith, here's the latest: The weather in San Francisco is always sunny!\n\nIf you'd like more details (temperature, wind, humidity) or a forecast for the next few days, I can pull that up. What would you like?",
* // ...
* }
*/
:::
Before Model
:::python
在 @[@before_model] middleware 中访问短期记忆(state)以在 model calls 之前处理消息。
:::
%%{
init: {
"fontFamily": "monospace",
"flowchart": {
"curve": "basis"
},
"themeVariables": {"edgeLabelBackground": "transparent"}
}
}%%
graph TD
S(["\_\_start\_\_"])
PRE(before_model)
MODEL(model)
TOOLS(tools)
END(["\_\_end\_\_"])
S --> PRE
PRE --> MODEL
MODEL -.-> TOOLS
MODEL -.-> END
TOOLS --> PRE
classDef blueHighlight fill:#DBEAFE,stroke:#2563EB,color:#1E3A8A;
classDef neutral fill:#F3F4F6,stroke:#9CA3AF,stroke-width:2px,color:#374151;
class S blueHighlight;
class END blueHighlight;
class PRE,MODEL,TOOLS neutral;
:::python
from langchain.messages import RemoveMessage
from langgraph.graph.message import REMOVE_ALL_MESSAGES
from langgraph.checkpoint.memory import InMemorySaver
from langchain.agents import create_agent, AgentState
from langchain.agents.middleware import before_model
from langchain_core.runnables import RunnableConfig
from langgraph.runtime import Runtime
from typing import Any
@before_model
def trim_messages(state: AgentState, runtime: Runtime) -> dict[str, Any] | None:
"""Keep only the last few messages to fit context window."""
messages = state["messages"]
if len(messages) <= 3:
return None # No changes needed
first_msg = messages[0]
recent_messages = messages[-3:] if len(messages) % 2 == 0 else messages[-4:]
new_messages = [first_msg] + recent_messages
return {
"messages": [
RemoveMessage(id=REMOVE_ALL_MESSAGES),
*new_messages
]
}
agent = create_agent(
"gpt-5-nano",
tools=[],
middleware=[trim_messages],
checkpointer=InMemorySaver()
)
config: RunnableConfig = {"configurable": {"thread_id": "1"}}
agent.invoke({"messages": "hi, my name is bob"}, config)
agent.invoke({"messages": "write a short poem about cats"}, config)
agent.invoke({"messages": "now do the same but for dogs"}, config)
final_response = agent.invoke({"messages": "what's my name?"}, config)
final_response["messages"][-1].pretty_print()
"""
================================== Ai Message ==================================
Your name is Bob. You told me that earlier.
If you'd like me to call you a nickname or use a different name, just say the word.
"""
::: :::js
import { RemoveMessage } from "@langchain/core/messages";
import { createAgent, createMiddleware, trimMessages } from "langchain";
import { MemorySaver } from "@langchain/langgraph";
import { REMOVE_ALL_MESSAGES } from "@langchain/langgraph";
const trimMessageHistory = createMiddleware({
name: "TrimMessages",
beforeModel: async (state) => {
const trimmed = await trimMessages(state.messages, {
maxTokens: 384,
strategy: "last",
startOn: "human",
endOn: ["human", "tool"],
tokenCounter: (msgs) => msgs.length,
});
return {
messages: [new RemoveMessage({ id: REMOVE_ALL_MESSAGES }), ...trimmed],
};
},
});
const checkpointer = new MemorySaver();
const agent = createAgent({
model: "gpt-5-nano",
tools: [],
middleware: [trimMessageHistory],
checkpointer,
});
:::
After Model
:::python
在 @[@after_model] middleware 中访问短期记忆(state)以在 model calls 之后处理消息。
:::
%%{
init: {
"fontFamily": "monospace",
"flowchart": {
"curve": "basis"
},
"themeVariables": {"edgeLabelBackground": "transparent"}
}
}%%
graph TD
S(["\_\_start\_\_"])
MODEL(model)
POST(after_model)
TOOLS(tools)
END(["\_\_end\_\_"])
S --> MODEL
MODEL --> POST
POST -.-> END
POST -.-> TOOLS
TOOLS --> MODEL
classDef blueHighlight fill:#DBEAFE,stroke:#2563EB,color:#1E3A8A;
classDef greenHighlight fill:#DCFCE7,stroke:#16A34A,color:#14532D;
classDef neutral fill:#F3F4F6,stroke:#9CA3AF,stroke-width:2px,color:#374151;
class S blueHighlight;
class END blueHighlight;
class POST greenHighlight;
class MODEL,TOOLS neutral;
:::python
from langchain.messages import RemoveMessage
from langgraph.checkpoint.memory import InMemorySaver
from langchain.agents import create_agent, AgentState
from langchain.agents.middleware import after_model
from langgraph.runtime import Runtime
@after_model
def validate_response(state: AgentState, runtime: Runtime) -> dict | None:
"""Remove messages containing sensitive words."""
STOP_WORDS = ["password", "secret"]
last_message = state["messages"][-1]
if any(word in last_message.content for word in STOP_WORDS):
return {"messages": [RemoveMessage(id=last_message.id)]}
return None
agent = create_agent(
model="gpt-5-nano",
tools=[],
middleware=[validate_response],
checkpointer=InMemorySaver(),
)
::: :::js
import { RemoveMessage } from "@langchain/core/messages";
import { createAgent, createMiddleware } from "langchain";
import { REMOVE_ALL_MESSAGES } from "@langchain/langgraph";
const validateResponse = createMiddleware({
name: "ValidateResponse",
afterModel: (state) => {
const lastMessage = state.messages.at(-1)?.content;
if (
typeof lastMessage === "string" &&
lastMessage.toLowerCase().includes("confidential")
) {
return {
messages: [
new RemoveMessage({ id: REMOVE_ALL_MESSAGES }),
],
};
}
return;
},
});
const agent = createAgent({
model: "gpt-5-nano",
tools: [],
middleware: [validateResponse],
});
:::