让 GPT 学会使用工具，拓展技能

作者：蔡超

2023-04-22
北京
本文字数：2352 字
阅读完需：约 8 分钟

引言

ChatGPT 无疑是现在最热门的话题，但我们都清楚 GPT 的训练数据基于过去的信息，且或许并未涵盖您所需场景的所有相关数据。另外，某些数据信息需要在特定环境中实时获取，例如：掌握集群当前的工作状态等。这些都大大限制了 GPT 的应用场景，甚至你无法让 GPT 告诉你根据今天的天气穿什么衣服更合适，更无法解决你的环境中跟实时事件和数据有关的问题。因此，赋予 GPT 使用工具、执行任务以及获取外部数据的功能将极大地拓展其应用范围。值得注意的是，当前备受关注的 AutoGPT 项目也是利用这一相关技术。

让 GPT 选择正确的工具（方法）

首先，我们都知道 GPT 是 LLM（Large Language Model）具有很强大的语言理解能力，所以我们只要简单告诉 GPT 哪个工具是干什么的，他就可以在不同情况下选择正确的工具。

下面的示例使用了 LangChain，所以我们要先安装：

pip install openaipip install langchain

复制代码

from langchain.agents import initialize_agent, Toolfrom langchain.llms import AzureOpenAI
llm = AzureOpenAI( model_name="gpt-4",temperature=0, engine="gpt4")
def get_node_info(input: str) -> str:    return "The disk is full."
def get_pod_info(input: str) -> str:    return "The pod's is running well."
tools = [    Tool(        name = "Get Node Info",func=get_node_info,         description="By invoking it, you can get the node info, also the node here is the (VM) virtual machine"    ),    Tool(name="Get Pod Info", func=get_pod_info,          description="By invoking it, you can get the pod info. Your application is always running in the pods."    )]agent = initialize_agent(tools, llm, agent="zero-shot-react-description", verbose=True)

复制代码

好了运行完以上代码后我们来看看效果，当然不要忘记了给程序提供你的 OPEN_AI_KEY, 上面的程序中我是通过 Azure 提供的 openai 服务来完成的。

接下来我们来验证一下：

print(agent.run("What's wrong with the machine (hostname:chao)?"))

复制代码

运行上面程序后，返回如下结果：

> Entering new AgentExecutor chain...I need to check the node info to see if there are any issues with the virtual machine.Action: Get Node InfoAction Input: chaoObservation: The disk is full.Thought:I now know the final answer.Final Answer: The issue with the machine (hostname: chao) is that its disk is full.
> Finished chain.The issue with the machine (hostname: chao) is that its disk is full.

复制代码

可以看到 GPT 准确的调用了 get_node_info 这个方法。上面的程序中我们利用了 LangChain 中的 Tool 和 Agent，可以看到在 Tool 的定义中我们为每个 Tool（方法）都加入一个描述（description），这样 GPT 通过对于 description 的理解就可以根据上下文正确的调用工具。

我们再来测试一个：

print(agent.run("How's my application(DSP)?"))

复制代码

这次返回的结果是

Entering new AgentExecutor chain...I need to check the pod information to see the status of the application.Action: Get Pod InfoAction Input: DSP
Observation: The pod's is running well.Thought:I now know the final answer.Final Answer: Your application (DSP) is running well.
> Finished chain.Your application (DSP) is running well.

复制代码

到这里你一定在想 LangChain 的 Tool 是怎么实现的这个魔术的呢？还有似乎看着这里有一个逐步推理的到得出结论的过程（Thought-Action-Observation-Thougt -），这个过程实际被称为 LLM 的 ReAct （Reasoning+Acting），关于 ReAct 的应用和解密我会放到我的下一篇文章，这里我们先揭秘这个方法选择。

揭秘神奇的提示词

其实 LangChain Tool 里并没有什么 Rocket Science，依然还是提示词工程，我们来看一下下面这段提示词：

prompt = """  According to user requirement and operation description, select the operation. Please, return the operation's name,   and if there's no suitable method, then return "none"  1. operation name:  get_cluster_info, description: get the info about the cluster, and the info can be used to detect the problems of the cluster;  2. operation name:  get_node_info, description: get the info about the node, and the info can be used to detect the problems of the node; , in our context node is equals to machine or virtual machine/VM  3. operation name:  get_pod_info, description: get the info about the pod, and the info can be used to detect the problems of the pod; """

复制代码

通过阅读提示词你会发现，我们做的只是简单告诉 GPT，根据当前用户的要求及 operation 的描述，返回 operation 的名字，就这么简单。下面我们来测试一下。

response = openai.ChatCompletion.create(    engine="gpt4", # engine = "deployment_name".    messages=[        {"role": "system", "content": "You are an operator."+prompt},        {"role": "user", "content": "我的机器怎么了"}]  )print(response.choices[0].message.content)

复制代码

上面调用的输出是：

get_node_info

复制代码

到这里我想你已经了解他的原理。

结束语

上面我们通过提示词就扩展了 GPT 的能力，当然你可通过方法来让 GPT 获取实时的事件和数据，甚至让他来执行语言模型做不了的事情，如调用其他模型来生成一张图片。在很多 AI 专业人士眼中，这些场景的扩展可能需要通过对模型 fine tuning 来完成，实际在大语言模型时代我们只需通过提示词工程（Prompt Engineering）就可以做到了。LLM 的成功，需要我们重新思考很多曾经的应用构建方式及 AI 模型的扩展方式。

发布于: 刚刚阅读数: 2

原文链接:【http://xie.infoq.cn/article/fbc519f5c49067dbd766f66e1】。

蔡超

关注

程序员是我的终身职业 2017-10-19 加入

蔡超，SpotMax创始人，Mobvista 集团副总裁/首席架构师，前亚马逊（中国）首席架构师，前HP（中国）软件架构师

发布

暂无评论

创作场景

让 GPT 学会使用工具，拓展技能

引言

让 GPT 选择正确的工具（方法）

揭秘神奇的提示词

结束语

蔡超

评论