ERNIE Bot Agent开发框架 & function calling 开发实践

原创

用户4080233

发布于 2024-06-17 00:26:15

2150

发布于 2024-06-17 00:26:15

1 背景

生成式AI的等级划分:

L1 Tool 人类完成所有⼯作，没有任何明显的AI辅助。

L2 Chatbot 人类直接完成绝⼤部份⼯作。人类向AI询问意见，了解信息AI提供信息和建议但不直接处理工作。

L3 Copilot 人类和4I进⾏协作，工作量相当。AI根据人类要求完成⼯作初稿，人类进行目标设定，修改调整，最后确认。

L4 Agent AI完成绝⼤部分⼯作，人类负责设定⽬标、提供资源和监督结果。AI完成任务拆分，工具选择，进度控制，实现目标后自助结束工作。

L5 Intelligence 完全⽆需人类监督，AI⾃主拆解⽬标，寻找资源，选择并使⽤⼯具，完成全部⼯作，人类只需给出初始⽬标。

2 ERNIE Bot Agent开发框架

ERNIE Bot Agent 基于文心模型的 Function Calling（下⾯简称FC）能力实现了多工具编排和自动调度功能，并且允许工具、插件、知识库等不同组件的混合编排。

# ⾸先安装Ernie Bot 
!pip install ./ERNIE-SDK/erniebot 
# 然后安装ERNIE Bot Agent 
!pip install ./ERNIE-SDK/erniebot-agent # 安装核⼼模块 
#pip install './erniebot-agent/.[all]' # 也可以加上[all]⼀次性安装所有模块，包括gradio 
等依赖库

2.1 定义第⼀个工具：word_helper_tool

功能：根据用户提供的单词，获取一个具体的单词描述、翻译以及例句

# 第⼀个类其实就是描述word这个变量，这是这个⼯具接受的参数。需要注意的是，需要继承官⽅给的类： 
ToolParameterView 
class ChatWithEBInputView(ToolParameterView): 
    word: str = Field(description="需要获取详细信息的单词") 

# 第⼆个类就是描述response这个变量，这是这个⼯具输出的参数。 
class ChatWithEBOutputView(ToolParameterView): 
    response: str = Field(description="单词的详细释义") 

# 第三个类将输⼊和输出，以及⼯具的描述组合起来，调⽤⽂⼼⼤模型实现对单词的翻译，以及近义词等 
class ChatWithEB(Tool): 
    description: str = "ChatWithEB是⼀款根据⽤户的提供的单词，获取⼀个具体的单词描述、翻译以 
及例句的⼯具" 
input_type: Type[ToolParameterView] = ChatWithEBInputView
ouptut_type: Type[ToolParameterView] = ChatWithEBOutputView
def __init__(self, llm: ERNIEBot):
    self.llm = llm
    async def __call__(self, word: str) -> Dict[str, str]:
        prompt = f"请告诉我{word}的详细信息，以下⾯的格式输出（其中单词翻译需要是中⽂，其他为
英语）" "<单词翻译>:\n<近义词>:\n<反义词>:\n<例句>:\n"
response = await self.llm.chat([HumanMessage(prompt)])
return {"response": response.content}

# 这是设置⼤模型的⻆⾊，按官⽅的⽅式设置就⾏了，没啥可多讲的
SYSTEM_MESSAGE = "你是⼀个英语⽼师，当⽤户给你⼀个单词的时候，请你专业⽽通俗的返回英语的<单词翻
译><近义词><反义词><例句>"
llm = ERNIEBot(model="ernie-4.0", system=SYSTEM_MESSAGE)
# 将这个⼯具实例化，等待调⽤
word_helper_tool = ChatWithEB(llm)

2.2 定义第二个工具：add_word_tool

功能：添加单词以及它的详细解释到词库当中

# 和第⼀个⼯具⼀样的⽅法，在此不赘述
class AddWordInput(ToolParameterView):
    word: str = Field(description="待添加的单词")
    des: str = Field(description="待添加的单词的详细释义,含有<单词翻译><近义词><反义词><例
句>")
                     
class AddWordOutput(ToolParameterView):
    result: str = Field(description="表示是否成功将单词成功添加到词库当中")

class AddWordTool(Tool): 
    description: str = "添加单词以及它的详细解释到词库当中" 
    input_type: Type[ToolParameterView] = AddWordInput 
    ouptut_type: Type[ToolParameterView] = AddWordOutput 
    def __init__(self) -> None: 
        self.word_books = {} 
    super().__init__() 
    
    async def __call__(self, word: str, des: str) -> Dict[str, Any]: 
        if word in self.word_books: 
            return {"result": f"<{word}>单词已经存在，⽆需添加"} 
        self.word_books[word] = des 
        words = "\n".join(list(self.word_books.keys())) 
        return {"result": f"<{word}>单词已添加成功, 当前单词本中有如下单词：{words}，释义 
            如下{des}"}

add_word_tool = AddWordTool()

2.3 第三个工具：ocr_tool

这是⼀个在线的，从星河社区直接调⽤的，识别图片中文字的工具

# 远程调⽤⼀个图⽚识别⼯具 
ocr_tool = RemoteToolkit.from_aistudio("highacc-ocr").get_tools()[0]

2.4 创建agent

memory = SlidingWindowMemory(max_round=5) 
llm_final = ERNIEBot(model="ernie-4.0", api_type="aistudio", enable_multi_step_tool_call=True) 
agent_all = FunctionAgent(llm=llm_final, tools=[word_helper_tool, add_word_tool, ocr_tool], memory=memory, max_steps=5)

2.5 运行agent

file_manager = GlobalFileManagerHandler().get() 
file = await file_manager.create_file_from_path(r"./road_sign.jpeg") 
response = await agent_all.run("请识别图⽚中的第⼀个英语单词，进⾏解释后添加到单词本中", files=[file]) 
print(response.text) 
add_word_tool.word_books

2.6 运行结果

INFO - [Run][Start] FunctionAgent is about to start running with input: 
    请识别图⽚中的第⼀个英语单词，进⾏解释后添加到单词本中 
INFO - [LLM][Start] ERNIEBot is about to start running with input: 
    role: user 
    content: 请识别图⽚中的第⼀个英语单词，进⾏解释后添加到单词本中 
<file>file-local-d1e1d2f4-f54e-11ee-9e25-fa2600005a22</file> 
INFO - [LLM][End] ERNIEBot finished running with output: 
    role: assistant 
    function_call: 
{ 
    "name": "highacc-ocr/v1.8/OCR", 
    "thoughts": "⽤户需要我识别图⽚中的第⼀个英语单词，并进⾏解释后添加到单词本中。这个任务可以拆 
    解为两个⼦任务：1）使⽤OCR⼯具识别图⽚中的⽂字；2）将识别到的英语单词添加到单词本中。任务拆解： 
    [sub-task1: 使⽤OCR⼯具识别图⽚中的⽂字，sub-task2: 将识别到的英语单词添加到单词本中]。⾸先， 
我需要调⽤[highacc-ocr/v1.8/OCR]⼯具来识别图⽚中的⽂字。", 
"arguments": "{\"image\":\"file-local-d1e1d2f4-f54e-11ee-9e25- 
fa2600005a22\",\"language_type\":\"ENG\"}" 
} 
INFO - [Tool][Start] RemoteTool is about to start running with input: 
    { 
        "image": "file-local-d1e1d2f4-f54e-11ee-9e25-fa2600005a22", 
        "language_type": "ENG" 
    } 
INFO - [Tool][End] RemoteTool finished running with output: 
    { 
        "words_result": [ 
            "BEWARE", 
            "OF FALLING", 
            "COCONUTS" 
        ], 
        "words_result_num": 3 
    } 
INFO - [LLM][Start] ERNIEBot is about to start running with input: 
    role: function 
    name: highacc-ocr/v1.8/OCR 
    content: {"words_result": ["BEWARE", "OF FALLING", "COCONUTS"], 
              "words_result_num": 3} 
INFO - [LLM][End] ERNIEBot finished running with output: 
    role: assistant 
    function_call: 
{ 
    "name": "AddWordTool", 
    "thoughts": "我已经成功识别了图⽚中的⽂字，现在我需要调⽤[AddWordTool]⼯具将识别到的英语单词
添加到单词本中。",
 "arguments": "{\"word\":\"BEWARE\",\"des\":\"BEWARE的详细释义\"}"
} 
INFO - [Tool][Start] AddWordTool is about to start running with input:
{
 "word": "BEWARE",
 "des": "BEWARE的详细释义"
}
INFO - [Tool][End] AddWordTool finished running with output:
{
 "result": "<BEWARE>单词已添加成功, 当前单词本中有如下单词：BEWARE，释义如下BEWARE的详细释
义"
}
INFO - [LLM][Start] ERNIEBot is about to start running with input:
 role: function 
 name: AddWordTool 
 content: {"result": "<BEWARE>单词已添加成功, 当前单词本中有如下单词：BEWARE，释义如下
BEWARE的详细释义"} 
INFO - [LLM][End] ERNIEBot finished running with output:
 role: assistant 
 content: 根据您提供的图⽚，我们已经成功识别了第⼀个英语单词"BEWARE"，并且将其详细释义添加到了单
词本中。⽬前单词本中有如下单词：BEWARE，释义为BEWARE的详细释义。如果您还有其他需求或者问题，
可... 
INFO - [Run][End] FunctionAgent finished running.
根据您提供的图⽚，我们已经成功识别了第⼀个英语单词"BEWARE"，并且将其详细释义添加到了单词本中。⽬前
单词本中有如下单词：BEWARE，释义为BEWARE的详细释义。如果您还有其他需求或者问题，可以随时告诉我，
我会尽⼒为您提供帮助。
{'BEWARE': 'BEWARE的详细释义'}

原创声明：本文系作者授权腾讯云开发者社区发表，未经许可，不得转载。

如有侵权，请联系 cloudcommunity@tencent.com 删除。

LLM

原创声明：本文系作者授权腾讯云开发者社区发表，未经许可，不得转载。

如有侵权，请联系 cloudcommunity@tencent.com 删除。

LLM

登录后参与评论

0 条评论

热度