Construct Autonomous AI Brokers with Operate Calling | by Julian Yip

Machine Learning

Construct Autonomous AI Brokers with Operate Calling | by Julian Yip | Apr, 2024

hhhhm

2024年4月8日

Construct Autonomous AI Brokers with Operate Calling | by Julian Yip | Apr, 2024

[ad_1]

Rework your chatbot into an agent that may work together with exterior APIs

Operate Calling isn’t one thing new. In July 2023, OpenAI launched Operate Calling for his or her GPT fashions, a function now being adopted by rivals. Google’s Gemini API not too long ago supported it, and Anthropic is integrating it into Claude. Operate Calling is turning into important for big language fashions (LLMs), enhancing their capabilities. All of the extra helpful to be taught this method!

With this in thoughts, I goal to jot down a complete tutorial masking Operate Calling past fundamental introductions (there are already loads of tutorials for it). The main focus shall be on sensible implementation, constructing a totally autonomous AI agent and integrating it with Streamlit for a ChatGPT-like interface. Though OpenAI is used for demonstration, this tutorial may be simply tailored for different LLMs supporting Operate Calling, comparable to Gemini.

Operate Calling permits builders to explain features (aka instruments, you may think about this as actions for the mannequin to take, like performing calculation, or making an order), and have the mannequin intelligently select to output a JSON object containing arguments to name these features. In less complicated phrases, it permits for:

Autonomous choice making: Fashions can intelligently select instruments to answer questions.
Dependable parsing: Responses are in JSON format, as an alternative of the extra typical dialogue-like response. It won’t appear a lot from the primary look, however that is what permits LLM to hook up with exterior methods, say by way of APIs with structured inputs.

It opens up quite a few potentialities:

Autonomous AI assistants: Bots can work together with inside methods for duties like buyer orders and returns, past offering solutions to enquiries
Private analysis assistants: Say in case you are planning on your journey, assistants can search the net, crawl content material, evaluate choices, and summarize leads to Excel.
IoT voice instructions: Fashions can management gadgets or counsel actions primarily based on detected intents, comparable to adjusting the AC temperature.

Borrowing from Gemini’s Operate Calling documentation, Operate Calling has the under construction, which works the identical in OpenAI

Picture from Gemini’s Operate Calling documentation

Consumer points immediate to the applying
Utility passes the user-provided immediate, and the Operate Declaration(s), which is an outline of the device(s) that the mannequin may use
Primarily based on the Operate Declaration, the mannequin suggests the device to make use of, and the related request parameters. Discover the mannequin outputs the instructed device and parameters solely, WITHOUT truly calling the features
& 5. Primarily based on the response, the applying invokes the related API

6. & 7. The response from API is fed into the mannequin once more to output a human-readable response

8. Utility returns the ultimate response to the person, then repeat from 1.

This might sound convuluted, however the idea shall be illustrated intimately with instance

Earlier than diving into the code, a couple of phrases in regards to the demo software’s structure

Resolution

Right here we construct an assistant for vacationers visiting a lodge. The assistant has entry to the next instruments, which permits the assistant to entry exterior functions.

get_items, purchase_item: Connect with product catalog saved in database by way of API, for retrieving merchandise checklist and making a purchase order respectively
rag_pipeline_func: Connect with doc retailer with Retrieval Augmented Technology (RAG) to acquire data from unstructured texts e.g. lodge’s brochures

Tech stack

Now let’s start!

Preparation

Head over to Github to clone my code. The contents under may be discovered within the function_calling_demo Pocket book.

Please additionally create and activate a digital surroundings, then pip set up -r necessities.txt to put in the required packages

Initialization

We first connect with OpenRouter. Alternatively utilizing the unique OpenAIChatGenerator with out overwritting the api_base_urlwould additionally work, supplied you’ve an OpenAI API key

import os
from dotenv import load_dotenv
from haystack.parts.turbines.chat import OpenAIChatGenerator
from haystack.utils import Secret
from haystack.dataclasses import ChatMessage
from haystack.parts.turbines.utils import print_streaming_chunk# Set your API key as surroundings variable earlier than executing this
load_dotenv()
OPENROUTER_API_KEY = os.environ.get('OPENROUTER_API_KEY')
chat_generator = OpenAIChatGenerator(api_key=Secret.from_env_var("OPENROUTER_API_KEY"),
api_base_url="https://openrouter.ai/api/v1",
mannequin="openai/gpt-4-turbo-preview",
streaming_callback=print_streaming_chunk)

Then we check can the chat_generator be efficiently invoked

chat_generator.run(messages=[ChatMessage.from_user("Return this text: 'test'")])

---------- The response ought to seem like this ----------
{'replies': [ChatMessage(content="'test'", role=<ChatRole.ASSISTANT: 'assistant'>, name=None, meta={'model': 'openai/gpt-4-turbo-preview', 'index': 0, 'finish_reason': 'stop', 'usage': {}})]}

Step 1: Set up information retailer

Right here we set up connection between our software and the 2 information sources: Doc retailer for unstructured texts, and software database by way of API

Index Paperwork with a Pipeline

We offer pattern texts in paperwork for the mannequin to carry out Retrival Augmented Technology (RAG). The texts are was embeddings and saved in an in-memory doc retailer

from haystack import Pipeline, Doc
from haystack.document_stores.in_memory import InMemoryDocumentStore
from haystack.parts.writers import DocumentWriter
from haystack.parts.embedders import SentenceTransformersDocumentEmbedder# Pattern paperwork
paperwork = [
Document(content="Coffee shop opens at 9am and closes at 5pm."),
Document(content="Gym room opens at 6am and closes at 10pm.")
]
# Create the doc retailer
document_store = InMemoryDocumentStore()
# Create a pipeline to show the texts into embeddings and retailer them within the doc retailer
indexing_pipeline = Pipeline()
indexing_pipeline.add_component(
"doc_embedder", SentenceTransformersDocumentEmbedder(mannequin="sentence-transformers/all-MiniLM-L6-v2")
)
indexing_pipeline.add_component("doc_writer", DocumentWriter(document_store=document_store))
indexing_pipeline.join("doc_embedder.paperwork", "doc_writer.paperwork")
indexing_pipeline.run({"doc_embedder": {"paperwork": paperwork}})

It ought to output this, similar to the paperwork we created as pattern

{'doc_writer': {'documents_written': 2}}

Spin up API server

An API server made with Flask is created underneath db_api.py to hook up with SQLite. Please spin it up by working python db_api.py in your terminal

This would be shown in the terminal, if successfully executed — This is able to be proven within the terminal, if efficiently executed

Additionally discover that some preliminary information has been added in db_api.py

Step 2: Outline the features

Right here we put together the precise features for the mannequin to invoke AFTER Operate Calling (Step 4–5 as described in The Construction of Operate Calling)

RAG perform

Specifically the rag_pipeline_func. That is for the mannequin to supply a solution by looking by means of the texts saved within the Doc Retailer. We first outline the RAG retrieval as a Haystack pipeline

from haystack.parts.embedders import SentenceTransformersTextEmbedder
from haystack.parts.retrievers.in_memory import InMemoryEmbeddingRetriever
from haystack.parts.builders import PromptBuilder
from haystack.parts.turbines import OpenAIGeneratortemplate = """
Reply the questions primarily based on the given context.
Context:
{% for doc in paperwork %}
{{ doc.content material }}
{% endfor %}
Query: {{ query }}
Reply:
"""
rag_pipe = Pipeline()
rag_pipe.add_component("embedder", SentenceTransformersTextEmbedder(mannequin="sentence-transformers/all-MiniLM-L6-v2"))
rag_pipe.add_component("retriever", InMemoryEmbeddingRetriever(document_store=document_store))
rag_pipe.add_component("prompt_builder", PromptBuilder(template=template))
# Word to llm: We're utilizing OpenAIGenerator, not the OpenAIChatGenerator, as a result of the latter solely accepts Checklist[str] as enter and can't settle for prompt_builder's str output
rag_pipe.add_component("llm", OpenAIGenerator(api_key=Secret.from_env_var("OPENROUTER_API_KEY"),
api_base_url="https://openrouter.ai/api/v1",
mannequin="openai/gpt-4-turbo-preview"))
rag_pipe.join("embedder.embedding", "retriever.query_embedding")
rag_pipe.join("retriever", "prompt_builder.paperwork")
rag_pipe.join("prompt_builder", "llm")

Take a look at if the perform works

question = “When does the espresso store open?”
rag_pipe.run({"embedder": {"textual content": question}, "prompt_builder": {"query": question}})

This could yield the next output. Discover the replies that the mannequin gave is from the pattern paperwork we supplied earlier than

{'llm': {'replies': ['The coffee shop opens at 9am.'],
'meta': [{'model': 'openai/gpt-4-turbo-preview',
'index': 0,
'finish_reason': 'stop',
'usage': {'completion_tokens': 9,
'prompt_tokens': 60,
'total_tokens': 69,
'total_cost': 0.00087}}]}}

We are able to then flip the rag_pipe right into a perform, which supplies the replies solely with out including within the different particulars

def rag_pipeline_func(question: str):
outcome = rag_pipe.run({"embedder": {"textual content": question}, "prompt_builder": {"query": question}})return {"reply": outcome["llm"]["replies"][0]}

API calls

We outline the get_items and purchase_itemfeatures for interacting with the database

# Flask's default native URL, change it if crucial
db_base_url = 'http://127.0.0.1:5000'# Use requests to get the information from the database
import requests
import json
# get_categories is equipped as a part of the immediate, it's not used as a device
def get_categories():
response = requests.get(f'{db_base_url}/class')
information = response.json()
return information
def get_items(ids=None,classes=None):
params = {
'id': ids,
'class': classes,
}
response = requests.get(f'{db_base_url}/merchandise', params=params)
information = response.json()
return information
def purchase_item(id,amount):
headers = {
'Content material-type':'software/json', 
'Settle for':'software/json'
}
information = {
'id': id,
'amount': amount,
}
response = requests.publish(f'{db_base_url}/merchandise/buy', json=information, headers=headers)
return response.json()

Outline the device checklist

Now that we have now outlined the fuctions, we have to let the mannequin acknowledge these features, and to instruct them how they’re used, by offering descriptions for them.

Since we’re utilizing OpenAI right here, the instruments is formatted as under following the format required by Open AI

instruments = [
{
"type": "function",
"function": {
"name": "get_items",
"description": "Get a list of items from the database",
"parameters": {
"type": "object",
"properties": {
"ids": {
"type": "string",
"description": "Comma separated list of item ids to fetch",
},
"categories": {
"type": "string",
"description": "Comma separated list of item categories to fetch",
},
},
"required": [],
},
}
},
{
"kind": "perform",
"perform": {
"title": "purchase_item",
"description": "Buy a selected merchandise",
"parameters": {
"kind": "object",
"properties": {
"id": {
"kind": "string",
"description": "The given product ID, product title isn't accepted right here. Please receive the product ID from the database first.",
},
"amount": {
"kind": "integer",
"description": "Variety of objects to buy",
},
},
"required": [],
},
}
},
{
"kind": "perform",
"perform": {
"title": "rag_pipeline_func",
"description": "Get data from lodge brochure",
"parameters": {
"kind": "object",
"properties": {
"question": {
"kind": "string",
"description": "The question to make use of within the search. Infer this from the person's message. It must be a query or a press release",
}
},
"required": ["query"],
},
},
}
]

Step 3: Placing all of it collectively

We now have the mandatory inputs to check Operate Calling! Right here we do a couple of issues:

Present the preliminary immediate to the mannequin, to offer it some context
Present a pattern user-generated message
Most significantly, we move the device checklist to the chat generator in instruments

# 1. Preliminary immediate
context = f"""You're an assistant to vacationers visiting a lodge.
You have got entry to a database of things (which incorporates {get_categories()}) that vacationers should buy, you even have entry to the lodge's brochure.
If the vacationer's query can't be answered from the database, you may seek advice from the brochure.
If the vacationer's query can't be answered from the brochure, you may ask the vacationer to ask the lodge employees.
"""
messages = [
ChatMessage.from_system(context),
# 2. Sample message from user
ChatMessage.from_user("Can I buy a coffee?"),
]# 3. Passing the instruments checklist and invoke the chat generator
response = chat_generator.run(messages=messages, generation_kwargs= {"instruments": instruments})
response

---------- Response ----------
{'replies': [ChatMessage(content='[{"index": 0, "id": "call_AkTWoiJzx5uJSgKW0WAI1yBB", "function": {"arguments": "{"categories":"Food and beverages"}", "name": "get_items"}, "type": "function"}]', function=<ChatRole.ASSISTANT: 'assistant'>, title=None, meta={'mannequin': 'openai/gpt-4-turbo-preview', 'index': 0, 'finish_reason': 'tool_calls', 'utilization': {}})]}

Now let’s examine the response. Discover how the Operate Calling returns each the perform chosen by the mannequin, and the arguments for invoking the chosen perform.

function_call = json.hundreds(response["replies"][0].content material)[0]
function_name = function_call["function"]["name"]
function_args = json.hundreds(function_call["function"]["arguments"])
print("Operate Title:", function_name)
print("Operate Arguments:", function_args)

---------- Response ----------
Operate Title: get_items
Operate Arguments: {‘classes’: ‘Meals and drinks’}

When offered with one other query, the mannequin will use one other device that’s extra related

# One other query
messages.append(ChatMessage.from_user("The place's the espresso store?"))# Invoke the chat generator, and passing the instruments checklist
response = chat_generator.run(messages=messages, generation_kwargs= {"instruments": instruments})
function_call = json.hundreds(response["replies"][0].content material)[0]
function_name = function_call["function"]["name"]
function_args = json.hundreds(function_call["function"]["arguments"])
print("Operate Title:", function_name)
print("Operate Arguments:", function_args)

---------- Response ----------
Operate Title: rag_pipeline_func
Operate Arguments: {'question': "The place's the espresso store?"}

Once more, discover that no precise perform is invoked right here, that is what we’ll do subsequent!

Calling the perform

We are able to then feed the arguments into the chosen perform

## Discover the correspoding perform and name it with the given arguments
available_functions = {"get_items": get_items, "purchase_item": purchase_item,"rag_pipeline_func": rag_pipeline_func}
function_to_call = available_functions[function_name]
function_response = function_to_call(**function_args)
print("Operate Response:", function_response)

---------- Response ----------
Operate Response: {'reply': 'The supplied context doesn't specify a bodily location for the espresso store, solely its working hours. Subsequently, I can't decide the place the espresso store is situated primarily based on the given data.'}

The response from rag_pipeline_func can then handed as a context to the chat by appending it underneath the messages, for the mannequin to supply the ultimate reply

messages.append(ChatMessage.from_function(content material=json.dumps(function_response), title=function_name))
response = chat_generator.run(messages=messages)
response_msg = response["replies"][0]print(response_msg.content material)

---------- Response ----------
For the placement of the espresso store inside the lodge, I like to recommend asking the lodge employees instantly. They'll be capable of information you to it precisely.

We now have accomplished the chat cycle!

Step 4: Flip into an interactive chat

The code above reveals how Operate Calling may be achieved, however we need to go a step additional by turning it into an interactive chat

Right here I showcase two strategies to do it, from the extra primitive enter() that prints the dialogue into the pocket book itself, to rendering it by means of Streamlit to supply it with an ChatGPT-like UI

enter() loop

The code is copied from Haystack’s tutorial, which permits us to rapidly check the mannequin. Word: This software is created to display the concept of Operate Calling, and is NOT meant to be completely sturdy e.g. supporting the order of a number of objects on the similar time, no hallucination, and so forth.

import json
from haystack.dataclasses import ChatMessage, ChatRoleresponse = None
messages = [
ChatMessage.from_system(context)
]
whereas True:
# if OpenAI response is a device name
if response and response["replies"][0].meta["finish_reason"] == "tool_calls":
function_calls = json.hundreds(response["replies"][0].content material)
for function_call in function_calls:
## Parse perform calling data
function_name = function_call["function"]["name"]
function_args = json.hundreds(function_call["function"]["arguments"])
## Discover the correspoding perform and name it with the given arguments
function_to_call = available_functions[function_name]
function_response = function_to_call(**function_args)
## Append perform response to the messages checklist utilizing `ChatMessage.from_function`
messages.append(ChatMessage.from_function(content material=json.dumps(function_response), title=function_name))
# Common Dialog
else:
# Append assistant messages to the messages checklist
if not messages[-1].is_from(ChatRole.SYSTEM):
messages.append(response["replies"][0])
user_input = enter("ENTER YOUR MESSAGE 👇 INFO: Kind 'exit' or 'give up' to stopn")
if user_input.decrease() == "exit" or user_input.decrease() == "give up":
break
else:
messages.append(ChatMessage.from_user(user_input))
response = chat_generator.run(messages=messages, generation_kwargs={"instruments": instruments})

Working interactive chats within the IDE

Whereas it really works, we would need to have one thing that appears nicer.

Streamlit interface

Streamlit turns information scripts into shareable net apps, which supplies a neat UI for our software. The code proven above are tailored right into a Streamlit software underneath the streamlit folder of my repo

You may run it by:

If in case you have not achieved so already, spin up the API server with python db_api.py
Set the OPENROUTER_API_KEY as surroundings variable e.g. export OPENROUTER_API_KEY = ‘@REPLACE WITH YOUR API KEY’ assuming you might be on Linux / executing with git bash
Navigate to the streamlit folder within the terminal with cd streamlit
Run Streamlit with streamlit run app.py. A brand new tab must be routinely created in your browser working the applying

That’s mainly it! I hope you take pleasure in this text.

*Until in any other case famous, all pictures are by the creator

[ad_2]