Module ai.agent

ballerinax/ai.agent Ballerina library

Deprecated0.5.0

Overview

This module provides the functionality required to build ReAct agent using Large Language Models (LLMs).

Prerequisites

Before using this module in your Ballerina application, complete the following:

Create an OpenAI account.
Obtain an API key by following these instructions.

Alternatively, it is possible to use an Azure OpenAI account by completing the following steps.

Create an Azure account.
Create an Azure OpenAI resource.
Obtain the tokens. Refer to the Azure OpenAI Authentication guide to learn how to generate and use tokens.

Tool

A tool refers to a single action used to retrieve, process, or manipulate data. It can be a function or an API call, which may require certain inputs following a specific input schema.

Function as a Tool

When using a Ballerina function as a tool, the function should adhere to the following template:


isolated function functionName(record parameters) returns anydata|error {
    // function body 
}

In this template, record parameters represents a Ballerina record that contains the input parameters for the function. If the function doesn't require any inputs, it can be defined without any parameters. The function has the flexibility to return any data type or an error. It is important to note that the function needs to be an isolated function to ensure concurrency safety.

To define a tool using the above function, you can use the following syntax:


agent:Tool exampleTool = {
    name: "exampleTool", // used as an identifier 
    description: "defines the purpose of the function", // provides information about the behavior
    inputSchema: {
        // a JSON schema that defines the inputs to the function (if applicable)
    },
    caller: functionName // a pointer to the function
}

HTTP Resource as a Tool

To use an API resource as a tool, an HTTP tool definition can be created as follows.


agent:HttpTool httpResourceTool = {
    name: "exampleTool", // used as an identifier 
    description: "defines the purpose of the API resource", // provides information about the behavior
    path: "/path/resourceA/" // path to the resource
    method: "get" // the HTTP request method (e.g., GET, POST, DELETE, PUT, etc.)
    queryParameters: {
        // a JSON schema defining the query parameters of the HTTP resource
    }
    pathParameters: {
        // a JSON schema defining path parameters of the HTTP resource
    }
    requestBody: {
        // a JSON schema defining the request body of the HTTP resource
    }
}

Tools from Interface Definition Languages (IDLs)

You can automatically extract tools from a valid OpenAPI specification (3.x) file using the extractToolsFromOpenApiSpecFile function, as demonstrated below:


string openApiPath = "<PATH TO THE JSON/YAML FILE>"
agent:HttpTool[] tools = extractToolsFromOpenApiSpecFile(openApiPath)

The file containing the OpenAPI specification should be in either JSON or YAML format. To load them using a map<json> field, use extractToolsFromOpenApiJsonSpec instead of the above.

Tool Input Schema

The tool utilizes a JSON schema to define the input schema. This schema specifies the expected structure of the Ballerina record required by the Ballerina function, as well as the parameters (query/path) and payload for an HTTP API call.

For example, the input schema for a Ballerina record can be defined as follows:

Ballerina record:


type SendEmailInput record {|
    string recipient = "<DEFAULT EMAIL>"; // should be an email address from the contacts
    string subject;
    string messageBody;
    string contentType?;
|};

JSON input schema:


agent:InputSchema schema = {
       'type: agent:OBJECT,
       properties: {
           recipient: {
               'type: agent:STRING, 
               description: "should be an email address from the contacts", 
               default: "<DEFAULT EMAIL>"
           },
           subject: {'type: agent:STRING},
           messageBody: {'type: agent:STRING},
           contentType: {'const: "text/plain"} // a constant value 
       }

ToolKit

A Toolkit is a highly valuable asset when it comes to organizing a collection of tools that share common attributes. Not only does it provide organization, but it also offers the flexibility to extend and define new types of tools.

To illustrate this point, let's consider an HTTP service that encompasses multiple resources. Typically, these resources share the same service URL and client configurations. In such cases, utilizing an HttpServiceToolKit allows for the convenient grouping of all the HttpTool records associated with the resources of that specific service.

Furthermore, the HttpServiceToolKit extends the definition of a Tool to encompass HttpTool specifics, effectively encapsulating HTTP-related details. By interpreting an HttpTool as a Tool, the HttpServiceToolKit eliminates the need for additional effort in writing separate Tools for HTTP services. This streamlined interpretation simplifies the development process and saves valuable time.


agent:HttpTool resource1 = {
    // defines resource 1
}

...

agent:HttpTool resourceN = {
    // defines resource N
}
agent:HttpServiceToolKit serviceAToolKit = check new (
    serviceUrl, 
    [resource1,...,resourceN], 
    httpClientConfigs, 
    httpHeaders
);

Model

This is a large language model (LLM) instance. Currently, the agent module has support for the following LLM APIs.

OpenAI GPT3


agent:Gpt3Model model = check new ({auth: {token: <OPENAI API KEY>}});

OpenAI ChatGPT (e.g. GPT3.5, GPT4)


agent:ChatGptModel model = check new ({auth: {token: <OPENAI API KEY>}});

Azure OpenAI GPT3


agent:AzureGpt3Model model = check new ({auth: {apiKey: <AZURE OPENAI API KEY>}}, string serviceUrl, string deploymentId, string apiVersion);

Azure OpenAI ChatGPT (e.g. GPT3.5, GPT4)


agent:AzureChatGptModel model = check new ({auth: {apiKey: <AZURE OPENAI API KEY>}}, string serviceUrl, string deploymentId, string apiVersion);

Extending `LlmModel` for Custom Models

This module offers extended support for utilizing other LLMs by extending the LlmModel as demonstrated below:


isolated class NewLlmModel {
    *agent:LlmModel; // extends LlmModel

    // Implement the init method to initialize the connection with the new LLM (if required)

    public isolated function generate(agent:PromptConstruct prompt) returns string|error {
        // Utilize utilities to create a completion prompt (or chat prompt) if applicable
        string completionPrompt = agent:createCompletionPrompt(prompt);
        // Add logic to call the LLM with the completionPrompt
        // Return the generated text from the LLM
    }
}

By extending LlmModel, the NewLlmModel gains the ability to interface with other LLMs seamlessly. To utilize NewLlmModel, you can follow a similar approach as with other built-in LLM models. This allows you to harness the power of custom LLMs while maintaining compatibility with existing functionality.

Agent

The agent facilitates the execution of natural language (NL) commands by leveraging the reasoning and text generation capabilities of LLMs (Language Models). It follows the ReAct framework:

To create an agent, you need an LLM model and a set of Tool (or ToolKit) definitions.


(agent.Tool|agent.BaseToolKit)[] tools = [
    //tools and toolkits
]
agent.Agent agent = check new (model, ...tools);

There are multiple ways to utilize the agent.

1. Agent.run() for Batch Execution

The agent can be executed without interruptions using Agent.run(). It attempts to fully execute the given NL command and returns the results at each step.


agent:ExecutionStep[] execution = agent.run("<NL COMMAND>", maxIter = 10);

2. `AgentIterator` for `foreach` Execution

The agent can also act as an iterator, providing reasoning and output from the tool at each step while executing the command.


agent:AgentIterator agentIterator = agent.getIterator("<NL COMMAND>");
foreach agent:ExecutionStep|error step in agentIterator{
    // logic goes here
    // can decide whether to continue/rollback/exit the loop based on the observation from the tool
}

3. `AgentExecutor` for Reason-Act Interface

The AgentExecutor offers enhanced flexibility for running agents through its reason() and act(string thought) methods. This separation of reasoning and acting enables developers to obtain user confirmation before executing actions based on the agent's reasoning. This feature is particularly valuable for verifying, validating, or refining the agent's reasoning by incorporating user intervention or feedback as new observations, which can be achieved using the update(ExecutionStep step) method of AgentExecutor.

Additionally, this approach empowers users to manipulate the execution trace of the agent based on specific requirements by modifying the records of previous execution steps. This capability becomes handy in situations where certain steps need to be excluded during execution (e.g., unsuccessful or outdated steps). Moreover, manual execution can be performed selectively, such as handling specific errors or acquiring user inputs. The AgentExecutor allows you to customize the execution trace to suit your needs effectively.


string QUERY = "<NL COMMAND>";
agent.AgentExecutor agentExecutor = agent.getExecutor(QUERY);
while(agentExecutor.hasNext()){
    string|error thought = agentExecutor.reason(); // reasoning step
    if thought is error {
        // reasoning fails due to LLM error. Handle appropriately
        break;
    }
    // <OPTIONAL> based on the reasoning user can decide whether to proceed with the action
    // possible to validate the thought, improve it, or get user confirmation to proceed with the action
    any|error observation = agentExecutor.act(thought); // acting step
    if observation is error {
        // error returned by the tool. Handle appropriately
        // handle the error using another tool if needed tool
        
        // <OPTIONAL> restart the execution after manipulating the trace
        agent.ExecutionStep[] trace = agentExecutor.getPromptConstruct().history;
        // manipulate the traces if required (e.g. remove unnecessary steps, add manual steps)
        agentExecutor = agent.getExecutor(QUERY, trace); // restarts the execution from the last step
        break;
    }
}

Quickstart

Let's walk through the usage of the ai.agent library using this sample. The example demonstrates the use of two types of tools:

To send a Google email, we utilize the sendMessage function from the ballerinax/googleapis.gmail connector as a tool.
HttpTool records are used to create and list WiFi accounts through the GuestWiFi HTTP service.
- List available WiFi accounts:GET /guest-wifi-accounts/{ownerEmail}
- Create a new WiFi account: POST /guest-wifi-accounts

By following the four steps below, we can easily configure and run an agent:

Step 1 - Import Library


import ballerinax/ai.agent;
import ballerinax/googleapis.gmail;

Step 2 - Defining Tools for the Agent

To begin, we need to define a gmail->sendMessage function as a tool. However, it's not possible to define a tool for a remote function directly without a wrapper function. If you attempt to do so, you won't be able to obtain the pointer for the remote function. Therefore, we start by creating the sendEmail function, which wraps the connector action gmail->sendMessage.


isolated function sendEmail(gmail:MessageRequest messageRequest) returns string|error {
    gmail:Client gmail = check new ({auth: {token: gmailToken}});
    gmail:Message|error sendMessage = gmail->sendMessage(messageRequest);
    if sendMessage is gmail:Message {
        return sendMessage.toString();
    }
    return "Error while sending the email" + sendMessage.message();
}

Now that we have the sendEmail function defined, we can proceed with creating the tool that utilizes this function. To define the inputSchema for the tool, we inspect the structure of the gmail:MessageRequest record and include only the necessary fields required for our task. Since the rest of the fields are not mandatory for the tool's execution, we can safely ignore them.


agent:Tool sendEmailTool = {
    name: "Send mail",
    description: "useful to send emails to a given recipient",
    inputSchema: {
        properties: {
            recipient: {'type: agent:STRING},
            subject: {'type: agent:STRING},
            messageBody: {'type: agent:STRING},
            contentType: {'const: "text/plain"}
        }
    },
    caller: sendMail
};

Next, define a HttpTool record for the resources of the GuestWiFi HTTP service. Then use HttpServiceToolKit to create a toolkit for that HTTP service. While creating the HttpTool record, there is no need to explicitly define pathParameters since the Agent can automatically extract them from the provided path.


agent:HttpTool listWifiHttpTool = {
    name: "List wifi",
    path: "/guest-wifi-accounts/{ownerEmail}",
    method: agent:GET,
    description: "useful to list the guest wifi accounts."
};

agent:HttpTool createWifiHttpTool = {
    name: "Create wifi",
    path: "/guest-wifi-accounts",
    method: agent:POST,
    description: "useful to create a guest wifi account.",
    requestBody: {
        'type: agent:OBJECT,
        properties: {
            email: {'type: agent:STRING},
            username: {'type: agent:STRING},
            password: {'type: agent:STRING}
        }
    }
};   

agent:HttpServiceToolKit wifiServiceToolKit = check new (wifiServiceUrl, [listWifiHttpTool, createWifiHttpTool], {
    auth: {
        tokenUrl: wifiServiceTokenUrl,
        clientId: wifiServiceClientId,
        clientSecret: wifiServiceClientSecret
    }
});

Note that when creating the HttpServiceToolKit for the GuestWiFi service, we provide the service URL and authentication configurations to the HttpServiceToolKit initializer to establish the connection with the service.

Step 3 - Create the Agent

To create the agent, we first need to initialize a LLM (e.g., Gpt3Model, ChatGptModel). In this example, we initialize the agent with the ChatGptModel model as follows:


agent:ChatGptModel model = check new ({auth: {token:  <OPENAI API KEY>}});
agent:Agent agent = check new (model, wifiServiceToolKit, sendEmailTool);

Step 4 - Run the Agent

Now we can run the agent with NL commands from the user. Note that in this case, we use a query template and pass unknowns as interpolations to the queryTemplate.


string queryTemplate = string`create a new guest WiFi account for email ${wifiOwnerEmail} with user ${wifiUsername} and password ${wifiPassword}. Send the available list of WiFi accounts for that email to ${recipientEmail}`;
agent:ExecutionStep[] run = agent.run(query);

Output

Let's examine the output produced by the above example. Assuming the following natural language (NL) command is given to the agent:

NL Command: "create a new guest WiFi account for email johnny@wso2.com with user guest123 and password john123. Send the available list of WiFi accounts for that email to alexa@wso2.com"

The agent will proceed with multiple reasoning-action iterations as follows to execute the given command.

Agent creates a new WiFi account for owner johnny@wso2.com:

Reasoning iteration: 1
Thought: We need to create a new guest WiFi account with the given username and password, and then list the available WiFi accounts for the email owner and send it to a specified recipient. 
Action: 
```
{
    "tool": "Create wifi",
    "tool_input": {
        "requestBody": {
        "email": "johnny@wso2.com",
        "username": "guest123",
        "password": "john123"
        }
    }
}
```
Observation: Successfully added the wifi account

Agent finds existing guest WiFi accounts under the owner johnny@wso2.com:

Reasoning iteration: 2
Thought: Now we need to use the "List wifi" tool to get the available list of wifi accounts for the email "alexa@wso2.com".
Action:
```
{
    "tool": "List wifi",
    "tool_input": {
        "pathParameters": {
            "ownerEmail": "johnny@wso2.com"
        }
    }
}
```
Observation: ["guest123.guestOf.johnny","newGuest.guestOf.johnny"]

Agent sends an email to alexa@wso2.com with the information about the existing accounts:

In this step, the agent is responsible for generating the email subject and message body as well. The user provides only the recipient's email.

Reasoning iteration: 3
Thought: Finally, we need to send the available wifi list to the specified recipient.
Action:
```
{
    "tool": "Send mail",
    "tool_input": {
        "recipient": "alexa@wso2.com",
        "subject": "Available Wifi List",
        "messageBody": "The available wifi accounts for johnny@wso2.com are: guest123.guestOf.johnny, newGuest.guestOf.johnny"
    }
}
```
Observation: {"threadId":"1884d1bda3d2c286","id":"1884d1bda3d2c286","labelIds":["SENT"]}

Agent concludes the task:

Reasoning iteration: 4
Thought: I now know the final answer
Final Answer: Successfully created a new guest wifi account with username "guest123" and password "john123" for the email owner "johnny@wso2.com". The available wifi accounts for "johnny@wso2.com" are "guest123.guestOf.johnny" and "newGuest.guestOf.johnny", and this list has been sent to the specified recipient "alexa@wso2.com".

As a result, alexa@wso2.com will receive an email generated by the agent with the subject "Available WiFi List" and the message body "The available WiFi accounts for johnny@wso2.com are: guest123.guestOf.johnny, newGuest.guestOf.johnny".

Functions

createChatPrompt

Isolated Function

function createChatPrompt(PromptConstruct prompt) returns ChatMessage[]

Generate a ReAct prompt for chat LLMs (e.g. ChatGPT, GPT4)

Parameters

prompt PromptConstruct - Prompt construct

Return Type

ChatMessage[] - ReAct prompt for chat LLMs

createCompletionPrompt

Isolated Function

function createCompletionPrompt(PromptConstruct prompt) returns string

Generate a ReAct prompt for completion LLMs (e.g. GPT3)

Parameters

prompt PromptConstruct - Prompt construct

Return Type

string - ReAct prompt for completion LLMs

extractToolsFromOpenApiJsonSpec

Isolated Function

function extractToolsFromOpenApiJsonSpec(map<json> openApiSpec, *AdditionInfoFlags additionInfoFlags) returns HttpApiSpecification & readonly|error

Extracts the Http tools from the given OpenAPI specification as a JSON

Parameters

openApiSpec map<json> - A valid OpenAPI specification in JSON format

additionInfoFlags *AdditionInfoFlags - Flags to extract additional information from the OpenAPI specification

Return Type

HttpApiSpecification & readonly|error - A record with the list of extracted tools and the service URL (if available)

extractToolsFromOpenApiSpecFile

Isolated Function

function extractToolsFromOpenApiSpecFile(string filePath, *AdditionInfoFlags additionInfoFlags) returns HttpApiSpecification & readonly|error

Extracts the Http tools from the given OpenAPI specification file.

Parameters

filePath string - Path to the OpenAPI specification file (should be JSON or YAML)

additionInfoFlags *AdditionInfoFlags - Flags to extract additional information from the OpenAPI specification

Return Type

HttpApiSpecification & readonly|error - A record with the list of extracted tools and the service URL (if available)

Classes

ai.agent: Agent

Isolated

ReAct Agent implementation to execute actions with LLMs.

Constructor

Initialize an Agent.

init (LlmModel model, (BaseToolKit|Tool)... tools)

model LlmModel - LLM model instance

tools (BaseToolKit|Tool)... -

getExecutor

Isolated Function

function getExecutor(string query, ExecutionStep[] previousSteps, string|map<json> context) returns AgentExecutor

Initialize the agent executor for a given query. Agent executor is useful for streaming-like execution of the agent or to make use of reason-act interface of the agent.

Parameters

query string - User's query

previousSteps ExecutionStep[] (default []) - Execution steps perviously taken by the agent for the query given

context string|map<json> (default {}) - Context information to be used by the LLM

Return Type

AgentExecutor - AgentExecutor instance

getIterator

Isolated Function

function getIterator(string query, string|map<json> context) returns AgentIterator

Initialize the agent iterator for a given query. Agent executor is useful for foreach execution of the agent.

Parameters

query string - User's query

context string|map<json> (default {}) - Context information to be used by the LLM

Return Type

AgentIterator - AgentIterator instance

run

Isolated Function

function run(string query, int maxIter, string|map<json> context, boolean verbose) returns ExecutionStep[]

Execute the agent for a given user's query.

Parameters

query string - Natural langauge commands to the agent

maxIter int (default 5) - No. of max iterations that agent will run to execute the task

context string|map<json> (default {}) - Context values to be used by the agent to execute the task

verbose boolean (default true) - If true, then print the reasoning steps

Return Type

ExecutionStep[] - Returns the execution steps tracing the agent's reasoning and outputs from the tools

ai.agent: AgentExecutor

hasNext

Isolated Function

function hasNext() returns boolean

Checks whether agent has more steps to execute.

Return Type

boolean - True if agent has more steps to execute, false otherwise

reason

Isolated Function

function reason() returns string|error

Reason the next step of the agent.

Return Type

string|error - Thought to be executed by the agent or an error if the reasoning failed

act

Isolated Function

function act(string thought) returns any|error

Execute the next step of the agent.

Parameters

thought string - Thought to be executed by the agent

Return Type

any|error - Observations from the tool can be any|error|null

update

Isolated Function

function update(ExecutionStep step)

Update the agent with the latest exectuion step.

Parameters

step ExecutionStep - Latest step to be added to the history

Isolated Function

function next() returns record {| value ExecutionStep|error |}?

Execute the next step of the agent.

Return Type

record {| value ExecutionStep|error |}? - A record with ExecutionStep or error

ai.agent: AgentIterator

iterator

function iterator() returns object {
        public function next() returns record {|ExecutionStep|error value;|}?;
    }

Iterate over the agent's execution steps.

Return Type

object { public function next() returns record {|ExecutionStep|error value;|}?; } - a record with the execution step or an error if the agent failed

ai.agent: AzureChatGptModel

Isolated

Constructor

Initializes the ChatGPT model with the given connection configuration and model configuration.

init (ConnectionConfig connectionConfig, string serviceUrl, string deploymentId, string apiVersion, ChatModelConfig modelConfig)

connectionConfig ConnectionConfig - Connection Configuration for OpenAI chat client

serviceUrl string - Service URL for Azure OpenAI service

deploymentId string - Deployment ID for Azure OpenAI model instance

apiVersion string - API version for Azure OpenAI model instance

modelConfig ChatModelConfig - Model Configuration for OpenAI chat client

chatComplete

Isolated Function

function chatComplete(ChatMessage[] messages, string? stop) returns string|error

Completes the given prompt using the ChatGPT model.

Parameters

messages ChatMessage[] - Messages to be completed

stop string? (default ()) - Stop sequence to stop the completion

Return Type

string|error - Completed message or error if the completion fails

generate

Isolated Function

function generate(PromptConstruct prompt) returns string|error

Generate ReAct response for the given prompt

Parameters

prompt PromptConstruct - Prompt construct

Return Type

string|error - ReAct response

Fields

modelConfig ChatModelConfig -

ai.agent: AzureGpt3Model

Isolated

Constructor

Initializes the GPT-3 model with the given connection configuration and model configuration.

init (ConnectionConfig connectionConfig, string serviceUrl, string deploymentId, string apiVersion, CompletionModelConfig modelConfig)

connectionConfig ConnectionConfig - Connection Configuration for Azure OpenAI text client

serviceUrl string - Service URL for Azure OpenAI service

deploymentId string - Deployment ID for Azure OpenAI model instance

apiVersion string - API version for Azure OpenAI model instance

modelConfig CompletionModelConfig {} - Model Configuration for Azure OpenAI text client

complete

Isolated Function

function complete(string prompt, string? stop) returns string|error

Completes the given prompt using the GPT3 model.

Parameters

prompt string - Prompt to be completed

stop string? (default ()) - Stop sequence to stop the completion

Return Type

string|error - Completed prompt or error if the completion fails

generate

Isolated Function

function generate(PromptConstruct prompt) returns string|error

Generate ReAct response for the given prompt

Parameters

prompt PromptConstruct - Prompt construct

Return Type

string|error - ReAct response

Fields

modelConfig CompletionModelConfig -

ai.agent: ChatGptModel

Isolated

Constructor

Initializes the ChatGPT model with the given connection configuration and model configuration.

init (ConnectionConfig connectionConfig, ChatModelConfig modelConfig)

connectionConfig ConnectionConfig - Connection Configuration for OpenAI chat client

modelConfig ChatModelConfig {} - Model Configuration for OpenAI chat client

chatComplete

Isolated Function

function chatComplete(ChatMessage[] messages, string? stop) returns string|error

Completes the given prompt using the ChatGPT model.

Parameters

messages ChatMessage[] - Messages to be completed

stop string? (default ()) - Stop sequence to stop the completion

Return Type

string|error - Completed message or error if the completion fails

generate

Isolated Function

function generate(PromptConstruct prompt) returns string|error

Generate ReAct response for the given prompt

Parameters

prompt PromptConstruct - Prompt construct

Return Type

string|error - ReAct response

Fields

modelConfig ChatModelConfig -

ai.agent: Gpt3Model

Isolated

Constructor

Initializes the GPT-3 model with the given connection configuration and model configuration.

init (ConnectionConfig connectionConfig, CompletionModelConfig modelConfig)

connectionConfig ConnectionConfig - Connection Configuration for OpenAI text client

modelConfig CompletionModelConfig {} - Model Configuration for OpenAI text client

complete

Isolated Function

function complete(string prompt, string? stop) returns string|error

Completes the given prompt using the GPT3 model.

Parameters

prompt string - Prompt to be completed

stop string? (default ()) - Stop sequence to stop the completion

Return Type

string|error - Completed prompt or error if the completion fails

generate

Isolated Function

function generate(PromptConstruct prompt) returns string|error

Generate ReAct response for the given prompt

Parameters

prompt PromptConstruct - Prompt construct

Return Type

string|error - ReAct response

Fields

modelConfig CompletionModelConfig -

ai.agent: HttpServiceToolKit

Isolated

Defines a HTTP tool kit. This is a special type of tool kit that can be used to invoke HTTP resources. Require to initialize the toolkit with the service url and http tools that are belongs to a singel API.

Constructor

Initializes the toolkit with the given service url and http tools.

init (string serviceUrl, HttpTool[] httpTools, ClientConfiguration clientConfig, HttpHeader headers)

serviceUrl string - The url of the service to be called

httpTools HttpTool[] - The http tools to be initialized

clientConfig ClientConfiguration {} - The http client configuration associated to the tools

headers HttpHeader {} - The http headers to be used in the requests

getTools

Isolated Function

function getTools() returns Tool[]|error

Method included from *BaseToolKit

Enums

ai.agent: HttpMethod

Supported HTTP methods.

Members

GET

POST

DELETE

PUT

PATCH

HEAD

OPTIONS

ai.agent: InputType

Supported input types by the Tool schemas.

Members

STRING

INTEGER

FLOAT

BOOLEAN

NUMBER

OBJECT

ARRAY

ai.agent: ROLE

Roles for the chat messages.

Members

SYSTEM_ROLE

USER_ROLE

Records

ai.agent: AdditionInfoFlags

Closed record

Defines additional information to be extracted from the OpenAPI specification.

Fields

extractDescription boolean(default false) - Flag to extract description of parameters and schema attributes from the OpenAPI specification

extractDefault boolean(default false) - Flag to extract default values of parameters and schema attributes from the OpenAPI specification

ai.agent: AllOfInputSchema

Closed record

Defines an allOf input field in the schema. Follows OpenAPI 3.x specification.

Fields

allOf ObjectInputSchema[] - List of possible input types

ai.agent: AnyOfInputSchema

Closed record

Defines an anyOf input field in the schema. Follows OpenAPI 3.x specification.

Fields

anyOf ObjectInputSchema[] - List of possible input types

ai.agent: ArrayInputSchema

Closed record

Defines an array input field in the schema.

Fields

Fields Included from * BaseInputTypeSchema

type InputType
description string
default json

'type ARRAY(default ARRAY) - Input data type. Should be ARRAY.

items JsonSubSchema - Schema of the array items

default json[]? - Default value for the array

ai.agent: ArrayTypeParameterSchema

Closed record

Defines a HTTP parameter schema for Array type parameters.

Fields

Fields Included from * ArrayInputSchema

type "array"
items JsonSubSchema
default json[]
description string

items PrimitiveInputSchema|ConstantValueSchema - Array item type

default PrimitiveType[]? - Default value of the parameter

ai.agent: BaseInputTypeSchema

Closed record

Defines a base input type schema.

Fields

'type InputType - Input data type

description string? - Description of the input

default json? - Default value of the input

ai.agent: ChatMessage

Closed record

Chat message record

Fields

role ROLE - Role of the message

content string - Content of the message

ai.agent: ChatModelConfig

Read OnlyClosed record

Chat model configurations.

Fields

model string(default GPT3_5_MODEL_NAME) - Model type to be used for the completion. Default is gpt-3.5-turbo

temperature decimal(default DEFAULT_TEMPERATURE) - Temperature value to be used for the completion. Default is 0.7.

ai.agent: CompletionModelConfig

Read OnlyClosed record

Completion model configurations.

Fields

model string(default GPT3_MODEL_NAME) - Model type to be used for the completion. Default is davinci.

temperature decimal(default DEFAULT_TEMPERATURE) - Temperature value to be used for the completion. Default is 0.7.

max_tokens int(default DEFAULT_MAX_TOKEN_COUNT) - Maximum number of tokens to be generated for the completion. Default is 512.

ai.agent: ConstantValueSchema

Closed record

Defines a constant value field in the schema.

Fields

'const json - The constant value.

ai.agent: ExecutionStep

Closed record

Prompt to be given to the LLM.

Fields

thought string - Thought produced by the LLM during the reasoning

observation any|error? - Observations produced by the tool during the execution

ai.agent: HttpApiSpecification

Closed record

Provides extracted tools and service URL from the OpenAPI specification.

Fields

serviceUrl string? - Extracted service URL from the OpenAPI specification if there is any

tools HttpTool[] - Extracted Http tools from the OpenAPI specification

ai.agent: HttpHeader

Read Only

Provide definition to an HTTP header

Fields

string|string[]... - Rest field

ai.agent: HttpOutput

Closed record

Defines an HTTP output record for requests.

Fields

code int - HTTP status code of the response

payload string? - Content of the response

ai.agent: HttpTool

Closed record

Defines an HTTP tool. This is a special type of tool that can be used to invoke HTTP resources.

Fields

name string - Name of the Http resource tool

description string - Description of the Http resource tool used by the LLM

method HttpMethod - Http method type (GET, POST, PUT, DELETE, PATCH, HEAD, OPTIONS)

path string - Path of the Http resource

queryParameters ParameterSchema? - Query parameters definitions of the Http resource

pathParameters ParameterSchema? - Path parameter definitions of the Http resource

requestBody JsonInputSchema? - Request body definition of the Http resource

ai.agent: NotInputSchema

Closed record

Defines a not input field in the schema. Follows OpenAPI 3.x specification.

Fields

not JsonSubSchema - Schema that is not accepted as an input

ai.agent: ObjectInputSchema

Closed record

Defines an object input field in the schema.

Fields

Fields Included from * BaseInputTypeSchema

type InputType
description string
default json

'type OBJECT(default OBJECT) - Input data type. Should be OBJECT.

required string[]? - List of required properties

properties map<JsonSubSchema> - Schema of the object properties

ai.agent: OneOfInputSchema

Closed record

Defines an oneOf input field in the schema. Follows OpenAPI 3.x specification.

Fields

oneOf JsonSubSchema[] - List of possible input types

ai.agent: ParameterSchema

Closed record

Defines a HTTP parameter schema (can be query parameter or path parameters).

Fields

required string[]? - A list of mandatory parameters

properties map<ParameterType> - A map of parameter names and their types

ai.agent: PrimitiveInputSchema

Closed record

Defines a primitive input field in the schema.

Fields

Fields Included from * BaseInputTypeSchema

type InputType
description string
default json

'type STRING|INTEGER|NUMBER|FLOAT|BOOLEAN - Input data type. Should be one of STRING, INTEGER, NUMBER, FLOAT, or BOOLEAN.

format string? - Format of the input. This is not applicable for BOOLEAN type.

pattern string? - Pattern of the input. This is only applicable for STRING type.

'enum string[]? - Enum values of the input. This is only applicable for STRING type.

default PrimitiveType? - Default value of the input

ai.agent: PromptConstruct

Closed record

Prompt construct record

Fields

instruction string - Instructions in the prompt

query string - Query to the prompt

history ExecutionStep[] - Execution history to the prompt

ai.agent: Tool

Closed record

Defines a tool. This is the only tool type directly understood by the agent. All other tool types are converted to this type using toolkits.

Fields

name string - Name of the tool

description string - A description of the tool. This is used by the LLMs to understand the behavior of the tool.

parameters JsonInputSchema?(default ()) - Input schema expected by the tool. If the tool doesn't expect any input, this should be null.

caller function() () - Pointer to the function that should be called when the tool is invoked.

Object types

ai.agent: BaseToolKit

Distinct

Allows implmenting custom toolkits by extending this type. Toolkits can help to define new types of tools so that agent can understand them.

ai.agent: LlmModel

Distinct

Extendable LLM model object that can be used for completion tasks. Useful to initialize the agents.

generate

Isolated Function

function generate(PromptConstruct prompt) returns string|error

Parameters

prompt PromptConstruct -

Union types

ai.agent: JsonInputSchema

JsonInputSchema

Defines a json input schema

ai.agent: JsonSubSchema

JsonInputSchema|PrimitiveInputSchema|ConstantValueSchema

JsonSubSchema

Defines a json sub schema

ai.agent: ParameterType

ConstantValueSchema|PrimitiveInputSchema|ArrayTypeParameterSchema

ParameterType

Define parameter types for HTTP parameters.

ai.agent: PrimitiveType

int|string|boolean|float|decimal

PrimitiveType

Primitive types supported by the Tool schemas.

Import

import ballerinax/ai.agent;

Metadata

Released date: 9 months ago

Version: 0.5.0

License: Apache-2.0

Compatibility

Platform: any

Ballerina version: 2201.7.1

GraalVM compatible: Yes

Pull count

Total: 900

Current verison: 204

Weekly downloads

Source repository

Keywords

AI/Agent

Cost/Freemium

Contributors

Other versions

0.7.6 0.7.5 0.7.4 0.7.3 0.7.2

Dependencies

ballerinax/azure.openai.chat/1.0.2 ballerinax/openai.text/1.0.4 ballerina/regex/1.4.3

Cookie policy

Delete policy

functions

classes

enums

records

objectTypes

unionTypes

ballerinax/ai.agent Ballerina library

Overview

Prerequisites

Tool

Function as a Tool

HTTP Resource as a Tool

Tools from Interface Definition Languages (IDLs)

Tool Input Schema

ToolKit

Model

Extending LlmModel for Custom Models

Agent

1. Agent.run() for Batch Execution

2. AgentIterator for foreach Execution

3. AgentExecutor for Reason-Act Interface

Quickstart

Step 1 - Import Library

Step 2 - Defining Tools for the Agent

Step 3 - Create the Agent

Step 4 - Run the Agent

Output

Functions

createChatPrompt

Parameters

Return Type

createCompletionPrompt

Parameters

Return Type

extractToolsFromOpenApiJsonSpec

Parameters

Return Type

extractToolsFromOpenApiSpecFile

Parameters

Return Type

Classes

ai.agent: Agent

Constructor

getExecutor

Parameters

Return Type

getIterator

Parameters

Return Type

run

Parameters

Return Type

ai.agent: AgentExecutor

hasNext

Return Type

reason

Return Type

act

Parameters

Return Type

update

Parameters

next

Return Type

ai.agent: AgentIterator

iterator

Return Type

ai.agent: AzureChatGptModel

Constructor

chatComplete

Parameters

Return Type

generate

Parameters

Return Type

Fields

ai.agent: AzureGpt3Model

Constructor

complete

Parameters

Extending `LlmModel` for Custom Models

2. `AgentIterator` for `foreach` Execution

3. `AgentExecutor` for Reason-Act Interface