Actions

Active Agent Actions render views, just like Action Controller or Action Mailer. Similar to Action Mailer that renders Mail objects with Messages, Active Agent renders Prompt objects with Messages.

Prompt

The prompt method is used to render the action's content as a message in a prompt. The prompt method is similar to mail in Action Mailer or render in Action Controller, it allows you to specify the content type and view template for the action's response.

These Prompt objects contain the context Messages and available Actions. These actions are the interface that agents can use to interact with tools through text and JSON views or interact with users through text and HTML views.

Actions can be used to render Prompt objects with :assistant Messages back to a user or :tool Messages to provide the result of an action back to the Agent.

Defining Actions

You can define actions in your agent class that can be used to interact with the agent.

translation_agent.rbtranslate.json.jbuildertranslate.text.erb

ruby

class TranslationAgent < ApplicationAgent
  generate_with :openai, instructions: "Translate the given text from one language to another."

  def translate
    prompt
  end
end

ruby

json.type :function
json.function do
  json.name action_name
  json.description "This action takes params locale and message and returns a translated message."
  json.parameters do
    json.type :object
    json.properties do
      json.locale do
        json.type :string
        json.description "The target language for translation."
      end
      json.message do
        json.type :string
        json.description "The text to be translated."
      end
    end
  end
end

erb

translate: <%= params[:message] %>; to <%= params[:locale] %>

Call to Actions

These actions can be invoked by the agent to perform specific tasks and receive the results or in your Rails app's controllers, models, or jobs to prompt the agent for generation with a templated prompt message. By default, public instance methods defined in the agent class are included in the context as available actions. You can also define actions in a separate concern module and include them in your agent class.

test/agents/translation_agent_test.rb:6..8

ruby

translate_prompt = TranslationAgent.with(message: "Hi, I'm Justin", locale: "japanese").translate

assert_equal "translate: Hi, I'm Justin; to japanese\n", translate_prompt.message.content
assert_equal "Translate the given text from one language to another.", translate_prompt.instructions

Action params

Agent Actions can accept parameters, which are passed as a hash to the action method. You pass arguments to agent's using the with method and access parameters using the params method, just like Mailer Actions.

Using Actions to prompt the Agent with a templated message

You can call these actions directly to render a prompt to the agent directly to generate the requested object.

test/agents/translation_agent_test.rb:15..16

ruby

response = TranslationAgent.with(message: "Hi, I'm Justin", locale: "japanese").translate.generate_now
assert_equal "こんにちは、私はジャスティンです。", response.message.content

Using Agents to call Actions

You can also provide an Agent with a prompt context that includes actions and messages. The agent can then use these actions to perform tasks and generate responses based on the provided context.

ruby

parameterized_agent = TravelAgent.with(message: "I want to book a hotel in Paris")
agent_text_prompt_context = parameterized_agent.text_prompt

agent_text_prompt_context.generate_now

In this example, the TravelAgent will use the provided message as context to determine which actions to use during generation. The agent can then call the search action to find hotels, book action to initialize a hotel booking, or confirm action to finalize a booking, as needed based on the prompt context.

The prompt takes the following options:

content_type: Specifies the type of content to be rendered (e.g., :text, :json, :html).
message: The message.content to be displayed in the prompt.
messages: An array of messages objects to be included in the prompt's context.
template_name: Specifies the name of the template to be used for rendering the action's response.

app/agents/travel_agent.rb

ruby

class TravelAgent < ActiveAgent::Agent
  def search
    Place.search(params[:location])
    prompt(content_type: :text, template_name: 'search_results')
  end

  def book
    Place.book(hotel_id: params[:hotel_id], user_id: params[:user_id])
    prompt(content_type: :json, template_name: 'booking_confirmation')
  end

  def confirm
    Place.confirm(params[:booking_id])
    prompt(content_type: :html, template_name: 'confirmation_page')
  end
end

Action View Templates & Partials

While partials can be used in the JSON views the action's json view should primarily define the tool schema, then secondarily define the tool's output using a partial to render results of the tool call all in a single JSON action view template. Use the JSON action views for tool schema definitions and results, and use the text or HTML action views for rendering the action's response to the user.

Runtime options

content_type: Specifies the type of content to be rendered (e.g., :text, :json, :html).
view: Specifies the view template to be used for rendering the action's response. This can be a string representing the view file name or a symbol representing a predefined view.
stream: If set to true, the response will be streamed to the user in real-time. This is useful for long-running actions or when you want to provide immediate feedback to the user.
options: Additional options that can be passed to the generation provider, such as model parameters or generation settings.

How Agents use Actions

The agent receives a request from the user, which may include a message or an action to be performed.
The agent processes the request and determines if an action needs to be invoked.
If an action is invoked, the agent calls the corresponding method and passes the parameters to it.
The action method executes the logic defined in the agent and may interact with tools or perform other tasks.
The action method returns a response, which can be a rendered view, JSON data, or any other content type specified in the prompt method.
The agent updates the context with the action's result and prepares the response to be sent back to the user.

How Agents handle responses

The agent receives a response from the generation provider, which includes the generated content and any actions that need to be performed.
The agent processes the response
If there are no requested_actions then response is sent back to the user.
If the response includes actions, then agent executes them and updates the context accordingly.
If the resulting context requested_actions includes reiterate, then context is updated with the new messages, actions, and parameters, and the cycle continues.

Respond to User

You provide the model with a prompt or conversation history, along with a set of tools.
Based on the context, the model may decide to call a tool.
If a tool is called, it will execute and return data.
This data can then be passed to a view for rendering to the user.

Respond to Agent

The user interacts with UI elements connected to Action Controllers that call Agent's to generate content or the user enters a message in the chat UI.
The message is sent to the controller action
In your controller action, the language model generates tool calls during the generate_* call.
All tool calls are persistent in context and renderable to the User
Tools are executed using their process action method and their results are forwarded to the context.
You can return the tool result from the after_generate callback.
Tool calls that require user interactions can be displayed in the UI. The tool calls and results are available as tool invocation parts in the parts property of the last assistant message.
When the user interaction is done, render tool result can be used to add the tool result to the chat or UI view element.
When there are tool calls in the last assistant message and all tool results are available, this flow is reiterated.

Features

Automatically included in the agent's context as tools.
Schema rendering for tool definitions.
Support for multiple Action View template content types (e.g., text, JSON, HTML).
Customizable actions with dynamic view templates for Retrieval Augmented Generation (RAG).
Prompt method to render the action's content in the prompt context.

Tool Definitions

Tool schema definitions are also view templates that can be rendered to the agent. They are used to define the structure and parameters of the tools that the agent can use. Tool definitions are typically defined in JSON format and can include properties, required fields, and descriptions. They can be represented in various formats, such as jbuilder, JSON, or ERB templates, to provide a structured way to define the tools available to the agent.

erb

<%= {
  type: :function,
  function: {
    name: action_name,
    description: "This action takes no params and gets a random cat image and returns it as a base64 string.",
    parameters: {
      type: :object,
      properties: {
        param_name: {
          type: :string,
          description: "The param_description"
        }
      }
    }
  }
}.to_json.html_safe %>

Actions ​

Prompt ​

Defining Actions ​

Call to Actions ​

Action params ​

Using Actions to prompt the Agent with a templated message ​

Using Agents to call Actions ​

Action View Templates & Partials ​

Runtime options ​

How Agents use Actions ​

How Agents handle responses ​

Respond to User ​

Respond to Agent ​

Features ​

Tool Definitions ​

Actions

Prompt

Defining Actions

Call to Actions

Action params

Using Actions to prompt the Agent with a templated message

Using Agents to call Actions

Action View Templates & Partials

Runtime options

How Agents use Actions

How Agents handle responses

Respond to User

Respond to Agent

Features

Tool Definitions