eino-examples

You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

12 KiB

Raw Blame History

title
Chapter 4: Tool and Filesystem Access

The goal of this chapter is to add Tool capabilities to the Agent, enabling it to access the filesystem.

Why We Need Tools

In the first three chapters, the Agent we implemented could only have conversations — it couldn't perform actual operations.

Agent limitations:

Can only generate text replies
Cannot access external resources (files, APIs, databases, etc.)
Cannot execute actual tasks (calculations, queries, modifications, etc.)

The role of Tools:

Tools are capability extensions for the Agent: They enable the Agent to perform concrete operations
Tools encapsulate specific implementations: The Agent doesn't care how a Tool works internally, only about its input and output
Tools are composable: An Agent can have multiple Tools and choose which to call as needed

Simple analogy:

Agent = "intelligent assistant" (can understand instructions, but needs tools to execute)
Tool = "toolbox" (file operations, network requests, database queries, etc.)

Why We Need Filesystem Access

This example is ChatWithDoc (chat with documentation), with the goal of helping users learn the Eino framework and write Eino code. So, what is the best documentation?

The answer is: the Eino repository's code itself.

Code: Source code shows the framework's actual implementation
Comments: Code comments provide design rationale and usage instructions
Examples: Example code demonstrates best practices

With filesystem access capabilities, the Agent can directly read Eino source code, comments, and examples, providing users with the most accurate and up-to-date technical support.

Key Concepts

Tool Interface

Tool is the interface in Eino that defines executable capabilities:

// BaseTool provides tool metadata that ChatModel uses to decide whether and how to call the tool
type BaseTool interface {
    Info(ctx context.Context) (*schema.ToolInfo, error)
}

// InvokableTool is a tool that can be executed by ToolsNode
type InvokableTool interface {
    BaseTool
    // InvokableRun executes the tool; arguments are a JSON-encoded string, returns a string result
    InvokableRun(ctx context.Context, argumentsInJSON string, opts ...Option) (string, error)
}

// StreamableTool is the streaming variant of InvokableTool
type StreamableTool interface {
    BaseTool
    // StreamableRun executes the tool in streaming mode, returns a StreamReader
    StreamableRun(ctx context.Context, argumentsInJSON string, opts ...Option) (*schema.StreamReader[string], error)
}

Interface hierarchy:

BaseTool: Base interface, only provides metadata
InvokableTool: Executable tool (extends BaseTool)
StreamableTool: Streaming tool (extends BaseTool)

Backend Interface

Backend is an abstract interface in Eino for filesystem operations:

type Backend interface {
    // List file information in a directory
    LsInfo(ctx context.Context, req *LsInfoRequest) ([]FileInfo, error)

    // Read file content, supports line offset and limits
    Read(ctx context.Context, req *ReadRequest) (*FileContent, error)

    // Search for matching content in files
    GrepRaw(ctx context.Context, req *GrepRequest) ([]GrepMatch, error)

    // Match files based on glob patterns
    GlobInfo(ctx context.Context, req *GlobInfoRequest) ([]FileInfo, error)

    // Write file content
    Write(ctx context.Context, req *WriteRequest) error

    // Edit file content (string replacement)
    Edit(ctx context.Context, req *EditRequest) error
}

LocalBackend

LocalBackend is the local filesystem implementation of Backend, directly accessing the operating system's filesystem:

import localbk "github.com/cloudwego/eino-ext/adk/backend/local"

backend, err := localbk.NewBackend(ctx, &localbk.Config{})

Features:

Directly accesses the local filesystem, implemented using Go standard library
Supports all Backend interface methods
Supports executing shell commands (ExecuteStreaming)
Path safety: Requires absolute paths to prevent directory traversal attacks
Zero configuration: Works out of the box with no additional setup

Implementation: Using DeepAgent

This chapter uses the DeepAgent prebuilt Agent, which provides first-class configuration for Backend and StreamingShell, making it easy to register filesystem-related tools.

From ChatModelAgent to DeepAgent: When to Switch?

Previous chapters used ChatModelAgent, which can handle multi-turn conversations. But to access the filesystem, we need to switch to DeepAgent.

ChatModelAgent vs DeepAgent comparison:

Capability	ChatModelAgent	DeepAgent
Multi-turn conversation	Yes	Yes
Add custom Tools	Yes, manually register each Tool	Yes, manual or automatic registration
Filesystem access (Backend)	No, must manually create and register all file tools	Yes, first-class config, auto-registered
Command execution (StreamingShell)	No, must manually create	Yes, first-class config, auto-registered
Built-in task management	No	Yes, `write_todos` tool
Sub-Agent support	No	Yes

Selection guide:

Pure conversation scenarios (no external access) -> Use ChatModelAgent
Need filesystem access or command execution -> Use DeepAgent

Why Use DeepAgent?

Compared to using ChatModelAgent directly, DeepAgent's advantages:

First-class configuration: Backend and StreamingShell are first-class config options — just pass them in
Automatic tool registration: Configuring a Backend automatically registers filesystem tools, no manual creation needed
Built-in task management: Provides the write_todos tool for task planning and tracking
Sub-Agent support: Can configure specialized sub-Agents for specific tasks
More powerful: Integrates filesystem, command execution, and many other capabilities

Code Implementation

import (
    localbk "github.com/cloudwego/eino-ext/adk/backend/local"
    "github.com/cloudwego/eino/adk/prebuilt/deep"
)

// Create LocalBackend
backend, err := localbk.NewBackend(ctx, &localbk.Config{})

// Create DeepAgent, automatically registers filesystem tools
agent, err := deep.New(ctx, &deep.Config{
    Name:           "Ch04ToolAgent",
    Description:    "ChatWithDoc agent with filesystem access via LocalBackend.",
    ChatModel:      cm,
    Instruction:    instruction,
    Backend:        backend,        // Provides filesystem operation capabilities
    StreamingShell: backend,        // Provides command execution capabilities
    MaxIteration:   50,
})

Tools Automatically Registered by DeepAgent

When Backend and StreamingShell are configured, DeepAgent automatically registers the following tools:

read_file: Read file content
write_file: Write file content
edit_file: Edit file content
glob: Find files based on glob patterns
grep: Search for content in files
execute: Execute shell commands

Code Location

Entry code: cmd/ch04/main.go

Prerequisites

Same as Chapter 1: you need to configure an available ChatModel (OpenAI or Ark).

This chapter also requires setting PROJECT_ROOT (optional, see running instructions below).

Running

In the examples/quickstart/chatwitheino directory, run:

# Optional: Set the root directory path for the Eino core library
# When not set, the Agent defaults to using the current working directory (the chatwitheino directory) as root
# To let the Agent search the complete Eino codebase, point this to the eino core library root
export PROJECT_ROOT=/path/to/eino

# Verify the path is correct (you should see directories like adk, components, compose, etc.)
ls $PROJECT_ROOT

go run ./cmd/ch04

PROJECT_ROOT explanation:

When not set: PROJECT_ROOT defaults to the current working directory (the directory containing chatwitheino), and the Agent can only access files in this example project. This is sufficient for quick experimentation.
When set: Points to the Eino core library root directory, and the Agent can search the complete Eino framework codebase (core library, extension library, examples library). This is the full ChatWithEino usage scenario.

Recommended three-repository directory structure (for the full experience):

eino/                    # PROJECT_ROOT (Eino core library)
├── adk/
├── components/
├── compose/
├── ext/                 # eino-ext (extension components, e.g., OpenAI, Ark implementations)
├── examples/            # eino-examples (this repository, where this example is located)
│   └── quickstart/
│       └── chatwitheino/
└── ...

You can use the dev_setup.sh script to automatically set up the above directory structure:

# Run in the eino root directory to automatically clone extension and example repos to the correct locations
bash scripts/dev_setup.sh

Output example:

you> List the files in the current directory
[assistant] Let me list the files in the current directory...
[tool call] glob(pattern: "*")
[tool result] Found 5 files:
- main.go
- go.mod
- go.sum
- README.md
- cmd/

you> Read the contents of main.go
[assistant] Let me read the main.go file...
[tool call] read_file(file_path: "main.go")
[tool result] File contents:
...

Note: If you encounter Tool errors during execution that cause the Agent to stop, don't panic — this is normal. Tool errors are common, such as parameter errors, file not found, etc. How to gracefully handle Tool errors will be covered in detail in the next chapter.

Tool Call Flow

When the Agent needs to call a Tool:

+------------------------------------------+
|  User: List the files in the current dir  |
+------------------------------------------+
                   |
        +------------------------+
        |  Agent analyzes intent  |
        |  Decides to call glob   |
        +------------------------+
                   |
        +------------------------+
        |  Generate Tool Call     |
        |  {"pattern": "*"}       |
        +------------------------+
                   |
        +------------------------+
        |  Execute Tool           |
        |  glob("*")              |
        +------------------------+
                   |
        +------------------------+
        |  Return Tool Result     |
        |  {"files": [...]}       |
        +------------------------+
                   |
        +------------------------+
        |  Agent generates reply  |
        |  "Found 5 files..."     |
        +------------------------+

Chapter Summary

Tool: Capability extensions for the Agent, enabling it to perform concrete operations
Backend: Abstract interface for filesystem operations, providing unified file operation capabilities
LocalBackend: Local filesystem implementation of Backend, directly accessing the OS filesystem
DeepAgent: A prebuilt advanced Agent with first-class configuration for Backend and StreamingShell
Automatic tool registration: Configuring a Backend automatically registers filesystem tools
Tool call flow: Agent analyzes intent -> Generates Tool Call -> Executes Tool -> Returns result -> Generates reply

Further Thinking

Other Tool types:

HTTP Tool: Call external APIs
Database Tool: Query databases
Calculator Tool: Perform calculations
Code Executor Tool: Run code

Other Backend implementations:

Other storage backends can be implemented based on the Backend interface
For example: cloud storage, database storage, etc.
LocalBackend already provides complete filesystem operation capabilities

Custom Tool creation:

If you need to create custom Tools, you can use utils.InferTool to automatically infer from functions. See:

12 KiB Raw Blame History