As AI Agents use lot of concepts and models in Generative-AI, Let's first understand what is Generative-AI?
what makes Generative AI different from traditional AI concepts in nutshell.
So basically, Artificial Intelligence (AI) is generalized domain which contains all the methods and techniques which helps to mimic human brain.
While Generative – AI is specialized AI which is particularly used for generating creative content / information for users. Day to day life and very well-known example for Gen-AI is ChatGPT.
So, its function is to create new data or content that is similar to what it was trained on, often indistinguishable from human-made content.
Okay!!! so let’s move forward.
Let's understand what is AI Agent?
So, Agent is basically entire system or entity which perform tasks Autonomously /Independently Based on its environment and input data.
It may use or may not use AI techniques for performing specific tasks.
They can be rule-based or decision tree based.
There are some standard characteristics of Agents:
Now, the question arises that if this is the agent, it seems approximately same as program, application and algorithm. In Computer Science. How we can differentiate then the agent different from Application OR Program OR Algorithm as all these do same things. Right?
Here are the simple definitions:
Now Let's understand the basic Architecture of Agents:
Let's start with Model.
Toolsbridge the gap between External World and Model, empowering agents to interact with external data and services while unlocking a wider range of actions beyond that of the underlying model alone.
Tools can take a variety of forms and have varying depths of complexity but typically align with common web API methods like GET, POST, PATCH, and DELETE. Which are very easy to understand.
3. Orchestration Layer –
The orchestration layer describes a cyclical process that governs how the agent takes in information, performs some internal reasoning, and uses that reasoning to inform its next action or decision.
Glimpses of Difference between Model and Agents:
Model | Agent |
Knowledge of Model is limited as it is trained on specific corpus of data. | Knowledge can be extended as Agent is always connected with external world. |
Process involves single prediction based on user query. Unless there is no externally managed system for history retention. | It has inbuilt history management system which retains history of all predictions from user query and all the decisions made in Orchestration layer. So the data can be fetched anytime. |
No native implementation of tools. | Tools are naively implemented in Agent Architecture. |
No native logic layer implemented | Native cognitive architecture that uses reasoning frameworks like CoT, ReAct, or other pre-built agent frameworks like LangChain. |
Now here, we have covered the most conceptual part of the document!!! The next part involves some definitions of some Concepts which we often use in AI Agents. Each Concept is explained with very simple example –
Let's go!!!
1] Extensions –
It's simply a bridge between API and AI agents
Example – Booking a flight using the Google Flights API via an extension that understands inputs like departure and destination cities.
2] Sample Extensions –
Pre-built extensions provided by Google for common tasks such as data fetching.
Example – Using a code interpreter extension to generate Python code for inverting a binary tree.
3] Functions
Use Cases of Functions
5] Data Stores
6] Implementation of Data Stores
7] Tools Recap
8] Enhancing Model Performance with Targeted Learning
9] Agent Quick Start with LangChain
10] Production Applications with Vertex AI Agents