Wednesday, July 9, 2025

AI AGENT VEDIO

 

nt agents agentic workflows agents agents agent agent agent agent agentic all right most explanations of AI agents 

is either too technical or too basic this video is meant for people like myself you have zero technical background but you use AI tools regularly and you want to learn just enough about AI agents to see how it affects you in this video we'll follow a simple one two three learning path by building on concepts you already understand like chatbt and then moving on to AI workflows and then finally AI agents all the while using examples you will actually encounter in real life and believe me when I tell you those intimidating terms you see everywhere like rag rag or react they're a lot simpler than you think let's get started kicking things off at level one large language models popular AI chatbots like CHBT Google Gemini and Claude are applications built on top of large language models LLMs and they're fantastic at generating and editing text here's a simple visualization you the human provides an input and the LLM produces an output based on its training data for example if I were to ask Chachi BT to draft an email requesting a coffee chat my prompt is the input and the resulting email that's way more polite than I would ever be in real life is the output so far so good right simple stuff but what if I asked Chachi BT when my next coffee chat is even without seeing the response both you and I know Chachi PT is gonna fail because it doesn't know that information it doesn't have access to my calendar this highlights two key traits of large language models first despite being trained on vast amounts of data they have limited knowledge of proprietary information like our personal information or internal company data second LLMs are passive they wait for our prompt and then respond right keep these two traits in mind moving forward moving to level two AI workflows let's build on our example what if I a human told the LM "Every time I ask about a personal event perform a search query and fetch data from my Google calendar before providing a response." With this logic implemented the next time I ask "When is my coffee chat with Elon Husky?" I'll get the correct answer because the LLM will now first go into my Google calendar to find that information but here's where it gets tricky what if my next follow-up question is "What will the weather be like that day?" The LM will now fail at answering the query because the path we told the LM to follow is to always search my Google calendar which does not have information about the weather this is a fundamental trait of AI workflows they can only follow predefined paths set by humans and if you want to get technical this path is also called the control logic pushing my example further what if I added more steps into the workflow by allowing the LM to access the weather via an API and then just for fun use a text to audio model to speak the answer the weather forecast for seeing Elon Husky is sunny with a chance of being a good boy here's the thing no matter how many steps we add this is still just an AI workflow even if there were hundreds or thousands of steps if a human is the decision maker there is no AI agent involvement pro tip: retrieval augmented generation or rag is a fancy term that's thrown around a lot in simple terms rag is a process that helps AI models look things up before they answer like accessing my calendar or the weather service essentially Rag is just a type of AI workflow by the way I have 

Video advertising services

a free AI toolkit that cuts through the noise and helps you master essential AI tools and workflows i'll leave a link to that down below here's a real world 

example following Helena Louu's amazing tutorial I created a simple AI workflow using make.com here you can see that first I'm using Google Sheets to do something specifically I'm compiling links to news articles in a Google sheet and this is that Google sheet second I'm using Perplexity to summarize those news articles then using Claude and using a prompt that I wrote I'm asking Claude to draft a LinkedIn and Instagram post finally I can schedule this to run automatically every day at 8 a.m as you can see this is an AI workflow because it follows a predefined path set by me step one you do this step two you do this step three you do this and finally remember to run daily at 8 am one last thing if I test this workflow and I don't like the final output of the LinkedIn post for example as you can see right here uh it's not funny enough and I'm naturally hilarious right i'd have to manually go back and rewrite the prompt for Claude okay and this trial and error iteration is currently being done by me a human so keep that in mind moving forward all right level three AI agents continuing the make.com example let's break down what I've been doing so far as the human decision maker with the goal of creating social media posts based off of news articles I need to do two things first reason or think about the best approach i need to first compile the news articles then summarize them then write the final posts second take action using tools i need to find and link to those news articles in Google Sheets use Perplexity for real-time summarization and then claw for copyrightiting so and this is the most important sentence in this entire video the one massive change that has to happen in order for this AI workflow to become an AI agent is for me the human decision maker to be replaced by an LLM in other words the AI agent must reason what's the most efficient way to compile these news articles should I copy and paste each article into a word document no it's probably easier to compile links to those articles and then use another tool to fetch the data yes that makes more sense the AI agent must act aka do things via tools should I use Microsoft Word to compile links no inserting links directly into rows is way more efficient what about Excel m so the user has already connected their Google account with make.com so Google Sheets is a better option pro tip because of this the most common configuration for AI agents is the react framework all AI agents must reason and act so react sounds simple once we break it down right a third key trait of AI agents is their ability to iterate remember when I had to manually rewrite the prompt to make the LinkedIn post funnier i the human probably need to repeat this iterative process a few times to get something I'm happy with right an AI agent will be able to do the same thing autonomously in our example the AI agent would autonomously add in another LM to critique its own output okay I've drafted V1 of a LinkedIn post how do I make sure it's good oh I know i'll add another step where an LM will critique the post based on LinkedIn best practices and let's repeat this until the best practices criteria are all met and after a few cycles of that we have the final output that was a hypothetical example so let's move on to a real world AI agent example andrew is a preeeminent figure in AI and he created this demo website that illustrates how an AI agent works i'll link the full video down below but when I search for a keyword like skier enter the AI vision agent in the background is first reasoning what a skier looks like a person on skis going really fast in snow for example right i'm not sure and then it's acting by looking at clips in video footage trying to identify what it thinks a skier is indexing that clip and then returning that clip to us although this might not feel impressive remember that an AI agent did all that instead of a human reviewing the footage beforehand manually identifying the skier and adding tags like skier mountain ski snow the programming is obviously a lot more technical and complicated than what we see in the front end but that's the point of this demo right the average user like myself wants a simple app that just works without me having to understand what's going on in the back end speaking of examples I'm also 

Video advertising services

building my very own basic AI agent using Nan so let me know in the comments what type of AI agent you'd like me to make a tutorial on next to wrap up here's a simplified visualization of the 

three levels we covered today level one we provide an input and the LM responds with an output easy level two for AI workflows we provide an input and tell the LM to follow a predefined path that may involve in retrieving information from external tools the key trait here is that the human programs a path for LM to follow level three the AI agent receives a goal and the LM performs reasoning to determine how best to achieve the goal takes action using tools to produce an interim result observes that interim result and decides whether iterations are required and produces a final output that achieves the initial goal the key trait here is that the LLM is a decision maker in the workflow if you found this helpful you might want to learn how to build a prompts database in Notion see you on the next video in the meantime have a great one

0 comments:

Popular Posts

Recent Posts

Unordered List

Text Widget