November 8, 2023

How to tackle big projects with AI: An intro to multi-agent prompting

By making a bunch of AI tools with unique skill sets work in harmony, you can tackle really ambitious projects using AI as a copilot.
November 8, 2023

How to tackle big projects with AI: An intro to multi-agent prompting

By making a bunch of AI tools with unique skill sets work in harmony, you can tackle really ambitious projects using AI as a copilot.
November 8, 2023
Briana Brownell
In this article
Start editing audio & video
This makes the editing process so much faster. I wish I knew about Descript a year ago.
Matt D., Copywriter
Sign up

What type of content do you primarily create?

Videos
Podcasts
Social media clips
Transcriptions
Start editing audio & video
This makes the editing process so much faster. I wish I knew about Descript a year ago.
Matt D., Copywriter
Sign up

What type of content do you primarily create?

Videos
Podcasts
Social media clips
Transcriptions

AI tools are everywhere right now. There’s a tool to help you write your novel, to design your brand assets, to invent new drugs, even to talk to dead celebrities. While the pop-culture conception of artificial intelligence has always been one intelligence — a smart, probably evil robot who can do everything in the world better than humans — the current state of AI is more like an infinite number of intelligences that are only good at one thing. 

But nothing’s stopping you from making those AIs work together. That’s where multi-agent prompting comes in.

What is multi-agent prompting?

Imagine a bunch of AI tools, each with its own skill set, working in harmony. It's like an orchestra of instruments where every musician contributes to the song by playing their own unique part.

By using this approach, you can tackle really ambitious projects using AI as a copilot. For instance, creating an entire website from scratch requires coding, graphic design, copywriting, search engine optimization, and promotion. There’s a tool for each of those tasks, and you can create a workflow that makes them all work in concert.

But getting to that point takes a thoughtful setup, a little bit of knowhow, and a lot of practice. Here’s how to do it.

1. Learn the nuances of different tools

Before you can compose your masterpiece, you've got to know your instruments—or in this case, your AI tools, which we’ll also call “agents.” Each tool has its limitations and strengths. Working with them before tackling your grand ambition can help you figure out where each excels and falls short. 

Check out the limitations of these tools—they all have specific constraints that might impact their performance, such as ChatGPT's context window or Midjourney's challenges with generating full-length character renders (Describe the shoes. Always describe the shoes). Figure out what they do best, and consider their best role in your project.

2. Choose your workstyle

Next, decide how you want to work with the tools. This involves considering your preferred workflow and the way you want to achieve the goals. For instance, are you a Centaur, who delegates specific tasks to the AI while doing some on your own, or are you a Cyborg, who works continuously and iteratively with the AI? 

(Don't know which you are? Figure out your style here.)

Experiment with different workstyles to see how you like them.

3. Articulate your goals

When you're dealing with one tool, goal-setting is relatively straightforward. But when you're working with multiple agents, things get way more complicated. It pays to sit down and map out what you hope to achieve. You may even benefit from flowcharts or process maps to visualize how each tool will contribute at various stages.

Think about what parts can be done in parallel and which ones need others to be worked out first. For example, if you're doing a rebranding exercise, you need to articulate your brand's voice and identify your ideal customer before you start doing any copywriting. But you might not need to have finalized your design for the "About Us" website page before you start on a "Services" page.

This roadmap is essential: it's easy to get lost when using so many tools. Put it front and center and use it throughout the process. If you're stuck, ask one of the AI tools for help. 

image
Framework the author used for a corporate website. One AI acted as a brand expert, one as a copywriter, and three more as customer personas (as generated by the brand expert). Each AI had its own task that fed into the other roles.

4. Organize, organize, organize

It's easy to get overwhelmed by the sheer volume of content you'll generate, especially when you're going through multiple revisions and working iteratively with the tools to get the output right. Things will get lost if they’re not formatted and named correctly.

Set up an organizational framework that includes the in-progress and final versions of each tool’s output. Make sure you organize it as you go. If you're using text-generating AI, you might consider whether you want to specify the format of the output from the tool explicitly in your prompts or in your custom instructions. For instance, if you're generating code, ChatGPT likes to summarize it—if you want to put it directly into your file, you have to explicitly ask for it to write everything out.

If you're generating lots of different files, systematically name them so that you remember key information about each one and so you can find it later. If you haven't already, this is a great time to start creating your own prompt library.

Making these decisions early will help you avoid a mid-project meltdown.

5. Assemble your band

Once you know your goals, you can start "auditioning" your agents. You might choose a diverse ensemble of tools like ChatGPT for language tasks and Midjourney for images. Or maybe you'll go for a homogenous group, like multiple instances of ChatGPT each programmed to perform specific roles—think business strategist, website designer, or even customer persona.

Most likely your AI tools will need to be supported by other pieces of software and the work of real humans. You'll need to decide who does what when.

And if one of your tools is underperforming, you might need to kick it out of the band. 

image
Custom Instructions

6. Get them talking

Communication between your agents is key. Most likely, you'll be linking them indirectly by feeding one's output into another's input. This won’t happen automatically, but on the plus side, this lets you review the results before they go into the next step. 

Often, this means that you’re going to have to explicitly generate output that can be used as input to another tool. Basically, you're asking a tool to create a whole or part of the prompt for another AI. But be careful: make sure you're using prompting best practices. For instance, ChatGPT doesn't always do a great job of generating Midjourney prompts, so while they might be a good first start, you’ll probably have to edit them.

For really big jobs, your agents can interact directly through APIs (Application Programming Interfaces). There are also specialized extensions that can help coordinate multi-agent activities. 

Whatever the method, the goal is synergy. You want to maximize the work you’re doing that's necessarily human-led—like deciding whether the output meets your needs or requires edits or a redo—rather than fussing around with formatting or copy-pasting (don't forget you can set up an agent to do the formatting for you).

7. Add a TL;DR tool

Even when you think you've set it up perfectly, you're going to encounter bumps along the way. Say you're deep into a content-generation task, and suddenly, the output starts going off-topic or you hit the context window limit. Or maybe the AI starts giving you results that are technically correct but miss the emotional tone you're aiming for. 

You need a summarizer tool. What's that? It's a prompt that generates a succinct few lines of text you can use if you need to "restart" a tool mid-way through.

Prompt template:

Please act as a [persona]. We are [describe task].

Here is [content 1]: [summary of content 1]

Here is [content 2]: [summary of content 2]

etc.

[Write your actual prompt]

For example:

Please act as a professional copywriter. We are creating a website for a brand.

Here is the brand voice: The brand voice is engaging, upbeat, conversational, approachable. It uses simple language, short sentences, and humor. 

Here is a description of the service for the page we're working on: The service is….

Here is the copy we wrote for another service page: 

Please write the website copy for the service page that matches the brand voice….

Getting into the habit of periodically asking for summaries or high-level project information makes it much easier to organize big projects.

8. Conduct your symphony

Finally, it's showtime! With all your agents in place and communicating well, you're ready to set them into action. But this phase isn’t just about hitting “start” and watching the magic unfold; it requires active management and possibly some on-the-fly adjustments.

Remember to stay flexible. For instance, if ChatGPT produces a product description that doesn't quite align with the visual created by Midjourney, you may need to tweak the input prompt for one or both tools. 

Having a good organizational method can make this much easier (you did make one, didn't you?) 

In the process, you might keep a spreadsheet that tracks which agent is responsible for what task, and the status of each. It acts like your conductor's score, guiding you through the performance.

Conclusion

Conducting a multi-agent AI symphony isn't just for tech savants—it's for anyone willing to understand the capabilities of different tools and how to harmonize them for more complex tasks. It might take a bit of time to get used to the baton, but once you do, you'll wonder how you ever got along with a solo act. So go ahead, assemble your band of AI agents, and start making some beautiful music.

Briana Brownell
Briana Brownell is a Canadian data scientist and multidisciplinary creator who writes about the intersection of technology and creativity.
Share this article
Start creating—for free
Sign up
Join millions of others creating with Descript

How to tackle big projects with AI: An intro to multi-agent prompting

AI tools are everywhere right now. There’s a tool to help you write your novel, to design your brand assets, to invent new drugs, even to talk to dead celebrities. While the pop-culture conception of artificial intelligence has always been one intelligence — a smart, probably evil robot who can do everything in the world better than humans — the current state of AI is more like an infinite number of intelligences that are only good at one thing. 

But nothing’s stopping you from making those AIs work together. That’s where multi-agent prompting comes in.

Descript marries the easy capture and sharing of screen recording with easy, powerful editing tools.
Try our script-based editor that makes editing video as easy as editing a doc.

What is multi-agent prompting?

Imagine a bunch of AI tools, each with its own skill set, working in harmony. It's like an orchestra of instruments where every musician contributes to the song by playing their own unique part.

By using this approach, you can tackle really ambitious projects using AI as a copilot. For instance, creating an entire website from scratch requires coding, graphic design, copywriting, search engine optimization, and promotion. There’s a tool for each of those tasks, and you can create a workflow that makes them all work in concert.

But getting to that point takes a thoughtful setup, a little bit of knowhow, and a lot of practice. Here’s how to do it.

1. Learn the nuances of different tools

Before you can compose your masterpiece, you've got to know your instruments—or in this case, your AI tools, which we’ll also call “agents.” Each tool has its limitations and strengths. Working with them before tackling your grand ambition can help you figure out where each excels and falls short. 

Check out the limitations of these tools—they all have specific constraints that might impact their performance, such as ChatGPT's context window or Midjourney's challenges with generating full-length character renders (Describe the shoes. Always describe the shoes). Figure out what they do best, and consider their best role in your project.

2. Choose your workstyle

Next, decide how you want to work with the tools. This involves considering your preferred workflow and the way you want to achieve the goals. For instance, are you a Centaur, who delegates specific tasks to the AI while doing some on your own, or are you a Cyborg, who works continuously and iteratively with the AI? 

(Don't know which you are? Figure out your style here.)

Experiment with different workstyles to see how you like them.

3. Articulate your goals

When you're dealing with one tool, goal-setting is relatively straightforward. But when you're working with multiple agents, things get way more complicated. It pays to sit down and map out what you hope to achieve. You may even benefit from flowcharts or process maps to visualize how each tool will contribute at various stages.

Think about what parts can be done in parallel and which ones need others to be worked out first. For example, if you're doing a rebranding exercise, you need to articulate your brand's voice and identify your ideal customer before you start doing any copywriting. But you might not need to have finalized your design for the "About Us" website page before you start on a "Services" page.

This roadmap is essential: it's easy to get lost when using so many tools. Put it front and center and use it throughout the process. If you're stuck, ask one of the AI tools for help. 

image
Framework the author used for a corporate website. One AI acted as a brand expert, one as a copywriter, and three more as customer personas (as generated by the brand expert). Each AI had its own task that fed into the other roles.

4. Organize, organize, organize

It's easy to get overwhelmed by the sheer volume of content you'll generate, especially when you're going through multiple revisions and working iteratively with the tools to get the output right. Things will get lost if they’re not formatted and named correctly.

Set up an organizational framework that includes the in-progress and final versions of each tool’s output. Make sure you organize it as you go. If you're using text-generating AI, you might consider whether you want to specify the format of the output from the tool explicitly in your prompts or in your custom instructions. For instance, if you're generating code, ChatGPT likes to summarize it—if you want to put it directly into your file, you have to explicitly ask for it to write everything out.

If you're generating lots of different files, systematically name them so that you remember key information about each one and so you can find it later. If you haven't already, this is a great time to start creating your own prompt library.

Making these decisions early will help you avoid a mid-project meltdown.

5. Assemble your band

Once you know your goals, you can start "auditioning" your agents. You might choose a diverse ensemble of tools like ChatGPT for language tasks and Midjourney for images. Or maybe you'll go for a homogenous group, like multiple instances of ChatGPT each programmed to perform specific roles—think business strategist, website designer, or even customer persona.

Most likely your AI tools will need to be supported by other pieces of software and the work of real humans. You'll need to decide who does what when.

And if one of your tools is underperforming, you might need to kick it out of the band. 

image
Custom Instructions

6. Get them talking

Communication between your agents is key. Most likely, you'll be linking them indirectly by feeding one's output into another's input. This won’t happen automatically, but on the plus side, this lets you review the results before they go into the next step. 

Often, this means that you’re going to have to explicitly generate output that can be used as input to another tool. Basically, you're asking a tool to create a whole or part of the prompt for another AI. But be careful: make sure you're using prompting best practices. For instance, ChatGPT doesn't always do a great job of generating Midjourney prompts, so while they might be a good first start, you’ll probably have to edit them.

For really big jobs, your agents can interact directly through APIs (Application Programming Interfaces). There are also specialized extensions that can help coordinate multi-agent activities. 

Whatever the method, the goal is synergy. You want to maximize the work you’re doing that's necessarily human-led—like deciding whether the output meets your needs or requires edits or a redo—rather than fussing around with formatting or copy-pasting (don't forget you can set up an agent to do the formatting for you).

7. Add a TL;DR tool

Even when you think you've set it up perfectly, you're going to encounter bumps along the way. Say you're deep into a content-generation task, and suddenly, the output starts going off-topic or you hit the context window limit. Or maybe the AI starts giving you results that are technically correct but miss the emotional tone you're aiming for. 

You need a summarizer tool. What's that? It's a prompt that generates a succinct few lines of text you can use if you need to "restart" a tool mid-way through.

Prompt template:

Please act as a [persona]. We are [describe task].

Here is [content 1]: [summary of content 1]

Here is [content 2]: [summary of content 2]

etc.

[Write your actual prompt]

For example:

Please act as a professional copywriter. We are creating a website for a brand.

Here is the brand voice: The brand voice is engaging, upbeat, conversational, approachable. It uses simple language, short sentences, and humor. 

Here is a description of the service for the page we're working on: The service is….

Here is the copy we wrote for another service page: 

Please write the website copy for the service page that matches the brand voice….

Getting into the habit of periodically asking for summaries or high-level project information makes it much easier to organize big projects.

8. Conduct your symphony

Finally, it's showtime! With all your agents in place and communicating well, you're ready to set them into action. But this phase isn’t just about hitting “start” and watching the magic unfold; it requires active management and possibly some on-the-fly adjustments.

Remember to stay flexible. For instance, if ChatGPT produces a product description that doesn't quite align with the visual created by Midjourney, you may need to tweak the input prompt for one or both tools. 

Having a good organizational method can make this much easier (you did make one, didn't you?) 

In the process, you might keep a spreadsheet that tracks which agent is responsible for what task, and the status of each. It acts like your conductor's score, guiding you through the performance.

Conclusion

Conducting a multi-agent AI symphony isn't just for tech savants—it's for anyone willing to understand the capabilities of different tools and how to harmonize them for more complex tasks. It might take a bit of time to get used to the baton, but once you do, you'll wonder how you ever got along with a solo act. So go ahead, assemble your band of AI agents, and start making some beautiful music.

Featured articles:

No items found.

Articles you might find interesting

Video

How to edit videos faster by recording right the first time

Just pressing record and starting to talk is the best way to waste a ton of your time. The better your recordings are, the less time you’ll spend editing.

Product Updates

New: Wistia integration for easier, better video marketing

It’s true that anybody with a smartphone can make and publish a video. But if you want that video to promote your business, it’s not so simple. Just shooting it and posting it on your social media accounts probably won’t accomplish much. If you’re going to invest the time and money, you probably want more from your video.

For Business

How to Record A Lecture: 7 Tips to Enhance the Learning Experience

The pandemic has forced educators to adopt virtual teaching forms, like record lectures. Luckily, good recordings don’t require much more than software, a webcam and microphone, and a good strategy.

Podcasting

12 best practices for remote recording podcasts

You can capture high-quality audio from anywhere in the world that sounds like all of the speakers were sitting in the same recording studio. Here are some best practices to keep in mind.

Video

The 7 best video recording software picks for your project (2024)

Video recording software can mean the difference between a pixelated video that takes days to edit and a high-resolution masterpiece that’s ready in no time. Figure out the right software for your project with our recommendation.s

For Business

How Cloudinary uses Descript to make customer education video that looks and sounds great

Cloudinary needs to make their customers understand how to use their software, and video is their primary tool. Here's how they use Descript to make video that looks and sounds great—without all the hassle.

Related articles:

Share this article

Get started for free →