RIGHTBRAIN BLOG

Comparing different task versions in Rightbrain

How to build and compare different task configurations, including new prompt versions and aternative models

(If you'd prefer to read this guide in notion, then click here.)

The Compare feature enables you to experiment with different task configurations and continuously improve performance. You can access the Compare view by selecting two different versions from the Task view. If you only select one, it will automatically allow you to Compare against your Active task.

What Can I Compare?

In the Compare view, you see two task configurations side-by-side. You can configure any of the core components that make up a task, such as:

User Prompt: Defines the task for the LLM.
System Prompt: Provides additional context and constraints.
Temperature: Adjusts the variation in responses.
Model: Choose from leading proprietary or open-source models.
Output Format: Specify structured formats that integrate seamlessly with your database.

After you’ve decided on the two versions you’d like to compare, simply provide the input data you’d like to run both on. Your tasks will run simultaneously and produce two distinct responses.

Testing a New Prompt and Outputs

Let’s try comparing a new prompt that provides more detail for the LLM and specifying exactly how we’d like the outputs formatted. You can instantly see the difference in the response.

Evaluating a New Model

Curious about testing a different model? You can instantly compare models—for example, Google Flash 2.0 versus Claude 3.7 Sonnet. This allows you to continuously evaluate and deploy new models, uncovering opportunities for both performance and efficiency gains.

Task Revisions

Every time you update your task configuration, you have the option to save it as a new revision. If you run a new version prior to saving, it will create a test revision.

These revisions can be:

Tested side by side with other versions.
Promoted directly into your active production pipeline.

Each revision is assigned a unique ID, which you can use for version logging and A/B testing.

If your task is currently running in a staging or production environment and you’d like to promote a new revision, simply click “Promote.” This action immediately replaces the existing version in your pipeline. In other words, the API endpoint for running the task will now use the new active version.

(If you'd prefer to read this guide in notion, then click here.)

What Can I Compare?

In the Compare view, you see two task configurations side-by-side. You can configure any of the core components that make up a task, such as:

User Prompt: Defines the task for the LLM.
System Prompt: Provides additional context and constraints.
Temperature: Adjusts the variation in responses.
Model: Choose from leading proprietary or open-source models.
Output Format: Specify structured formats that integrate seamlessly with your database.

After you’ve decided on the two versions you’d like to compare, simply provide the input data you’d like to run both on. Your tasks will run simultaneously and produce two distinct responses.

Testing a New Prompt and Outputs

Let’s try comparing a new prompt that provides more detail for the LLM and specifying exactly how we’d like the outputs formatted. You can instantly see the difference in the response.

Evaluating a New Model

Task Revisions

Every time you update your task configuration, you have the option to save it as a new revision. If you run a new version prior to saving, it will create a test revision.

These revisions can be:

Tested side by side with other versions.
Promoted directly into your active production pipeline.

Each revision is assigned a unique ID, which you can use for version logging and A/B testing.

(If you'd prefer to read this guide in notion, then click here.)

What Can I Compare?

In the Compare view, you see two task configurations side-by-side. You can configure any of the core components that make up a task, such as:

User Prompt: Defines the task for the LLM.
System Prompt: Provides additional context and constraints.
Temperature: Adjusts the variation in responses.
Model: Choose from leading proprietary or open-source models.
Output Format: Specify structured formats that integrate seamlessly with your database.

After you’ve decided on the two versions you’d like to compare, simply provide the input data you’d like to run both on. Your tasks will run simultaneously and produce two distinct responses.

Testing a New Prompt and Outputs

Let’s try comparing a new prompt that provides more detail for the LLM and specifying exactly how we’d like the outputs formatted. You can instantly see the difference in the response.

Evaluating a New Model

Task Revisions

Every time you update your task configuration, you have the option to save it as a new revision. If you run a new version prior to saving, it will create a test revision.

These revisions can be:

Tested side by side with other versions.
Promoted directly into your active production pipeline.

Each revision is assigned a unique ID, which you can use for version logging and A/B testing.

(If you'd prefer to read this guide in notion, then click here.)

What Can I Compare?

In the Compare view, you see two task configurations side-by-side. You can configure any of the core components that make up a task, such as:

User Prompt: Defines the task for the LLM.
System Prompt: Provides additional context and constraints.
Temperature: Adjusts the variation in responses.
Model: Choose from leading proprietary or open-source models.
Output Format: Specify structured formats that integrate seamlessly with your database.

After you’ve decided on the two versions you’d like to compare, simply provide the input data you’d like to run both on. Your tasks will run simultaneously and produce two distinct responses.

Testing a New Prompt and Outputs

Let’s try comparing a new prompt that provides more detail for the LLM and specifying exactly how we’d like the outputs formatted. You can instantly see the difference in the response.

Evaluating a New Model

Task Revisions

Every time you update your task configuration, you have the option to save it as a new revision. If you run a new version prior to saving, it will create a test revision.

These revisions can be:

Tested side by side with other versions.
Promoted directly into your active production pipeline.

Each revision is assigned a unique ID, which you can use for version logging and A/B testing.

(If you'd prefer to read this guide in notion, then click here.)

What Can I Compare?

In the Compare view, you see two task configurations side-by-side. You can configure any of the core components that make up a task, such as:

User Prompt: Defines the task for the LLM.
System Prompt: Provides additional context and constraints.
Temperature: Adjusts the variation in responses.
Model: Choose from leading proprietary or open-source models.
Output Format: Specify structured formats that integrate seamlessly with your database.

After you’ve decided on the two versions you’d like to compare, simply provide the input data you’d like to run both on. Your tasks will run simultaneously and produce two distinct responses.

Testing a New Prompt and Outputs

Let’s try comparing a new prompt that provides more detail for the LLM and specifying exactly how we’d like the outputs formatted. You can instantly see the difference in the response.

Evaluating a New Model

Task Revisions

Every time you update your task configuration, you have the option to save it as a new revision. If you run a new version prior to saving, it will create a test revision.

These revisions can be:

Tested side by side with other versions.
Promoted directly into your active production pipeline.

Each revision is assigned a unique ID, which you can use for version logging and A/B testing.

(If you'd prefer to read this guide in notion, then click here.)

What Can I Compare?

In the Compare view, you see two task configurations side-by-side. You can configure any of the core components that make up a task, such as:

User Prompt: Defines the task for the LLM.
System Prompt: Provides additional context and constraints.
Temperature: Adjusts the variation in responses.
Model: Choose from leading proprietary or open-source models.
Output Format: Specify structured formats that integrate seamlessly with your database.

After you’ve decided on the two versions you’d like to compare, simply provide the input data you’d like to run both on. Your tasks will run simultaneously and produce two distinct responses.

Testing a New Prompt and Outputs

Let’s try comparing a new prompt that provides more detail for the LLM and specifying exactly how we’d like the outputs formatted. You can instantly see the difference in the response.

Evaluating a New Model

Task Revisions

Every time you update your task configuration, you have the option to save it as a new revision. If you run a new version prior to saving, it will create a test revision.

These revisions can be:

Tested side by side with other versions.
Promoted directly into your active production pipeline.

Each revision is assigned a unique ID, which you can use for version logging and A/B testing.

Rightbrain

Rightbrain

Rightbrain

Rightbrain

Rightbrain

Our latest blogs and articles

Vibe Coding | Replit | Rightbrain Guides

🧠 Guide: Building AI-Powered Apps with Rightbrain + Replit

This guide walks you through building modular AI apps using Rightbrain for LLM orchestration and Replit for UI and frontend development.

Vibe Coding | Replit | Rightbrain Guides

🧠 Guide: Building AI-Powered Apps with Rightbrain + Replit

This guide walks you through building modular AI apps using Rightbrain for LLM orchestration and Replit for UI and frontend development.

Vibe Coding | Replit | Rightbrain Guides

🧠 Guide: Building AI-Powered Apps with Rightbrain + Replit

This guide walks you through building modular AI apps using Rightbrain for LLM orchestration and Replit for UI and frontend development.

Vibe Coding | Replit | Rightbrain Guides

🧠 Guide: Building AI-Powered Apps with Rightbrain + Replit

This guide walks you through building modular AI apps using Rightbrain for LLM orchestration and Replit for UI and frontend development.

Vibe Coding | Replit | Rightbrain Guides

🧠 Guide: Building AI-Powered Apps with Rightbrain + Replit

This guide walks you through building modular AI apps using Rightbrain for LLM orchestration and Replit for UI and frontend development.

Vibe Coding | Replit | Rightbrain Guides

🧠 Guide: Building AI-Powered Apps with Rightbrain + Replit

This guide walks you through building modular AI apps using Rightbrain for LLM orchestration and Replit for UI and frontend development.

Image generated in Google Gemini from the prompt "nerdy Elton John Glasses"

Copyright and IP | AI Policy

The Road to Hell is Paved with Yellow Bricks

Off the back of recent comments from Sir Elton John, Pete Tiarks explores the overlooked nuance in proposed legislation—specifically, a transparency measure from the House of Lords.

Copyright and IP | AI Policy

The Road to Hell is Paved with Yellow Bricks

Off the back of recent comments from Sir Elton John, Pete Tiarks explores the overlooked nuance in proposed legislation—specifically, a transparency measure from the House of Lords.

Copyright and IP | AI Policy

The Road to Hell is Paved with Yellow Bricks

Off the back of recent comments from Sir Elton John, Pete Tiarks explores the overlooked nuance in proposed legislation—specifically, a transparency measure from the House of Lords.

Copyright and IP | AI Policy

The Road to Hell is Paved with Yellow Bricks

Off the back of recent comments from Sir Elton John, Pete Tiarks explores the overlooked nuance in proposed legislation—specifically, a transparency measure from the House of Lords.

Copyright and IP | AI Policy

The Road to Hell is Paved with Yellow Bricks

Off the back of recent comments from Sir Elton John, Pete Tiarks explores the overlooked nuance in proposed legislation—specifically, a transparency measure from the House of Lords.

Copyright and IP | AI Policy

The Road to Hell is Paved with Yellow Bricks

Off the back of recent comments from Sir Elton John, Pete Tiarks explores the overlooked nuance in proposed legislation—specifically, a transparency measure from the House of Lords.

LLM Use Cases

100+ LLM use cases to try today

Ideas for integrating LLMs into any app or workflow

LLM Use Cases

100+ LLM use cases to try today

Ideas for integrating LLMs into any app or workflow

LLM Use Cases

100+ LLM use cases to try today

Ideas for integrating LLMs into any app or workflow

LLM Use Cases

100+ LLM use cases to try today

Ideas for integrating LLMs into any app or workflow

LLM Use Cases

100+ LLM use cases to try today

Ideas for integrating LLMs into any app or workflow

LLM Use Cases

100+ LLM use cases to try today

Ideas for integrating LLMs into any app or workflow

Join our developer slack

Request to join our developer slack channel

Join us on

Join our developer slack

Request to join our developer slack channel

Join us on

Join our developer slack

Request to join our developer slack channel

Join us on

Join our developer slack

Request to join our developer slack channel

Join us on

Join our developer slack

Request to join our developer slack channel

Join us on

Join our developer slack

Request to join our developer slack channel

RIGHTBRAIN BLOG

Comparing different task versions in Rightbrain

Comparing different task versions in Rightbrain

How to build and compare different task configurations, including new prompt versions and aternative models

What Can I Compare?

Testing a New Prompt and Outputs

Evaluating a New Model

Task Revisions

What Can I Compare?

Testing a New Prompt and Outputs

Evaluating a New Model

Task Revisions

What Can I Compare?

Testing a New Prompt and Outputs

Evaluating a New Model

Task Revisions

What Can I Compare?

Testing a New Prompt and Outputs

Evaluating a New Model

Task Revisions

What Can I Compare?

Testing a New Prompt and Outputs

Evaluating a New Model

Task Revisions

What Can I Compare?

Testing a New Prompt and Outputs

Evaluating a New Model

Task Revisions

Rightbrain

Rightbrain

Rightbrain

Rightbrain

Rightbrain

RELATED CONTENT

Our latest blogs and articles

Vibe Coding | Replit | Rightbrain Guides

🧠 Guide: Building AI-Powered Apps with Rightbrain + Replit

This guide walks you through building modular AI apps using Rightbrain for LLM orchestration and Replit for UI and frontend development.

Vibe Coding | Replit | Rightbrain Guides

🧠 Guide: Building AI-Powered Apps with Rightbrain + Replit

This guide walks you through building modular AI apps using Rightbrain for LLM orchestration and Replit for UI and frontend development.

Vibe Coding | Replit | Rightbrain Guides

🧠 Guide: Building AI-Powered Apps with Rightbrain + Replit

This guide walks you through building modular AI apps using Rightbrain for LLM orchestration and Replit for UI and frontend development.

Vibe Coding | Replit | Rightbrain Guides

🧠 Guide: Building AI-Powered Apps with Rightbrain + Replit

This guide walks you through building modular AI apps using Rightbrain for LLM orchestration and Replit for UI and frontend development.

Vibe Coding | Replit | Rightbrain Guides

🧠 Guide: Building AI-Powered Apps with Rightbrain + Replit

This guide walks you through building modular AI apps using Rightbrain for LLM orchestration and Replit for UI and frontend development.

Vibe Coding | Replit | Rightbrain Guides

🧠 Guide: Building AI-Powered Apps with Rightbrain + Replit

This guide walks you through building modular AI apps using Rightbrain for LLM orchestration and Replit for UI and frontend development.

Copyright and IP | AI Policy

The Road to Hell is Paved with Yellow Bricks

Off the back of recent comments from Sir Elton John, Pete Tiarks explores the overlooked nuance in proposed legislation—specifically, a transparency measure from the House of Lords.

Copyright and IP | AI Policy

The Road to Hell is Paved with Yellow Bricks

Off the back of recent comments from Sir Elton John, Pete Tiarks explores the overlooked nuance in proposed legislation—specifically, a transparency measure from the House of Lords.

Copyright and IP | AI Policy

The Road to Hell is Paved with Yellow Bricks

Off the back of recent comments from Sir Elton John, Pete Tiarks explores the overlooked nuance in proposed legislation—specifically, a transparency measure from the House of Lords.

Copyright and IP | AI Policy

The Road to Hell is Paved with Yellow Bricks

Off the back of recent comments from Sir Elton John, Pete Tiarks explores the overlooked nuance in proposed legislation—specifically, a transparency measure from the House of Lords.

Copyright and IP | AI Policy

The Road to Hell is Paved with Yellow Bricks

Off the back of recent comments from Sir Elton John, Pete Tiarks explores the overlooked nuance in proposed legislation—specifically, a transparency measure from the House of Lords.

Copyright and IP | AI Policy

The Road to Hell is Paved with Yellow Bricks

Off the back of recent comments from Sir Elton John, Pete Tiarks explores the overlooked nuance in proposed legislation—specifically, a transparency measure from the House of Lords.

LLM Use Cases

100+ LLM use cases to try today

Ideas for integrating LLMs into any app or workflow

LLM Use Cases

100+ LLM use cases to try today

Ideas for integrating LLMs into any app or workflow

LLM Use Cases

100+ LLM use cases to try today

Ideas for integrating LLMs into any app or workflow