This is the multi-page printable view of this section. Click here to print.

Return to the regular view of this page.

Welcome

Welcome to merge.ai official documentation site.

DOCS SECTION UNDER DEVELOPMENT!

About merge.ai

merge.ai is your gateway to the latest advancements in generative AI technology. Our app offers seamless access to cutting-edge models from top-tier AI providers, including OpenAI, Anthropic, Stability.ai, and more.

With merge.ai, you can effortlessly interact with the newest generative AI tools available. Whether you’re looking to enhance your productivity with sophisticated chat completions or create compelling visual content through image and video generation, our platform provides an extensive suite of tools designed to meet your needs.

merge.ai gives you access to the latest models available without the need to individually subscribe to every provider and pay monthly fees. With our pay-as-you-go strategy you only pay for the tokens/credits you actually use.

Experience the future of AI today with merge.ai, where innovation meets convenience.

Documentation

We’re working hard to ensure that our documentation keeps up with our growing community. If you have a question we encourage you to start with the documentation (right here!). If you can’t find what you’re looking for, please visit one of our support channels.

Getting support

Users can get support through the following channels:

Where should I go next?

1 - Release Notes

Release 1.5.0

March 6, 2025

Changes

  • Changed Components Library style to New-York
  • Added option to start chat from splash screen
  • Improved screen area usage (specially at mobile)
  • Added support to OpenAI GPT-4.5 Preview
  • Added support to OpenAI o1 and o3-mini
  • Added support to Gemini model 2.0 flash
  • Updated Prices
  • Updated image models to version 3.5
  • Minor improvements on UI/UX
  • Bug fixing.

Release 1.4.0

February 27, 2025

Changes

  • Added persistent Model selection per Thread.
  • Added new voices for Text to Speech.
  • Minor improvements and bug fixing.

Release 1.3.3

February 24, 2025

Changes

  • Added x.Ai Grok models.
  • Added new model Claude 3.7 Sonnet from Anthropic.
  • Multiple package updates.
  • Minor improvements and bug fixing.

Release 1.3.2

February 7, 2025

Changes

  • Added DeepSeek V3 and R1 models.
  • Added Google Gemini Flash 2.0 model.
  • Improved way to select Model at Chat Completions.
  • Added Export Thread Messages button.
  • Minor improvements and bug fixing.

Release 1.3.1

November 26, 2024

Changes

  • New Chat Bubble UI.
  • Multiple major UI/UX improvements.
  • Minor improvements and bug fixing.

Release 1.3.0

November 20, 2024

Changes

  • Added support to OpenAI o1-preview and o1-mini.
  • Added support to Anthropic Claude 3.5 Haiku.
  • Minor improvements and bug fixing.

Release 1.2.0

October 18, 2024

Changes

  • Added NEW FEATURE Task Planner.
  • Updated token prices.
  • Minor improvements and bug fixing.

Release 1.1.0

September 15, 2024

Changes

  • Added support to Google Gemini Models.
  • Added support to edit and export chat messages.
  • Minor improvements and bug fixing.

Release 1.0.0

August 1, 2024

Initial release of merge.ai.

2 - Getting Started

This section explains how to access merge.ai. We recommend that you read the entire section to ensure the best user experience.

The following pages contain information to help you get started using merge.ai:

2.1 - Sign up for an Account

If you do not have a user account, follow these steps to create one.

  1. At the website top Banner, click the Sign in button.

  2. Click the Sign up link in ‘Don’t have an account? Sign Up’.

  3. Enter your details on the following screens.

  4. Click the Continue button to create your account.

  1. Once your account is created and verified you will be able to interact with models and features.

Where should I go next?

2.2 - Logging in to merge.ai

Follow these steps to log in to your account.

  1. At the website top Banner, click the Sign in button.

  2. Enter your e-mail address and click Continue.

  3. Enter your password and click Continue.

  4. You will be redirected to the initial page (Chat Completions).

Cannot remember your password?

  1. Click Can’t log in to your account?

  2. Enter your enrolled email address and you will receive instructions in your inbox on how to reset your password.

Where should I go next?

2.3 - Exploring the merge.ai Workspace

The Chat Completions area is the first page you see after logging in to merge.ai.

  • The navigation bar (at the left of the screen) is the same on every screen in merge.ai. It contains links which give you quick access to many of merge.ai’s areas.

  • The header bar (at the top of the screen) contains area-specific functions plus the Logout and Light/Dark Theme buttons.

Chat Completions Prompt Patterns Explorer
Chat with Documents Documentation
Text to Speech Pricing
Image Generation Account
Video Generation

Header bar Icons

Area-specific Configuration
Light/Dark Theme button
Logout button

Where should I go next?

2.4 - Using merge.ai on a Mobile Device

When you view a merge.ai page on a mobile device, such as an iPhone or an Android phone, merge.ai will display an optimised version of the page.

What does merge.ai look like on a mobile device?

Chat Completions
Threads
Configuration
Image Generation

What can you do in merge.ai on a mobile device?

The merge.ai mobile interface has been designed to give users access to all features available on the desktop interface.

3 - Using merge.ai

This section explains how to use merge.ai many features.

The following pages contain information to help you understand and use merge.ai:

3.1 - Chat Completions

Chat Completions is where you interact via text, or image (when available) with the most advanced LLMs (Large Language Models) on the market.

  • The easiest way to start is by selecting a Thread (by clicking on it) and start typing your messages. If you don’t have any Threads, create one by using the “add” button (1).

  • If you want to change the model or advanced options click on the “gear” (2). More details at Configuring Chat Completions.

Threads

Threads are a group of Chat Messages. You can think of Threads being the subject of a conversation, holding all related messages that belongs to that subejct.

You can create up to 100 Threads but only the first 50 will be shown at the Threads browser on the left side of the screen.

Chat Messages

Each chat message has a role (either system, user, or assistant) and content.

  • The system message is optional and can be used to set the behavior of the assistant

  • The user messages provide requests or comments for the assistant to respond to

  • Assistant messages is the answer from the model

To customize the system message and other advanced options check Configuring Chat Completions.

Model

You can check the selected LLM Model by looking at the Selected Model Chip (7).

Differently from the system message wich is unique per Thread, the LLM Model can be changed at any time. That means you can start a chat with Model “A” and continue the same chat with Model “B”.

To customize the LLM Model and other advanced options check Configuring Chat Completions.

Pricing

Chat Completions pricing is based on token usage. You can think of tokens as pieces of words used for natural language processing. For more details visit What is a token?.

For price details per Model visit Pricing.

Where should I go next?

3.1.1 - Configuring Chat Completions

The Configuration menu is displayed when you click on the “gear” icon at the Header Bar.

Model

Merge.ai is powered by a diverse set of models with different capabilities and price points. You can choose them using the drop-down menu (1).

For price details per Model visit Pricing.

Temperature

Temperature value controls the output balance between coherence and creativity. Lower values for temperature result in more consistent outputs (e.g. 0.2), while higher values generate more diverse and creative results (e.g. 1.0). The default value is 1.

For more details visit What is Temperature?.

Top P

Top P or nucleus sampling is a setting that decides how many possible words to consider. A high “Top P” value means the model looks at more possible words, even the less likely ones, which makes the generated text more diverse.

For more details visit What is Top P?.

System Message

The System Message is a special input provided to the model to steer its behavior and set the context for its interactions. Its primary purpose is to define guidelines, tone, or instructions on how the model should respond to user inputs. This helps ensure that the generated outputs are aligned with the desired objectives and constraints.

The default message is “You are a helpful assistant” and there are two ways of changing it:

  • Typing your desired System Message at the input text field

  • Choose a pre-defined Prompt Pattern from the “Prompt Patterns” drop-down menu

For more details on Prompt Patterns visit Prompt Patterns Explorer.

Where should I go next?

3.2 - Chat with Documents

3.3 - Text to Speech

3.4 - Image Generation

3.5 - Video Generation

3.6 - Prompt Patterns Explorer

3.7 - Account

4 - Frequently Asked Questions

Search the most frequently asked questions about merge.ai.

4.1 - What is a token?

Tokens are pieces of words used for natural language processing.

For English text, 1 token is approximately 4 characters or 0.75 words.

As a point of reference, the collected works of Shakespeare are about 900,000 words or 1.2M tokens.

To learn more about how tokens work and estimate your usage you can visit OpenAI Tokenizer tool.

Where should I go next?

4.2 - What is Temperature?

Temperature is a parameter that controls randomness when picking words during text creation. Low values of temperature make the text more predictable and consistent, while high values let more freedom and creativity into the mix, but can also make things less consistent. Temperature can vary from 0 to 1.

  • Temperature closer to 0: Responses are very predictable, always choosing the next most likely word. This is great for answers where facts and accuracy are really important.

  • Temperature closer to 1: The model takes more chances, picking words that are less likely, which can lead to more creative but unpredictable answers.

Examples of Temperature

  • Temperature = 0: If you ask, “What are the benefits of exercising?”, with a temperature of 0, the model might say: “Exercising improves heart health and muscle strength, lowers the chance of chronic diseases, and helps manage weight.”

  • Temperature = 1: With the same question on exercise and a temperature of 1, you might get: “Exercise is the alchemist turning sweat into a miracle cure, a ritual dancing in the flames of effort and reward.”

Where should I go next?

4.3 - What is Top P?

Top P or nucleus sampling is a parameter that decides how many possible words to consider. A high “Top P” value means the model looks at more possible words, even the less likely ones, which makes the generated text more diverse. Top P can vary from 0 to 1.

  • Top P = 0.5: The model considers words that together add up to at least 50% of the total probability, leaving out the less likely ones and keeping a good level of varied responses.

  • Top P = 0.9: The model includes a lot more words in the choice, allowing for more variety and originality.

Examples of Top P

  • Top P = 0.5: If you ask for a title for an adventure book, with a top-p of 0.5, the model might come up with: “The Mystery of the Blue Mountain.”

  • Top P = 0.9: For the same adventure book title and a top-p of 0.9, the model might create: “Voices from the Abyss: A Portrait of the Brave.”

Where should I go next?

4.4 - Mixing Temperature and Top P

Mixing Temperature and Top P can give a wide range of text styles. A low Temperature with a high Top P can lead to coherent text with creative touches. On the other hand, a high Temperature with a low Top P might give you common words put together in unpredictable ways.

Low Temperature and High Top P

Model outputs are usually logical and consistent because of the low Temperature, but they can still have rich vocabulary and ideas due to the high Top P. This setup is good for educational or informative texts where clarity is crucial, but you also want to keep the reader’s interest.

High Temperature and Low Top P

Model outputs often results in texts where sentences may make sense on their own but as a whole seem disconnected or less logical. The high Temperature allows more variation in sentence building, while the low Top P limits word choices to the most likely ones. This can be useful in creative settings where you want unexpected results or to spark new ideas with unusual concept combinations.

Where should I go next?