The following pages contain information to help you understand and use merge.ai:
This is the multi-page printable view of this section. Click here to print.
Using merge.ai
- 1: Chat Completions
- 2: Chat with Documents
- 3: Text to Speech
- 4: Image Generation
- 5: Video Generation
- 6: Prompt Patterns Explorer
- 7: Account
1 - Chat Completions
Chat Completions is where you interact via text, or image (when available) with the most advanced LLMs (Large Language Models) on the market.
-
The easiest way to start is by selecting a Thread (by clicking on it) and start typing your messages. If you don’t have any Threads, create one by using the “add” button (1).
-
If you want to change the model or advanced options click on the “gear” (2). More details at Configuring Chat Completions.

If the selected Model is capable of ingest images, the “attach file (4)” option will be available.
Threads
Threads are a group of Chat Messages. You can think of Threads being the subject of a conversation, holding all related messages that belongs to that subejct.
You can create up to 100 Threads but only the first 50 will be shown at the Threads browser on the left side of the screen.
Chat Messages
Each chat message has a role (either system, user, or assistant) and content.
-
The system message is optional and can be used to set the behavior of the assistant
-
The user messages provide requests or comments for the assistant to respond to
-
Assistant messages is the answer from the model
To customize the system message and other advanced options check Configuring Chat Completions.
Model
You can check the selected LLM Model by looking at the Selected Model Chip (7).
Differently from the system message wich is unique per Thread, the LLM Model can be changed at any time. That means you can start a chat with Model “A” and continue the same chat with Model “B”.
To customize the LLM Model and other advanced options check Configuring Chat Completions.
Pricing
Chat Completions pricing is based on token usage. You can think of tokens as pieces of words used for natural language processing. For more details visit What is a token?.
For price details per Model visit Pricing.
Where should I go next?
1.1 - Configuring Chat Completions
The Configuration menu is displayed when you click on the “gear” icon at the Header Bar.

Model
Merge.ai is powered by a diverse set of models with different capabilities and price points. You can choose them using the drop-down menu (1).
For price details per Model visit Pricing.
Temperature
Temperature value controls the output balance between coherence and creativity. Lower values for temperature result in more consistent outputs (e.g. 0.2), while higher values generate more diverse and creative results (e.g. 1.0). The default value is 1.
For more details visit What is Temperature?.
Top P
Top P or nucleus sampling is a setting that decides how many possible words to consider. A high “Top P” value means the model looks at more possible words, even the less likely ones, which makes the generated text more diverse.
For more details visit What is Top P?.
System Message
The System Message is a special input provided to the model to steer its behavior and set the context for its interactions. Its primary purpose is to define guidelines, tone, or instructions on how the model should respond to user inputs. This helps ensure that the generated outputs are aligned with the desired objectives and constraints.
The default message is “You are a helpful assistant” and there are two ways of changing it:
-
Typing your desired System Message at the input text field
-
Choose a pre-defined Prompt Pattern from the “Prompt Patterns” drop-down menu
For more details on Prompt Patterns visit Prompt Patterns Explorer.