Skip to content

Azure OpenAI Chat Model#

Use the Azure OpenAI Chat Model node to use OpenAI's chat models with conversational agents.

On this page, you'll find the node parameters for the Azure OpenAI Chat Model node, and links to more resources.

Credentials

You can find authentication information for this node here.

Parameter resolution in sub-nodes

Sub-nodes behave differently to other nodes when processing multiple items using an expression.

Most nodes, including root nodes, take any number of items as input, process these items, and output the results. You can use expressions to refer to input items, and the node resolves the expression for each item in turn. For example, given an input of five name values, the expression {{ $json.name }} resolves to each name in turn.

In sub-nodes, the expression always resolves to the first item. For example, given an input of five name values, the expression {{ $json.name }} always resolves to the first name.

Node parameters#

Model: the model to use to generate the completion.

Node options#

  • Frequency Penalty: increase this to reduce the chance of the model repeating itself.
  • Maximum Number of Tokens: the completion length, in characters.
  • Response Format: choose Text or JSON. JSON ensures the model returns valid JSON.
  • Presence Penalty: increase this to increase the chance of the model talking about new topics.
  • Sampling Temperature: controls the randomness of the sampling process. A higher temperature creates more diverse sampling, but increases the risk of hallucinations.
  • Timeout: maximum request time in milliseconds.
  • Max Retries: maximum number of times to retry a request.
  • Top P: use a lower value to ignore less probable options.

Templates and examples#

Browse Azure OpenAI Chat Model integration templates, or search all templates

Refer to LangChains's Azure OpenAI documentation for more information about the service.

View n8n's Advanced AI documentation.

  • completion: Completions are the responses generated by a model like GPT.
  • hallucinations: Hallucination in AI is when an LLM (large language model) mistakenly perceives patterns or objects that don't exist.
  • vector database: A vector database stores mathematical representations of information. Use with embeddings and retrievers to create a database that your AI can access when answering questions.
  • vector store: A vector store, or vector database, stores mathematical representations of information. Use with embeddings and retrievers to create a database that your AI can access when answering questions.