LLM Configuration#

Louie supports a growing number of generative and embedding models. That includes using multiple models simultaneously and in air-gapped environments, and control of which model is used when.

Louie supports all providers implementing the OpenAI chat protocol: follow the OpenAI instructions in that case. Additionally, it natively supports a growing number of LLMs and providers, including OpenAI, Azure OpenAI, Anthropic Claude, and Groq.

Key concepts#

Configuration phases#

You can configure:

Model provider connection: Securely connect to one or more providers
Model registration: Automatically or manually register individual models from a provider with a model profile name
Model selection: Use the same model everywhere, or configure preferences on which model to use with which agent or agentic task

The same system is used for both generative chat models and embedding models.

Recommended models#

We generally recommend using models at the OpenAI GPT-4 level or better. This includes coverage for the following tasks:

Smart and fast. Ex: gpt-4o for instruction following and code generation
Smart, fast, scalable, and cheap. Ex: gpt-4o-mini for processing large amounts of data
Large context. Ex: gpt-4.1

Model provider authentication#

You must provide connection information for your model provider(s), including potential authentication details.

Depending on the complexity of deployment, this is done in:

data/custom.env: For a single global model provider
data/<provider>/auth.env: When multiple providers, but at most one of each type
data/<provider>/auth/profile123.env: When providers of the same type, such as due to differing model availabilities and new model testing

When multiple authentication profiles for the same provider, set distinct AUTH_PROFILE_NAME=... names.

Model registration#

Louie automatically detects the models exposed by some providers, such as for OpenAI. Other providers, such as Azure OpenAI, require you to explicitly register or configure your model.

Depending on the complexity of deployment, this is done in:

data/custom.env: For registering a single model, or a provider that supports automatic multi-model registration
data/<provider>/model.env: When multiple providers, but at most one of each type, and registering a single model or a model provider that supports automatic multi-model registration
data/<provider>/model/profile123.env: When multiple models from the same provider and that provider does not support automatic multi-model registration

When multiple model profiles for the same provider, set distinct MODEL_PROVIDER_PROFILE_NAME=... names.

Model selection#

You configure which models are used by default. Optionally, you can further configure which models are used for specific agents and their tasks.

Model names are specified by the profile in the registration step.

Testing#

When starting Louie, you can see which model provider connections are established and which models are registered.

You can dynamically inspect what configurations are used, as a system administrator, via /api/docs endpoints /api/account and /api/capabilities.

Contact staff for planning assistance#

Please contact staff for assistance with AI configuration. Discussions benefit from knowledge around:

Model service provider, e.g., Azure OpenAI vs OpenAI vs a local serving framework
Available models
Hardware availa7bility
Target workloads, e.g., number of active analysts and whether any batch/continuous processing is needed

Model selection#

After connecting to your model provider and registering its models (see below), you can specify which models are used by default and for specific tasks via your data/custom.env:

Default model selection#

We recommend setting the below. Generally, stick to gpt-4o+ grade models. We encourage experimentation with new models such as o-series for reasoning, and performing real testing before committing to a model.

# Required
LLM_MODEL=gpt-4o

# Optional
LLM_MODEL_FAST=gpt-4o-mini
LLM_MODEL_SMART=gpt-4o
LLM_MODEL_REASONING=gpt-4o
LLM_MODEL_LARGE_CONTEXT=gpt-4o

Agent model selection#

Each agent, and key tasks in agents, may be configured to use specific models:

System agent selection#

LLM_MODEL_LOUIE=
LLM_MODEL_LOUIE_MEMORY=
LLM_MODEL_MAKE_EXAMPLE=
LLM_MODEL_MAKE_RECIPE=
LLM_MODEL_TABLEQA=
LLM_MODEL_TABLEQA_ASK=

Plugin agent selection#

LLM_MODEL_CHART=
LLM_MODEL_CODE=
LLM_MODEL_CYPHER=
LLM_MODEL_DATABRICKS=
LLM_MODEL_DATABRICKS_CONTEXT=
LLM_MODEL_GRAPHISTRY=
LLM_MODEL_OPENSEARCH=
LLM_MODEL_OPENSEARCH_CONTEXT=
LLM_MODEL_PANDAS=
LLM_MODEL_SPLUNK=
LLM_MODEL_SQL=
LLM_MODEL_SQL_CONTEXT=

Embedding model selection#

Embedding models are registered just like generative models. You may also specify which embedding model to use in general, or for specific indexes. Louie ships with out-of-the-box serving of sentence-transformers, and most model service providers also serve embedding models.

LLM_EMBEDDING_MODEL=sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2

OPENSEARCH_INDEX_EMBEDDING_MAPPING='{"myindex1": "text-embedding-ada-002", "myindex2":"matryoska"}'

LLM Provider connection and registration#

Louie LLM connection configuration is currently via environment variables. Depending on deployment complexity, place your connection configuration in:

data/custom.env: Single model provider
data/<provider>/auth.env: Multiple model providers, but at most one of each type
data/<provider>/auth/profile123.env: Multiple model providers of the same type, such as due to differing model availabilities and new model testing

Models from providers like OpenAI are automatically registered, but for other providers, you must manually register the models:

data/custom.env: Single model to register, or a provider that supports automatic multi-model registration
data/<provider>/model.env: Multiple model providers, but at most one of each type, and registering a single model or a model provider that supports automatic multi-model registration
data/<provider>/model/profile123.env: Multiple models from the same provider and that provider does not support automatic multi-model registration

After configuring your model provider connection and registering your models, restart the Louie server to apply the changes:

Restart the Louie server:

cd /var/louie
./dc up -d --force-recreate louie api

OpenAI#

Get an OpenAI key
Provider connection

In data/custom.env, data/openai/auth.env, or data/openai/auth/profile123.env:

OPENAI_API_KEY=...

# OPTIONAL
OPENAI_ORGANIZATION=...
OPENAI_PROXY=...

Model registration

You may manually register models, but typically unnecessary, in data/custom.env, data/openai/model.env, or data/openai/models/profile123.env.

Azure OpenAI#

Create Model Deployments

Create the desired Model Deployments in Azure OpenAI Studio.
Provider connection

In data/custom.env, data/azureopenai/auth.env, or data/azureopenai/auth/profile123.env:

AZURE_OPENAI_API_KEY=...
AZURE_OPENAI_API_ENDPOINT=...

Model registration

In data/custom.env, data/azureopenai/model.env, or data/azureopenai/model/profile123.env:

# Azure-configured name
AZURE_DEPLOYMENT_NAME=gpt-4-latest-10Ktpm

# You can define a shorter nickname for internal use
AZURE_MODEL_NAME=gpt-4

# When using model.env or model/<profile>.env, set a unique MODEL_PROVIDER_PROFILE_NAME when using multiple models or providers to prevent name conflicts
MODEL_PROVIDER_PROFILE_NAME=azure-openai-prod-gpt-4

Anthropic#

Get API Key

Obtain an API key from your Anthropic account.
Provider connection

In data/custom.env, data/anthropic/auth.env, or data/anthropic/auth/profile123.env:

ANTHROPIC_API_KEY=...

# OPTIONAL
ANTHROPIC_API_URL=...

Model registration

In data/custom.env, data/anthropic/model.env, or data/anthropic/model/profile123.env:

# OPTIONAL
MODEL_PROVIDER_PROFILE_NAME=...

Groq#

Get API Key

Obtain an API key from your Groq account.
Provider connection

In data/custom.env, data/groq/auth.env, or data/groq/auth/profile123.env:

GROQ_API_KEY=...

# OPTIONAL
GROQ_API_BASE=...
GROQ_PROXY=...

Model registration

In data/custom.env, data/groq/model.env, or data/groq/model/profile123.env:

GROQ_CUSTOM_MODEL_NAME_OVERRIDE='cllama3-70b-8192'

Vertex AI#

To use Gemini chat and embedding models via Vertex AI:

Create a Service Account with Required Roles

Assign the following IAM roles to your service account:

roles/aiplatform.user
roles/aiplatform.serviceAgent
roles/generativelanguage.user
roles/generativeaiprojects.user
roles/storage.objectViewer

Configure Louie

In your custom.env:

VERTEX_SERVICE_ACCOUNT_JSON_PATH=/custom/path/service-account.json  # Default path /opt/host_data/vertex/service-account.json
VERTEX_PROJECT=<YOUR_PROJECT_ID>
VERTEX_LOCATION=us-central1

Sentence Transformers#

Louie comes with several popular sentence transformer embedding models preloaded. See system start or capabilities/ API for list of available models. The default recommendation is sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2.

For larger deployments with many users, consider using a dedicated model server. Ex: Contact staff about configuring Nvidia Triton and other popular OSS options.

LLM Configuration#

Key concepts#

Configuration phases#

Recommended models#

Model provider authentication#

Model registration#

Model selection#

Testing#

Contact staff for planning assistance#

Model selection#

Default model selection#

Agent model selection#

System agent selection#

Plugin agent selection#

Embedding model selection#

LLM Provider connection and registration#

OpenAI#

Azure OpenAI#

Anthropic#

Groq#

Vertex AI#

Sentence Transformers#

This Page