Class Ollama

Class that represents the Ollama language model. It extends the base LLM class and implements the OllamaInput interface.

Example

const ollama = new Ollama({
  baseUrl: "http://api.example.com",
  model: "llama2",
});

// Streaming translation from English to German
const stream = await ollama.stream(
  `Translate "I love programming" into German.`
);

const chunks = [];
for await (const chunk of stream) {
  chunks.push(chunk);
}

console.log(chunks.join(""));

Hierarchy

LLM<OllamaCallOptions>
- Ollama

Implements

OllamaInput

Constructors

constructor

new Ollama(fields): Ollama
Parameters
- fields: OllamaInput & BaseLLMParams
Returns Ollama
Overrides LLM.constructor
- Defined in docs/api_refs/langchain/src/llms/ollama.ts:109

Properties

CallOptions

CallOptions: OllamaCallOptions

ParsedCallOptions

ParsedCallOptions: Omit<OllamaCallOptions, never>

baseUrl

baseUrl: string = "http://localhost:11434"

caller

caller: AsyncCaller

The async caller should be used by subclasses to make any async calls, which will thus benefit from the concurrency and retry logic.

model

model: string = "llama2"

verbose

verbose: boolean

Whether to print out response text.

`Optional` cache

cache?: BaseCache<Generation[]>

`Optional` callbacks

callbacks?: Callbacks

`Optional` embeddingOnly

embeddingOnly?: boolean

`Optional` f16KV

f16KV?: boolean

`Optional` format

format?: StringWithAutocomplete<"json">

`Optional` frequencyPenalty

frequencyPenalty?: number

`Optional` logitsAll

logitsAll?: boolean

`Optional` lowVram

lowVram?: boolean

`Optional` mainGpu

mainGpu?: number

`Optional` metadata

metadata?: Record<string, unknown>

`Optional` mirostat

mirostat?: number

`Optional` mirostatEta

mirostatEta?: number

`Optional` mirostatTau

mirostatTau?: number

`Optional` numBatch

numBatch?: number

`Optional` numCtx

numCtx?: number

`Optional` numGpu

numGpu?: number

`Optional` numGqa

numGqa?: number

`Optional` numKeep

numKeep?: number

`Optional` numThread

numThread?: number

`Optional` penalizeNewline

penalizeNewline?: boolean

`Optional` presencePenalty

presencePenalty?: number

`Optional` repeatLastN

repeatLastN?: number

`Optional` repeatPenalty

repeatPenalty?: number

`Optional` ropeFrequencyBase

ropeFrequencyBase?: number

`Optional` ropeFrequencyScale

ropeFrequencyScale?: number

`Optional` stop

stop?: string[]

`Optional` tags

tags?: string[]

`Optional` temperature

temperature?: number

`Optional` tfsZ

tfsZ?: number

`Optional` topK

topK?: number

`Optional` topP

topP?: number

`Optional` typicalP

typicalP?: number

`Optional` useMLock

useMLock?: boolean

`Optional` useMMap

useMMap?: boolean

`Optional` vocabOnly

vocabOnly?: boolean

Accessors

callKeys

get callKeys(): string[]
Keys that the language model accepts as call options.

Returns string[]
Inherited from LLM.callKeys
- Defined in langchain-core/dist/language_models/base.d.ts:113

Methods

batch

batch(inputs, options?, batchOptions?): Promise<string[]>
Default implementation of batch, which calls invoke N times. Subclasses should override this method if they can batch more efficiently.
Parameters
- inputs: BaseLanguageModelInput[]
  
  Array of inputs to each batch call.
- Optional options: Partial<OllamaCallOptions> | Partial<OllamaCallOptions>[]
  
  Either a single call options object to apply to each batch call or an array for each call.
- Optional batchOptions: RunnableBatchOptions & {
  returnExceptions?: false;
  }
Returns Promise<string[]>
An array of RunOutputs, or mixed RunOutputs and errors if batchOptions.returnExceptions is set
Inherited from LLM.batch
- Defined in langchain-core/dist/runnables/base.d.ts:71
batch(inputs, options?, batchOptions?): Promise<(string | Error)[]>
Parameters
- inputs: BaseLanguageModelInput[]
- Optional options: Partial<OllamaCallOptions> | Partial<OllamaCallOptions>[]
- Optional batchOptions: RunnableBatchOptions & {
  returnExceptions: true;
  }
Returns Promise<(string | Error)[]>
Inherited from LLM.batch
- Defined in langchain-core/dist/runnables/base.d.ts:74
batch(inputs, options?, batchOptions?): Promise<(string | Error)[]>
Parameters
- inputs: BaseLanguageModelInput[]
- Optional options: Partial<OllamaCallOptions> | Partial<OllamaCallOptions>[]
- Optional batchOptions: RunnableBatchOptions
Returns Promise<(string | Error)[]>
Inherited from LLM.batch
- Defined in langchain-core/dist/runnables/base.d.ts:77

bind

bind(kwargs): Runnable<BaseLanguageModelInput, string, OllamaCallOptions>
Bind arguments to a Runnable, returning a new Runnable.
Parameters
- kwargs: Partial<OllamaCallOptions>
Returns Runnable<BaseLanguageModelInput, string, OllamaCallOptions>
A new RunnableBinding that, when invoked, will apply the bound args.
Inherited from LLM.bind
- Defined in langchain-core/dist/runnables/base.d.ts:28

call

call(prompt, options?, callbacks?): Promise<string>
Convenience wrapper for generate that takes in a single string prompt and returns a single string output.
Parameters
- prompt: string
- Optional options: string[] | OllamaCallOptions
- Optional callbacks: Callbacks
Returns Promise<string>
Inherited from LLM.call
- Defined in langchain-core/dist/language_models/llms.d.ts:71

generate

generate(prompts, options?, callbacks?): Promise<LLMResult>
Run the LLM on the given prompts and input, handling caching.
Parameters
- prompts: string[]
- Optional options: string[] | OllamaCallOptions
- Optional callbacks: Callbacks
Returns Promise<LLMResult>
Inherited from LLM.generate
- Defined in langchain-core/dist/language_models/llms.d.ts:67

generatePrompt

generatePrompt(promptValues, options?, callbacks?): Promise<LLMResult>
This method takes prompt values, options, and callbacks, and generates a result based on the prompts.
Parameters
- promptValues: BasePromptValue[]
  
  Prompt values for the LLM.
- Optional options: string[] | OllamaCallOptions
  
  Options for the LLM call.
- Optional callbacks: Callbacks
  
  Callbacks for the LLM call.
Returns Promise<LLMResult>
An LLMResult based on the prompts.
Inherited from LLM.generatePrompt
- Defined in langchain-core/dist/language_models/llms.d.ts:50

getNumTokens

getNumTokens(content): Promise<number>
Parameters
- content: MessageContent
Returns Promise<number>
Inherited from LLM.getNumTokens
- Defined in langchain-core/dist/language_models/base.d.ts:130

invocationParams

invocationParams(options?): {
    format: undefined | StringWithAutocomplete<"json">;
    model: string;
    options: {
        embedding_only: undefined | boolean;
        f16_kv: undefined | boolean;
        frequency_penalty: undefined | number;
        logits_all: undefined | boolean;
        low_vram: undefined | boolean;
        main_gpu: undefined | number;
        mirostat: undefined | number;
        mirostat_eta: undefined | number;
        mirostat_tau: undefined | number;
        num_batch: undefined | number;
        num_ctx: undefined | number;
        num_gpu: undefined | number;
        num_gqa: undefined | number;
        num_keep: undefined | number;
        num_thread: undefined | number;
        penalize_newline: undefined | boolean;
        presence_penalty: undefined | number;
        repeat_last_n: undefined | number;
        repeat_penalty: undefined | number;
        rope_frequency_base: undefined | number;
        rope_frequency_scale: undefined | number;
        stop: undefined | string[];
        temperature: undefined | number;
        tfs_z: undefined | number;
        top_k: undefined | number;
        top_p: undefined | number;
        typical_p: undefined | number;
        use_mlock: undefined | boolean;
        use_mmap: undefined | boolean;
        vocab_only: undefined | boolean;
    };
}
Get the parameters used to invoke the model
Parameters
- Optional options: Omit<OllamaCallOptions, never>
Returns {
    format: undefined | StringWithAutocomplete<"json">;
    model: string;
    options: {
        embedding_only: undefined | boolean;
        f16_kv: undefined | boolean;
        frequency_penalty: undefined | number;
        logits_all: undefined | boolean;
        low_vram: undefined | boolean;
        main_gpu: undefined | number;
        mirostat: undefined | number;
        mirostat_eta: undefined | number;
        mirostat_tau: undefined | number;
        num_batch: undefined | number;
        num_ctx: undefined | number;
        num_gpu: undefined | number;
        num_gqa: undefined | number;
        num_keep: undefined | number;
        num_thread: undefined | number;
        penalize_newline: undefined | boolean;
        presence_penalty: undefined | number;
        repeat_last_n: undefined | number;
        repeat_penalty: undefined | number;
        rope_frequency_base: undefined | number;
        rope_frequency_scale: undefined | number;
        stop: undefined | string[];
        temperature: undefined | number;
        tfs_z: undefined | number;
        top_k: undefined | number;
        top_p: undefined | number;
        typical_p: undefined | number;
        use_mlock: undefined | boolean;
        use_mmap: undefined | boolean;
        vocab_only: undefined | boolean;
    };
}
- format: undefined | StringWithAutocomplete<"json">
- model: string
- options: {
      embedding_only: undefined | boolean;
      f16_kv: undefined | boolean;
      frequency_penalty: undefined | number;
      logits_all: undefined | boolean;
      low_vram: undefined | boolean;
      main_gpu: undefined | number;
      mirostat: undefined | number;
      mirostat_eta: undefined | number;
      mirostat_tau: undefined | number;
      num_batch: undefined | number;
      num_ctx: undefined | number;
      num_gpu: undefined | number;
      num_gqa: undefined | number;
      num_keep: undefined | number;
      num_thread: undefined | number;
      penalize_newline: undefined | boolean;
      presence_penalty: undefined | number;
      repeat_last_n: undefined | number;
      repeat_penalty: undefined | number;
      rope_frequency_base: undefined | number;
      rope_frequency_scale: undefined | number;
      stop: undefined | string[];
      temperature: undefined | number;
      tfs_z: undefined | number;
      top_k: undefined | number;
      top_p: undefined | number;
      typical_p: undefined | number;
      use_mlock: undefined | boolean;
      use_mmap: undefined | boolean;
      vocab_only: undefined | boolean;
  }
  - embedding_only: undefined | boolean
  - f16_kv: undefined | boolean
  - frequency_penalty: undefined | number
  - logits_all: undefined | boolean
  - low_vram: undefined | boolean
  - main_gpu: undefined | number
  - mirostat: undefined | number
  - mirostat_eta: undefined | number
  - mirostat_tau: undefined | number
  - num_batch: undefined | number
  - num_ctx: undefined | number
  - num_gpu: undefined | number
  - num_gqa: undefined | number
  - num_keep: undefined | number
  - num_thread: undefined | number
  - penalize_newline: undefined | boolean
  - presence_penalty: undefined | number
  - repeat_last_n: undefined | number
  - repeat_penalty: undefined | number
  - rope_frequency_base: undefined | number
  - rope_frequency_scale: undefined | number
  - stop: undefined | string[]
  - temperature: undefined | number
  - tfs_z: undefined | number
  - top_k: undefined | number
  - top_p: undefined | number
  - typical_p: undefined | number
  - use_mlock: undefined | boolean
  - use_mmap: undefined | boolean
  - vocab_only: undefined | boolean
Overrides LLM.invocationParams
- Defined in docs/api_refs/langchain/src/llms/ollama.ts:154

invoke

invoke(input, options?): Promise<string>
This method takes an input and options, and returns a string. It converts the input to a prompt value and generates a result based on the prompt.
Parameters
- input: BaseLanguageModelInput
  
  Input for the LLM.
- Optional options: OllamaCallOptions
  
  Options for the LLM call.
Returns Promise<string>
A string result based on the prompt.
Inherited from LLM.invoke
- Defined in langchain-core/dist/language_models/llms.d.ts:35

map

map(): Runnable<BaseLanguageModelInput[], string[], OllamaCallOptions>
Return a new Runnable that maps a list of inputs to a list of outputs, by calling invoke() with each input.

Returns Runnable<BaseLanguageModelInput[], string[], OllamaCallOptions>
Inherited from LLM.map
- Defined in langchain-core/dist/runnables/base.d.ts:33

pipe

pipe<NewRunOutput>(coerceable): RunnableSequence<BaseLanguageModelInput, Exclude<NewRunOutput, Error>>
Create a new runnable sequence that runs each individual runnable in series, piping the output of one runnable into another runnable or runnable-like.
Type Parameters
- NewRunOutput
Parameters
- coerceable: RunnableLike<string, NewRunOutput>
  
  A runnable, function, or object whose values are functions or runnables.
Returns RunnableSequence<BaseLanguageModelInput, Exclude<NewRunOutput, Error>>
A new runnable sequence.
Inherited from LLM.pipe
- Defined in langchain-core/dist/runnables/base.d.ts:131

predict

predict(text, options?, callbacks?): Promise<string>
This method is similar to call, but it's used for making predictions based on the input text.
Parameters
- text: string
  
  Input text for the prediction.
- Optional options: string[] | OllamaCallOptions
  
  Options for the LLM call.
- Optional callbacks: Callbacks
  
  Callbacks for the LLM call.
Returns Promise<string>
A prediction based on the input text.
Inherited from LLM.predict
- Defined in langchain-core/dist/language_models/llms.d.ts:80

predictMessages

predictMessages(messages, options?, callbacks?): Promise<BaseMessage>
This method takes a list of messages, options, and callbacks, and returns a predicted message.
Parameters
- messages: BaseMessage[]
  
  A list of messages for the prediction.
- Optional options: string[] | OllamaCallOptions
  
  Options for the LLM call.
- Optional callbacks: Callbacks
  
  Callbacks for the LLM call.
Returns Promise<BaseMessage>
A predicted message based on the list of messages.
Inherited from LLM.predictMessages
- Defined in langchain-core/dist/language_models/llms.d.ts:89

serialize

serialize(): SerializedLLM
Returns SerializedLLM

Deprecated
Return a json-like object representing this LLM.
Inherited from LLM.serialize
- Defined in langchain-core/dist/language_models/llms.d.ts:104

stream

stream(input, options?): Promise<IterableReadableStream<string>>
Stream output in chunks.
Parameters
- input: BaseLanguageModelInput
- Optional options: Partial<OllamaCallOptions>
Returns Promise<IterableReadableStream<string>>
A readable stream that is also an iterable.
Inherited from LLM.stream
- Defined in langchain-core/dist/runnables/base.d.ts:92

streamLog

streamLog(input, options?, streamOptions?): AsyncGenerator<RunLogPatch, any, unknown>
Stream all output from a runnable, as reported to the callback system. This includes all inner runs of LLMs, Retrievers, Tools, etc. Output is streamed as Log objects, which include a list of jsonpatch ops that describe how the state of the run has changed in each step, and the final state of the run. The jsonpatch ops can be applied in order to construct state.
Parameters
- input: BaseLanguageModelInput
- Optional options: Partial<OllamaCallOptions>
- Optional streamOptions: Omit<LogStreamCallbackHandlerInput, "autoClose">
Returns AsyncGenerator<RunLogPatch, any, unknown>
Inherited from LLM.streamLog
- Defined in langchain-core/dist/runnables/base.d.ts:151

toJSON

toJSON(): Serialized
Returns Serialized
Inherited from LLM.toJSON
- Defined in langchain-core/dist/load/serializable.d.ts:72

toJSONNotImplemented

toJSONNotImplemented(): SerializedNotImplemented
Returns SerializedNotImplemented
Inherited from LLM.toJSONNotImplemented
- Defined in langchain-core/dist/load/serializable.d.ts:73

transform

transform(generator, options): AsyncGenerator<string, any, unknown>
Default implementation of transform, which buffers input and then calls stream. Subclasses should override this method if they can start producing output while input is still being generated.
Parameters
- generator: AsyncGenerator<BaseLanguageModelInput, any, unknown>
- options: Partial<OllamaCallOptions>
Returns AsyncGenerator<string, any, unknown>
Inherited from LLM.transform
- Defined in langchain-core/dist/runnables/base.d.ts:139

withConfig

withConfig(config): RunnableBinding<BaseLanguageModelInput, string, OllamaCallOptions>
Bind config to a Runnable, returning a new Runnable.
Parameters
- config: BaseCallbackConfig
  
  New configuration parameters to attach to the new runnable.
Returns RunnableBinding<BaseLanguageModelInput, string, OllamaCallOptions>
A new RunnableBinding with a config matching what's passed.
Inherited from LLM.withConfig
- Defined in langchain-core/dist/runnables/base.d.ts:48

withFallbacks

withFallbacks(fields): RunnableWithFallbacks<BaseLanguageModelInput, string>
Create a new runnable from the current one that will try invoking other passed fallback runnables if the initial invocation fails.
Parameters
- fields: {
  fallbacks: Runnable<BaseLanguageModelInput, string, BaseCallbackConfig>[];
  }
  - fallbacks: Runnable<BaseLanguageModelInput, string, BaseCallbackConfig>[]
    
    Other runnables to call if the runnable errors.
Returns RunnableWithFallbacks<BaseLanguageModelInput, string>
A new RunnableWithFallbacks.
Inherited from LLM.withFallbacks
- Defined in langchain-core/dist/runnables/base.d.ts:55

withRetry

withRetry(fields?): RunnableRetry<BaseLanguageModelInput, string, OllamaCallOptions>
Add retry logic to an existing runnable.
Parameters
- Optional fields: {
  onFailedAttempt?: RunnableRetryFailedAttemptHandler;
  stopAfterAttempt?: number;
  }
  - Optional onFailedAttempt?: RunnableRetryFailedAttemptHandler
  - Optional stopAfterAttempt?: number
Returns RunnableRetry<BaseLanguageModelInput, string, OllamaCallOptions>
A new RunnableRetry that, when invoked, will retry according to the parameters.
Inherited from LLM.withRetry
- Defined in langchain-core/dist/runnables/base.d.ts:39

`Static` deserialize

deserialize(_data): Promise<BaseLanguageModel<any, BaseLanguageModelCallOptions>>
Parameters
- _data: SerializedLLM
Returns Promise<BaseLanguageModel<any, BaseLanguageModelCallOptions>>

Deprecated
Load an LLM from a json-like object describing it.
Inherited from LLM.deserialize
- Defined in langchain-core/dist/language_models/base.d.ts:154

`Static` isRunnable

isRunnable(thing): thing is Runnable<any, any, BaseCallbackConfig>
Parameters
- thing: any
Returns thing is Runnable<any, any, BaseCallbackConfig>
Inherited from LLM.isRunnable
- Defined in langchain-core/dist/runnables/base.d.ts:152

Class Ollama

Example

Hierarchy

Implements

Index

Constructors

Properties

Accessors

Methods

Constructors

constructor

Parameters

fields: OllamaInput & BaseLLMParams

Returns Ollama

Properties

CallOptions

ParsedCallOptions

baseUrl

caller

model

verbose

Optional cache

Optional callbacks

Optional embeddingOnly

Optional f16KV

Optional format

Optional frequencyPenalty

Optional logitsAll

Optional lowVram

Optional mainGpu

Optional metadata

Optional mirostat

Optional mirostatEta

Optional mirostatTau

Optional numBatch

Optional numCtx

Optional numGpu

Optional numGqa

Optional numKeep

Optional numThread

Optional penalizeNewline

Optional presencePenalty

Optional repeatLastN

Optional repeatPenalty

Optional ropeFrequencyBase

Optional ropeFrequencyScale

Optional stop

Optional tags

Optional temperature

Optional tfsZ

Optional topK

Optional topP

Optional typicalP

Optional useMLock

Optional useMMap

Optional vocabOnly

Accessors

callKeys

Returns string[]

Methods

batch

Parameters

inputs: BaseLanguageModelInput[]

Optional options: Partial<OllamaCallOptions> | Partial<OllamaCallOptions>[]

Optional batchOptions: RunnableBatchOptions & { returnExceptions?: false; }

Returns Promise<string[]>

Parameters

inputs: BaseLanguageModelInput[]

Optional options: Partial<OllamaCallOptions> | Partial<OllamaCallOptions>[]

Optional batchOptions: RunnableBatchOptions & { returnExceptions: true; }

Returns Promise<(string | Error)[]>

Parameters

inputs: BaseLanguageModelInput[]

Optional options: Partial<OllamaCallOptions> | Partial<OllamaCallOptions>[]

Optional batchOptions: RunnableBatchOptions

Returns Promise<(string | Error)[]>

bind

Parameters

kwargs: Partial<OllamaCallOptions>

Returns Runnable<BaseLanguageModelInput, string, OllamaCallOptions>

`Optional` cache

`Optional` callbacks

`Optional` embeddingOnly

`Optional` f16KV

`Optional` format

`Optional` frequencyPenalty

`Optional` logitsAll

`Optional` lowVram

`Optional` mainGpu

`Optional` metadata

`Optional` mirostat

`Optional` mirostatEta

`Optional` mirostatTau

`Optional` numBatch

`Optional` numCtx

`Optional` numGpu

`Optional` numGqa

`Optional` numKeep

`Optional` numThread

`Optional` penalizeNewline

`Optional` presencePenalty

`Optional` repeatLastN

`Optional` repeatPenalty

`Optional` ropeFrequencyBase

`Optional` ropeFrequencyScale

`Optional` stop

`Optional` tags

`Optional` temperature

`Optional` tfsZ

`Optional` topK

`Optional` topP

`Optional` typicalP

`Optional` useMLock

`Optional` useMMap

`Optional` vocabOnly

`Optional` options: Partial<OllamaCallOptions> | Partial<OllamaCallOptions>[]

`Optional` batchOptions: RunnableBatchOptions & {
returnExceptions?: false;
}

`Optional` options: Partial<OllamaCallOptions> | Partial<OllamaCallOptions>[]

`Optional` batchOptions: RunnableBatchOptions & {
returnExceptions: true;
}

`Optional` options: Partial<OllamaCallOptions> | Partial<OllamaCallOptions>[]

`Optional` batchOptions: RunnableBatchOptions

`Optional` options: string[] | OllamaCallOptions

`Optional` callbacks: Callbacks

`Optional` options: string[] | OllamaCallOptions

`Optional` callbacks: Callbacks

`Optional` options: string[] | OllamaCallOptions

`Optional` callbacks: Callbacks

`Optional` options: Omit<OllamaCallOptions, never>

`Optional` options: OllamaCallOptions