Reranking (NIM)

Model name: nim_reranking

About Reranking

Reranking is a method in text search that sorts results by relevance to make them more accurate. It gives scores to documents using cross-attention mechanisms, improving the initial search results.

Supported aidb operations

  • rerank_text

Supported models

NVIDIA NGC

  • nvidia/llama-3.2-nv-rerankqa-1b-v2 (default)

Creating the default model

SELECT aidb.create_model(
    'my_nim_reranker', 
    'nim_reranking',
    credentials=>'{"api_key": "<API_KEY_HERE>"'::JSONB
);

There is only one model, the default nvidia/nvclip, so we do not need to specify the model in the configuration.

Model configuration settings

The following configuration settings are available for CLIP models:

  • model - The NIM model to use. The default is nvidia/llama-3.2-nv-rerankqa-1b-v2 and is the only model available.
  • url - The URL of the model to use. This is optional and can be used to specify a custom model URL. Defaults to https://ai.api.nvidia.com/v1/retrieval.

Model credentials

The following credentials are required if executing inside NVIDIA NGC:

  • api_key - The NVIDIA Cloud API key to use for authentication.

Could this page be better? Report a problem or suggest an addition!