Reranking (NIM)
Model name: nim_reranking
About Reranking
Reranking is a method in text search that sorts results by relevance to make them more accurate. It gives scores to documents using cross-attention mechanisms, improving the initial search results.
Supported aidb operations
- rerank_text
Supported models
NVIDIA NGC
- nvidia/llama-3.2-nv-rerankqa-1b-v2 (default)
Creating the default model
SELECT aidb.create_model( 'my_nim_reranker', 'nim_reranking', credentials=>'{"api_key": "<API_KEY_HERE>"'::JSONB );
There is only one model, the default nvidia/nvclip
, so we do not need to specify the model in the configuration.
Model configuration settings
The following configuration settings are available for CLIP models:
model
- The NIM model to use. The default isnvidia/llama-3.2-nv-rerankqa-1b-v2
and is the only model available.url
- The URL of the model to use. This is optional and can be used to specify a custom model URL. Defaults tohttps://ai.api.nvidia.com/v1/retrieval
.
Model credentials
The following credentials are required if executing inside NVIDIA NGC:
api_key
- The NVIDIA Cloud API key to use for authentication.
Could this page be better? Report a problem or suggest an addition!