Skip to main content

llama-index packs infer retrieve rerank integration

Project description

Infer-Retrieve-Rerank LlamaPack

This is our implementation of the paper "In-Context Learning for Extreme Multi-Label Classification by Oosterlinck et al.

The paper proposes "infer-retrieve-rerank", a simple paradigm using frozen LLM/retriever models that can do "extreme"-label classification (the label space is huge).

  1. Given a user query, use an LLM to predict an initial set of labels.
  2. For each prediction, retrieve the actual label from the corpus.
  3. Given the final set of labels, rerank them using an LLM.

All of these can be implemented as LlamaIndex abstractions.

A full notebook guide can be found here.

CLI Usage

You can download llamapacks directly using llamaindex-cli, which comes installed with the llama-index python package:

llamaindex-cli download-llamapack InferRetrieveRerankPack --download-dir ./infer_retrieve_rerank_pack

You can then inspect the files at ./infer_retrieve_rerank_pack and use them as a template for your own project!

Code Usage

You can download the pack to a ./infer_retrieve_rerank_pack directory:

from llama_index.core.llama_pack import download_llama_pack

# download and install dependencies
InferRetrieveRerankPack = download_llama_pack(
    "InferRetrieveRerankPack", "./infer_retrieve_rerank_pack"
)

From here, you can use the pack, or inspect and modify the pack in ./infer_retrieve_rerank_pack.

Then, you can set up the pack like so:

# create the pack
pack = InferRetrieveRerankPack(
    labels,  # list of all label strings
    llm=llm,
    pred_context="<pred_context>",
    reranker_top_n=3,
    verbose=True,
)

The run() function runs predictions.

pred_reactions = pack.run(inputs=[s["text"] for s in samples])

You can also use modules individually.

# call the llm.complete()
llm = pack.llm
label_retriever = pack.label_retriever

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file llama_index_packs_infer_retrieve_rerank-0.5.1.tar.gz.

File metadata

File hashes

Hashes for llama_index_packs_infer_retrieve_rerank-0.5.1.tar.gz
Algorithm Hash digest
SHA256 bfacb68361075ed9a5e9f079a194c19234f58955877b87b739922a6b9730b0bf
MD5 45fe0752316915b244bf616be8bc242d
BLAKE2b-256 ac89b8b8ba89fe2f3d3e6d23a8e580fcd5ff218a5309d1ad629826e4e21e4980

See more details on using hashes here.

File details

Details for the file llama_index_packs_infer_retrieve_rerank-0.5.1-py3-none-any.whl.

File metadata

File hashes

Hashes for llama_index_packs_infer_retrieve_rerank-0.5.1-py3-none-any.whl
Algorithm Hash digest
SHA256 25067ebf2a002c447b80acb5e0cdb0807a1e44fe0a03c61a0b3a8f3811382f57
MD5 5edb4395a8a77d19e20f77b85f5a9afa
BLAKE2b-256 d154e46485c573cca7eaac4fa466cb06f61cb56282ea8d76b5123cb71d500f2a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page