BedrockEmbeddings
Amazon Bedrock 是一项完全托管的服务,通过单一 API 提供来自 AI21 Labs、Anthropic、Cohere、Meta、Stability AI 和 Amazon 等领先 AI 公司的高性能基础模型(FM)选择,并提供构建生成式 AI 应用所需的一整套功能,包括安全性、隐私保护和负责任的 AI 实践。
这将帮助您使用 LangChain 开始使用 Amazon
Bedrock 的嵌入模型。有关Bedrock功能和配置选项的详细文档,请参阅API 参考。
概览
集成详情
| 类别 | 包 | 本地支持 | Python 支持 | 包下载量 | 最新包版本 |
|---|---|---|---|---|---|
| Bedrock | @langchain/aws | ❌ | ✅ | ![]() | ![]() |
准备工作
要访问 Bedrock 嵌入模型,您需要创建一个 AWS 账户,获取 API 密钥,并安装@langchain/aws集成包。
前往AWS 文档注册 AWS 并设置您的凭证。您还需要为您的账户启用模型访问权限,可以通过按照这些说明操作来完成。
凭证
如果您希望自动追踪您的模型调用,也可以取消下面的注释来设置您的LangSmith API 密钥:
# export LANGSMITH_TRACING="true"
# export LANGSMITH_API_KEY="your-api-key"
安装
LangChain Bedrock 集成位于@langchain/aws包中:
:::提示 请参阅安装集成包的一般说明部分。 :::
- npm
- yarn
- pnpm
npm i @langchain/aws @langchain/core
yarn add @langchain/aws @langchain/core
pnpm add @langchain/aws @langchain/core
实例化
现在我们可以实例化我们的模型对象并嵌入文本。
有几种不同的方式可以与 AWS 进行身份验证——以下示例依赖于在环境变量中设置的访问密钥、秘密访问密钥和区域:
import { BedrockEmbeddings } from "@langchain/aws";
const embeddings = new BedrockEmbeddings({
region: process.env.BEDROCK_AWS_REGION!,
credentials: {
accessKeyId: process.env.BEDROCK_AWS_ACCESS_KEY_ID!,
secretAccessKey: process.env.BEDROCK_AWS_SECRET_ACCESS_KEY!,
},
model: "amazon.titan-embed-text-v1",
});
索引与检索
嵌入模型通常用于检索增强生成(RAG)流程中,既用于数据索引,也用于后续的数据检索。有关更详细的说明,请参阅我们位于使用外部知识教程下的 RAG 教程。
下面展示如何使用我们之前初始化的embeddings对象进行数据索引和检索。在此示例中,我们将使用演示版的MemoryVectorStore来索引和检索一个示例文档。
// Create a vector store with a sample text
import { MemoryVectorStore } from "langchain/vectorstores/memory";
const text =
"LangChain is the framework for building context-aware reasoning applications";
const vectorstore = await MemoryVectorStore.fromDocuments(
[{ pageContent: text, metadata: {} }],
embeddings
);
// Use the vector store as a retriever that returns a single document
const retriever = vectorstore.asRetriever(1);
// Retrieve the most similar text
const retrievedDocuments = await retriever.invoke("What is LangChain?");
retrievedDocuments[0].pageContent;
LangChain is the framework for building context-aware reasoning applications
直接使用
在底层,向量存储和检索器的实现分别通过调用
embeddings.embedDocument(...) 和 embeddings.embedQuery(...) 来为
fromDocuments 和检索器的 invoke 操作中使用的文本创建嵌入。
您可以直接调用这些方法,以用于您自己的用例。
嵌入单个文本
您可以使用 embedQuery
对查询进行嵌入,以用于搜索。这将生成针对该查询的向量表示:
const singleVector = await embeddings.embedQuery(text);
console.log(singleVector.slice(0, 100));
[
0.625, 0.111328125, 0.265625, -0.20019531, 0.40820312,
-0.010803223, -0.22460938, -0.0002937317, 0.29882812, -0.14355469,
-0.068847656, -0.3984375, 0.75, -0.1953125, -0.5546875,
-0.087402344, 0.5625, 1.390625, -0.3515625, 0.39257812,
-0.061767578, 0.65625, -0.36328125, -0.06591797, 0.234375,
-0.36132812, 0.42382812, -0.115234375, -0.28710938, -0.29296875,
-0.765625, -0.16894531, 0.23046875, 0.6328125, -0.08544922,
0.13671875, 0.0004272461, 0.3125, 0.12207031, -0.546875,
0.14257812, -0.119628906, -0.111328125, 0.61328125, 0.6875,
0.3671875, -0.2578125, -0.27734375, 0.703125, 0.203125,
0.17675781, -0.26757812, -0.76171875, 0.71484375, 0.77734375,
-0.1953125, -0.007232666, -0.044921875, 0.23632812, -0.24121094,
-0.012207031, 0.5078125, 0.08984375, 0.56640625, -0.3046875,
0.6484375, -0.25, -0.37890625, -0.2421875, 0.38476562,
-0.18164062, -0.05810547, 0.7578125, 0.04296875, 0.609375,
0.50390625, 0.023803711, -0.23046875, 0.099121094, 0.79296875,
-1.296875, 0.671875, -0.66796875, 0.43359375, 0.087890625,
0.14550781, -0.37304688, -0.068359375, 0.00012874603, -0.47265625,
-0.765625, 0.07861328, -0.029663086, 0.076660156, -0.32617188,
-0.453125, -0.5546875, -0.45703125, 1.1015625, -0.29492188
]
嵌入多个文本
您可以使用 embedDocuments
对多个文本进行索引嵌入。此方法的内部实现可能(但不一定)与嵌入查询的不同:
const text2 =
"LangGraph is a library for building stateful, multi-actor applications with LLMs";
const vectors = await embeddings.embedDocuments([text, text2]);
console.log(vectors[0].slice(0, 100));
console.log(vectors[1].slice(0, 100));
[
0.625, 0.111328125, 0.265625, -0.20019531, 0.40820312,
-0.010803223, -0.22460938, -0.0002937317, 0.29882812, -0.14355469,
-0.068847656, -0.3984375, 0.75, -0.1953125, -0.5546875,
-0.087402344, 0.5625, 1.390625, -0.3515625, 0.39257812,
-0.061767578, 0.65625, -0.36328125, -0.06591797, 0.234375,
-0.36132812, 0.42382812, -0.115234375, -0.28710938, -0.29296875,
-0.765625, -0.16894531, 0.23046875, 0.6328125, -0.08544922,
0.13671875, 0.0004272461, 0.3125, 0.12207031, -0.546875,
0.14257812, -0.119628906, -0.111328125, 0.61328125, 0.6875,
0.3671875, -0.2578125, -0.27734375, 0.703125, 0.203125,
0.17675781, -0.26757812, -0.76171875, 0.71484375, 0.77734375,
-0.1953125, -0.007232666, -0.044921875, 0.23632812, -0.24121094,
-0.012207031, 0.5078125, 0.08984375, 0.56640625, -0.3046875,
0.6484375, -0.25, -0.37890625, -0.2421875, 0.38476562,
-0.18164062, -0.05810547, 0.7578125, 0.04296875, 0.609375,
0.50390625, 0.023803711, -0.23046875, 0.099121094, 0.79296875,
-1.296875, 0.671875, -0.66796875, 0.43359375, 0.087890625,
0.14550781, -0.37304688, -0.068359375, 0.00012874603, -0.47265625,
-0.765625, 0.07861328, -0.029663086, 0.076660156, -0.32617188,
-0.453125, -0.5546875, -0.45703125, 1.1015625, -0.29492188
]
[
0.65625, 0.48242188, 0.70703125, -0.13378906, 0.859375,
0.2578125, -0.13378906, -0.0002670288, -0.34375, 0.25585938,
-0.33984375, -0.26367188, 0.828125, -0.23242188, -0.61328125,
0.12695312, 0.43359375, 1.3828125, -0.099121094, 0.3203125,
-0.34765625, 0.35351562, -0.28710938, 0.009521484, 0.083496094,
0.040283203, -0.25390625, 0.17871094, 0.044189453, -0.19628906,
0.45898438, 0.21191406, 0.67578125, 0.8359375, -0.29101562,
0.021118164, 0.13671875, 0.083984375, 0.34570312, 0.30859375,
-0.001625061, 0.31835938, -0.18164062, -0.0058288574, 0.22460938,
0.26757812, -0.09082031, 0.17480469, 1.4921875, -0.24316406,
0.36523438, 0.14550781, -0.609375, 0.33007812, 0.10595703,
0.3671875, 0.18359375, -0.62109375, 0.51171875, 0.024047852,
0.092285156, -0.44335938, 0.4921875, 0.609375, -0.48242188,
0.796875, -0.47851562, -0.53125, -0.66796875, 0.68359375,
-0.16796875, 0.110839844, 0.84765625, 0.703125, 0.8671875,
0.37695312, -0.0022888184, -0.30664062, 0.3671875, 0.16503906,
-0.59765625, 0.3203125, -0.34375, 0.08251953, 0.890625,
0.38476562, -0.24707031, -0.125, 0.00013160706, -0.69921875,
-0.53125, 0.052490234, 0.27734375, 0.42773438, -0.38867188,
-0.2578125, -0.25, -0.46875, 0.828125, -0.94140625
]
配置 Bedrock 运行时客户端
如果您想要自定义诸如
credentials(凭证)、region(区域)、retryPolicy(重试策略)等选项,可以传入自己实例化的
BedrockRuntimeClient。
import { BedrockRuntimeClient } from "@aws-sdk/client-bedrock-runtime";
import { BedrockEmbeddings } from "@langchain/aws";
const getCredentials = () => {
// do something to get credentials
};
const client = new BedrockRuntimeClient({
region: "us-east-1",
credentials: getCredentials(),
});
const embeddingsWithCustomClient = new BedrockEmbeddings({
client,
});
API 参考文档
有关所有 Bedrock 功能和配置的详细文档,请访问 API 参考页面: https://api.js.langchain.com/classes/langchain_aws.BedrockEmbeddings.html
Related
- Embedding model conceptual guide
- Embedding model how-to guides

