Tavily爬取
Tavily 是专为 AI 代理(LLMs)构建的搜索引擎,能快速提供实时、准确和基于事实的结果。Tavily 提供了四个主要端点,其中之一是 Crawl,它是一个基于图的网站遍历工具,可以并行探索数百条路径,并具有内置的提取和智能发现功能。
本指南简要介绍了如何开始使用 Tavily 工具。如需对 Tavily 工具进行更详细的介绍,请参阅API 参考文档。
概览
集成详情
| 类别 | 包 | Python 支持 | 最新版本 |
|---|---|---|---|
| TavilyMap | @langchain/tavily | ✅ | ![]() |
配置
该集成在@langchain/tavily包中,你可以通过以下方式安装:
:::提示 请参阅安装集成包的一般说明部分。 :::
- npm
- yarn
- pnpm
npm i @langchain/tavily @langchain/core
yarn add @langchain/tavily @langchain/core
pnpm add @langchain/tavily @langchain/core
凭据
在此处链接设置 API 密钥,并将其设置为名为TAVILY_API_KEY的环境变量。
process.env.TAVILY_API_KEY = "YOUR_API_KEY";
另外,设置LangSmith对于获得最佳观测能力也有帮助(非必需):
process.env.LANGSMITH_TRACING = "true";
process.env.LANGSMITH_API_KEY = "your-api-key";
实例化
你可以通过以下方式导入并实例化 TavilyCrawl 工具:
import { TavilyCrawl } from "@langchain/tavily";
const tool = new TavilyCrawl({
maxDepth: 3,
maxBreadth: 50,
// extractDepth: "basic",
// format: "markdown",
// limit: 100,
// includeImages: false,
// allowExternal: false,
});
调用
使用参数直接调用
Tavily 爬虫工具在调用时接受以下参数:
url(必填):一个自然语言搜索查询也可以在调用期间设置以下参数:
instructions、selectPaths、selectDomains、excludePaths、excludeDomains、allowExternal、categories。
await tool.invoke({
url: "https://docs.tavily.com",
});
使用工具调用调用
我们还可以使用模型生成的 ToolCall 来调用工具,在这种情况下,将返回一个
ToolMessage:
// This is usually generated by a model, but we'll create a tool call directly for demo purposes.
const modelGeneratedToolCall = {
args: {
url: "https://docs.tavily.com",
},
id: "1",
name: tool.name,
type: "tool_call",
};
await tool.invoke(modelGeneratedToolCall);
链式调用
我们可以通过首先将工具绑定到一个工具调用模型,然后调用它,从而在链式结构中使用我们的工具:
Pick your chat model:
- Groq
- OpenAI
- Anthropic
- Google Gemini
- FireworksAI
- MistralAI
- VertexAI
Install dependencies
- npm
- yarn
- pnpm
npm i @langchain/groq
yarn add @langchain/groq
pnpm add @langchain/groq
Add environment variables
GROQ_API_KEY=your-api-key
Instantiate the model
import { ChatGroq } from "@langchain/groq";
const llm = new ChatGroq({
model: "llama-3.3-70b-versatile",
temperature: 0
});
Install dependencies
- npm
- yarn
- pnpm
npm i @langchain/openai
yarn add @langchain/openai
pnpm add @langchain/openai
Add environment variables
OPENAI_API_KEY=your-api-key
Instantiate the model
import { ChatOpenAI } from "@langchain/openai";
const llm = new ChatOpenAI({
model: "gpt-4o-mini",
temperature: 0
});
Install dependencies
- npm
- yarn
- pnpm
npm i @langchain/anthropic
yarn add @langchain/anthropic
pnpm add @langchain/anthropic
Add environment variables
ANTHROPIC_API_KEY=your-api-key
Instantiate the model
import { ChatAnthropic } from "@langchain/anthropic";
const llm = new ChatAnthropic({
model: "claude-3-5-sonnet-20240620",
temperature: 0
});
Install dependencies
- npm
- yarn
- pnpm
npm i @langchain/google-genai
yarn add @langchain/google-genai
pnpm add @langchain/google-genai
Add environment variables
GOOGLE_API_KEY=your-api-key
Instantiate the model
import { ChatGoogleGenerativeAI } from "@langchain/google-genai";
const llm = new ChatGoogleGenerativeAI({
model: "gemini-2.0-flash",
temperature: 0
});
Install dependencies
- npm
- yarn
- pnpm
npm i @langchain/community
yarn add @langchain/community
pnpm add @langchain/community
Add environment variables
FIREWORKS_API_KEY=your-api-key
Instantiate the model
import { ChatFireworks } from "@langchain/community/chat_models/fireworks";
const llm = new ChatFireworks({
model: "accounts/fireworks/models/llama-v3p1-70b-instruct",
temperature: 0
});
Install dependencies
- npm
- yarn
- pnpm
npm i @langchain/mistralai
yarn add @langchain/mistralai
pnpm add @langchain/mistralai
Add environment variables
MISTRAL_API_KEY=your-api-key
Instantiate the model
import { ChatMistralAI } from "@langchain/mistralai";
const llm = new ChatMistralAI({
model: "mistral-large-latest",
temperature: 0
});
Install dependencies
- npm
- yarn
- pnpm
npm i @langchain/google-vertexai
yarn add @langchain/google-vertexai
pnpm add @langchain/google-vertexai
Add environment variables
GOOGLE_APPLICATION_CREDENTIALS=credentials.json
Instantiate the model
import { ChatVertexAI } from "@langchain/google-vertexai";
const llm = new ChatVertexAI({
model: "gemini-1.5-flash",
temperature: 0
});
import { HumanMessage } from "@langchain/core/messages";
import { ChatPromptTemplate } from "@langchain/core/prompts";
import { RunnableLambda } from "@langchain/core/runnables";
const prompt = ChatPromptTemplate.fromMessages([
["system", "You are a helpful assistant."],
["placeholder", "{messages}"],
]);
const llmWithTools = llm.bindTools([tool]);
const chain = prompt.pipe(llmWithTools);
const toolChain = RunnableLambda.from(async (userInput: string, config) => {
const humanMessage = new HumanMessage(userInput);
const aiMsg = await chain.invoke(
{
messages: [new HumanMessage(userInput)],
},
config
);
const toolMsgs = await tool.batch(aiMsg.tool_calls, config);
return chain.invoke(
{
messages: [humanMessage, aiMsg, ...toolMsgs],
},
config
);
});
const toolChainResult = await toolChain.invoke("https://docs.tavily.com");
const { tool_calls, content } = toolChainResult;
console.log(
"AIMessage",
JSON.stringify(
{
tool_calls,
content,
},
null,
2
)
);
代理
关于如何在代理中使用 LangChain 工具的指南,请参阅 LangGraph.js 文档。
API 参考文档
有关 Tavily 爬虫 API 所有功能和配置的详细文档,请访问 API 参考页面:
https://docs.tavily.com/documentation/api-reference/endpoint/crawl
相关内容
Related
- Tool conceptual guide
- Tool how-to guides
