AI驱动的网页抓取:Jina、Google Sheets和OpenAI简易方案
中级
这是一个AI领域的自动化工作流,包含 7 个节点。主要使用 SplitOut, HttpRequest, GoogleSheets, ManualTrigger, LmChatOpenAi 等节点,结合人工智能技术实现智能自动化。 AI驱动的网页抓取:Jina、Google Sheets和OpenAI简易方案
前置要求
- •可能需要目标 API 的认证凭证
- •Google Sheets API 凭证
- •OpenAI API Key
分类
工作流预览
可视化展示节点连接关系,支持缩放和平移
导出工作流
复制以下 JSON 配置到 n8n 导入,即可使用此工作流
{
"nodes": [
{
"id": "c3ef40df-084e-435c-9a11-3aa0a2f94f36",
"name": "当点击\"测试工作流\"时",
"type": "n8n-nodes-base.manualTrigger",
"position": [
740,
520
],
"parameters": {},
"typeVersion": 1
},
{
"id": "e0583472-a450-4582-83bc-84a014bea543",
"name": "拆分输出",
"type": "n8n-nodes-base.splitOut",
"position": [
1640,
520
],
"parameters": {
"options": {},
"fieldToSplitOut": "output.results"
},
"typeVersion": 1
},
{
"id": "b8aa573d-5b63-4669-900f-bcc915b6ad41",
"name": "保存到 Google Sheets",
"type": "n8n-nodes-base.googleSheets",
"position": [
1900,
520
],
"parameters": {
"columns": {
"value": {},
"schema": [
{
"id": "name",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "name",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "price",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "price",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "availability",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "availability",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "image",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "image",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "link",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "link",
"defaultMatch": false,
"canBeUsedToMatch": true
}
],
"mappingMode": "autoMapInputData",
"matchingColumns": [
"Book prices"
]
},
"options": {},
"operation": "append",
"sheetName": {
"__rl": true,
"mode": "list",
"value": 258629074,
"cachedResultUrl": "https://docs.google.com/spreadsheets/d/1VDbfi2PpeheD2ZlO6feX3RdMeSsm0XukQlNVW8uVcuo/edit#gid=258629074",
"cachedResultName": "Sheet2"
},
"documentId": {
"__rl": true,
"mode": "list",
"value": "1VDbfi2PpeheD2ZlO6feX3RdMeSsm0XukQlNVW8uVcuo",
"cachedResultUrl": "https://docs.google.com/spreadsheets/d/1VDbfi2PpeheD2ZlO6feX3RdMeSsm0XukQlNVW8uVcuo/edit?usp=drivesdk",
"cachedResultName": "Book Prices"
}
},
"credentials": {
"googleSheetsOAuth2Api": {
"id": "GHRceL2SKjXxz0Dx",
"name": "Google Sheets account"
}
},
"typeVersion": 4.2
},
{
"id": "a63c3ab3-6aab-43b2-8af6-8b00e24e0ee6",
"name": "OpenAI 聊天模型",
"type": "@n8n/n8n-nodes-langchain.lmChatOpenAi",
"position": [
1300,
700
],
"parameters": {
"options": {}
},
"credentials": {
"openAiApi": {
"id": "5oYe8Cxj7liOPAKk",
"name": "Derek T"
}
},
"typeVersion": 1
},
{
"id": "40326966-0c46-4df2-8d80-fa014e05b693",
"name": "信息提取器",
"type": "@n8n/n8n-nodes-langchain.informationExtractor",
"position": [
1260,
520
],
"parameters": {
"text": "={{ $json.data }}",
"options": {
"systemPromptTemplate": "You are an expert extraction algorithm.\nOnly extract relevant information from the text.\nIf you do not know the value of an attribute asked to extract, you may omit the attribute's value.\nAlways output the data in a json array called results. Each book should have a title, price, availability and product_url, image_url"
},
"schemaType": "manual",
"inputSchema": "{\n \"results\": {\n \"type\": \"array\",\n \"items\": {\n \"type\": \"object\",\n \"properties\": {\n \"price\": {\n \"type\": \"string\"\n },\n \"title\": {\n \"type\": \"string\"\n },\n \"image_url\": {\n \"type\": \"string\"\n },\n \"product_url\": {\n \"type\": \"string\"\n },\n \"availability\": {\n \"type\": \"string\"\n } \n }\n }\n }\n}"
},
"typeVersion": 1
},
{
"id": "8ddca560-8da7-4090-b865-0523f95ca463",
"name": "Jina Fetch",
"type": "n8n-nodes-base.httpRequest",
"position": [
1020,
520
],
"parameters": {
"url": "https://r.jina.ai/http://books.toscrape.com/catalogue/category/books/historical-fiction_4/index.html",
"options": {
"allowUnauthorizedCerts": true
},
"authentication": "genericCredentialType",
"genericAuthType": "httpHeaderAuth"
},
"credentials": {
"httpHeaderAuth": {
"id": "ALBmOXmADcPmyHr1",
"name": "jina"
}
},
"typeVersion": 4.1
},
{
"id": "b1745cea-fdbe-4f14-b09c-884549beac7e",
"name": "便签说明5",
"type": "n8n-nodes-base.stickyNote",
"position": [
80,
320
],
"parameters": {
"color": 5,
"width": 587,
"height": 570,
"content": "## 从这里开始:分步 YouTube 教程 :star:"
},
"typeVersion": 1
}
],
"pinData": {},
"connections": {
"e0583472-a450-4582-83bc-84a014bea543": {
"main": [
[
{
"node": "b8aa573d-5b63-4669-900f-bcc915b6ad41",
"type": "main",
"index": 0
}
]
]
},
"8ddca560-8da7-4090-b865-0523f95ca463": {
"main": [
[
{
"node": "40326966-0c46-4df2-8d80-fa014e05b693",
"type": "main",
"index": 0
}
]
]
},
"a63c3ab3-6aab-43b2-8af6-8b00e24e0ee6": {
"ai_languageModel": [
[
{
"node": "40326966-0c46-4df2-8d80-fa014e05b693",
"type": "ai_languageModel",
"index": 0
}
]
]
},
"40326966-0c46-4df2-8d80-fa014e05b693": {
"main": [
[
{
"node": "e0583472-a450-4582-83bc-84a014bea543",
"type": "main",
"index": 0
}
]
]
},
"c3ef40df-084e-435c-9a11-3aa0a2f94f36": {
"main": [
[
{
"node": "8ddca560-8da7-4090-b865-0523f95ca463",
"type": "main",
"index": 0
}
]
]
}
}
}常见问题
如何使用这个工作流?
复制上方的 JSON 配置代码,在您的 n8n 实例中创建新工作流并选择「从 JSON 导入」,粘贴配置后根据需要修改凭证设置即可。
这个工作流适合什么场景?
中级 - 人工智能
需要付费吗?
本工作流完全免费,您可以直接导入使用。但请注意,工作流中使用的第三方服务(如 OpenAI API)可能需要您自行付费。
相关工作流推荐
自动化新闻到简报AI代理v13
AI新闻研究团队:24/7简报自动化,含Perplexity引用
Set
Code
Gmail
+
Set
Code
Gmail
37 节点Derek Cheung
产品
AI房地产经纪人:端到端运营自动化(网络、数据、语音)
AI房地产经纪人:端到端运营自动化(网络、数据、语音)
If
Set
Code
+
If
Set
Code
45 节点Sam Yassine
销售
API架构提取器
API架构提取器
If
Set
Code
+
If
Set
Code
88 节点Polina Medvedieva
工程
使用AI自动化品牌风格博客创建
使用AI自动化品牌风格博客创建
Set
Html
Limit
+
Set
Html
Limit
27 节点Jimleuk
人工智能
使用 Bright Data 抓取和 Google Gemini 自动化 Etsy 数据挖掘
使用 Bright Data 抓取和 Google Gemini 实现 Etsy 数据挖掘自动化
Set
Function
Split Out
+
Set
Function
Split Out
19 节点Ranjan Dailata
产品
使用 Browserflow 和 Google Sheets 自动化 LinkedIn
使用 Browserflow 和 Google Sheets 自动化 LinkedIn 请求与破冰消息
If
Set
Sort
+
If
Set
Sort
44 节点PollupAI
销售
工作流信息
难度等级
中级
节点数量7
分类1
节点类型7
作者
Derek Cheung
@derekcheungsaDerek is an engineer, instructor, and investor based in Canada. He is publicly building AI applications to solve real-world problems, leveraging his technical expertise to develop practical solutions.
外部链接
在 n8n.io 查看 →
分享此工作流