KI-gesteuertes Web-Scraping: Einfache Lösung mit Jina, Google Sheets und OpenAI
Dies ist ein AI-Bereich Automatisierungsworkflow mit 7 Nodes. Hauptsächlich werden SplitOut, HttpRequest, GoogleSheets, ManualTrigger, LmChatOpenAi und andere Nodes verwendet, kombiniert mit KI-Technologie für intelligente Automatisierung. KI-gesteuertes Web-Scraping: Einfache Lösung mit Jina, Google Sheets und OpenAI
- •Möglicherweise sind Ziel-API-Anmeldedaten erforderlich
- •Google Sheets API-Anmeldedaten
- •OpenAI API Key
Verwendete Nodes (7)
Kategorie
{
"nodes": [
{
"id": "c3ef40df-084e-435c-9a11-3aa0a2f94f36",
"name": "Bei Klick auf \"Workflow testen\"",
"type": "n8n-nodes-base.manualTrigger",
"position": [
740,
520
],
"parameters": {},
"typeVersion": 1
},
{
"id": "e0583472-a450-4582-83bc-84a014bea543",
"name": "Aufteilen",
"type": "n8n-nodes-base.splitOut",
"position": [
1640,
520
],
"parameters": {
"options": {},
"fieldToSplitOut": "output.results"
},
"typeVersion": 1
},
{
"id": "b8aa573d-5b63-4669-900f-bcc915b6ad41",
"name": "In Google Tabellen speichern",
"type": "n8n-nodes-base.googleSheets",
"position": [
1900,
520
],
"parameters": {
"columns": {
"value": {},
"schema": [
{
"id": "name",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "name",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "price",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "price",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "availability",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "availability",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "image",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "image",
"defaultMatch": false,
"canBeUsedToMatch": true
},
{
"id": "link",
"type": "string",
"display": true,
"removed": false,
"required": false,
"displayName": "link",
"defaultMatch": false,
"canBeUsedToMatch": true
}
],
"mappingMode": "autoMapInputData",
"matchingColumns": [
"Book prices"
]
},
"options": {},
"operation": "append",
"sheetName": {
"__rl": true,
"mode": "list",
"value": 258629074,
"cachedResultUrl": "https://docs.google.com/spreadsheets/d/1VDbfi2PpeheD2ZlO6feX3RdMeSsm0XukQlNVW8uVcuo/edit#gid=258629074",
"cachedResultName": "Sheet2"
},
"documentId": {
"__rl": true,
"mode": "list",
"value": "1VDbfi2PpeheD2ZlO6feX3RdMeSsm0XukQlNVW8uVcuo",
"cachedResultUrl": "https://docs.google.com/spreadsheets/d/1VDbfi2PpeheD2ZlO6feX3RdMeSsm0XukQlNVW8uVcuo/edit?usp=drivesdk",
"cachedResultName": "Book Prices"
}
},
"credentials": {
"googleSheetsOAuth2Api": {
"id": "GHRceL2SKjXxz0Dx",
"name": "Google Sheets account"
}
},
"typeVersion": 4.2
},
{
"id": "a63c3ab3-6aab-43b2-8af6-8b00e24e0ee6",
"name": "OpenAI-Chat-Modell",
"type": "@n8n/n8n-nodes-langchain.lmChatOpenAi",
"position": [
1300,
700
],
"parameters": {
"options": {}
},
"credentials": {
"openAiApi": {
"id": "5oYe8Cxj7liOPAKk",
"name": "Derek T"
}
},
"typeVersion": 1
},
{
"id": "40326966-0c46-4df2-8d80-fa014e05b693",
"name": "Informationsextraktor",
"type": "@n8n/n8n-nodes-langchain.informationExtractor",
"position": [
1260,
520
],
"parameters": {
"text": "={{ $json.data }}",
"options": {
"systemPromptTemplate": "You are an expert extraction algorithm.\nOnly extract relevant information from the text.\nIf you do not know the value of an attribute asked to extract, you may omit the attribute's value.\nAlways output the data in a json array called results. Each book should have a title, price, availability and product_url, image_url"
},
"schemaType": "manual",
"inputSchema": "{\n \"results\": {\n \"type\": \"array\",\n \"items\": {\n \"type\": \"object\",\n \"properties\": {\n \"price\": {\n \"type\": \"string\"\n },\n \"title\": {\n \"type\": \"string\"\n },\n \"image_url\": {\n \"type\": \"string\"\n },\n \"product_url\": {\n \"type\": \"string\"\n },\n \"availability\": {\n \"type\": \"string\"\n } \n }\n }\n }\n}"
},
"typeVersion": 1
},
{
"id": "8ddca560-8da7-4090-b865-0523f95ca463",
"name": "Jina Fetch",
"type": "n8n-nodes-base.httpRequest",
"position": [
1020,
520
],
"parameters": {
"url": "https://r.jina.ai/http://books.toscrape.com/catalogue/category/books/historical-fiction_4/index.html",
"options": {
"allowUnauthorizedCerts": true
},
"authentication": "genericCredentialType",
"genericAuthType": "httpHeaderAuth"
},
"credentials": {
"httpHeaderAuth": {
"id": "ALBmOXmADcPmyHr1",
"name": "jina"
}
},
"typeVersion": 4.1
},
{
"id": "b1745cea-fdbe-4f14-b09c-884549beac7e",
"name": "Notiz5",
"type": "n8n-nodes-base.stickyNote",
"position": [
80,
320
],
"parameters": {
"color": 5,
"width": 587,
"height": 570,
"content": "## Start here: Step-by Step Youtube Tutorial :star:\n\n[](https://youtu.be/f3AJYXHirr8)\n\n[Google Sheet Example](https://docs.google.com/spreadsheets/d/1VDbfi2PpeheD2ZlO6feX3RdMeSsm0XukQlNVW8uVcuo/edit?usp=sharing)\n\n\n"
},
"typeVersion": 1
}
],
"pinData": {},
"connections": {
"e0583472-a450-4582-83bc-84a014bea543": {
"main": [
[
{
"node": "In Google Sheets speichern",
"type": "main",
"index": 0
}
]
]
},
"8ddca560-8da7-4090-b865-0523f95ca463": {
"main": [
[
{
"node": "40326966-0c46-4df2-8d80-fa014e05b693",
"type": "main",
"index": 0
}
]
]
},
"OpenAI Chat Model": {
"ai_languageModel": [
[
{
"node": "40326966-0c46-4df2-8d80-fa014e05b693",
"type": "ai_languageModel",
"index": 0
}
]
]
},
"40326966-0c46-4df2-8d80-fa014e05b693": {
"main": [
[
{
"node": "e0583472-a450-4582-83bc-84a014bea543",
"type": "main",
"index": 0
}
]
]
},
"c3ef40df-084e-435c-9a11-3aa0a2f94f36": {
"main": [
[
{
"node": "8ddca560-8da7-4090-b865-0523f95ca463",
"type": "main",
"index": 0
}
]
]
}
}
}Wie verwende ich diesen Workflow?
Kopieren Sie den obigen JSON-Code, erstellen Sie einen neuen Workflow in Ihrer n8n-Instanz und wählen Sie "Aus JSON importieren". Fügen Sie die Konfiguration ein und passen Sie die Anmeldedaten nach Bedarf an.
Für welche Szenarien ist dieser Workflow geeignet?
Fortgeschritten - Künstliche Intelligenz
Ist es kostenpflichtig?
Dieser Workflow ist völlig kostenlos. Beachten Sie jedoch, dass Drittanbieterdienste (wie OpenAI API), die im Workflow verwendet werden, möglicherweise kostenpflichtig sind.
Verwandte Workflows
Derek Cheung
@derekcheungsaDerek is an engineer, instructor, and investor based in Canada. He is publicly building AI applications to solve real-world problems, leveraging his technical expertise to develop practical solutions.
Diesen Workflow teilen