Website-URLs aus Sitemap.XML für SEO-Analyse extrahieren

Fortgeschritten

Dies ist ein Marketing-Bereich Automatisierungsworkflow mit 12 Nodes. Hauptsächlich werden Set, Xml, SplitOut, HttpRequest, ConvertToFile und andere Nodes verwendet. Extraktion von Website-URLs aus Sitemap.XML für SEO-Analysen

Voraussetzungen
  • Möglicherweise sind Ziel-API-Anmeldedaten erforderlich

Kategorie

Workflow-Vorschau
Visualisierung der Node-Verbindungen, mit Zoom und Pan
Workflow exportieren
Kopieren Sie die folgende JSON-Konfiguration und importieren Sie sie in n8n
{
  "id": "n2iZmshLmcXubEpo",
  "meta": {
    "instanceId": "e704ee1a5300eff6019042bec66778b3a26c1aec8f3e21c554a16dd7216a94ee",
    "templateCredsSetupCompleted": true
  },
  "name": "Extract Website URLs from Sitemap.XML for SEO Analysis",
  "tags": [
    {
      "id": "MePRktFsL1ttwWdT",
      "name": "website",
      "createdAt": "2025-05-12T18:47:34.764Z",
      "updatedAt": "2025-05-12T18:47:34.764Z"
    },
    {
      "id": "xutdortHHmV1yNZB",
      "name": "SEO",
      "createdAt": "2025-03-24T16:18:45.828Z",
      "updatedAt": "2025-03-24T16:18:45.828Z"
    }
  ],
  "nodes": [
    {
      "id": "6d91a84e-bf2b-4118-9e35-5baecda1b14b",
      "name": "XML",
      "type": "n8n-nodes-base.xml",
      "position": [
        340,
        -40
      ],
      "parameters": {
        "options": {}
      },
      "typeVersion": 1
    },
    {
      "id": "65f12b51-d34c-4e87-b581-e29370eb0554",
      "name": "Bei Klick auf 'Workflow testen'",
      "type": "n8n-nodes-base.manualTrigger",
      "position": [
        -320,
        -40
      ],
      "parameters": {},
      "typeVersion": 1
    },
    {
      "id": "b82a0bce-0dd4-4a64-b60a-64ea4021bee5",
      "name": "Aufteilen",
      "type": "n8n-nodes-base.splitOut",
      "position": [
        560,
        -40
      ],
      "parameters": {
        "options": {},
        "fieldToSplitOut": "sitemapindex.sitemap"
      },
      "typeVersion": 1
    },
    {
      "id": "0e97abde-ba13-4889-979d-0f0e5b085dcb",
      "name": "URL setzen",
      "type": "n8n-nodes-base.set",
      "notes": "Set full URL - not domain",
      "position": [
        -100,
        -40
      ],
      "parameters": {
        "options": {},
        "assignments": {
          "assignments": [
            {
              "id": "fa078c97-4c7c-4c08-a011-5527661997c6",
              "name": "Domain",
              "type": "string",
              "value": "https://phu.io.vn/"
            }
          ]
        }
      },
      "notesInFlow": true,
      "typeVersion": 3.4
    },
    {
      "id": "146a5e34-d64a-450b-8354-770c90547325",
      "name": "In Datei konvertieren",
      "type": "n8n-nodes-base.convertToFile",
      "position": [
        1440,
        -40
      ],
      "parameters": {
        "options": {},
        "binaryPropertyName": "={{ $json.loc }}"
      },
      "typeVersion": 1.1
    },
    {
      "id": "0c1f8d18-ca4b-4996-9928-abbc6d45b227",
      "name": "Sitemap crawlen",
      "type": "n8n-nodes-base.httpRequest",
      "notes": "or past sitemap URL at here",
      "position": [
        120,
        -40
      ],
      "parameters": {
        "url": "={{ $json.Domain }}sitemap.xml",
        "options": {
          "timeout": 10000
        },
        "responseFormat": "string"
      },
      "notesInFlow": true,
      "typeVersion": 1
    },
    {
      "id": "eaa43363-d059-4c66-8851-7e85d4fb5bd3",
      "name": "Sitemap crawlen 2",
      "type": "n8n-nodes-base.httpRequest",
      "position": [
        780,
        -40
      ],
      "parameters": {
        "url": "={{ $json.loc }}",
        "options": {
          "timeout": 10000
        },
        "responseFormat": "string"
      },
      "typeVersion": 1
    },
    {
      "id": "692efb13-a6ce-4667-842b-614cf9ee8315",
      "name": "XML 2",
      "type": "n8n-nodes-base.xml",
      "position": [
        1000,
        -40
      ],
      "parameters": {
        "options": {}
      },
      "typeVersion": 1
    },
    {
      "id": "88a15568-352f-4997-ae2b-522a2713843d",
      "name": "Aufteilen 2",
      "type": "n8n-nodes-base.splitOut",
      "position": [
        1220,
        -40
      ],
      "parameters": {
        "options": {},
        "fieldToSplitOut": "urlset.url"
      },
      "typeVersion": 1
    },
    {
      "id": "e0e0fb12-1dc5-4665-9530-6b53ed7dc593",
      "name": "Notiz",
      "type": "n8n-nodes-base.stickyNote",
      "position": [
        -160,
        -240
      ],
      "parameters": {
        "width": 440,
        "height": 360,
        "content": "## Set website URL at node 1 (or paste sitemap URL at node 2)"
      },
      "typeVersion": 1
    },
    {
      "id": "fc9d88ce-4ec0-4581-b1ff-7c007bdf5f0b",
      "name": "Notiz1",
      "type": "n8n-nodes-base.stickyNote",
      "position": [
        1340,
        -200
      ],
      "parameters": {
        "color": 4,
        "width": 300,
        "height": 320,
        "content": "## Download file at here\n(or replace this node = Gooogle sheet node)\n"
      },
      "typeVersion": 1
    },
    {
      "id": "39f13360-e94d-4b33-b258-5e8837daab4f",
      "name": "Notiz2",
      "type": "n8n-nodes-base.stickyNote",
      "position": [
        -980,
        -460
      ],
      "parameters": {
        "color": 6,
        "width": 600,
        "height": 940,
        "content": "# FAQ\n## Q: What happens if the sitemap is large or contains many sub-sitemaps?\n\nA: The workflow handles sitemap indexes by splitting and processing each sub-sitemap individually. For very large sitemaps, ensure your n8n instance has sufficient resources (memory and CPU) to avoid performance issues. See Scaling n8n for optimization tips.\n\n## Q: Can I use this workflow with a specific sitemap URL instead of a domain?\n\nA: Yes, in the Crawl sitemap node, replace the url parameter ({{ $json.Domain }}sitemap.xml) with the direct sitemap URL (e.g., https://example.com/sitemap.xml). Update the node’s notes for clarity.\n\n## Q: Why am I getting a timeout error?\n\nA: The HTTP Request nodes have a default timeout of 10 seconds. If the target server is slow, increase the timeout value in the options parameter of the Crawl sitemap or Crawl sitemap 2 nodes.\n\n## Q: How can I save the URLs to Google Sheets instead of a file?\n\nA: Replace the Convert to File node with a Google Sheets node. Configure it with your Google Sheets credentials and map the loc field from the Split Out 2 node to the desired spreadsheet column. Refer to the Google Sheets node documentation.\n\n## Q: Is this workflow compatible with older n8n versions?\n\nA: The workflow uses nodes compatible with n8n version 1.0 and later. For older versions, check for deprecated features (e.g., MySQL support) in the n8n v1.0 migration guide."
      },
      "typeVersion": 1
    }
  ],
  "active": false,
  "pinData": {},
  "settings": {
    "executionOrder": "v1"
  },
  "versionId": "d2ef17d5-a482-4b0d-b48a-83d5bd146b9f",
  "connections": {
    "6d91a84e-bf2b-4118-9e35-5baecda1b14b": {
      "main": [
        [
          {
            "node": "b82a0bce-0dd4-4a64-b60a-64ea4021bee5",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "692efb13-a6ce-4667-842b-614cf9ee8315": {
      "main": [
        [
          {
            "node": "88a15568-352f-4997-ae2b-522a2713843d",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "0e97abde-ba13-4889-979d-0f0e5b085dcb": {
      "main": [
        [
          {
            "node": "0c1f8d18-ca4b-4996-9928-abbc6d45b227",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "b82a0bce-0dd4-4a64-b60a-64ea4021bee5": {
      "main": [
        [
          {
            "node": "eaa43363-d059-4c66-8851-7e85d4fb5bd3",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "88a15568-352f-4997-ae2b-522a2713843d": {
      "main": [
        [
          {
            "node": "146a5e34-d64a-450b-8354-770c90547325",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "0c1f8d18-ca4b-4996-9928-abbc6d45b227": {
      "main": [
        [
          {
            "node": "6d91a84e-bf2b-4118-9e35-5baecda1b14b",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "146a5e34-d64a-450b-8354-770c90547325": {
      "main": [
        []
      ]
    },
    "eaa43363-d059-4c66-8851-7e85d4fb5bd3": {
      "main": [
        [
          {
            "node": "692efb13-a6ce-4667-842b-614cf9ee8315",
            "type": "main",
            "index": 0
          }
        ]
      ]
    },
    "65f12b51-d34c-4e87-b581-e29370eb0554": {
      "main": [
        [
          {
            "node": "0e97abde-ba13-4889-979d-0f0e5b085dcb",
            "type": "main",
            "index": 0
          }
        ]
      ]
    }
  }
}
Häufig gestellte Fragen

Wie verwende ich diesen Workflow?

Kopieren Sie den obigen JSON-Code, erstellen Sie einen neuen Workflow in Ihrer n8n-Instanz und wählen Sie "Aus JSON importieren". Fügen Sie die Konfiguration ein und passen Sie die Anmeldedaten nach Bedarf an.

Für welche Szenarien ist dieser Workflow geeignet?

Fortgeschritten - Marketing

Ist es kostenpflichtig?

Dieser Workflow ist völlig kostenlos. Beachten Sie jedoch, dass Drittanbieterdienste (wie OpenAI API), die im Workflow verwendet werden, möglicherweise kostenpflichtig sind.

Workflow-Informationen
Schwierigkeitsgrad
Fortgeschritten
Anzahl der Nodes12
Kategorie1
Node-Typen7
Schwierigkeitsbeschreibung

Für erfahrene Benutzer, mittelkomplexe Workflows mit 6-15 Nodes

Autor
Le Thua Phu

Le Thua Phu

@lethuaphu

I'm passionate about leveraging n8n to create seamless workflows that blend AI capabilities with data automation. Based in Vietnam, I specialize in building efficient, scalable solutions to streamline business processes and unlock the power of data. Excited to connect, share insights, and contribute to the n8n community!

Externe Links
Auf n8n.io ansehen

Diesen Workflow teilen

Kategorien

Kategorien: 34