Sitemap to URL Crawler — Extract Sitemap.xml URLs for RAG
Instantly extract all public URLs from any website sitemap.xml recursively. Handles nested sitemap indexes automatically. The fastest cheap way to build URL lists for RAG pipelines, LLM training datasets, SEO audits and content inventories. Zero-config, no proxy required.
Charged only on successful results.
Clean schema, ready for ETL.
41 users total
No setup — runs in the cloud on Apify.
What it returns
Instantly extract all public URLs from any website sitemap.xml recursively.
How to run it
- Open the actor on Apify (button above).
- Fill the input schema — most fields have sensible defaults.
- Click Run. Results land in the dataset within minutes.
- Export as JSON or CSV, or pull via the dataset API.
Guides for this scraper
Related scrapers
App Store Scraper — iOS App Data, Reviews & ASO API
App Store scraper for iOS app data, reviews, ratings, top charts, ASO keywords & privacy labels. No API key. Export CSV, JSON, Excel. 10 endpoints.
Apple Podcasts Scraper — Episodes, Audio URLs & RSS Data
Extract podcast shows and full episode lists from Apple Podcasts. Titles, descriptions, audio MP3 URLs, durations, publish dates, artwork, genres, transcripts. Via iTunes Search/Lookup API + RSS.
Binance API Scraper - Spot Prices for 3,500+ Pairs
Scrape Binance spot prices for 3,500+ pairs with no API key: symbol, last price, 24h change %, high/low, volume, bid/ask, trade count. Export CSV, JSON, Excel.