Data Extraction
Transform web content into structured data using our powerful extraction system. From simple CSS selectors to advanced AI-powered extraction, get exactly what you need.
- CSS and XPath selectors for precise targeting
- Multiple output formats (text, HTML, attributes)
- Bulk extraction for lists and collections
- AI-powered extraction using natural language
- CSS & XPath Selectors
Use familiar CSS selectors or powerful XPath expressions to target exactly the data you need.
- Bulk Extraction
Extract multiple items at once with our list extraction feature, perfect for product listings and search results.
- AI-Powered Extraction
Use natural language to extract data with our AI-powered extraction system (Beta).
Examples
Extract Data Your Way
Choose between traditional selector-based extraction for precise targeting or AI-powered extraction for natural language understanding.
Selector-based Extraction
Use CSS and XPath selectors to precisely target and extract specific data from web pages.
Request - Extraction Rules
{ "product": { "selector": ".product-card", "type": "list", "fields": { "title": { "selector": "h2.product-title", "output": "text" }, "price": { "selector": ".price-tag", "output": "text" }, "description": { "selector": ".description", "output": "text" }, "image": { "selector": "img.product-image", "output": "@src" }, "rating": { "selector": ".rating-stars", "output": "@data-rating" } } } }
200 - Extracted Data
{ "product": [ { "title": "iPhone 15 Pro Max", "price": "$999.99", "description": "The most powerful...", "image": "/images/iphone15.jpg", "rating": "4.8" }, { "title": "Samsung Galaxy S24", "price": "$899.99", "description": "Next generation...", "image": "/images/galaxy24.jpg", "rating": "4.7" } ] }
AI-Powered Extraction
Extract data using natural language prompts or structured schemas - our AI adapts to your needs.
Prompt-based Extraction
Extract data using natural language. No technical knowledge required - just describe what you need.
- Write in plain English
- Works with any page layout
- Extract complex data easily
Request - Prompt
Extract all products. For each product, I need: - The product name - Its price - A brief description - Any technical specifications mentioned
200 - Extracted Data
{ "products": [ { "name": "Ultra HD Smart TV", "price": "$899.99", "description": "55-inch 4K Smart LED TV with HDR", "specifications": { "display": "4K Resolution", "features": [ "HDR", "Smart TV OS", "Voice Control" ], "connectivity": "WiFi" } }, { "name": "Premium Wireless Headphones", "price": "$249.99", "description": "Noise-cancelling wireless headphones", "specifications": { "battery": "30hr Battery Life", "features": [ "Active Noise Cancellation" ], "connectivity": "Bluetooth 5.2" } } ] }
Schema-based Extraction
Define your data structure with JSON Schema for consistent, validated results.
- Structured data output
- Easy API integration
- Built-in validation
Request - JSON Schema
{ "type": "object", "properties": { "products": { "type": "array", "description": "List of products", "items": { "type": "object", "properties": { "name": { "type": "string", "description": "Name of the product" }, "price": { "type": "string", "description": "Price of the product" }, "description": { "type": "string", "description": "Description of the product" }, "stock": { "type": "string", "description": "Stock availability" }, "sku": { "type": "string", "description": "Product SKU" } } } } } }
200 - Extracted Data
{ "products": [ { "name": "Ultra HD Smart TV", "price": "$899.99", "description": "55-inch 4K Ultra HD Smart LED TV with HDR...", "stock": "15 units", "sku": "TV-UHD-55" }, { "name": "Premium Wireless Headphones", "price": "$249.99", "description": "Noise-cancelling wireless headphones...", "stock": "42 units", "sku": "AUDIO-WH-100" }, { "name": "Pro Laptop", "price": "$1299.99", "description": "High-performance laptop with 14-inch display...", "stock": "8 units", "sku": "PC-LT-PRO" } ] }
Get Started with BitFetcher
Data Extraction Today
Up to 1,000 extractions for free are waiting for you