Data Extraction

Transform web content into structured data using our powerful extraction system. From simple CSS selectors to advanced AI-powered extraction, get exactly what you need.

  • CSS and XPath selectors for precise targeting
  • Multiple output formats (text, HTML, attributes)
  • Bulk extraction for lists and collections
  • AI-powered extraction using natural language
CSS & XPath Selectors

Use familiar CSS selectors or powerful XPath expressions to target exactly the data you need.

Learn more

Bulk Extraction

Extract multiple items at once with our list extraction feature, perfect for product listings and search results.

Learn more

AI-Powered Extraction

Use natural language to extract data with our AI-powered extraction system (Beta).

Learn more

Examples

Extract Data Your Way

Choose between traditional selector-based extraction for precise targeting or AI-powered extraction for natural language understanding.

Selector-based Extraction

Use CSS and XPath selectors to precisely target and extract specific data from web pages.

Request - Extraction Rules

{
  "product": {
    "selector": ".product-card",
    "type": "list",
    "fields": {
      "title": {
        "selector": "h2.product-title",
        "output": "text"
      },
      "price": {
        "selector": ".price-tag",
        "output": "text"
      },
      "description": {
        "selector": ".description",
        "output": "text"
      },
      "image": {
        "selector": "img.product-image",
        "output": "@src"
      },
      "rating": {
        "selector": ".rating-stars",
        "output": "@data-rating"
      }
    }
  }
}

200 - Extracted Data

{
  "product": [
    {
      "title": "iPhone 15 Pro Max",
      "price": "$999.99",
      "description": "The most powerful...",
      "image": "/images/iphone15.jpg",
      "rating": "4.8"
    },
    {
      "title": "Samsung Galaxy S24",
      "price": "$899.99",
      "description": "Next generation...",
      "image": "/images/galaxy24.jpg",
      "rating": "4.7"
    }
  ]
}

AI-Powered Extraction

Extract data using natural language prompts or structured schemas - our AI adapts to your needs.

Prompt-based Extraction

Extract data using natural language. No technical knowledge required - just describe what you need.

  • Write in plain English
  • Works with any page layout
  • Extract complex data easily

Request - Prompt

Extract all products. For each product, I need:
- The product name
- Its price
- A brief description
- Any technical specifications mentioned

200 - Extracted Data

{
  "products": [
    {
      "name": "Ultra HD Smart TV",
      "price": "$899.99",
      "description": "55-inch 4K Smart LED TV with HDR",
      "specifications": {
        "display": "4K Resolution",
        "features": [
          "HDR",
          "Smart TV OS",
          "Voice Control"
        ],
        "connectivity": "WiFi"
      }
    },
    {
      "name": "Premium Wireless Headphones",
      "price": "$249.99",
      "description": "Noise-cancelling wireless headphones",
      "specifications": {
        "battery": "30hr Battery Life",
        "features": [
          "Active Noise Cancellation"
        ],
        "connectivity": "Bluetooth 5.2"
      }
    }
  ]
}

Schema-based Extraction

Define your data structure with JSON Schema for consistent, validated results.

  • Structured data output
  • Easy API integration
  • Built-in validation

Request - JSON Schema

{
  "type": "object",
  "properties": {
    "products": {
      "type": "array",
      "description": "List of products",
      "items": {
        "type": "object",
        "properties": {
          "name": {
            "type": "string",
            "description": "Name of the product"
          },
          "price": {
            "type": "string",
            "description": "Price of the product"
          },
          "description": {
            "type": "string",
            "description": "Description of the product"
          },
          "stock": {
            "type": "string",
            "description": "Stock availability"
          },
          "sku": {
            "type": "string",
            "description": "Product SKU"
          }
        }
      }
    }
  }
}

200 - Extracted Data

{
  "products": [
    {
      "name": "Ultra HD Smart TV",
      "price": "$899.99",
      "description": "55-inch 4K Ultra HD Smart LED TV with HDR...",
      "stock": "15 units",
      "sku": "TV-UHD-55"
    },
    {
      "name": "Premium Wireless Headphones",
      "price": "$249.99",
      "description": "Noise-cancelling wireless headphones...",
      "stock": "42 units",
      "sku": "AUDIO-WH-100"
    },
    {
      "name": "Pro Laptop",
      "price": "$1299.99",
      "description": "High-performance laptop with 14-inch display...",
      "stock": "8 units",
      "sku": "PC-LT-PRO"
    }
  ]
}

Get Started with BitFetcher

Data Extraction Today

Up to 1,000 extractions for free are waiting for you