Error Handling - Firecrawl

Proper error handling is essential for building robust applications with Firecrawl. This guide covers common errors, HTTP status codes, and best practices for handling failures.

HTTP Status Codes

Firecrawl uses standard HTTP status codes to indicate success or failure:

Status Code	Meaning	Description
`200`	OK	Request successful
`400`	Bad Request	Invalid request parameters
`401`	Unauthorized	Invalid or missing API key
`402`	Payment Required	Insufficient credits or payment required
`404`	Not Found	Resource not found (e.g., invalid job ID)
`408`	Request Timeout	Request took too long to complete
`429`	Too Many Requests	Rate limit exceeded
`500`	Internal Server Error	Unexpected server error

Common Errors

400 Bad Request

Invalid parameters or malformed request body.

{
  "success": false,
  "error": "Invalid input data."
}

Common causes:

Invalid URL format
Missing required parameters
Invalid parameter values
Malformed JSON schema

from firecrawl import Firecrawl

app = Firecrawl(api_key="fc-YOUR_API_KEY")

try:
    doc = app.scrape("not-a-valid-url", formats=["markdown"])
except Exception as e:
    print(f"Error: {e}")
    # Handle invalid URL

401 Unauthorized

Invalid or missing API key.

{
  "success": false,
  "error": "Unauthorized: Invalid API key"
}

Solutions:

Verify your API key is correct
Ensure the Authorization header is properly formatted: Bearer fc-YOUR_API_KEY
Check that your API key hasn’t been revoked

from firecrawl import Firecrawl

try:
    app = Firecrawl(api_key="invalid-key")
    doc = app.scrape("https://firecrawl.dev")
except Exception as e:
    if "Unauthorized" in str(e) or "401" in str(e):
        print("Invalid API key. Please check your credentials.")
    else:
        raise

402 Payment Required

Insufficient credits or payment method required.

{
  "success": false,
  "error": "Payment required to access this resource."
}

Solutions:

Add credits to your account at firecrawl.dev/billing
Upgrade your plan for higher limits
Check your credit usage at firecrawl.dev/usage

from firecrawl import Firecrawl

app = Firecrawl(api_key="fc-YOUR_API_KEY")

try:
    doc = app.scrape("https://example.com")
except Exception as e:
    if "402" in str(e) or "Payment required" in str(e):
        print("Insufficient credits. Please add credits to your account.")
        print("Visit: https://firecrawl.dev/billing")
    else:
        raise

404 Not Found

Resource not found (e.g., invalid job ID).

{
  "success": false,
  "error": "Crawl job not found."
}

Common causes:

Invalid or expired job ID
Job ID from a different account
Typo in the job ID

from firecrawl import Firecrawl

app = Firecrawl(api_key="fc-YOUR_API_KEY")

try:
    status = app.get_crawl_status("invalid-job-id")
except Exception as e:
    if "404" in str(e) or "not found" in str(e).lower():
        print("Job not found. Please check the job ID.")
    else:
        raise

408 Request Timeout

Request took too long to complete.

{
  "success": false,
  "error": "Request timed out"
}

Solutions:

Increase the timeout parameter in your request
For large sites, use the asynchronous crawl API instead of synchronous scrape
Split large batch scrapes into smaller batches

from firecrawl import Firecrawl

app = Firecrawl(api_key="fc-YOUR_API_KEY")

try:
    # Increase timeout to 60 seconds
    doc = app.scrape(
        "https://slow-site.com",
        formats=["markdown"],
        timeout=60000
    )
except Exception as e:
    if "408" in str(e) or "timeout" in str(e).lower():
        print("Request timed out. Consider using async crawl for large sites.")
    else:
        raise

429 Too Many Requests

Rate limit exceeded.

{
  "success": false,
  "error": "Request rate limit exceeded. Please wait and try again later."
}

Solutions:

Implement exponential backoff and retry logic
Reduce request frequency
Upgrade to a higher plan with increased rate limits
Use the delay parameter in crawl requests to respect rate limits

from firecrawl import Firecrawl
import time

app = Firecrawl(api_key="fc-YOUR_API_KEY")

def scrape_with_retry(url, max_retries=3):
    for attempt in range(max_retries):
        try:
            return app.scrape(url, formats=["markdown"])
        except Exception as e:
            if "429" in str(e) or "rate limit" in str(e).lower():
                if attempt < max_retries - 1:
                    wait_time = 2 ** attempt  # Exponential backoff
                    print(f"Rate limited. Waiting {wait_time}s before retry...")
                    time.sleep(wait_time)
                else:
                    print("Max retries reached. Rate limit still in effect.")
                    raise
            else:
                raise

doc = scrape_with_retry("https://example.com")

500 Internal Server Error

Unexpected server error.

{
  "success": false,
  "error": "An unexpected error occurred on the server."
}

Solutions:

Retry the request after a short delay
Check status.firecrawl.dev for service status
Contact support if the error persists

from firecrawl import Firecrawl
import time

app = Firecrawl(api_key="fc-YOUR_API_KEY")

try:
    doc = app.scrape("https://example.com")
except Exception as e:
    if "500" in str(e):
        print("Server error. Retrying in 5 seconds...")
        time.sleep(5)
        doc = app.scrape("https://example.com")  # Retry once
    else:
        raise

Per-Page Errors in Crawls

During crawls, individual pages may fail while the overall job succeeds. These errors are available in the metadata:

from firecrawl import Firecrawl

app = Firecrawl(api_key="fc-YOUR_API_KEY")

crawl_result = app.crawl("https://example.com", limit=50)

for doc in crawl_result.data:
    if doc.metadata.get('error'):
        print(f"Error on {doc.metadata.get('sourceURL')}: {doc.metadata.get('error')}")
    else:
        print(f"Success: {doc.metadata.get('sourceURL')}")

Retrieving Crawl Errors

You can fetch a detailed list of errors for a crawl job:

from firecrawl import Firecrawl

app = Firecrawl(api_key="fc-YOUR_API_KEY")

job = app.start_crawl("https://example.com", limit=100)

# Wait for completion...
status = app.get_crawl_status(job.id)

if status.status == "completed":
    # Get errors using the API directly
    import requests
    response = requests.get(
        f"https://api.firecrawl.dev/v2/crawl/{job.id}/errors",
        headers={"Authorization": f"Bearer {app.api_key}"}
    )
    errors = response.json()
    print(f"Total errors: {len(errors.get('data', []))}")
    for error in errors.get('data', []):
        print(f"- {error['url']}: {error['error']}")

Best Practices

Implement retry logic: Use exponential backoff for transient errors like rate limits and server errors.

Log errors with context: Include the URL, timestamp, and error details to help with debugging.

Monitor credit usage: Check your remaining credits regularly to avoid 402 errors mid-operation.

Validate inputs: Check URLs and parameters before making API calls to avoid 400 errors.

Don’t retry 400 (Bad Request) or 401 (Unauthorized) errors automatically. These indicate issues with your request that won’t be fixed by retrying.

Always handle the error field in page metadata during crawls, as individual pages may fail even if the overall crawl succeeds.

Getting Help

If you encounter persistent errors:

Check the Firecrawl status page for service issues
Review the API documentation for parameter requirements
Join the Discord community for community support
Contact support@firecrawl.dev for direct assistance

Documentation Index

​HTTP Status Codes

​Common Errors

​400 Bad Request

​401 Unauthorized

​402 Payment Required

​404 Not Found

​408 Request Timeout

​429 Too Many Requests

​500 Internal Server Error

​Per-Page Errors in Crawls

​Retrieving Crawl Errors

​Best Practices

​Getting Help

HTTP Status Codes

Common Errors

400 Bad Request

401 Unauthorized

402 Payment Required

404 Not Found

408 Request Timeout

429 Too Many Requests

500 Internal Server Error

Per-Page Errors in Crawls

Retrieving Crawl Errors

Best Practices

Getting Help