Website analysis - Current API Docs

Website analysis scrapes a company’s website and uses AI to extract structured data: people, addresses, contact information, industry classification, and more.

Enabling website analysis

Add websiteAnalysis: true to your request. Providing the website URL is strongly recommended — if omitted, the API attempts to discover it automatically, which adds time.

{
  "name": "Shopify Inc",
  "website": "https://shopify.com",
  "websiteAnalysis": true
}

What gets extracted

Results appear in the website field of the response:

{
  "website": {
    "status": "success",
    "url": "https://shopify.com",
    "scrapedAt": "2024-01-15T10:30:00Z",
    "extracted": {
      "legalName": "Shopify Inc.",
      "descriptions": [
        "Shopify is a commerce platform that allows anyone to set up an online store and sell their products."
      ],
      "addresses": [
        {
          "label": "Headquarters",
          "street": "151 O'Connor Street",
          "city": "Ottawa",
          "province": "Ontario",
          "postalCode": "K2P 2L8",
          "country": "Canada"
        }
      ],
      "people": [
        { "name": "Tobias Lütke", "title": "Chief Executive Officer" }
      ],
      "contactInfo": {
        "emails": ["support@shopify.com"],
        "phoneNumbers": ["+18006578057"]
      },
      "registrations": [
        { "label": "GST/HST Number", "value": "123456789RT0001" }
      ]
    },
    "industry": {
      "naicsCode": "454110",
      "naicsDescription": "Electronic Shopping and Mail-Order Houses",
      "reasoning": "Shopify operates an e-commerce platform..."
    },
    "domain": {
      "url": "https://shopify.com",
      "createdAt": "1996-04-28T00:00:00Z"
    }
  }
}

Extracted fields

Field	Description
`extracted.legalName`	Legal name found on the website
`extracted.descriptions`	Business description text
`extracted.addresses`	Physical addresses found on the site
`extracted.people`	Named individuals with titles (executives, contacts)
`extracted.contactInfo.emails`	Contact email addresses
`extracted.contactInfo.phoneNumbers`	Phone numbers in E.164 format
`extracted.registrations`	Registration numbers mentioned on the site (GST, BN, etc.)
`industry.naicsCode`	NAICS industry classification code
`industry.naicsDescription`	Human-readable industry description
`domain.createdAt`	Domain registration date

Status values

Status	Meaning
`success`	Website was scraped and data extracted
`skipped`	Website analysis was not requested
`error`	Scraping failed (site unreachable, blocked, etc.)
`empty`	Site was reachable but no structured data could be extracted

People deduplication

People found on the website are merged with people found in registry data. The people top-level field in the response consolidates everyone across all sources — you won’t see duplicates between website.extracted.people and people.directors etc.

Website analysis adds 10–20 seconds to the response time. For the best results, provide the website URL directly rather than relying on automatic discovery.

​Enabling website analysis

​What gets extracted

​Extracted fields

​Status values

​People deduplication

Enabling website analysis

What gets extracted

Extracted fields

Status values

People deduplication